if HDFS threshold has been reached. What is the our approach to resolve this issue?

Question

Rishi · Answer

Datanodes fill the disk unevenly. Most of the time, certain disks are full while others still have space in them. You can use the tool called DiskBalancer to solve this issue.&#160;The Disk Balancer lets administrators rebalance data across multiple disks of a DataNode. To know more, refer to this:&#160;https://issues.apache.org/jira/browse/HDFS-1312

Gitika · Answer

This was a fundamental issue in HDFS for a long time, but there is a new tool called DiskBalancer.&#160;It essentially allows you to create a plan file -- That describes how data will be moved from disk to disk and then you can ask a datanode to execute it.&#160;If one disk is over-utilized, some writes will fail, that is when the datanode picks that disk. So you need to make sure data is similarly distributed in each of the disks. That is what DiskBalancer does for you, it computes how much to move based on each disk type.

if HDFS threshold has been reached What is the our approach to resolve this issue

Your comment on this question:

2 answers to this question.

Your answer

Your comment on this answer:

Your comment on this answer:

Related Questions In Big Data Hadoop

What is the HDFS command to list all the files in HDFS according to the timestamp?

What is the command to navigate in HDFS?

What is the command to find the free space in HDFS?

What is the standard way to create files in your hdfs file-system?

Apache Hadoop Yarn example program

Hadoop Mapreduce word count Program

hadoop.mapred vs hadoop.mapreduce?

hadoop fs -put command?

Why we need to install "ant -Dhadoopversion=23" ? What is the use of this?

What will happen if the OOZIE_URL environment variable has not been set?

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES