Why do we need Hadoop framework?

0 votes

What is the need of Hadoop framework? Was not Distributed File System enough to handle the large volume of data?

Apr 9, 2019 in Big Data Hadoop by sunny

edited Apr 12, 2019 by Gitika 258 views

1 answer to this question.

0 votes

The function of Distributes File System is to partitions the data, store and manage the data across different machines. DFS can handle the large volume of data, but Hadoop framework will help to process the large amount of data.

Large data is divided into several blocks and stored in different commodity hardware (storing data into distributed way). 

Lets take an example in general:

If we want to process those data, first we will go to commodity hardware and will copy the data to the processing unit and finally, we will do the processing.

This process has some complications, for large data. 

  • When transferring large data some data may lose and make trouble.
  •  But if we use HADOOP, we don't need to take the data to the processing unit. 
  • Instead, we will take our processing unit to the commodity hardware, where our data is stored.
  • Processing can be done over there and we can take the output.

This is how Hadoop framework is more beneficial then DFS.

 

answered Apr 9, 2019 by Gitika
• 42,520 points

edited Apr 12, 2019 by Gitika

Related Questions In Big Data Hadoop

0 votes
1 answer

why do we need MaPReduce in BigData Hadoop?

Hi, As we know Hadoop provides Hdfs as ...READ MORE

answered Feb 4 in Big Data Hadoop by MD
• 68,280 points
107 views
0 votes
1 answer

What do we exactly mean by “Hadoop” – the definition of Hadoop?

The official definition of Apache Hadoop given ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by Shubham
810 views
0 votes
1 answer

What skills do I need to become Hadoop Admin?

You have heard it right. There is ...READ MORE

answered Nov 8, 2018 in Big Data Hadoop by anonymous
76 views
0 votes
2 answers

Do we need to format datanode to store data

That was not important. If you can ...READ MORE

answered Jan 15, 2019 in Big Data Hadoop by Sam
• 160 points
656 views
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,950 points
6,145 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,950 points
941 views
+1 vote
11 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyF ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
39,862 views
–1 vote
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,320 points
2,306 views
0 votes
1 answer

Why do we need the FOR EACH operation in Pig Scripts?

The operation FOREACH in Apache Pig is ...READ MORE

answered Apr 30, 2019 in Big Data Hadoop by Gitika
• 42,520 points
54 views
0 votes
1 answer

Why we need to install "ant -Dhadoopversion=23" ? What is the use of this?

Hi, We install ant to build pig, If the ...READ MORE

answered Apr 26, 2019 in Big Data Hadoop by Gitika
• 42,520 points
82 views