Explain to me the difference between HadoopHDFS and HBase.

0 votes
May 13 in Big Data Hadoop by nitinrawat895
• 10,710 points
32 views

1 answer to this question.

0 votes

Let me explain the major differences between both.

Hadoop is basically made up of three components. 

  • Hadoop Distributed File-System
  • MapReduce
  • Yet-Another-Resource-Negotiator

HDFS is the storage unit of Hadoop or you can also call it as a file system of Hadoop.

MapReduce is considered as a Java Program which is designed to perform the computational operations on data in order to process it.

YARN is like a mediator in between the HDFS and MapReduce.

HDFS is generally designed to store data in huge amounts in a distributed manner amongst commodity hardware.

HDFS provides faster data accessing but lacks random read and write capabilities.

This is where the HBase is required as it is capable to provide a NoSQL database on top of Hadoop cluster and provides you real-time random read and write options.

Both HBase and HDFS can provide many operations that can be performed on data.

HDFS stores data in the form of files and HBase stores data in the form of key-value pairs.

Some of the important differences are as follows.

Hadoop

Optimized for streaming access to large files.
Follows write-once read-many ideology.
Doesn't support random read/write.

HBase

Stores key/value pairs in columnar fashion (columns are clubbed together as column families).
Provides low latency access to small amounts of data from within a large data set.
Provides flexible data model.

Hadoop is limited to Batch-Processing while on the other hand HBase is used in real-time data processing environment.

answered May 13 by ravikiran
• 4,560 points

edited May 14 by Omkar

Related Questions In Big Data Hadoop

+1 vote
1 answer

Explain to me the difference between HBase and HDFS

Let's start from scratch. Hadoop basically consists of three ...READ MORE

answered Mar 19 in Big Data Hadoop by nitinrawat895
• 10,710 points
74 views
+1 vote
1 answer
0 votes
2 answers
0 votes
1 answer

Explain to me the method to transfer data between Azure tables and Hadoop on Azure

this article on HiveStorageHandler will let you create ...READ MORE

answered May 2 in Big Data Hadoop by ravikiran
• 4,560 points
38 views
0 votes
1 answer

Explain to me the relationship between Hadoop and Databases.

Okay, that's a lot of queries together, ...READ MORE

answered May 15 in Big Data Hadoop by ravikiran
• 4,560 points
62 views
0 votes
1 answer

Explain to me the Elasticsearch and Hadoop in a much better manner

I understand your problem, I suggest you download ...READ MORE

answered May 10 in Big Data Hadoop by ravikiran
• 4,560 points
43 views
0 votes
1 answer

What is the difference between the disable and disable-all keyword in HBase?

Hey, disable:- This command will start disabling the ...READ MORE

answered May 24 in Big Data Hadoop by Gitika
• 25,340 points
42 views
0 votes
1 answer

what is the difference between drop and drop_all command in Hbase?

Hey, drop:- This command is used to drop ...READ MORE

answered May 24 in Big Data Hadoop by Gitika
• 25,340 points
54 views
0 votes
1 answer

Explain to me the difference between HBase and HDFS.

Hadoop generally consists of three major components: HDFS It ...READ MORE

answered Apr 12 in Big Data Hadoop by ravikiran
• 4,560 points
183 views
0 votes
1 answer

Explain to me the difference between name node and secondary name node.

Firstly, You need to understand the major ...READ MORE

answered Apr 30 in Big Data Hadoop by ravikiran
• 4,560 points
59 views