Explain to me the difference between HadoopHDFS and HBase.

0 votes
May 13, 2019 in Big Data Hadoop by nitinrawat895
• 10,950 points
70 views

1 answer to this question.

0 votes

Let me explain the major differences between both.

Hadoop is basically made up of three components. 

  • Hadoop Distributed File-System
  • MapReduce
  • Yet-Another-Resource-Negotiator

HDFS is the storage unit of Hadoop or you can also call it as a file system of Hadoop.

MapReduce is considered as a Java Program which is designed to perform the computational operations on data in order to process it.

YARN is like a mediator in between the HDFS and MapReduce.

HDFS is generally designed to store data in huge amounts in a distributed manner amongst commodity hardware.

HDFS provides faster data accessing but lacks random read and write capabilities.

This is where the HBase is required as it is capable to provide a NoSQL database on top of Hadoop cluster and provides you real-time random read and write options.

Both HBase and HDFS can provide many operations that can be performed on data.

HDFS stores data in the form of files and HBase stores data in the form of key-value pairs.

Some of the important differences are as follows.

Hadoop

Optimized for streaming access to large files.
Follows write-once read-many ideology.
Doesn't support random read/write.

HBase

Stores key/value pairs in columnar fashion (columns are clubbed together as column families).
Provides low latency access to small amounts of data from within a large data set.
Provides flexible data model.

Hadoop is limited to Batch-Processing while on the other hand HBase is used in real-time data processing environment.

answered May 13, 2019 by ravikiran
• 4,600 points

edited May 14, 2019 by Omkar

Related Questions In Big Data Hadoop

+1 vote
1 answer

Explain to me the difference between HBase and HDFS

Let's start from scratch. Hadoop basically consists of three ...READ MORE

answered Mar 19, 2019 in Big Data Hadoop by nitinrawat895
• 10,950 points
219 views
+1 vote
1 answer

Explain to me the difference between Name node and Secondary name node

First of all, let understand what is ...READ MORE

answered May 14, 2019 in Big Data Hadoop by ravikiran
• 4,600 points
402 views
0 votes
2 answers

Explain to me the difference between Name Node and Secondary Name Node

Secondary namenode is just a helper for ...READ MORE

answered Aug 6, 2019 in Big Data Hadoop by Dhiman
469 views
0 votes
1 answer

Explain to me the method to transfer data between Azure tables and Hadoop on Azure

this article on HiveStorageHandler will let you create ...READ MORE

answered May 2, 2019 in Big Data Hadoop by ravikiran
• 4,600 points
76 views
0 votes
1 answer

Explain to me the relationship between Hadoop and Databases.

Okay, that's a lot of queries together, ...READ MORE

answered May 15, 2019 in Big Data Hadoop by ravikiran
• 4,600 points
381 views
0 votes
1 answer

Explain to me the Elasticsearch and Hadoop in a much better manner

I understand your problem, I suggest you download ...READ MORE

answered May 10, 2019 in Big Data Hadoop by ravikiran
• 4,600 points
81 views
0 votes
1 answer

What is the difference between the disable and disable-all keyword in HBase?

Hey, disable:- This command will start disabling the ...READ MORE

answered May 24, 2019 in Big Data Hadoop by Gitika
• 50,980 points
214 views
0 votes
1 answer

what is the difference between drop and drop_all command in Hbase?

Hey, drop:- This command is used to drop ...READ MORE

answered May 24, 2019 in Big Data Hadoop by Gitika
• 50,980 points
157 views
0 votes
1 answer

Explain to me the difference between HBase and HDFS.

Hadoop generally consists of three major components: HDFS It ...READ MORE

answered Apr 12, 2019 in Big Data Hadoop by ravikiran
• 4,600 points
414 views
0 votes
1 answer

Explain to me the difference between name node and secondary name node.

Firstly, You need to understand the major ...READ MORE

answered Apr 30, 2019 in Big Data Hadoop by ravikiran
• 4,600 points
197 views