Explain to me the difference between HadoopHDFS and HBase.

0 votes
May 13 in Big Data Hadoop by nitinrawat895
• 9,030 points
9 views

1 answer to this question.

Your answer

Your name to display (optional):
Privacy: Your email address will only be used for sending these notifications.
0 votes

Let me explain the major differences between both.

Hadoop is basically made up of three components. 

  • Hadoop Distributed File-System
  • MapReduce
  • Yet-Another-Resource-Negotiator

HDFS is the storage unit of Hadoop or you can also call it as a file system of Hadoop.

MapReduce is considered as a Java Program which is designed to perform the computational operations on data in order to process it.

YARN is like a mediator in between the HDFS and MapReduce.

HDFS is generally designed to store data in huge amounts in a distributed manner amongst commodity hardware.

HDFS provides faster data accessing but lacks random read and write capabilities.

This is where the HBase is required as it is capable to provide a NoSQL database on top of Hadoop cluster and provides you real-time random read and write options.

Both HBase and HDFS can provide many operations that can be performed on data.

HDFS stores data in the form of files and HBase stores data in the form of key-value pairs.

Some of the important differences are as follows.

Hadoop

Optimized for streaming access to large files.
Follows write-once read-many ideology.
Doesn't support random read/write.

HBase

Stores key/value pairs in columnar fashion (columns are clubbed together as column families).
Provides low latency access to small amounts of data from within a large data set.
Provides flexible data model.

Hadoop is limited to Batch-Processing while on the other hand HBase is used in real-time data processing environment.

answered May 13 by ravikiran
• 1,460 points

edited May 14 by Omkar

Related Questions In Big Data Hadoop

+1 vote
1 answer

Explain to me the difference between HBase and HDFS

Let's start from scratch. Hadoop basically consists of three ...READ MORE

answered Mar 19 in Big Data Hadoop by nitinrawat895
• 9,030 points
36 views
+1 vote
1 answer
0 votes
1 answer

Explain to me the method to transfer data between Azure tables and Hadoop on Azure

this article on HiveStorageHandler will let you create ...READ MORE

answered May 2 in Big Data Hadoop by ravikiran
• 1,460 points
14 views
0 votes
1 answer

Explain to me the relationship between Hadoop and Databases.

Okay, that's a lot of queries together, ...READ MORE

answered May 15 in Big Data Hadoop by ravikiran
• 1,460 points
8 views
0 votes
1 answer

Explain to me the Elasticsearch and Hadoop in a much better manner

I understand your problem, I suggest you download ...READ MORE

answered May 10 in Big Data Hadoop by ravikiran
• 1,460 points
11 views
0 votes
10 answers

What is the difference between Mongodb and Hadoop?

Apart from the similarity that they are ...READ MORE

answered Dec 6, 2018 in Big Data Hadoop by Deeraj
1,822 views
0 votes
12 answers

What is the difference between Hadoop/HDFS & HBase?

HDFS is a distributed file system whereas ...READ MORE

answered Apr 26 in Big Data Hadoop by Arihar
• 160 points
4,129 views
0 votes
1 answer
0 votes
1 answer

Explain to me the difference between HBase and HDFS.

Hadoop generally consists of three major components: HDFS It ...READ MORE

answered Apr 12 in Big Data Hadoop by ravikiran
• 1,460 points
66 views
0 votes
1 answer

Explain to me the difference between name node and secondary name node.

Firstly, You need to understand the major ...READ MORE

answered Apr 30 in Big Data Hadoop by ravikiran
• 1,460 points
15 views

© 2018 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
"PMP®","PMI®", "PMI-ACP®" and "PMBOK®" are registered marks of the Project Management Institute, Inc. MongoDB®, Mongo and the leaf logo are the registered trademarks of MongoDB, Inc.