Explain to me the difference between HBase and HDFS.

0 votes
I need to learn the difference between HDFS and HBase in more daetail
Apr 12, 2019 in Big Data Hadoop by nitinrawat895
• 10,920 points
322 views

1 answer to this question.

0 votes

Hadoop generally consists of three major components:

HDFS

  • It is a file system in Hadoop that allows you to store big-data in it and apply your business logic. 

MapReduce

  • MapReduce is a java program which allows you to apply your business logic through mapper and reducer and process the data in a distributed fashion.

Yarn

  • It acts as an intermediate manager between HDFS and Yarn when it comes to the requirement for resources like memory and processor units.

The only issue with the HDFS was it could not process data when it is not in a sequential manner and it lacks Random read-write option is unavailable in the HDFS. This when the HBase got into the picture.

Hadoop

  • Real-time streaming is impossible since it needs time to load the data to HDFS to carry out further operations.
  • Follows write once and read many times principle.
  • Designed to gain access for streaming data.

HBase

  • Stores key/value pairs in columnar fashion (columns are clubbed together as column families).
  • Provides low latency access to small amounts of data from within a large data set.
  • Provides flexible data model.
Hadoop is a batch processing tool hence it can never be used for real-time data Processing.
Image result for difference between hdfs and hbase
answered Apr 12, 2019 by ravikiran
• 4,600 points

Related Questions In Big Data Hadoop

+1 vote
1 answer

Explain to me the difference between HBase and HDFS

Let's start from scratch. Hadoop basically consists of three ...READ MORE

answered Mar 19, 2019 in Big Data Hadoop by nitinrawat895
• 10,920 points
165 views
+1 vote
1 answer

Explain to me the difference between Name node and Secondary name node

First of all, let understand what is ...READ MORE

answered May 14, 2019 in Big Data Hadoop by ravikiran
• 4,600 points
216 views
0 votes
2 answers

Explain to me the difference between Name Node and Secondary Name Node

Secondary namenode is just a helper for ...READ MORE

answered Aug 6, 2019 in Big Data Hadoop by Dhiman
244 views
0 votes
1 answer

Explain to me the method to transfer data between Azure tables and Hadoop on Azure

this article on HiveStorageHandler will let you create ...READ MORE

answered May 2, 2019 in Big Data Hadoop by ravikiran
• 4,600 points
61 views
0 votes
1 answer
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,920 points
5,338 views
+1 vote
11 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyF ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
32,624 views
–1 vote
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,310 points
2,008 views
0 votes
1 answer

Explain to me the difference between HadoopHDFS and HBase.

Let me explain the major differences between ...READ MORE

answered May 13, 2019 in Big Data Hadoop by ravikiran
• 4,600 points

edited May 14, 2019 by Omkar 56 views
0 votes
1 answer

Explain to me the difference between name node and secondary name node.

Firstly, You need to understand the major ...READ MORE

answered Apr 30, 2019 in Big Data Hadoop by ravikiran
• 4,600 points
147 views