Explain to me the difference between HBase and HDFS

+1 vote
Explain to me the difference between HBase and HDFS. Since the functionality of both looks to be almost the same, I am getting confused between both. The only difference I found in my research is that HDFS can work with any kind of data but HBase need structured data to work with. help me to better understand this concept
Mar 19, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
1,093 views

1 answer to this question.

+1 vote

Let's start from scratch.

Hadoop basically consists of three components

  1. HDFS(Hadoop Distributed File System)
  2. MapReduce
  3. YARN(Yet Another Resource Negotiator)

HDFS, the name explains it all. It is a distributed file system which stores data in commodity hardware. HDFS can store any type of data regardless of Structured, Unstructured and Semi-Structured data. It provides data in a better manner but ends up replicating the data. Being just a File-System it stores data in flat files and HDFS also lacks random Read-Write capabilities.

  • It can boost up the speed for accessing Big-Data
  • It follows the slogan of "Write once, Read Many"
  • Lacks random Read-Write capabilities
MapReduce is a framework used to compute and process Big-Data. Unlike HDFS, MapReduce can access data randomly but HDFS was proven to be good for sequential data accessing. so, this when HBase comes into the picture.
  • HBase stores data in terms of Key-Value pair
  • Low latency in data accessing regardless of the size of the data file in which it needs to search the needed data
  • Flexibility in Data Model
YARN acts like a manager between HDFS and MapReduce.
Hadoop is used for Batch-Processing and HBase is used in Real-Time needs.
answered Mar 19, 2019 by nitinrawat895
• 11,380 points

Related Questions In Big Data Hadoop

0 votes
1 answer

Explain to me the difference between HBase and HDFS.

Hadoop generally consists of three major components: HDFS It ...READ MORE

answered Apr 12, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
2,424 views
0 votes
1 answer

Explain to me the difference between HadoopHDFS and HBase.

Let me explain the major differences between ...READ MORE

answered May 13, 2019 in Big Data Hadoop by ravikiran
• 4,620 points

edited May 14, 2019 by Omkar 577 views
0 votes
1 answer

Explain to me the difference between name node and secondary name node.

Firstly, You need to understand the major ...READ MORE

answered Apr 30, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
1,619 views
+1 vote
1 answer

Explain to me the difference between Name node and Secondary name node

First of all, let understand what is ...READ MORE

answered May 15, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
1,788 views
0 votes
1 answer
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
11,059 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
2,562 views
+2 votes
11 answers

hadoop fs -put command?

Hi, You can create one directory in HDFS ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
108,991 views
0 votes
1 answer
0 votes
2 answers

Difference between HDFS and NFS

In NFS, the data is stored only ...READ MORE

answered Feb 4, 2019 in Big Data Hadoop by Kumar
15,039 views
webinar REGISTER FOR FREE WEBINAR X
REGISTER NOW
webinar_success Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP