What is the difference between Hadoop/HDFS & HBase?

0 votes

I am new to Hadoop as well as NoSQL. I am very much confused about HDFS & HBase. Can anyone help me figure out the differences between the HBase and Hadoop?
 

Mar 21, 2018 in Big Data Hadoop by nitinrawat895
• 9,070 points
4,209 views

12 answers to this question.

Your answer

Your name to display (optional):
Privacy: Your email address will only be used for sending these notifications.
0 votes

HDFS is a Java based distributed file system that allows you to store large data across multiple nodes in a Hadoop cluster. Whereas HBase is a NoSQL database (similar as NTFS and MySQL).

As Both HDFS and HBase stores all kind of data such as structured, semi-structured and unstructured in a distributed environment. 

Differences between HDFS & HBase

  • HBase provides low latency access to small amounts of data within large data sets while HDFS provides high latency operations.
  • HBase supports random read and writes while HDFS supports WORM (Write once Read Many or Multiple times).
  • HDFS is basically or primarily accessed through MapReduce jobs while HBase is accessed through shell commands, Java API, REST, Avro or Thrift API.

HDFS stores large data sets in a distributed environment and leverages batch processing on that data. 

While HBase stores data in a column oriented manner where each column is stored together so that, reading becomes faster leveraging real time processing. 

answered Mar 21, 2018 by Shubham
• 12,270 points

edited Mar 21, 2018 by Shubham
0 votes
answered Aug 24, 2018 by anonymous
0 votes
Hdfs is a storage system and HBase is a relational database
answered Dec 7, 2018 by Anu
Hbase is not RDBMS
Is it a non-relational database?
0 votes
Hadoop is a set of components/tools. HBase is a part of hadoop eco-system
answered Dec 7, 2018 by Toni
0 votes
Hadoop is a composition of different tools/frameworks used for Big data. It consists of Hive, Pig, Sqoop etc. HBase is also a component of Hadoop and it is database.
answered Dec 10, 2018 by Tori
0 votes
Hadoop is an open source software which is used for Big Data storage, computation and other Big Data related tasks. It is a collection of tools where each tool or a combination of these tools is used to solve the task related to storing, retrieving or computing big data.

HBase is a non-relational database used to store and retrieve distributed data on Hadoop. HBase is one of the most used database for Big data. HBase is also a part of Hadoop ecosystem.
answered Dec 10, 2018 by Ashwath
0 votes
Hadoop and HBase are both used to store a massive amount of data. But the difference is that in Hadoop Distributed File System (HDFS) data is stored is a distributed manner across different nodes on that network. Whereas, HBase is a database that stores data in the form of columns and rows in a Table.
answered Dec 10, 2018 by Bheesh
0 votes
Hadoop is a software collection that is mainly used by people or companies who deal with Big Data. Hadoop is used to handle big data and is responsible for efficient storage and fast computation. One of the main concept of Hadoop is MapReduce (Mapping+Reducing) which is used to distribute the data stored in the Hadoop storage.

Hbase is a NoSQL database which means it is used to store data in a tabular format even when there is no relation between the tables.
answered Dec 10, 2018 by Karna
0 votes
Hadoop is a storage system that contains two main components: HDFS (Hadoop Distributed File System) and MapReduce. Here, data is stored in a distributed fashion among different nodes. HBase (Hadoop Database) is a NoSQL database where data is stored in a structured format.
answered Dec 10, 2018 by Shekara
0 votes
Hadoop storage does not support Dynamic storage but HBase does. In HDFS the architecture of the storage system is fixed but in HBase, dynamic changes can be made.
answered Dec 10, 2018 by Bhujan
+1 vote
Useful information here....
answered Feb 22 by Mayur Gupta
+1 vote
HDFS is a distributed file system whereas HBase is a no sql database. Distributed file system means the set of information is split and process in different machine(Clusters).
answered Apr 26 by Arihar
• 160 points
Hey @Arihar. Can you tell me which is better for big-data? Hbase of HDFS? I want it for a project

Related Questions In Big Data Hadoop

0 votes
1 answer

What is the difference between Writable & WritableComparable in Hadoop?

Writable in an interface in Hadoop and types ...READ MORE

answered Oct 3, 2018 in Big Data Hadoop by Frankie
• 9,590 points
85 views
0 votes
10 answers

What is the difference between Mongodb and Hadoop?

Apart from the similarity that they are ...READ MORE

answered Dec 6, 2018 in Big Data Hadoop by Deeraj
1,834 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

What is the difference between Hadoop MapReduce and built-in MapReduce?

Differences are as follows: Hadoop's MR can be ...READ MORE

answered Sep 11, 2018 in Big Data Hadoop by Frankie
• 9,590 points
74 views
+1 vote
1 answer

What is difference between Hadoop Namenode HA and HDFS federation?

The main difference between HDFS High Availability ...READ MORE

answered Oct 30, 2018 in Big Data Hadoop by Frankie
• 9,590 points
139 views
0 votes
1 answer
0 votes
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 9,070 points
1,675 views
0 votes
10 answers

hadoop fs -put command?

copy command can be used to copy files ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Sujay
8,133 views
0 votes
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,260 points
570 views

© 2018 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
"PMP®","PMI®", "PMI-ACP®" and "PMBOK®" are registered marks of the Project Management Institute, Inc. MongoDB®, Mongo and the leaf logo are the registered trademarks of MongoDB, Inc.