What is the difference between Hadoop/HDFS & HBase?

0 votes

I am new to Hadoop as well as NoSQL. I am very much confused about HDFS & HBase. Can anyone help me figure out the differences between the HBase and Hadoop?
 

Mar 21, 2018 in Big Data Hadoop by nitinrawat895
• 10,730 points
10,952 views

13 answers to this question.

0 votes

HDFS is a Java based distributed file system that allows you to store large data across multiple nodes in a Hadoop cluster. Whereas HBase is a NoSQL database (similar as NTFS and MySQL).

As Both HDFS and HBase stores all kind of data such as structured, semi-structured and unstructured in a distributed environment. 

Differences between HDFS & HBase

  • HBase provides low latency access to small amounts of data within large data sets while HDFS provides high latency operations.
  • HBase supports random read and writes while HDFS supports WORM (Write once Read Many or Multiple times).
  • HDFS is basically or primarily accessed through MapReduce jobs while HBase is accessed through shell commands, Java API, REST, Avro or Thrift API.

HDFS stores large data sets in a distributed environment and leverages batch processing on that data. 

While HBase stores data in a column oriented manner where each column is stored together so that, reading becomes faster leveraging real time processing. 

answered Mar 21, 2018 by Shubham
• 13,310 points

edited Mar 21, 2018 by Shubham
0 votes
answered Aug 24, 2018 by anonymous
0 votes
Hdfs is a storage system and HBase is a relational database
answered Dec 7, 2018 by Anu
Hbase is not RDBMS
Is it a non-relational database?
Hbase is a non-relational sgbd column-oriented.
0 votes
Hadoop is a set of components/tools. HBase is a part of hadoop eco-system
answered Dec 7, 2018 by Toni
0 votes
Hadoop is a composition of different tools/frameworks used for Big data. It consists of Hive, Pig, Sqoop etc. HBase is also a component of Hadoop and it is database.
answered Dec 10, 2018 by Tori
0 votes
Hadoop is an open source software which is used for Big Data storage, computation and other Big Data related tasks. It is a collection of tools where each tool or a combination of these tools is used to solve the task related to storing, retrieving or computing big data.

HBase is a non-relational database used to store and retrieve distributed data on Hadoop. HBase is one of the most used database for Big data. HBase is also a part of Hadoop ecosystem.
answered Dec 10, 2018 by Ashwath
0 votes
Hadoop and HBase are both used to store a massive amount of data. But the difference is that in Hadoop Distributed File System (HDFS) data is stored is a distributed manner across different nodes on that network. Whereas, HBase is a database that stores data in the form of columns and rows in a Table.
answered Dec 10, 2018 by Bheesh
0 votes
Hadoop is a software collection that is mainly used by people or companies who deal with Big Data. Hadoop is used to handle big data and is responsible for efficient storage and fast computation. One of the main concept of Hadoop is MapReduce (Mapping+Reducing) which is used to distribute the data stored in the Hadoop storage.

Hbase is a NoSQL database which means it is used to store data in a tabular format even when there is no relation between the tables.
answered Dec 10, 2018 by Karna
0 votes
Hadoop is a storage system that contains two main components: HDFS (Hadoop Distributed File System) and MapReduce. Here, data is stored in a distributed fashion among different nodes. HBase (Hadoop Database) is a NoSQL database where data is stored in a structured format.
answered Dec 10, 2018 by Shekara
0 votes
Hadoop storage does not support Dynamic storage but HBase does. In HDFS the architecture of the storage system is fixed but in HBase, dynamic changes can be made.
answered Dec 10, 2018 by Bhujan
+1 vote
Useful information here....
answered Feb 22 by Mayur Gupta
+1 vote
HDFS is a distributed file system whereas HBase is a no sql database. Distributed file system means the set of information is split and process in different machine(Clusters).
answered Apr 26 by Arihar
• 160 points
Hey @Arihar. Can you tell me which is better for big-data? Hbase of HDFS? I want it for a project
0 votes
HDFS is a file system HBASE is a nonrelational sgbd
answered Jun 10 by Lowe
• 140 points
Hey Lowe, agree with you.

Are you french by any chance? I believe SGBD in English translates to RDBMS. :)
Bonjour Gitika,

je parle courament français. merci pour la correction SGBD=RDBMS parfois je mélange les deux langues :)
Hey Lowe,

Glad to see your efforts.

Related Questions In Big Data Hadoop

0 votes
1 answer

What is the difference between Writable & WritableComparable in Hadoop?

Writable in an interface in Hadoop and types ...READ MORE

answered Oct 3, 2018 in Big Data Hadoop by Frankie
• 9,810 points
689 views
0 votes
10 answers

What is the difference between Mongodb and Hadoop?

Apart from the similarity that they are ...READ MORE

answered Dec 6, 2018 in Big Data Hadoop by Deeraj
2,704 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

What is the difference between Hadoop MapReduce and built-in MapReduce?

Differences are as follows: Hadoop's MR can be ...READ MORE

answered Sep 11, 2018 in Big Data Hadoop by Frankie
• 9,810 points
231 views
+1 vote
1 answer

What is difference between Hadoop Namenode HA and HDFS federation?

The main difference between HDFS High Availability ...READ MORE

answered Oct 30, 2018 in Big Data Hadoop by Frankie
• 9,810 points
261 views
0 votes
1 answer
0 votes
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,730 points
3,386 views
0 votes
10 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyFr ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
16,912 views
0 votes
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,260 points
1,235 views