What is Hive Is Hive a database

0 votes
I am new to Hive. I found it similar to RDBMS like tables, joins, partitions. According to my understanding Hive uses HDFS for storing data and it provides SQL abstraction over HDFS. Is Hive a database over HDFS like HBase, or is it a querying tool over HDFS.

But I doubt that Hive is a query language, as it has tables, joins & partitions.
Mar 16, 2018 in Big Data Hadoop by Shubham
• 13,490 points
22,890 views

3 answers to this question.

+1 vote

No, we cannot call Apache Hive a relational database, as it is a data warehouse which is built on top of Apache Hadoop for providing data summarization, query and, analysis. It differs from a relational database in a way that it stores schema in a database and processed data into HDFS. 

For processing, Hive provides a SQL-like interface to query data stored in various databases and file systems that integrate with Hadoop. It supports queries expressed in a language called HiveQL, which automatically translates SQL-like queries into MapReduce jobs executed on Hadoop. 

Hive is read-based and therefore not support transaction processing that typically involves a high percentage of write operations. It is best suited for batch jobs like weblog processing and is designed for OLAP workloads.

answered Mar 16, 2018 by nitinrawat895
• 11,380 points

Hi here you mentioned "stores schema in a database", what the database can be like SQL server etc..,?

Hi @Sai.

By default, the schema is stored in Derby. But it is possible to change it to MySql or PostgreSql.
0 votes

Hey,

HIVE:- Hive is an ETL (extract, transform, load) and data warehouse tool developed on the top of the Hadoop Distributed File System. In Hive, tables and databases are created first and then the data is loaded into these tables. Hive as data warehouse is designed only for managing and querying only the structured data that is stored in the table.

The main difference in HiveQL and SQL is the hive query executes on Hadoop's infrastructure rather than the traditional database. The Hive query execution is like a series of automatically generated Map Reduce jobs

By using Hive, we can achieve some peculiar functionality that is not achieved in the relational database. For a huge amount of data that is in peta-bytes, querying it and getting results in seconds is important, and hive does is quite efficient, it processes the query fast and produce results in seconds.

answered May 8, 2019 by Gitika
• 65,910 points
+1 vote
Hive is a data Warehouse infrastructure/system built on top of Hadoop for querying and analyzing structured data residing in HDFS.

Hope this answers your question.
answered Jul 1, 2019 by Ved Gupta

Related Questions In Big Data Hadoop

0 votes
1 answer

What is a importance of Hive ODBC Connector

The Cloudera ODBC Driver for Hive enables ...READ MORE

answered Apr 10, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
646 views
0 votes
1 answer

What is the command to know the details of your data created in a table in Hive?

Hey, Yes, there is a way to check ...READ MORE

answered May 15, 2019 in Big Data Hadoop by Gitika
• 65,910 points
1,258 views
0 votes
1 answer

What is the significance of ‘IF EXISTS” clause while dropping a table in Hive?

Hi, Since the table being dropped does not ...READ MORE

answered Jun 26, 2019 in Big Data Hadoop by Gitika
• 65,910 points
2,645 views
0 votes
0 answers

What is the difference between partitioning and bucketing a table in Hive ?

Hi Team, I am new to Hive. I ...READ MORE

Dec 20, 2020 in Big Data Hadoop by akhtar
• 38,230 points
1,211 views
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
10,557 views
+2 votes
11 answers

hadoop fs -put command?

Hi, You can create one directory in HDFS ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
104,207 views
–1 vote
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
4,260 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

What metadata is stored on a DataNode when a block is written to it?

Let me explain you step by step.  Each ...READ MORE

answered Jul 23, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,725 views
webinar REGISTER FOR FREE WEBINAR X
REGISTER NOW
webinar_success Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP