What is Hive? Is Hive a database?

0 votes
I am new to Hive. I found it similar to RDBMS like tables, joins, partitions. According to my understanding Hive uses HDFS for storing data and it provides SQL abstraction over HDFS. Is Hive a database over HDFS like HBase, or is it a querying tool over HDFS.

But I doubt that Hive is a query language, as it has tables, joins & partitions.
Mar 15, 2018 in Big Data Hadoop by Shubham
• 13,350 points
7,705 views

3 answers to this question.

+1 vote

No, we cannot call Apache Hive a relational database, as it is a data warehouse which is built on top of Apache Hadoop for providing data summarization, query and, analysis. It differs from a relational database in a way that it stores schema in a database and processed data into HDFS. 

For processing, Hive provides a SQL-like interface to query data stored in various databases and file systems that integrate with Hadoop. It supports queries expressed in a language called HiveQL, which automatically translates SQL-like queries into MapReduce jobs executed on Hadoop. 

Hive is read-based and therefore not support transaction processing that typically involves a high percentage of write operations. It is best suited for batch jobs like weblog processing and is designed for OLAP workloads.

answered Mar 15, 2018 by nitinrawat895
• 10,800 points

Hi here you mentioned "stores schema in a database", what the database can be like SQL server etc..,?

Hi @Sai.

By default, the schema is stored in Derby. But it is possible to change it to MySql or PostgreSql.
0 votes

Hey,

HIVE:- Hive is an ETL (extract, transform, load) and data warehouse tool developed on the top of the Hadoop Distributed File System. In Hive, tables and databases are created first and then the data is loaded into these tables. Hive as data warehouse is designed only for managing and querying only the structured data that is stored in the table.

The main difference in HiveQL and SQL is the hive query executes on Hadoop's infrastructure rather than the traditional database. The Hive query execution is like a series of automatically generated Map Reduce jobs

By using Hive, we can achieve some peculiar functionality that is not achieved in the relational database. For a huge amount of data that is in peta-bytes, querying it and getting results in seconds is important, and hive does is quite efficient, it processes the query fast and produce results in seconds.

answered May 8 by Gitika
• 25,420 points
+1 vote
Hive is a data Warehouse infrastructure/system built on top of Hadoop for querying and analyzing structured data residing in HDFS.

Hope this answers your question.
answered Jul 1 by Ved Gupta

Related Questions In Big Data Hadoop

0 votes
1 answer

What is a importance of Hive ODBC Connector

The Cloudera ODBC Driver for Hive enables ...READ MORE

answered Apr 10, 2018 in Big Data Hadoop by kurt_cobain
• 9,280 points
123 views
0 votes
1 answer

What is the command to know the details of your data created in a table in Hive?

Hey, Yes, there is a way to check ...READ MORE

answered May 14 in Big Data Hadoop by Gitika
• 25,420 points
181 views
0 votes
1 answer

What is the significance of ‘IF EXISTS” clause while dropping a table in Hive?

Hi, Since the table being dropped does not ...READ MORE

answered Jun 26 in Big Data Hadoop by Gitika
• 25,420 points
276 views
0 votes
1 answer

What is a container in YARN?

A container basically represents a resource on ...READ MORE

answered Apr 9, 2018 in Big Data Hadoop by kurt_cobain
• 9,280 points
918 views
0 votes
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,800 points
3,589 views
+1 vote
11 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyFr ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
18,520 views
0 votes
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,280 points
1,337 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

What metadata is stored on a DataNode when a block is written to it?

Let me explain you step by step.  Each ...READ MORE

answered Jul 23, 2018 in Big Data Hadoop by nitinrawat895
• 10,800 points
277 views