What is Hive? Is Hive a database?

0 votes
I am new to Hive. I found it similar to RDBMS like tables, joins, partitions. According to my understanding Hive uses HDFS for storing data and it provides SQL abstraction over HDFS. Is Hive a database over HDFS like HBase, or is it a querying tool over HDFS.

But I doubt that Hive is a query language, as it has tables, joins & partitions.
Mar 15, 2018 in Big Data Hadoop by Shubham
• 12,270 points
3,300 views

2 answers to this question.

Your answer

Your name to display (optional):
Privacy: Your email address will only be used for sending these notifications.
0 votes

No, we cannot call Apache Hive a relational database, as it is a data warehouse which is built on top of Apache Hadoop for providing data summarization, query and, analysis. It differs from a relational database in a way that it stores schema in a database and processed data into HDFS. 

For processing, Hive provides a SQL-like interface to query data stored in various databases and file systems that integrate with Hadoop. It supports queries expressed in a language called HiveQL, which automatically translates SQL-like queries into MapReduce jobs executed on Hadoop. 

Hive is read-based and therefore not support transaction processing that typically involves a high percentage of write operations. It is best suited for batch jobs like weblog processing and is designed for OLAP workloads.

answered Mar 15, 2018 by nitinrawat895
• 9,070 points

Hi here you mentioned "stores schema in a database", what the database can be like SQL server etc..,?

Hi @Sai.

By default, the schema is stored in Derby. But it is possible to change it to MySql or PostgreSql.
0 votes

Hey,

HIVE:- Hive is an ETL (extract, transform, load) and data warehouse tool developed on the top of the Hadoop Distributed File System. In Hive, tables and databases are created first and then the data is loaded into these tables. Hive as data warehouse is designed only for managing and querying only the structured data that is stored in the table.

The main difference in HiveQL and SQL is the hive query executes on Hadoop's infrastructure rather than the traditional database. The Hive query execution is like a series of automatically generated Map Reduce jobs

By using Hive, we can achieve some peculiar functionality that is not achieved in the relational database. For a huge amount of data that is in peta-bytes, querying it and getting results in seconds is important, and hive does is quite efficient, it processes the query fast and produce results in seconds.

answered May 8 by Gitika
• 8,620 points

Related Questions In Big Data Hadoop

0 votes
1 answer

What is a importance of Hive ODBC Connector

The Cloudera ODBC Driver for Hive enables ...READ MORE

answered Apr 10, 2018 in Big Data Hadoop by kurt_cobain
• 9,260 points
55 views
0 votes
1 answer
0 votes
1 answer

What is a container in YARN?

A container basically represents a resource on ...READ MORE

answered Apr 9, 2018 in Big Data Hadoop by kurt_cobain
• 9,260 points
590 views
0 votes
1 answer
0 votes
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 9,070 points
1,684 views
0 votes
10 answers

hadoop fs -put command?

copy command can be used to copy files ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Sujay
8,183 views
0 votes
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,260 points
578 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

What metadata is stored on a DataNode when a block is written to it?

Let me explain you step by step.  Each ...READ MORE

answered Jul 23, 2018 in Big Data Hadoop by nitinrawat895
• 9,070 points
98 views

© 2018 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
"PMP®","PMI®", "PMI-ACP®" and "PMBOK®" are registered marks of the Project Management Institute, Inc. MongoDB®, Mongo and the leaf logo are the registered trademarks of MongoDB, Inc.