How to run Hive scripts

Big Data and Hadoop (165 Blogs) Become a Certified Professional

Being a Data Warehousing package built on top of Hadoop, Apache Hive is increasingly getting used for data analysis, data mining and predictive modeling. Organizations are looking for professionals with a firm hold on Hive & Hadoop skills. In this post, let’s look at how to run Hive Scripts. In general, we use the scripts to execute a set of statements at once. Hive Scripts are used pretty much in the same way. It will reduce the time and effort we put on to writing and executing each command manually.

Hive Scripts are supported in the Hive 0.10.0 and above versions. As Hive 0.90 version is installed in CDH3, we cannot run Hive Scripts in CDH3. You can try the below steps in CDH4 as it has Hive 0.10.0 version installed in them. Are you aware of how to create a Hive script? If no, click here to gain more clarification.

Now, let us see how to write the scripts in Hive and run them in CDH4:

Step 1: Writing a Hive script.

To write the Hive Script the file should be saved with .sql extension. Open a terminal in your Cloudera CDH4 distribution and give the following command to create a Hive Script.
Command: sudo gedit sample.sql

On executing the above command, it will open the file with the list of all the Hive commands that need to be executed.

In this script, a table will be created, described and data will be loaded and retrieved from the table.

1. Creating the Table in Hive:

Command: create table product ( productid: int, productname: string, price: float, category: string) rows format delimited fields terminated by ‘,’ ;

Here, product is the table name and { productid, productname, price, category} are the columns of this table.

Fields terminated by ‘,’ indicate that the columns in the input file are separated by the symbol ‘,’.

By default the records in the input file are separated by a new line.

2. Describing the Table:

Command: describe product;

3. Loading the Data into the Table.

To load the data into the table first we need to create an input file which contains the records that need to be inserted in the table.

Let us create an input file.

Command: sudo gedit input.txt

Edit the contents in the file as shown in the figure.

4. Retrieving the Data:

To retrieve the data, the select command is used.

Command: Select * from product;

The above command is used to retrieve the value of all the columns present in the table. The script should be like as it is shown in the below image.

Now, we are done with writing the Hive script. The file sample.sql can now be saved.

Step 2: Running the Hive Script

The following is the command to run the Hive script:

Command: hive –f /home/cloudera/sample.sql

While executing the script, make sure that the entire path of the location of the Script file is present.

We can see that all the commands are executed successfully.

This is how Hive scripts are run and executed in CDH4.

Hive is a critical component of Hadoop and your expertise in Hive can land you top-paying Hadoop jobs! Edureka has a specially curated Hadoop course that helps you master concepts such as MapReduce, Yarn, Pig, Hive, HBase, Oozie, Flume and Sqoop. Click on the button below to get started.

Learn more about Big Data and its applications from the Data Engineer Course.

Got a question for us? Please mention them in the comments section and we will get back to you.

Related Posts:

Get Started with Hadoop

Hive Commands

How to Run Hive Scripts?

Step 1: Writing a Hive script.

3. Loading the Data into the Table.

Step 2: Running the Hive Script

Recommended videos for you

Big Data – XML Parsing With MapReduce

Improve Customer Service With Big Data

Webinar: Introduction to Big Data & Hadoop

Hadoop Cluster With High Availability

Apache Spark Redefining Big Data Processing

Hive Tutorial – Understanding Hive In Depth

Hadoop-A Highly Available And Secure Enterprise Data Warehousing Solution

MapReduce Design Patterns – Application of Join Pattern

Bulk Loading Into HBase With MapReduce

Real-Time Analytics with Apache Storm

Hadoop Tutorial – A Complete Tutorial For Hadoop

Secure Your Hadoop Cluster With Kerberos

Ways to Succeed with Hadoop in 2015

Python for Big Data Analytics

New-Age Search through Apache Solr

5 Things One Must Know About Spark

MapReduce Tutorial – All You Need To Know About MapReduce

Spark SQL | Apache Spark

HBase Tutorial – A Complete Guide On Apache HBase

Pig Tutorial – Know Everything About Apache Pig Script

Recommended blogs for you

All You Need To Know About Splunk

PySpark Tutorial – Learn Apache Spark Using Python

ELK Stack Tutorial – Discover, Analyze And Visualize Your Data Efficiently

Hadoop Cluster Configuration Files

Rio Olympics 2016: Big Data powers the biggest sporting spectacle of the year!

How to Create a Pipeline in Azure Data Factory Step-by-Step

Apache Sqoop Tutorial – Import/Export Data Between HDFS and RDBMS

Infographics: How Big is Big Data?

Top Apache Kafka Interview Questions To Prepare In 2025

Hadoop Interview Questions On HBase In 2025

Map Side Join Vs. Join

Apache Hadoop 2.0 and YARN

Career Advantages of Hadoop Certification

Big Data Testing: A Perfect Guide You Need to Follow

How To Install MongoDB on Mac Operating System?

Introduction to Lambda Architecture

What are Kafka Streams and How are they implemented?

A Day In The Life Of A Hadoop Administrator

Splunk Architecture: Tutorial On Forwarder, Indexer And Search Head

Spark Streaming Tutorial – Sentiment Analysis Using Apache Spark

Join the discussionCancel reply

Trending Courses in Big Data

Microsoft Azure Data Engineering Training Cou ...

Microsoft Fabric DP-700 Certification Trainin ...

PySpark Certification Training Course

Big Data Hadoop Certification Training Course

Applied Data Engineering on Azure Cloud Cours ...

Apache Kafka Certification Training Course

Apache Spark and Scala Certification Training ...

ELK Stack Training & Certification

Splunk Certification Training: Power User and ...

Comprehensive MapReduce Certification Trainin ...

Browse Categories

Subscribe to our Newsletter, and get personalized recommendations.

How to Run Hive Scripts?