Hive and Yarn Examples on Spark

Apache Spark and Scala (25 Blogs) Become a Certified Professional

We have learnt how to Build Hive and Yarn on Spark. Now let us try out Hive and Yarn examples on Spark.

Hive Example on Spark

We will run an example of Hive on Spark. We will create a table, load data in that table and execute a simple query. When working with Hive, one must construct a HiveContext which inherits from SQLContext.

Command: cd spark-1.1.1

Command: ./bin/spark-shell

Create an input file ‘sample’ in your home directory as below snapshot (tab separated).

Command: val sqlContext = new org.apache.spark.sql.hive.HiveContext(sc)

Command: sqlContext.sql(“CREATE TABLE IF NOT EXISTS test (name STRING, rank INT) ROW FORMAT DELIMITED FIELDS TERMINATED BY ‘ ‘ LINES TERMINATED BY ‘
‘”)

Command: sqlContext.sql(“LOAD DATA LOCAL INPATH ‘/home/edureka/sample’ INTO TABLE test”)

Command: sqlContext.sql(“SELECT * FROM test WHERE rank < 5”).collect().foreach(println)

Yarn Example on Spark

We will run SparkPi example on Yarn. We can deploy Yarn on Spark in two modes : cluster mode and client mode. In yarn-cluster mode, the Spark driver runs inside an application master process which is managed by Yarn on the cluster, and the client can go away after initiating the application. In yarn-client mode, the driver runs in the client process, and the application master is only used for requesting resources from Yarn.

Command: cd spark-1.1.1

Command: SPARK_JAR=./assembly/target/scala-2.10/spark-assembly-1.1.1-hadoop2.2.0.jar ./bin/spark-submit –master yarn –deploy-mode cluster –class org.apache.spark.examples.SparkPi –num-executors 1 –driver-memory 2g –executor-memory 1g –executor-cores 1 examples/target/scala-2.10/spark-examples-1.1.1-hadoop2.2.0.jar

After you execute the above command, please wait for sometime till you get SUCCEEDED message.

Browse localhost:8088/cluster and click on the Spark application.

Click on logs.

Click on stdout to check the output.

For deploying Yarn on Spark in client mode, just make –deploy-mode as “client”. Now, you know how to build Hive and Yarn on Spark. We also did practicals on them.

Got a question for us? Please mention them in the comments section and we will get back to you.

Apache Spark with Hadoop-Why it matters?

Hive and Yarn Examples on Spark

Hive Example on Spark

Yarn Example on Spark

Recommended videos for you

MapReduce Design Patterns – Application of Join Pattern

5 Scenarios: When To Use & When Not to Use Hadoop

What Is Hadoop – All You Need To Know About Hadoop

HBase Tutorial – A Complete Guide On Apache HBase

Hadoop Architecture – Hadoop Tutorial on HDFS Architecture

Hadoop Tutorial – A Complete Tutorial For Hadoop

Hadoop Cluster With High Availability

Secure Your Hadoop Cluster With Kerberos

Big Data Tutorial – Get Started With Big Data And Hadoop

Reduce Side Joins With MapReduce

Tailored Big Data Solutions Using MapReduce Design Patterns

Introduction to Apache Solr-1

When not to use Hadoop

Hive Tutorial – Understanding Hive In Depth

Hadoop for Java Professionals

Is Hadoop A Necessity For Data Science?

MapReduce Tutorial – All You Need To Know About MapReduce

Hadoop-A Highly Available And Secure Enterprise Data Warehousing Solution

Big Data Processing with Spark and Scala

What is Apache Storm all about?

Recommended blogs for you

DynamoDB vs MongoDB: Which One Meets Your Business Needs Better?

Running Scala Application In Eclipse IDE Using Sbteclipse

Scala Functional Programming

Setting Up A Multi Node Cluster In Hadoop 2.X

HDFS Tutorial: Introduction to HDFS & its Features

How Predictive Analysis can Help you Combat Employee Attrition

Introduction to Hadoop 2.0 and Advantages of Hadoop 2.0 over 1.0

Most Important Scala Interview Questions to Prepare in 2025

Jupyter Notebook Cheat Sheet : A Beginner’s Guide to Jupyter Notebook

Pig Programming: Apache Pig Script with UDF in HDFS Mode

Why Should a Mainframe Professional Move to Big Data and Hadoop?

What is CCA-175 Spark and Hadoop Developer Certification?

Hadoop Components that you Need to know about

Transfer files from Windows to Cloudera Demo VM

Azure Data Factory Vs Databricks

DBInputFormat to Transfer Data From SQL to NoSQL Database

What is integration runtime in Azure data factory?

Demystifying Partitioning in Spark

Hadoop Learners’ Profile

Commissioning and Decommissioning Nodes in a Hadoop Cluster

Join the discussionCancel reply

Trending Courses in Big Data

PySpark Certification Training Course

Applied Data Engineering on Azure Cloud Cours ...

Apache Kafka Certification Training Course

Big Data Hadoop Certification Training Course

Apache Spark and Scala Certification Training ...

Big Data Hadoop Administration Certification ...

Splunk Certification Training: Power User and ...

ELK Stack Training & Certification

Apache Storm Certification Training

Comprehensive MapReduce Certification Trainin ...

Browse Categories

Subscribe to our Newsletter, and get personalized recommendations.

Hive and Yarn Examples on Spark