Drilling Down On Apache Drill, the New-Age Query Engine Part 2

In this second Apache Drill blog post, we will learn how integrate Hive and HBase with Apache Drill. Apache Drill provides inbuilt storage plugins for Hive and HBase integration. We just need to edit the configurations of these storage plugins and connect Drill.

Check out the first Apache Drill blog in the series here.

First, let us see how to connect Apache Drill with Hive storage plugin.

In the snapshot below, you can see that I have a table customer which has some data. Now we will connect Hive with Drill and access the same table from Drill.

Start Drillbit service

Command: ./bin/Drillbit.sh start

Start Drill shell

Command: Drill-conf

Next, open the Drill UI in the browser localhost:8047

You will see Hive in the disabled storage plugins. Click on Update for Hive.

Edit the configurations according to your existing Hive settings that are available in hive-site.xml

After editing, click on Enable.

You will now find Hive now in the Enabled Storage Plugin.

Run the command below command in the Drill shell to access Hive from Drill.

Command: use hive;

You can now access the customer table from Drill. You have successfully integrated Hive with Drill.

Command: select * from customer;

Next, to enable the HBase storage plugin, we will follow the same steps as Hive. Click on update for HBase and edit the configurations accordingly.

Click on enable after editing the configurations:

Now, I have the “students” table in my HBase which has some data. Let us see if we can access it now through Drill.

Run the command below in Drill to use the HBase storage plugin.

Command: use HBase;

Run a query to the HBase student table.

Command: select * from students;

The above query returns results that are not useable. In the next step, we have to convert the data from byte arrays to UTF8 types that are meaningful.

Issue the following query, that includes the CONVERT_FROM function, to convert the students table to typed data:

Command: SELECT CONVERT_FROM(row_key, ‘UTF8’) AS studentid,

CONVERT_FROM(students.account.name, ‘UTF8’) AS name,

CONVERT_FROM(students.address.state, ‘UTF8’) AS state,

CONVERT_FROM(students.address.street, ‘UTF8’) AS street,

CONVERT_FROM(students.address.zipcode, ‘UTF8’) AS zipcode

From students;

You can now successfully access HBase through Apache Drill.

Got a question for us? Mention them in the comment section and we will get back to you.

Related Posts:

Get Started with Apache Spark and Scala

Get Started with Big Data and Hadoop

Drilling Down On Apache Drill, the New-Age Query Engine Part 1

Drilling Down On Apache Drill, The New-Age Query Engine (Part 2)

Recommended videos for you

Ways to Succeed with Hadoop in 2015

5 Scenarios: When To Use & When Not to Use Hadoop

Introduction to Hadoop Administration

Apache Spark Will Replace Hadoop ! Know Why

Apache Spark For Faster Batch Processing

MapReduce Design Patterns – Application of Join Pattern

Advanced Security In Hadoop Cluster

Logistic Regression In Data Science

Big Data – XML Parsing With MapReduce

Introduction to Big Data TDD and Pig Unit

Big Data Processing with Spark and Scala

Is Hadoop A Necessity For Data Science?

Filtering on HBase Using MapReduce Filtering Pattern

5 Things One Must Know About Spark

Reduce Side Joins With MapReduce

Apache Kafka With Spark Streaming: Real-Time Analytics Redefined

Spark SQL | Apache Spark

Hadoop-A Highly Available And Secure Enterprise Data Warehousing Solution

Big Data Processing With Apache Spark

Hadoop Architecture – Hadoop Tutorial on HDFS Architecture

Recommended blogs for you

How to become a Hadoop Developer? Job Trends and Salary

What are the Best books for Hadoop?

What are the Key Terminologies in Hadoop Security?

We Are Deloitte’s #1 Fastest Growing Tech Company!

Install Apache Hadoop Cluster on Amazon EC2 free tier Ubuntu server in 30 minutes

What is the difference between Big Data and Hadoop?

NameNode High Availability with Quorum Journal Manager

Introduction to Apache Hive

Using Big Data to Boost Telecom’s Marketing Capabilities

HBase Tutorial: HBase Introduction and Facebook Case Study

Why Should a Data Warehouse Professional Move to Big Data Hadoop?

Oozie Tutorial: Learn How to Schedule your Hadoop Jobs

Big Data Processing with Apache Spark & Scala

Everything About Cloudera Certified Administrator for Apache Hadoop (CCAH)

MapReduce Example: Reduce Side Join in Hadoop MapReduce

Setting Up A Multi Node Cluster In Hadoop 2.X

Running Scala Application In Eclipse IDE Using Sbteclipse

Hadoop and Java Job Trends

Apache Flink: The Next Gen Big Data Analytics Framework For Stream And Batch Data Processing

Hadoop Interview Questions On HBase In 2025

Join the discussionCancel reply

Browse Categories

Subscribe to our Newsletter, and get personalized recommendations.

Drilling Down On Apache Drill, The New-Age Query Engine (Part 2)