Drilling Down On Apache Drill, the New-Age Query Engine Part 2

Become a Certified Professional

In this second Apache Drill blog post, we will learn how integrate Hive and HBase with Apache Drill. Apache Drill provides inbuilt storage plugins for Hive and HBase integration. We just need to edit the configurations of these storage plugins and connect Drill.

Check out the first Apache Drill blog in the series here.

First, let us see how to connect Apache Drill with Hive storage plugin.

In the snapshot below, you can see that I have a table customer which has some data. Now we will connect Hive with Drill and access the same table from Drill.

table-Apache-Drill

Start Drillbit service

Command: ./bin/Drillbit.sh start

Start Drill shell

Command: Drill-conf

Next, open the Drill UI in the browser localhost:8047

You will see Hive in the disabled storage plugins. Click on Update for Hive.

Edit the configurations according to your existing Hive settings that are available in hive-site.xml

After editing, click on Enable.

You will now find Hive now in the Enabled Storage Plugin.

Run the command below command in the Drill shell to access Hive from Drill.

Command: use hive;

You can now access the customer table from Drill. You have successfully integrated Hive with Drill.

Command: select * from customer;

Next, to enable the HBase storage plugin, we will follow the same steps as Hive. Click on update for HBase and edit the configurations accordingly.

Click on enable after editing the configurations:

Now, I have the “students” table in my HBase which has some data. Let us see if we can access it now through Drill.

Run the command below in Drill to use the HBase storage plugin.

Command: use HBase;

Run a query to the HBase student table.

Command: select * from students;

The above query returns results that are not useable. In the next step, we have to convert the data from byte arrays to UTF8 types that are meaningful.

Issue the following query, that includes the CONVERT_FROM function, to convert the students table to typed data:

Command: SELECT CONVERT_FROM(row_key, ‘UTF8’) AS studentid,

CONVERT_FROM(students.account.name, ‘UTF8’) AS name,

CONVERT_FROM(students.address.state, ‘UTF8’) AS state,

CONVERT_FROM(students.address.street, ‘UTF8’) AS street,

CONVERT_FROM(students.address.zipcode, ‘UTF8’) AS zipcode

From students;

You can now successfully access HBase through Apache Drill.

Got a question for us? Mention them in the comment section and we will get back to you.

Related Posts:

Get Started with Apache Spark and Scala

Get Started with Big Data and Hadoop

Drilling Down On Apache Drill, the New-Age Query Engine Part 1

Drilling Down On Apache Drill, The New-Age Query Engine (Part 2)

Recommended videos for you

Advanced Security In Hadoop Cluster

Big Data Tutorial – Get Started With Big Data And Hadoop

Big Data – XML Parsing With MapReduce

New-Age Search through Apache Solr

Reduce Side Joins With MapReduce

Filtering on HBase Using MapReduce Filtering Pattern

Hadoop-A Highly Available And Secure Enterprise Data Warehousing Solution

Administer Hadoop Cluster

Python for Big Data Analytics

Introduction to Big Data TDD and Pig Unit

Pig Tutorial – Know Everything About Apache Pig Script

Distributed Cache With MapReduce

Boost Your Data Career with Predictive Analytics! Learn How ?

Spark SQL | Apache Spark

What is Apache Storm all about?

Big Data Processing With Apache Spark

What is Big Data and Why Learn Hadoop!!!

What Is Hadoop – All You Need To Know About Hadoop

Introduction to Apache Solr-1

Logistic Regression In Data Science

Recommended blogs for you

Dataframes in Spark: All you need to know about Structured Data Processing

What Is Elasticsearch – Getting Started With No Constraints Search Engine

Hadoop Developer-Job Responsibilities & Skills

Spark SQL Tutorial – Understanding Spark SQL With Examples

Big Data Processing with Spark and Scala

Scala Functional Programming

HBase Architecture: HBase Data Model & HBase Read/Write Mechanism

How to Plan the Capacity of a Hadoop Cluster?

Big Data Engineer Salary – How Much Can You Expect As A Big Data Engineer?

Pig Programming: Apache Pig Script in Local Mode

Hadoop Interview Questions On HBase In 2024

PySpark MLlib Tutorial : Machine Learning with PySpark

Top Hadoop Interview Questions On Apache PIG For 2024

Explaining Kerberos

Hadoop Administration Interview Questions and Answers For 2024

Map Side Join Vs. Join

Big Data Applications in Healthcare

How to become an Apache Spark Developer?

Install Apache Hadoop Cluster on Amazon EC2 free tier Ubuntu server in 30 minutes

How To Create User In MongoDB?

Join the discussion Cancel reply

Trending Courses in Big Data

Azure Data Engineer Certification (DP-203) Co ...

PySpark Course Online Training

Big Data Hadoop Certification Training Course

Apache Spark and Scala Certification Training ...

Apache Kafka Certification Training Course

Splunk Certification Training: Power User and ...

Leveraging Big Data for Business Intelligence ...

ELK Stack Training & Certification

Apache Solr Certification Training

Apache Storm Certification Training

Browse Categories

Subscribe to our Newsletter, and get personalized recommendations.

Drilling Down On Apache Drill, The New-Age Query Engine (Part 2)