How to perform Near-real time analytics using Hadoop?

0 votes
We have PostgreSQL in our production server. We have to analyse it on a regular basis, for which we are planning to use Hadoop. Each query which we are executing should be executed on the updated data present in our PostgreSQL database. All this process can only handle a little delay.
Can anyone help me in understanding how to perform this near-real time analytics using Hadoop?
Apr 24, 2018 in Big Data Hadoop by Shubham
• 13,350 points
44 views

1 answer to this question.

0 votes

To perform Near-real time analytics, you have to use Storage Handler API of Hive. Storage Handler API basically keeps the metadata of the external table in the Hive metastore, whereas the tables reside in the external data source(in your case it is postgreSQL).

But the Hive metastore will have all the information of the data (i.e. metadata) i.e. coulnm & format details. So, whenever we are reading data from Hive table with storage handler then the data is fetched from the external table.

You can go to this link & check the Hive handler for PostgreSQL.

https://github.com/myui/HiveJdbcStorageHandler

To know more about handlers, check this link:

https://cwiki.apache.org/confluence/display/Hive/StorageHandlers

answered Apr 24, 2018 by kurt_cobain
• 9,280 points

Related Questions In Big Data Hadoop

0 votes
1 answer

How to get started with Hadoop and do some development using Eclipse IDE?

Alright, there are couple of things that ...READ MORE

answered Apr 4, 2018 in Big Data Hadoop by Ashish
• 2,630 points
143 views
+1 vote
2 answers

How to authenticate username & password while using Connector for Cloudera Hadoop in Tableau?

Hadoop server installed was kerberos enabled server. ...READ MORE

answered Aug 21, 2018 in Big Data Hadoop by Priyaj
• 56,960 points
231 views
0 votes
1 answer

How to create a project for the first time in Hadoop.?

If you want to learn Hadoop framework ...READ MORE

answered Jul 26, 2018 in Big Data Hadoop by Neha
• 6,280 points
104 views
0 votes
1 answer

How to access Hadoop Data using REST service?

The REST API gateway for the Apache ...READ MORE

answered Sep 5, 2018 in Big Data Hadoop by Frankie
• 9,810 points
660 views
0 votes
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,760 points
3,515 views
+1 vote
11 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyFr ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
17,842 views
0 votes
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,280 points
1,293 views
0 votes
1 answer
+1 vote
2 answers

Failed to restart Hadoop namenode using cloudera quickstart

You can use cloudera manager to manage ...READ MORE

answered Mar 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,280 points
879 views
0 votes
1 answer

How to start working on Hadoop?

Ok, so basically, you are looking forward ...READ MORE

answered Mar 29, 2018 in Big Data Hadoop by kurt_cobain
• 9,280 points
80 views