How to perform Near-real time analytics using Hadoop

0 votes
We have PostgreSQL in our production server. We have to analyse it on a regular basis, for which we are planning to use Hadoop. Each query which we are executing should be executed on the updated data present in our PostgreSQL database. All this process can only handle a little delay.
Can anyone help me in understanding how to perform this near-real time analytics using Hadoop?
Apr 24, 2018 in Big Data Hadoop by Shubham
• 13,480 points
267 views

1 answer to this question.

0 votes

To perform Near-real time analytics, you have to use Storage Handler API of Hive. Storage Handler API basically keeps the metadata of the external table in the Hive metastore, whereas the tables reside in the external data source(in your case it is postgreSQL).

But the Hive metastore will have all the information of the data (i.e. metadata) i.e. coulnm & format details. So, whenever we are reading data from Hive table with storage handler then the data is fetched from the external table.

You can go to this link & check the Hive handler for PostgreSQL.

https://github.com/myui/HiveJdbcStorageHandler

To know more about handlers, check this link:

https://cwiki.apache.org/confluence/display/Hive/StorageHandlers

answered Apr 24, 2018 by kurt_cobain
• 9,390 points

Related Questions In Big Data Hadoop

0 votes
1 answer

How to get started with Hadoop and do some development using Eclipse IDE?

Alright, there are couple of things that ...READ MORE

answered Apr 4, 2018 in Big Data Hadoop by Ashish
• 2,650 points
1,201 views
+1 vote
2 answers

How to authenticate username & password while using Connector for Cloudera Hadoop in Tableau?

Hadoop server installed was kerberos enabled server. ...READ MORE

answered Aug 21, 2018 in Big Data Hadoop by Priyaj
• 58,140 points
698 views
0 votes
1 answer

How to create a project for the first time in Hadoop.?

If you want to learn Hadoop framework ...READ MORE

answered Jul 27, 2018 in Big Data Hadoop by Neha
• 6,300 points
498 views
0 votes
1 answer

How to access Hadoop Data using REST service?

The REST API gateway for the Apache ...READ MORE

answered Sep 5, 2018 in Big Data Hadoop by Frankie
• 9,810 points
2,860 views
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
8,057 views
+2 votes
11 answers

hadoop fs -put command?

Hi, You can create one directory in HDFS ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
67,328 views
–1 vote
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
3,066 views
0 votes
1 answer
+1 vote
2 answers

Failed to restart Hadoop namenode using cloudera quickstart

You can use Cloudera Manager to manage ...READ MORE

answered Mar 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points

edited Jun 9, 2020 by MD 2,473 views
0 votes
1 answer

How to start working on Hadoop?

Ok, so basically, you are looking forward ...READ MORE

answered Mar 30, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
320 views