Trending questions in Big Data Hadoop

0 votes
1 answer

CSV integration with Hadoop

For integrating Hadoop with CSV, we can use ...READ MORE

Jul 16, 2019 in Big Data Hadoop by Krish
1,920 views
0 votes
1 answer

Hadoop security GroupMappingServiceProvider exception for Spark job via Dataproc API

If you don't want to turn off ...READ MORE

Jul 2, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
2,519 views
0 votes
1 answer

Index LZO files

You can do it using the following ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Rishi
1,552 views
0 votes
1 answer

Primary keys in apache Spark

from pyspark.sql.functions import monotonically_increasing_id df.withColumn("id", monotonically_increasing_id()).show() Verify the second ...READ MORE

Jul 11, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
2,101 views
0 votes
1 answer

What are the services running in Hadoop

Below are the services Running in Hadoop Hdfs  yarn mapreduce ozzie zookeeper hive hue hbase impala flume sqoop spark Depending ...READ MORE

Jun 24, 2019 in Big Data Hadoop by deepak
2,879 views
0 votes
1 answer

Explain to me the correct way to get a Hadoop FileSystem object so that I can use it for reading from HDFS as well as writing to HDFS.

FileSystem needs only one configuration key to successfully ...READ MORE

Jun 7, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
3,524 views
0 votes
1 answer

Merging Files using PIG

In order to merge two or more ...READ MORE

Jul 4, 2019 in Big Data Hadoop by Tina
2,322 views
0 votes
1 answer

What are the modes of Hive metastore?

Hey, The metastore stores the schema and partition ...READ MORE

May 13, 2019 in Big Data Hadoop by Gitika
• 65,730 points
4,576 views
0 votes
1 answer

Syntax to mention that the data is enclosed in double quotes in pig

So, we will execute the below command, new_A_2 ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Esha
2,038 views
0 votes
1 answer

Does HQL query go to all the nodes to process data?

InputSplits are created by logical division of ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Tina
1,340 views
0 votes
1 answer

What is the use of ZooKeeper in Hbase?

Hey, The zookeeper is used to maintain the ...READ MORE

May 21, 2019 in Big Data Hadoop by Gitika
• 65,730 points
4,193 views
0 votes
1 answer

How to use Hbase shell in a bash script?

Hello, To write scripts with HBase shell it includes non-interactive mode, ...READ MORE

May 29, 2019 in Big Data Hadoop by Gitika
• 65,730 points
3,829 views
0 votes
1 answer

Pig script not dumping the data correctly

The first column is denoted by $0, ...READ MORE

Jul 15, 2019 in Big Data Hadoop by George
1,798 views
0 votes
2 answers

if HDFS threshold has been reached. What is the our approach to resolve this issue?

This was a fundamental issue in HDFS ...READ MORE

Jun 24, 2019 in Big Data Hadoop by Gitika
• 65,730 points
2,723 views
0 votes
1 answer

Hive Query: Split value

You can use the SUBSTR() in hive ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Paul
1,347 views
0 votes
1 answer

How to execute python script in hadoop file system (hdfs)?

If you are simply looking to distribute ...READ MORE

Sep 19, 2018 in Big Data Hadoop by digger
• 26,740 points
14,696 views
0 votes
1 answer

Can we use different input and output format classes?

Yes, InputFormatClass and OutputFormatClass are independent of ...READ MORE

Jul 22, 2019 in Big Data Hadoop by Jishan
1,460 views
0 votes
1 answer

Query regarding temporary File type conversion of parquet to text

Suppose I have the below parquet file ...READ MORE

Jul 9, 2019 in Big Data Hadoop by Janan
2,005 views
0 votes
1 answer

Classes implementing InputFormat frequently

FileInputFormat : Base class for all file-based InputFormats Other ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Reshma
1,356 views
0 votes
1 answer

Hive: "SET LOCATION" command not changing the location of the database

The SET LOCATION command does not change ...READ MORE

May 20, 2019 in Big Data Hadoop by Yash
4,158 views
0 votes
1 answer

Difference between root of hdfs and local machine

The command hdfs dfs -put command is used to ...READ MORE

Jun 19, 2019 in Big Data Hadoop by Mahi
2,856 views
0 votes
1 answer

How to solve error caused due to output types of mapper and reducer not matching?

job.setOutputValueClass will set the types expected as ...READ MORE

Jul 9, 2019 in Big Data Hadoop by Rishab
1,986 views
0 votes
1 answer

Query regarding ArrayBuffer

It's because that is the syntax. This ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Rishi
1,233 views
0 votes
1 answer

Mention what are the main configuration parameters that user need to specify to run MapReduce Job?

Hi, The user of the MapReduce framework needs ...READ MORE

Jun 10, 2019 in Big Data Hadoop by Gitika
• 65,730 points
3,209 views
0 votes
1 answer

Delimiter on the data

You can use this: import org.apache.spark.sql.functions.struct val df = ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Vinay
1,261 views
0 votes
1 answer

How to execute combiner and partitioning program without adding package name?

Yes, it is possible to do so ...READ MORE

Jul 16, 2019 in Big Data Hadoop by Raman
1,633 views
0 votes
1 answer

Produce compressed data from map reduce

 It is straight forward and you can achieve ...READ MORE

Jul 24, 2019 in Big Data Hadoop by John
1,273 views
0 votes
1 answer

Heartbeat time interval for Datanode

The heartbeat interval is 3 seconds by ...READ MORE

Jun 20, 2019 in Big Data Hadoop by Rajat
2,726 views
0 votes
1 answer

Pdf to csv file format conversions

You can convert the pdf files with ...READ MORE

Jul 9, 2019 in Big Data Hadoop by Esha
1,889 views
0 votes
1 answer

How to change the Hive table comment?

Yes, it is possible to change the ...READ MORE

May 20, 2019 in Big Data Hadoop by Tina
4,051 views
0 votes
1 answer

How can we ignore header line while loading data into Pig?

You can use the following code: A = ...READ MORE

Jul 22, 2019 in Big Data Hadoop by kiran
1,303 views
0 votes
1 answer

Getting error while using sqoop import.

You are trying to execute the sqoop ...READ MORE

Jul 5, 2019 in Big Data Hadoop by Reshma
2,050 views
0 votes
1 answer

Hive Query to sort data

If you are trying to sort first ...READ MORE

Jul 14, 2019 in Big Data Hadoop by Tina
1,664 views
0 votes
1 answer

Datanode not running on Hadoop-2.7.3

Hey, I solved this problem by removing hadoop ...READ MORE

Jun 14, 2019 in Big Data Hadoop by Gitika
• 65,730 points
2,946 views
0 votes
1 answer

what is the difference between CREATE TABLE and CREATE EXTERNAL TABLE in Hive?

Hey, Although, we can create two types of ...READ MORE

Jun 26, 2019 in Big Data Hadoop by Gitika
• 65,730 points
2,367 views
0 votes
1 answer

How many partitions can do for one table?

Well, there are two kinds of partitions: 1. ...READ MORE

Jul 22, 2019 in Big Data Hadoop by Kunal
1,205 views
0 votes
2 answers

How are Partitioning and Bucketing different from each other in Apache Hive?

Let us consider a student database table ...READ MORE

Apr 15, 2019 in Big Data Hadoop by nitinrawat895
• 11,380 points
5,389 views
0 votes
1 answer

Hive queries from mapreduce gives Null Pointer Exception.

I think you have upgraded CDH. This ...READ MORE

Jun 4, 2019 in Big Data Hadoop by Rihan
3,237 views
0 votes
1 answer

How to Import the complete directory into HDFS?

First, need to create a directory in Hadoop: $ ...READ MORE

Jul 15, 2019 in Big Data Hadoop by Hiran
1,462 views
0 votes
1 answer

How does Hive job execution flow works?

Hey, Hive query is received from UI or ...READ MORE

May 13, 2019 in Big Data Hadoop by Gitika
• 65,730 points
4,186 views
0 votes
1 answer

Limit for Namenode Quantity

Each file Schema = 150bytes   Block schema ...READ MORE

Jul 25, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
1,019 views
0 votes
0 answers

Sqoop exclude tables

How to exclude tables in sqoop if ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Will
1,040 views
0 votes
2 answers

Error While Running Oozie Project

You need to add your current user ...READ MORE

Aug 6, 2019 in Big Data Hadoop by Dhimanshu
1,417 views
0 votes
1 answer

How to create Children or Sub-znode in zookeeper?

Hey, Creating children is similar to creating new ...READ MORE

May 28, 2019 in Big Data Hadoop by Gitika
• 65,730 points
3,494 views
0 votes
1 answer

Creating a hive script and execute in the edureka cloudlab.

Please remove -f option from hive arguments and use hql extension ...READ MORE

Jul 14, 2019 in Big Data Hadoop by Karan
1,454 views
0 votes
1 answer
0 votes
1 answer

How to increase reducers in Hive?

By default, only one reducer is assigned ...READ MORE

May 21, 2019 in Big Data Hadoop by Lohit
3,751 views
0 votes
2 answers

Using PHP to get all file names in a folder stored in HDFS

So i found a workaround for the ...READ MORE

Mar 13, 2019 in Big Data Hadoop by Bhavish
• 370 points

edited Mar 13, 2019 by Omkar 6,756 views
0 votes
1 answer

JDBC connection to Hive using TLS/SSL

Hi. The properties are right but in ...READ MORE

May 31, 2019 in Big Data Hadoop by Karan
3,261 views
0 votes
1 answer

Creating testjob in sqoop for incremental load

Yes, it is possible to do so. ...READ MORE

Jul 5, 2019 in Big Data Hadoop by Umar
1,731 views