Most viewed questions in Big Data Hadoop

0 votes
1 answer

Error mapreduce.Job: Task Id : attempt_1475517800829_0009_m_000000_0, Status : FAILED Error: Java heap space

Hi@Hasid, To avoid this error, you have to ...READ MORE

Apr 17, 2020 in Big Data Hadoop by MD
• 95,460 points
3,329 views
0 votes
1 answer

How to use appnedToFile command in HDFS?

Hey, This Hadoop fs command appends single sources ...READ MORE

Jun 20, 2019 in Big Data Hadoop by Gitika
• 65,730 points
3,319 views
0 votes
1 answer

How to write a file in HDFS using Java Programming language?

Define the HADOOP_CONF_DIR environment variable to your Hadoop configuration ...READ MORE

Sep 28, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,314 views
0 votes
1 answer

org.datanucleus.store.rdbms.exceptions.MissingTableException: Required table missing : "DBS" in Catalog "" Schema "".

Hi@akhtar, Here you have to add the below  ...READ MORE

Feb 5, 2020 in Big Data Hadoop by MD
• 95,460 points
3,304 views
0 votes
1 answer

Hadoop Hive Hbase: How to insert data into Hbase using Hive (JSON file)?

You can use the get_json_object function to parse the ...READ MORE

Nov 15, 2018 in Big Data Hadoop by Omkar
• 69,180 points
3,286 views
0 votes
1 answer

Can we use Apache Sqoop and Hive both together? What is the difference between the two?

Yes, you can. In fact many people ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,281 views
0 votes
1 answer

Monitoring Hadoop with Prometheus JMX

Hi@Som, This error is not related to Prometheus ...READ MORE

Sep 14, 2020 in Big Data Hadoop by MD
• 95,460 points
3,278 views
0 votes
1 answer

How to find the running namenodes and secondary name nodes in hadoop?

Name nodes: hdfs getconf -namenodes Secondary name nodes: hdfs getconf ...READ MORE

Nov 26, 2018 in Big Data Hadoop by Omkar
• 69,180 points
3,277 views
0 votes
1 answer

How will you decide whether you need to use the Capacity Scheduler or the Fair Scheduler?

Fair Scheduling is the process in which ...READ MORE

Dec 20, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,272 views
0 votes
1 answer

Joining Multiple Spark Dataframes

You can run the below code to ...READ MORE

Mar 26, 2018 in Big Data Hadoop by Bharani
• 4,660 points
3,258 views
0 votes
1 answer

How can we ignore header line while loading data into Pig?

Suppose you need to load this in ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Kiran
3,256 views
0 votes
1 answer

Output types of mapper and reducer does not match

job.setOutputValueClass will set the types expected as ...READ MORE

Jul 22, 2019 in Big Data Hadoop by Reena
3,252 views
0 votes
1 answer

How to use load data local infile command?

This command should be executed in MySQL ...READ MORE

Feb 4, 2019 in Big Data Hadoop by Omkar
• 69,180 points
3,247 views
+1 vote
1 answer

How to count number of rows in alias in PIG?

COUNT is part of pig LOGS= LOAD 'log'; LOGS_GROUP= ...READ MORE

Oct 15, 2018 in Big Data Hadoop by Omkar
• 69,180 points
3,237 views
0 votes
1 answer

Current Date in Hive

Try this instead: select from_unixtime(unix_timestamp()); If you have an ...READ MORE

Jul 25, 2019 in Big Data Hadoop by Tina
3,235 views
0 votes
1 answer

Which among the following are the Features of Hadoop?

Apache Hadoop is a collection of open-source ...READ MORE

Dec 9, 2021 in Big Data Hadoop by Kavya
• 700 points
3,234 views
0 votes
1 answer

How to install RHadoop packages (RMR, RHDFS, RHBase)?

With dev tools you can install directly ...READ MORE

Jul 5, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
3,227 views
0 votes
1 answer

Hadoop: Datanode not starting correctly

You can do the following method, copy to ...READ MORE

Nov 5, 2018 in Big Data Hadoop by Omkar
• 69,180 points
3,223 views
0 votes
2 answers

Unable to start YARN daemons. Error: Could not find or load main class org.apache.hadoop.yarn.server.resourcemanager.ResourceManager

Hi@Akshitha, This seems like a path issue. you ...READ MORE

Nov 9, 2020 in Big Data Hadoop by MD
• 95,460 points
3,209 views
0 votes
0 answers
0 votes
1 answer

What is the difference between Hive internal tables and external tables?

Hive has a relational database on the ...READ MORE

Nov 19, 2018 in Big Data Hadoop by Neha
• 6,300 points
3,206 views
0 votes
1 answer

Relationship between Spark, Hadoop and Cassandra?

Spark is a distributed in memory processing ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,206 views
0 votes
1 answer

Java MapReduce error saying No FileSystem for scheme:hdfs

The problem is with the dependencies. The ...READ MORE

May 9, 2018 in Big Data Hadoop by Shubham
• 13,490 points
3,204 views
0 votes
1 answer

How do I create a subquery in hive?

Hey, Hive subquery is a select expression enclosed ...READ MORE

May 17, 2019 in Big Data Hadoop by Gitika
• 65,730 points
3,197 views
0 votes
1 answer

How to Suspending a Workflow, Coordinator or Bundle Job in Oozie?

Hey, The suspend option suspends a workflow job in RUNNING status. After ...READ MORE

Jun 24, 2019 in Big Data Hadoop by Gitika
• 65,730 points
3,196 views
0 votes
1 answer

Apache Spark gives "Failed to load native-hadoop with error"

Seems like hadoop path is missing in java.library.path. ...READ MORE

Nov 22, 2018 in Big Data Hadoop by Omkar
• 69,180 points
3,195 views
–1 vote
1 answer

Error while creating Hive Database table

In your error, it says that the ...READ MORE

Jan 8, 2019 in Big Data Hadoop by Omkar
• 69,180 points
3,171 views
0 votes
1 answer

Error: permission denied locoalhost: chown: cannot access /usr/local/hadoop/ No such file or directory

Hey, As the error suggested that you have ...READ MORE

Jun 17, 2019 in Big Data Hadoop by Gitika
• 65,730 points
3,165 views
0 votes
1 answer

Hadoop Spark: __spark_lib.zip does not exist

To find this file, your HADOOP_CONF_DIR env ...READ MORE

Nov 15, 2018 in Big Data Hadoop by Omkar
• 69,180 points
3,161 views
0 votes
1 answer

Is it possible to change the default location of a managed table?

Hi, Yes, you can do it by using ...READ MORE

Jun 18, 2019 in Big Data Hadoop by Gitika
• 65,730 points
3,155 views
0 votes
1 answer

How to install Scala on Centos?

Maybe you don't have the install point ...READ MORE

Feb 20, 2019 in Big Data Hadoop by Omkar
• 69,180 points
3,155 views
0 votes
1 answer

Hadoop: Adding multiple inputs with a different mapper using python

hadoop jar hadoop-multiple-streaming.jar \ ...READ MORE

Nov 19, 2018 in Big Data Hadoop by Omkar
• 69,180 points
3,152 views
0 votes
1 answer

How hadoop command works with javac?

Javac is used for compiling your java ...READ MORE

Aug 17, 2018 in Big Data Hadoop by Neha
• 6,300 points
3,151 views
0 votes
1 answer

Hive: Dropping a not empty database

This behavior is because Hive by default ...READ MORE

May 20, 2019 in Big Data Hadoop by Rakshit
3,144 views
0 votes
1 answer

How do I print hadoop properties in command line?

You can dump Hadoop config by running: $ ...READ MORE

Aug 23, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,129 views
0 votes
1 answer

Hadoop input folder not found?

Hey, This is because the user directory not ...READ MORE

Jun 14, 2019 in Big Data Hadoop by Gitika
• 65,730 points
3,115 views
0 votes
1 answer

What is the difference between local file system commands touch and touchz?

Actually they both do the same except touchz is ...READ MORE

Aug 14, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,113 views
0 votes
1 answer

Small files and HDFS blocks

Multiple files are not stored in a ...READ MORE

Nov 16, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,112 views
0 votes
1 answer

Hadoop: Can not start namenode: Unable to load native-hadoop library for your platform

The entries in your .bashrc file looks ...READ MORE

Dec 5, 2018 in Big Data Hadoop by Omkar
• 69,180 points
3,109 views
0 votes
1 answer

Moving files in Hadoop using the Java API?

I would recommend you to use FileSystem.rename(). ...READ MORE

Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,490 points
3,105 views
0 votes
1 answer

How to get resource utilized(memory and CPU) by each hive query which is running under single application Id?

Hi@sivachandran, You can monitor the metrics of your ...READ MORE

Sep 18, 2020 in Big Data Hadoop by MD
• 95,460 points
3,099 views
+1 vote
1 answer

How to get status of hdfs directory using python?

import commands hdir_list = commands.getoutput('hadoop fs -ls hdfs: ...READ MORE

Dec 7, 2018 in Big Data Hadoop by Omkar
• 69,180 points
3,093 views
0 votes
1 answer

How to check the name of the hadoop server?

Hi@akhtar, If you're running a pseudo-distributed setup, you ...READ MORE

Oct 13, 2020 in Big Data Hadoop by MD
• 95,460 points
3,087 views
0 votes
1 answer

HDFS on docker containers - is it possible?

Hi @judy, It is possible to Simplifying Hadoop deployments ...READ MORE

Feb 12, 2020 in Big Data Hadoop by Gitika
• 65,730 points
3,086 views
0 votes
1 answer

Explain to me the difference between HBase and HDFS.

Hadoop generally consists of three major components: HDFS It ...READ MORE

Apr 12, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
3,082 views
0 votes
1 answer

org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does not exist: hdfs://localhost:54310/user/hduser/input.

Hi@akhtar, It seems you are providing the wrong ...READ MORE

Oct 13, 2020 in Big Data Hadoop by MD
• 95,460 points
3,075 views
0 votes
1 answer

How to create Children or Sub-znode in zookeeper?

Hey, Creating children is similar to creating new ...READ MORE

May 28, 2019 in Big Data Hadoop by Gitika
• 65,730 points
3,072 views
0 votes
1 answer

What is the difference between a zero reducer and identity reducer in Hadoop Mapreduce?

A Zero reducer as the name suggests ...READ MORE

Apr 10, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
3,065 views
0 votes
1 answer

Error running hadoop mapreduce in Python using Hadoop Streaming

Hi As you write mapper and reducer program  ...READ MORE

Jan 21, 2020 in Big Data Hadoop by anonymous
3,065 views
0 votes
1 answer

Not able to start Job History Server in Hadoop 2.8.1

You have to start JobHistoryServer process specifically ...READ MORE

Mar 30, 2018 in Big Data Hadoop by Ashish
• 2,650 points
3,063 views