Most answered questions in Big Data Hadoop

0 votes
1 answer

Hadoop: What does RDD mean?

RDD is the acronym for Resilient Distribution ...READ MORE

Nov 21, 2018 in Big Data Hadoop by Deepak
939 views
0 votes
1 answer

What is the difference between Apache ZooKeeper and Apache Oozie?

The main difference between these two is ...READ MORE

Nov 21, 2018 in Big Data Hadoop by Anoop
4,059 views
0 votes
1 answer

Hadoop: How to get the column name along with the output in Hive?

You can get the column names by ...READ MORE

Nov 21, 2018 in Big Data Hadoop by Omkar
• 69,230 points
4,646 views
0 votes
1 answer

Hadoop: Java code to get Hive table properties.

You can try this: org.apache.hadoop.hive.metastore.api.Table table=..........; org.apache.hadoop.hive.metastore.api.StorageDescriptor sd=table.getSd(); String loc=sd.getLocation(); String ...READ MORE

Nov 21, 2018 in Big Data Hadoop by Omkar
• 69,230 points
716 views
0 votes
1 answer

Hadoop Java: org.apache.hadoop.conf.Configuration does not exist error

To solve this error, add hadoop-common to ...READ MORE

Nov 21, 2018 in Big Data Hadoop by Omkar
• 69,230 points
7,092 views
0 votes
1 answer

In Hadoop MapReduce, how can i set an Object as the Value for Map output?

Try this and see if it works: public ...READ MORE

Nov 21, 2018 in Big Data Hadoop by Omkar
• 69,230 points
767 views
0 votes
1 answer

Hadoop HDFS: Exception in createBlockOutputStream

It's a connection error. Have you turned ...READ MORE

Nov 21, 2018 in Big Data Hadoop by Omkar
• 69,230 points
4,690 views
0 votes
1 answer

Hadoop: "Caused by: java.lang.ClassNotFoundException" error

Have you placed the jar files in ...READ MORE

Nov 19, 2018 in Big Data Hadoop by Omkar
• 69,230 points
6,629 views
0 votes
1 answer

Hadoop hdfs: How to count number of lines?

You can count the number of lines ...READ MORE

Nov 19, 2018 in Big Data Hadoop by Omkar
• 69,230 points
958 views
0 votes
1 answer

Hadoop: Adding multiple inputs with a different mapper using python

hadoop jar hadoop-multiple-streaming.jar \ ...READ MORE

Nov 19, 2018 in Big Data Hadoop by Omkar
• 69,230 points
2,191 views
0 votes
1 answer

Hadoop HDFS: Java API to move files to hdfs

You can use the FileUtil api to do this. Example: Configuration ...READ MORE

Nov 19, 2018 in Big Data Hadoop by Omkar
• 69,230 points
3,800 views
0 votes
1 answer

Hadoop Apache: Could not fully initialize service [org.apache.oozie.service.ShareLibService]

As mentioned in the error, the problem ...READ MORE

Nov 19, 2018 in Big Data Hadoop by Omkar
• 69,230 points
911 views
0 votes
1 answer

What is the difference between Hive internal tables and external tables?

Hive has a relational database on the ...READ MORE

Nov 19, 2018 in Big Data Hadoop by Neha
• 6,300 points
2,660 views
0 votes
1 answer

How to use Sqoop in Java Program?

You can run sqoop from inside your ...READ MORE

Nov 19, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,520 views
0 votes
1 answer

Hadoop MapReduce intermediate output

keep.task.files.pattern parameter can be used to keep the ...READ MORE

Nov 16, 2018 in Big Data Hadoop by Frankie
• 9,830 points
573 views
0 votes
1 answer

Small files and HDFS blocks

Multiple files are not stored in a ...READ MORE

Nov 16, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,560 views
0 votes
1 answer

Hadoop: Error: Could not find or load main class org.apache.hadoop.util.VersionInfo

This seems like a path issue. Add the ...READ MORE

Nov 16, 2018 in Big Data Hadoop by Omkar
• 69,230 points
11,975 views
0 votes
1 answer

Hadoop with Python: How to use python to stop Hadoop tasktracker?

Here you go os.system('sudo -u hadoop bin/hadoop-daemon.sh stop ...READ MORE

Nov 16, 2018 in Big Data Hadoop by Omkar
• 69,230 points
377 views
0 votes
1 answer

Hadoop Pig: java.lang.NoClassDefFoundError: org/apache/hadoop/hbase/filter/Filter

This seems like a problem with the ...READ MORE

Nov 16, 2018 in Big Data Hadoop by Omkar
• 69,230 points
1,871 views
0 votes
1 answer

Hadoop Pig: How to include external jar file in PIG?

You can do this: register /local/path/to/Jar_name.jar READ MORE

Nov 16, 2018 in Big Data Hadoop by Omkar
• 69,230 points
845 views
0 votes
1 answer

Hadoop Spark Derby: Caused by: ERROR XSDB6: Another instance of Derby may have already booted the database

As mentioned in the error, there's another ...READ MORE

Nov 16, 2018 in Big Data Hadoop by Omkar
• 69,230 points
4,369 views
0 votes
1 answer

Can someone tell me how apache sqoop works?

This blog would give you answers to ...READ MORE

Nov 16, 2018 in Big Data Hadoop by brat_1
• 7,200 points
422 views
0 votes
1 answer

how is apache sqoop different from flume? Can someone explain?

Apache Sqoop lets you import and export ...READ MORE

Nov 15, 2018 in Big Data Hadoop by brat_1
• 7,200 points
353 views
+2 votes
1 answer
0 votes
1 answer

Hadoop Hive Hbase: How to insert data into Hbase using Hive (JSON file)?

You can use the get_json_object function to parse the ...READ MORE

Nov 15, 2018 in Big Data Hadoop by Omkar
• 69,230 points
2,667 views
0 votes
1 answer

Hadoop HDFS: How to delete old files from HDFS?

You can use commands like this: hdfs dfs ...READ MORE

Nov 15, 2018 in Big Data Hadoop by Omkar
• 69,230 points
13,602 views
0 votes
1 answer

Hadoop Spark: __spark_lib.zip does not exist

To find this file, your HADOOP_CONF_DIR env ...READ MORE

Nov 15, 2018 in Big Data Hadoop by Omkar
• 69,230 points
2,449 views
0 votes
1 answer

Hadoop Hive: recursively descending into subdirectories without partitions or editing hive-site.xml

 I set the following properties before I ...READ MORE

Nov 15, 2018 in Big Data Hadoop by Omkar
• 69,230 points
5,220 views
0 votes
1 answer

Hadoop: How to copy directory from local system to hdfs using Java code?

Just use the FileSystem's copyFromLocalFile method. If the source Path ...READ MORE

Nov 14, 2018 in Big Data Hadoop by Omkar
• 69,230 points
5,338 views
0 votes
1 answer

Hadoop Hive: How to drop a table?

You cannot drop column directly from a ...READ MORE

Nov 14, 2018 in Big Data Hadoop by Omkar
• 69,230 points
1,808 views
0 votes
1 answer

Hadoop Hive: How to split a single row into multiple rows?

Try this SELECT ID1, Sub FROM tableName lateral view ...READ MORE

Nov 14, 2018 in Big Data Hadoop by Omkar
• 69,230 points
8,699 views
0 votes
1 answer

Hadoop Hive: How to convert multiple rows into comma separated values?

You can use the aggregator function collect_set to do ...READ MORE

Nov 14, 2018 in Big Data Hadoop by Omkar
• 69,230 points
5,002 views
0 votes
1 answer
0 votes
1 answer

Hadoop MacOS: formatting namenode: Could not find or load main class

You need to point the HADOOP_PREFIX to ...READ MORE

Nov 13, 2018 in Big Data Hadoop by Omkar
• 69,230 points
1,011 views
0 votes
1 answer

Hadoop Spark: What is version to find SparkSession in library Spark?

you need both core and SQL artifacts <repositories> ...READ MORE

Nov 13, 2018 in Big Data Hadoop by Omkar
• 69,230 points
1,797 views
0 votes
1 answer

Hadoop: method to send output to multiple directories

setup: MultipleOutputs.addNamedOutput(job, "Output", TextOutputFormat.class, Text.class, Text.class); setup of reducer: mout ...READ MORE

Nov 13, 2018 in Big Data Hadoop by Omkar
• 69,230 points
752 views
0 votes
1 answer

Hadoop Sqoop: How to change sqoop metastore?

In sqoop-site.xml I configured it with, the ...READ MORE

Nov 13, 2018 in Big Data Hadoop by Omkar
• 69,230 points
1,522 views
0 votes
1 answer

Hadoop: Format namenode gives error: Shutting down NameNode at java.net.UnknownHostException

UnknownHostException is thrown when hadoop tries to resolve ...READ MORE

Nov 12, 2018 in Big Data Hadoop by Omkar
• 69,230 points
5,363 views
0 votes
1 answer

Hadoop Hive: message:Version information not found in metastore. Unable to instantiate

These are the necessary tables required for metastore that are ...READ MORE

Nov 12, 2018 in Big Data Hadoop by Omkar
• 69,230 points
3,321 views
0 votes
1 answer

Hadoop Oozie: E0902: Exception occured: [User: oozie is not allowed to impersonate oozie]

Seems like your version of Hadoop does ...READ MORE

Nov 12, 2018 in Big Data Hadoop by Omkar
• 69,230 points
933 views
0 votes
1 answer

Hadoop Hive: How to insert data in Hive table?

First, copy data into HDFS. Then create ...READ MORE

Nov 12, 2018 in Big Data Hadoop by Omkar
• 69,230 points
9,508 views
+1 vote
1 answer

Hadoop: Using composite-key

You can use a Writable, something like ...READ MORE

Nov 12, 2018 in Big Data Hadoop by Omkar
• 69,230 points
1,901 views
0 votes
1 answer

Where can I find AMI for Hadoop on EC2?

To find that, please try with below ...READ MORE

Nov 12, 2018 in Big Data Hadoop by Frankie
• 9,830 points
873 views
0 votes
1 answer

How to control logging functionality in Hadoop?

 Logs are distributed across your cluster, but ...READ MORE

Nov 12, 2018 in Big Data Hadoop by Frankie
• 9,830 points
4,209 views
0 votes
1 answer

Hadoop: How to use Jps command with Sun JDK installed?

You can put JPS into your path ...READ MORE

Nov 9, 2018 in Big Data Hadoop by Omkar
• 69,230 points
1,694 views
0 votes
1 answer

Hadoop: Reading and Writing Sequencefile using Apis?

public class SequenceFilesTest { @Test ...READ MORE

Nov 9, 2018 in Big Data Hadoop by Omkar
• 69,230 points
369 views
0 votes
1 answer

What skills do I need to become Hadoop Admin?

You have heard it right. There is ...READ MORE

Nov 8, 2018 in Big Data Hadoop by anonymous
374 views
0 votes
1 answer

Hadoop: Not able to start Namenode

Stop all running server 1) stop-all.sh Edit the ...READ MORE

Nov 8, 2018 in Big Data Hadoop by Omkar
• 69,230 points
2,120 views
0 votes
1 answer

Hbase: Client not able to connect with remote Hbase server

You have to remove the localhost entry from hbase server's ...READ MORE

Nov 8, 2018 in Big Data Hadoop by Omkar
• 69,230 points
5,143 views
0 votes
1 answer

Hadoop Hive: How to skip the first line of csv while loading in hive table?

You can try this: CREATE TABLE temp ...READ MORE

Nov 8, 2018 in Big Data Hadoop by Omkar
• 69,230 points
8,490 views