Most answered questions in Big Data Hadoop

0 votes
1 answer

How can I calculate exact median with Apache Spark?

You need to sort RDD and take ...READ MORE

Oct 8, 2018 in Big Data Hadoop by Omkar
• 69,230 points
4,116 views
+3 votes
1 answer

Getting Connection Error while loading data into table using cloudera hive

Hey Nafeesa, Itseems that Hive is not able ...READ MORE

Oct 4, 2018 in Big Data Hadoop by Vardhan
• 13,190 points
732 views
0 votes
1 answer

How can I access S3/S3n from a local Hadoop 2.6 installation?

For some reason, the jar hadoop-aws-[version].jar which contains the ...READ MORE

Oct 3, 2018 in Big Data Hadoop by digger
• 26,740 points
5,121 views
0 votes
1 answer

What is Hadoop Speculative task execution?

One problem with the Hadoop system is ...READ MORE

Oct 3, 2018 in Big Data Hadoop by Frankie
• 9,830 points
894 views
0 votes
1 answer

What is the difference between Writable & WritableComparable in Hadoop?

Writable in an interface in Hadoop and types ...READ MORE

Oct 3, 2018 in Big Data Hadoop by Frankie
• 9,830 points
7,422 views
0 votes
1 answer

out of Memory Error in Hadoop

You can assign more memory by editing ...READ MORE

Oct 1, 2018 in Big Data Hadoop by slayer
• 29,350 points
1,471 views
0 votes
1 answer

Iterate twice on values (MapReduce)

We have to cache the values from ...READ MORE

Oct 1, 2018 in Big Data Hadoop by digger
• 26,740 points
1,911 views
0 votes
1 answer

Using Hadoop for Data Analytics.

Yes, your approach is correct - you ...READ MORE

Sep 28, 2018 in Big Data Hadoop by Frankie
• 9,830 points
530 views
0 votes
1 answer

How to write a file in HDFS using Java Programming language?

Define the HADOOP_CONF_DIR environment variable to your Hadoop configuration ...READ MORE

Sep 28, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,642 views
0 votes
1 answer

What should be the choice of database and what type of data format is suitable for Spark/hadoop?

Use Parquet. I'm not sure about CSV ...READ MORE

Sep 28, 2018 in Big Data Hadoop by Frankie
• 9,830 points
785 views
0 votes
1 answer

When I try to connect to Hadoop, I get a error as -"Connecting to Resource Manager failed".

Make sure you've started Yarn. Use this ...READ MORE

Sep 28, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,166 views
0 votes
1 answer

How will you choose various file formats for storing and processing data using Apache Hadoop ?

The decision to choose a particular file ...READ MORE

Sep 27, 2018 in Big Data Hadoop by zombie
• 3,790 points
1,575 views
0 votes
1 answer

Permission denied at hdfs

You can spolve this by adding below ...READ MORE

Sep 27, 2018 in Big Data Hadoop by digger
• 26,740 points
2,406 views
0 votes
1 answer

Is Hadoop a good open-source project to join?

Yes, Hadoop has a whole ecosystem of ...READ MORE

Sep 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
676 views
0 votes
1 answer

Where can I find older versions of Hadoop?

You can check here. From the archives. In particular, ...READ MORE

Sep 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
501 views
0 votes
1 answer

How to exit name node from safe mode?

In order to forcefully let the namenode ...READ MORE

Sep 26, 2018 in Big Data Hadoop by slayer
• 29,350 points
1,747 views
0 votes
1 answer

How to write a file in hdfs with Java?

You could pass the URI when getting ...READ MORE

Sep 26, 2018 in Big Data Hadoop by digger
• 26,740 points
3,913 views
0 votes
1 answer

Spark - load CSV file as DataFrame?

spark-csv is part of core Spark functionality ...READ MORE

Sep 25, 2018 in Big Data Hadoop by slayer
• 29,350 points
6,642 views
0 votes
1 answer

Does map/reduce merge output files after reduce phase?

Hey there, instead of doing the file ...READ MORE

Sep 25, 2018 in Big Data Hadoop by digger
• 26,740 points
704 views
0 votes
1 answer

How to configure Hosts file for Hadoop Eco-System?

For UBUNTU Hosts File and other configuration for Hadoop ...READ MORE

Sep 25, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,938 views
0 votes
1 answer

In Hadoop what does dfs replicate mean?

The total number of files in the ...READ MORE

Sep 25, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,743 views
0 votes
1 answer

How to pass large records to map/reduce tasks?

Hadoop is not designed for records about ...READ MORE

Sep 25, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,229 views
0 votes
1 answer

Integration of Hadoop with Mongo DB concept

MongoDB isn't built to work on top ...READ MORE

Sep 25, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,499 views
0 votes
1 answer

ISR out of Replica

If the replica stays out of the ...READ MORE

Sep 24, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
433 views
0 votes
1 answer

Kafka vs Flume

Even though both are used for real-time ...READ MORE

Sep 24, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
694 views
0 votes
1 answer

Hadoop: intervals and JOIN

Hey, a solution was given on Biostar: http://biostar.stackexchange.com/questions/8821. Hope ...READ MORE

Sep 24, 2018 in Big Data Hadoop by slayer
• 29,350 points
430 views
0 votes
1 answer

How to create smaller table from big table in HIVE?

You could probably best use Hive's built-in sampling ...READ MORE

Sep 24, 2018 in Big Data Hadoop by digger
• 26,740 points
1,487 views
0 votes
1 answer

What is Dumbo MapReduce in Hadoop?

Doc on Hadoop Streaming : http://hadoop.apache.org/docs/r1.2.1/streaming.html Hadoop streaming is ...READ MORE

Sep 23, 2018 in Big Data Hadoop by Frankie
• 9,830 points
841 views
0 votes
1 answer

What is the standard way to create files in your hdfs file-system?

Well, it's so easy. Just enter the below ...READ MORE

Sep 23, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,354 views
0 votes
1 answer

Why Hadoop is not implemented using Message Passing Interface (MPI)?

One of the big features of Hadoop/map-reduce ...READ MORE

Sep 21, 2018 in Big Data Hadoop by Frankie
• 9,830 points
601 views
0 votes
1 answer

How to add user in supergroup of hdfs in linux?

Yes , now i have whole idea ...READ MORE

Sep 21, 2018 in Big Data Hadoop by Frankie
• 9,830 points
10,658 views
0 votes
1 answer

Hadoop Java Error: java.lang.NoClassDefFoundError: WordCount (wrong name: org/myorg/WordCount)

Hey, try this code import java.io.IOException; import java.util.Iterator; import java.util.StringTokenizer; import ...READ MORE

Sep 19, 2018 in Big Data Hadoop by slayer
• 29,350 points
5,828 views
0 votes
1 answer

How to execute python script in hadoop file system (hdfs)?

If you are simply looking to distribute ...READ MORE

Sep 19, 2018 in Big Data Hadoop by digger
• 26,740 points
13,072 views
0 votes
1 answer

How does the HDFS architecture provide redundancy?

Hadoop provides redundancy by storing multiple replicas ...READ MORE

Sep 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
728 views
0 votes
1 answer

How to discover missing or corrupt HDFS data?

HDFS supports fsck command to check for ...READ MORE

Sep 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,613 views
0 votes
1 answer

How does Avro schema evolution work?

If you have one avro file and ...READ MORE

Sep 19, 2018 in Big Data Hadoop by Frankie
• 9,830 points
7,964 views
0 votes
1 answer

What is Modeling data in Hadoop and how to do it?

I suggest spending some time with Apache ...READ MORE

Sep 19, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,585 views
0 votes
1 answer

Where are Hadoop's task log files stored?

For each MapReduce job, Hadoop stores the ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
714 views
0 votes
1 answer

How to access different directories in a Hadoop cluster?

You need to configure the client to ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
615 views
0 votes
1 answer

What additional capability does Ganglia provide to monitor Hadoop?

Ganglia is an open-source, scalable and distributed ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
769 views
0 votes
1 answer

How to achieve maximum memory to disk I/O ratio?

You need to tune io.sort.mb value until ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
508 views
0 votes
1 answer

How to install Sqoop with Hadoop 2.2.0?

You can refer the below link to ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
874 views
0 votes
1 answer

Hadoop fs mkdir and testing existence FileSystem.exists

FileSystem.get(conf) may return the local file system where ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,071 views
0 votes
1 answer

How to choose between Cassandra, Membase, Hadoop, MongoDB and RDBMS?

Actually it comes in two ways: One ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
650 views
0 votes
1 answer

submitJob method in JobClient of Hadoop throws null pointer

I had this problem as well. But when ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Neha
• 6,300 points
592 views
0 votes
1 answer

What is Custom partitioner in Hadoop? How to write partition function ?

Don't think that in Hadoop the same ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,385 views
0 votes
1 answer

What is the difference between Hadoop MapReduce and built-in MapReduce?

Differences are as follows: Hadoop's MR can be ...READ MORE

Sep 11, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,237 views
0 votes
1 answer

Hadoop Documentation for Eclipse.

Few considerations to be taken are here: If ...READ MORE

Sep 11, 2018 in Big Data Hadoop by Frankie
• 9,830 points
697 views
0 votes
1 answer

How should I provide a URL for hdfs file system?

If you are trying to access your ...READ MORE

Sep 10, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,317 views
0 votes
1 answer

Why am I getting error as hdfs dfs -mkdir, No such file or directory when I create a directory?

It is because the parent directories do ...READ MORE

Sep 10, 2018 in Big Data Hadoop by Frankie
• 9,830 points
14,694 views