Most answered questions in Big Data Hadoop

0 votes
1 answer

How can I calculate exact median with Apache Spark?

You need to sort RDD and take ...READ MORE

Oct 8, 2018 in Big Data Hadoop by Omkar
• 69,230 points
4,126 views
+3 votes
1 answer

Getting Connection Error while loading data into table using cloudera hive

Hey Nafeesa, Itseems that Hive is not able ...READ MORE

Oct 4, 2018 in Big Data Hadoop by Vardhan
• 13,190 points
743 views
0 votes
1 answer

How can I access S3/S3n from a local Hadoop 2.6 installation?

For some reason, the jar hadoop-aws-[version].jar which contains the ...READ MORE

Oct 3, 2018 in Big Data Hadoop by digger
• 26,740 points
5,149 views
0 votes
1 answer

What is Hadoop Speculative task execution?

One problem with the Hadoop system is ...READ MORE

Oct 3, 2018 in Big Data Hadoop by Frankie
• 9,830 points
900 views
0 votes
1 answer

What is the difference between Writable & WritableComparable in Hadoop?

Writable in an interface in Hadoop and types ...READ MORE

Oct 3, 2018 in Big Data Hadoop by Frankie
• 9,830 points
7,457 views
0 votes
1 answer

out of Memory Error in Hadoop

You can assign more memory by editing ...READ MORE

Oct 1, 2018 in Big Data Hadoop by slayer
• 29,350 points
1,478 views
0 votes
1 answer

Iterate twice on values (MapReduce)

We have to cache the values from ...READ MORE

Oct 1, 2018 in Big Data Hadoop by digger
• 26,740 points
1,917 views
0 votes
1 answer

Using Hadoop for Data Analytics.

Yes, your approach is correct - you ...READ MORE

Sep 28, 2018 in Big Data Hadoop by Frankie
• 9,830 points
539 views
0 votes
1 answer

How to write a file in HDFS using Java Programming language?

Define the HADOOP_CONF_DIR environment variable to your Hadoop configuration ...READ MORE

Sep 28, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,651 views
0 votes
1 answer

What should be the choice of database and what type of data format is suitable for Spark/hadoop?

Use Parquet. I'm not sure about CSV ...READ MORE

Sep 28, 2018 in Big Data Hadoop by Frankie
• 9,830 points
794 views
0 votes
1 answer

When I try to connect to Hadoop, I get a error as -"Connecting to Resource Manager failed".

Make sure you've started Yarn. Use this ...READ MORE

Sep 28, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,174 views
0 votes
1 answer

How will you choose various file formats for storing and processing data using Apache Hadoop ?

The decision to choose a particular file ...READ MORE

Sep 27, 2018 in Big Data Hadoop by zombie
• 3,790 points
1,582 views
0 votes
1 answer

Permission denied at hdfs

You can spolve this by adding below ...READ MORE

Sep 27, 2018 in Big Data Hadoop by digger
• 26,740 points
2,415 views
0 votes
1 answer

Is Hadoop a good open-source project to join?

Yes, Hadoop has a whole ecosystem of ...READ MORE

Sep 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
722 views
0 votes
1 answer

Where can I find older versions of Hadoop?

You can check here. From the archives. In particular, ...READ MORE

Sep 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
512 views
0 votes
1 answer

How to exit name node from safe mode?

In order to forcefully let the namenode ...READ MORE

Sep 26, 2018 in Big Data Hadoop by slayer
• 29,350 points
1,763 views
0 votes
1 answer

How to write a file in hdfs with Java?

You could pass the URI when getting ...READ MORE

Sep 26, 2018 in Big Data Hadoop by digger
• 26,740 points
3,927 views
0 votes
1 answer

Spark - load CSV file as DataFrame?

spark-csv is part of core Spark functionality ...READ MORE

Sep 25, 2018 in Big Data Hadoop by slayer
• 29,350 points
6,649 views
0 votes
1 answer

Does map/reduce merge output files after reduce phase?

Hey there, instead of doing the file ...READ MORE

Sep 25, 2018 in Big Data Hadoop by digger
• 26,740 points
708 views
0 votes
1 answer

How to configure Hosts file for Hadoop Eco-System?

For UBUNTU Hosts File and other configuration for Hadoop ...READ MORE

Sep 25, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,951 views
0 votes
1 answer

In Hadoop what does dfs replicate mean?

The total number of files in the ...READ MORE

Sep 25, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,765 views
0 votes
1 answer

How to pass large records to map/reduce tasks?

Hadoop is not designed for records about ...READ MORE

Sep 25, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,242 views
0 votes
1 answer

Integration of Hadoop with Mongo DB concept

MongoDB isn't built to work on top ...READ MORE

Sep 25, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,511 views
0 votes
1 answer

ISR out of Replica

If the replica stays out of the ...READ MORE

Sep 24, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
441 views
0 votes
1 answer

Kafka vs Flume

Even though both are used for real-time ...READ MORE

Sep 24, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
702 views
0 votes
1 answer

Hadoop: intervals and JOIN

Hey, a solution was given on Biostar: http://biostar.stackexchange.com/questions/8821. Hope ...READ MORE

Sep 24, 2018 in Big Data Hadoop by slayer
• 29,350 points
437 views
0 votes
1 answer

How to create smaller table from big table in HIVE?

You could probably best use Hive's built-in sampling ...READ MORE

Sep 24, 2018 in Big Data Hadoop by digger
• 26,740 points
1,491 views
0 votes
1 answer

What is Dumbo MapReduce in Hadoop?

Doc on Hadoop Streaming : http://hadoop.apache.org/docs/r1.2.1/streaming.html Hadoop streaming is ...READ MORE

Sep 23, 2018 in Big Data Hadoop by Frankie
• 9,830 points
850 views
0 votes
1 answer

What is the standard way to create files in your hdfs file-system?

Well, it's so easy. Just enter the below ...READ MORE

Sep 23, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,368 views
0 votes
1 answer

Why Hadoop is not implemented using Message Passing Interface (MPI)?

One of the big features of Hadoop/map-reduce ...READ MORE

Sep 21, 2018 in Big Data Hadoop by Frankie
• 9,830 points
605 views
0 votes
1 answer

How to add user in supergroup of hdfs in linux?

Yes , now i have whole idea ...READ MORE

Sep 21, 2018 in Big Data Hadoop by Frankie
• 9,830 points
10,678 views
0 votes
1 answer

Hadoop Java Error: java.lang.NoClassDefFoundError: WordCount (wrong name: org/myorg/WordCount)

Hey, try this code import java.io.IOException; import java.util.Iterator; import java.util.StringTokenizer; import ...READ MORE

Sep 19, 2018 in Big Data Hadoop by slayer
• 29,350 points
5,843 views
0 votes
1 answer

How to execute python script in hadoop file system (hdfs)?

If you are simply looking to distribute ...READ MORE

Sep 19, 2018 in Big Data Hadoop by digger
• 26,740 points
13,099 views
0 votes
1 answer

How does the HDFS architecture provide redundancy?

Hadoop provides redundancy by storing multiple replicas ...READ MORE

Sep 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
740 views
0 votes
1 answer

How to discover missing or corrupt HDFS data?

HDFS supports fsck command to check for ...READ MORE

Sep 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,622 views
0 votes
1 answer

How does Avro schema evolution work?

If you have one avro file and ...READ MORE

Sep 19, 2018 in Big Data Hadoop by Frankie
• 9,830 points
7,977 views
0 votes
1 answer

What is Modeling data in Hadoop and how to do it?

I suggest spending some time with Apache ...READ MORE

Sep 19, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,592 views
0 votes
1 answer

Where are Hadoop's task log files stored?

For each MapReduce job, Hadoop stores the ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
734 views
0 votes
1 answer

How to access different directories in a Hadoop cluster?

You need to configure the client to ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
620 views
0 votes
1 answer

What additional capability does Ganglia provide to monitor Hadoop?

Ganglia is an open-source, scalable and distributed ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
775 views
0 votes
1 answer

How to achieve maximum memory to disk I/O ratio?

You need to tune io.sort.mb value until ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
517 views
0 votes
1 answer

How to install Sqoop with Hadoop 2.2.0?

You can refer the below link to ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
879 views
0 votes
1 answer

Hadoop fs mkdir and testing existence FileSystem.exists

FileSystem.get(conf) may return the local file system where ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,082 views
0 votes
1 answer

How to choose between Cassandra, Membase, Hadoop, MongoDB and RDBMS?

Actually it comes in two ways: One ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
661 views
0 votes
1 answer

submitJob method in JobClient of Hadoop throws null pointer

I had this problem as well. But when ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Neha
• 6,300 points
598 views
0 votes
1 answer

What is Custom partitioner in Hadoop? How to write partition function ?

Don't think that in Hadoop the same ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,392 views
0 votes
1 answer

What is the difference between Hadoop MapReduce and built-in MapReduce?

Differences are as follows: Hadoop's MR can be ...READ MORE

Sep 11, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,255 views
0 votes
1 answer

Hadoop Documentation for Eclipse.

Few considerations to be taken are here: If ...READ MORE

Sep 11, 2018 in Big Data Hadoop by Frankie
• 9,830 points
709 views
0 votes
1 answer

How should I provide a URL for hdfs file system?

If you are trying to access your ...READ MORE

Sep 10, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,327 views
0 votes
1 answer

Why am I getting error as hdfs dfs -mkdir, No such file or directory when I create a directory?

It is because the parent directories do ...READ MORE

Sep 10, 2018 in Big Data Hadoop by Frankie
• 9,830 points
14,755 views