Most voted questions in Big Data Hadoop

0 votes
1 answer

How will you choose various file formats for storing and processing data using Apache Hadoop ?

The decision to choose a particular file ...READ MORE

Sep 27, 2018 in Big Data Hadoop by zombie
• 3,790 points
1,560 views
0 votes
1 answer

Permission denied at hdfs

You can spolve this by adding below ...READ MORE

Sep 27, 2018 in Big Data Hadoop by digger
• 26,740 points
2,394 views
0 votes
1 answer

Is Hadoop a good open-source project to join?

Yes, Hadoop has a whole ecosystem of ...READ MORE

Sep 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
658 views
0 votes
1 answer

Where can I find older versions of Hadoop?

You can check here. From the archives. In particular, ...READ MORE

Sep 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
492 views
0 votes
1 answer

How to exit name node from safe mode?

In order to forcefully let the namenode ...READ MORE

Sep 26, 2018 in Big Data Hadoop by slayer
• 29,350 points
1,727 views
0 votes
1 answer

How to write a file in hdfs with Java?

You could pass the URI when getting ...READ MORE

Sep 26, 2018 in Big Data Hadoop by digger
• 26,740 points
3,895 views
0 votes
1 answer

Spark - load CSV file as DataFrame?

spark-csv is part of core Spark functionality ...READ MORE

Sep 25, 2018 in Big Data Hadoop by slayer
• 29,350 points
6,621 views
0 votes
1 answer

Does map/reduce merge output files after reduce phase?

Hey there, instead of doing the file ...READ MORE

Sep 25, 2018 in Big Data Hadoop by digger
• 26,740 points
697 views
0 votes
1 answer

How to configure Hosts file for Hadoop Eco-System?

For UBUNTU Hosts File and other configuration for Hadoop ...READ MORE

Sep 25, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,925 views
0 votes
1 answer

In Hadoop what does dfs replicate mean?

The total number of files in the ...READ MORE

Sep 25, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,717 views
0 votes
1 answer

How to pass large records to map/reduce tasks?

Hadoop is not designed for records about ...READ MORE

Sep 25, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,215 views
0 votes
1 answer

Integration of Hadoop with Mongo DB concept

MongoDB isn't built to work on top ...READ MORE

Sep 25, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,479 views
0 votes
1 answer

ISR out of Replica

If the replica stays out of the ...READ MORE

Sep 24, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
424 views
0 votes
1 answer

Kafka vs Flume

Even though both are used for real-time ...READ MORE

Sep 24, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
688 views
0 votes
1 answer

Hadoop: intervals and JOIN

Hey, a solution was given on Biostar: http://biostar.stackexchange.com/questions/8821. Hope ...READ MORE

Sep 24, 2018 in Big Data Hadoop by slayer
• 29,350 points
422 views
0 votes
1 answer

How to create smaller table from big table in HIVE?

You could probably best use Hive's built-in sampling ...READ MORE

Sep 24, 2018 in Big Data Hadoop by digger
• 26,740 points
1,478 views
0 votes
1 answer

What is Dumbo MapReduce in Hadoop?

Doc on Hadoop Streaming : http://hadoop.apache.org/docs/r1.2.1/streaming.html Hadoop streaming is ...READ MORE

Sep 23, 2018 in Big Data Hadoop by Frankie
• 9,830 points
834 views
0 votes
1 answer

What is the standard way to create files in your hdfs file-system?

Well, it's so easy. Just enter the below ...READ MORE

Sep 23, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,345 views
0 votes
1 answer

Why Hadoop is not implemented using Message Passing Interface (MPI)?

One of the big features of Hadoop/map-reduce ...READ MORE

Sep 21, 2018 in Big Data Hadoop by Frankie
• 9,830 points
591 views
0 votes
1 answer

How to add user in supergroup of hdfs in linux?

Yes , now i have whole idea ...READ MORE

Sep 21, 2018 in Big Data Hadoop by Frankie
• 9,830 points
10,619 views
0 votes
1 answer

Hadoop Java Error: java.lang.NoClassDefFoundError: WordCount (wrong name: org/myorg/WordCount)

Hey, try this code import java.io.IOException; import java.util.Iterator; import java.util.StringTokenizer; import ...READ MORE

Sep 19, 2018 in Big Data Hadoop by slayer
• 29,350 points
5,814 views
0 votes
1 answer

How to execute python script in hadoop file system (hdfs)?

If you are simply looking to distribute ...READ MORE

Sep 19, 2018 in Big Data Hadoop by digger
• 26,740 points
13,046 views
0 votes
1 answer

How does the HDFS architecture provide redundancy?

Hadoop provides redundancy by storing multiple replicas ...READ MORE

Sep 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
716 views
0 votes
1 answer

How to discover missing or corrupt HDFS data?

HDFS supports fsck command to check for ...READ MORE

Sep 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,600 views
0 votes
1 answer

How does Avro schema evolution work?

If you have one avro file and ...READ MORE

Sep 19, 2018 in Big Data Hadoop by Frankie
• 9,830 points
7,951 views
0 votes
1 answer

What is Modeling data in Hadoop and how to do it?

I suggest spending some time with Apache ...READ MORE

Sep 19, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,573 views
0 votes
1 answer

Where are Hadoop's task log files stored?

For each MapReduce job, Hadoop stores the ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
693 views
0 votes
1 answer

How to access different directories in a Hadoop cluster?

You need to configure the client to ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
605 views
0 votes
1 answer

What additional capability does Ganglia provide to monitor Hadoop?

Ganglia is an open-source, scalable and distributed ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
759 views
0 votes
1 answer

How to achieve maximum memory to disk I/O ratio?

You need to tune io.sort.mb value until ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
500 views
0 votes
1 answer

How to install Sqoop with Hadoop 2.2.0?

You can refer the below link to ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
869 views
0 votes
1 answer

Hadoop fs mkdir and testing existence FileSystem.exists

FileSystem.get(conf) may return the local file system where ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,056 views
0 votes
1 answer

How to choose between Cassandra, Membase, Hadoop, MongoDB and RDBMS?

Actually it comes in two ways: One ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
644 views
0 votes
2 answers

Is there a .NET equivalent to Apache Hadoop?

Hadoop is a Java-based platform. So, to ...READ MORE

Jul 16, 2020 in Big Data Hadoop by Suhana
• 340 points
1,413 views
0 votes
1 answer

submitJob method in JobClient of Hadoop throws null pointer

I had this problem as well. But when ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Neha
• 6,300 points
583 views
0 votes
1 answer

What is Custom partitioner in Hadoop? How to write partition function ?

Don't think that in Hadoop the same ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,375 views
0 votes
1 answer

What is the difference between Hadoop MapReduce and built-in MapReduce?

Differences are as follows: Hadoop's MR can be ...READ MORE

Sep 11, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,216 views
0 votes
1 answer

Hadoop Documentation for Eclipse.

Few considerations to be taken are here: If ...READ MORE

Sep 11, 2018 in Big Data Hadoop by Frankie
• 9,830 points
688 views
0 votes
1 answer

How should I provide a URL for hdfs file system?

If you are trying to access your ...READ MORE

Sep 10, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,306 views
0 votes
1 answer

Why am I getting error as hdfs dfs -mkdir, No such file or directory when I create a directory?

It is because the parent directories do ...READ MORE

Sep 10, 2018 in Big Data Hadoop by Frankie
• 9,830 points
14,637 views
0 votes
1 answer

What are the basic concepts for video data structure and processing?

1.  Make use of hadoop storm framework. ( ...READ MORE

Sep 7, 2018 in Big Data Hadoop by Frankie
• 9,830 points
414 views
0 votes
1 answer

How to upgrade Apache Hadoop from 2.4.1 to 2.6.0?

If the downtime is not an issue, ...READ MORE

Sep 7, 2018 in Big Data Hadoop by Frankie
• 9,830 points
450 views
0 votes
1 answer

Understanding Hadoop Master Node, Slave Node and Data Node.

The NameNode and the JobTracker are single ...READ MORE

Sep 6, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,481 views
0 votes
1 answer

What is Network Topology in Hadoop?

Let's imagine your cluster as a tree ...READ MORE

Sep 6, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,519 views
0 votes
1 answer

Clarification of some Hadoop Concepts.

Please go through : http://bytepadding.com/big-data/map-reduce/understanding-map-reduce-the-missing-guide/ Lets try to understand ...READ MORE

Sep 5, 2018 in Big Data Hadoop by Frankie
• 9,830 points
338 views
0 votes
1 answer

How to format the output being written by MapReduce in Hadoop?

Here is a simple code demonstrate the ...READ MORE

Sep 5, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,281 views
0 votes
1 answer

How to access Hadoop Data using REST service?

The REST API gateway for the Apache ...READ MORE

Sep 5, 2018 in Big Data Hadoop by Frankie
• 9,830 points
4,103 views
0 votes
1 answer

How to write examples to test whether the installation is validated or not?

You have some example jars in your ...READ MORE

Sep 5, 2018 in Big Data Hadoop by Frankie
• 9,830 points
499 views
0 votes
1 answer

Will hadoop(sqoop) load oracle faster than SQL loader?

down vote Had a similar task of exporting ...READ MORE

Sep 4, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,571 views
0 votes
1 answer

Which Operating system is more preferable for data node?

In a real installation (1 active namenode, ...READ MORE

Sep 4, 2018 in Big Data Hadoop by Neha
• 6,300 points
516 views