Most answered questions in Big Data Hadoop

0 votes
1 answer

I get an error stating- Hadoop: «ERROR : JAVA_HOME is not set». How to resolve this?

Make sure that you have removed the ...READ MORE

Aug 7, 2018 in Big Data Hadoop by Frankie
• 9,830 points
5,933 views
0 votes
1 answer

How should I connect Hadoop in C#?

The Uri should be your Cluster Name, ...READ MORE

Aug 7, 2018 in Big Data Hadoop by Neha
• 6,300 points
2,065 views
0 votes
1 answer

Increasing HFile block size

If you increase the block size then ...READ MORE

Aug 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
947 views
0 votes
1 answer

Designing Schema in HBase

Just create a single table with two ...READ MORE

Aug 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
802 views
0 votes
1 answer

What are the security guidelines for Impala?

Security Guidelines for Impala: The following are the ...READ MORE

Aug 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
870 views
0 votes
1 answer

Cloudera Managing and Compatibality

With more partners than any other Hadoop ...READ MORE

Aug 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
532 views
0 votes
1 answer

What are the different ways of Installing Hadoop into our local machine?

Hadoop runs on Unix and on Windows. ...READ MORE

Aug 4, 2018 in Big Data Hadoop by Neha
• 6,300 points
5,274 views
0 votes
1 answer

Which is the most preferable language for Hadooop Map-Reduce programs?

MapReduce is a programming model to perform ...READ MORE

Aug 4, 2018 in Big Data Hadoop by Neha
• 6,300 points
3,761 views
0 votes
1 answer

Is there any way to make Apache Commons Math Library available to MapReducer job at runtime?

So, what you can do is when ...READ MORE

Aug 3, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,114 views
0 votes
1 answer

Reading the file and populating the associative array

You can use the configure method to ...READ MORE

Aug 2, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,200 views
0 votes
1 answer

What happens in a MapReduce job when you set the number of reducers to one?

If you set number of reducers as ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,076 views
0 votes
1 answer

HDFS for customer information file

No, because HDFS is optimized for write-once, ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
620 views
0 votes
1 answer

How a client reads a file from HDFS?

Let me explain you it briefly. So, ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,545 views
0 votes
1 answer

Mappers in Hadoop

The number of mappers is determined by ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
533 views
0 votes
1 answer

Backup and Checkpoint Node

Checkpoint Node keeps track of the latest ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
3,572 views
0 votes
1 answer

Checkpoint Node in Hadoop

Checkpoint Node keeps track of the latest ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,326 views
0 votes
1 answer

Block Scanner HDFS

Block scanner runs periodically on every DataNode ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
2,467 views
0 votes
1 answer

Passing keys and values to the reducers during a standard sort and shuffle phase of MapReduce

Let me explain you the whole scenario Reducer ...READ MORE

Jul 27, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,437 views
0 votes
1 answer

Hortonworks vs Cloudera Architecture Difference

Features Cloudera MapR HortonWorks License Commercial Commercial Open-Source Community Yes Yes Yes SQL Impala Hive, Drill, Shark Stinger MapReduce Yes Yes Yes Unique Features Cloudera Manager Direct Access NFS 100% ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
842 views
0 votes
1 answer

How to create a project for the first time in Hadoop.?

If you want to learn Hadoop framework ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,174 views
0 votes
1 answer

When do reduce tasks start in Hadoop?

The reduce phase has 3 steps: shuffle, ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,045 views
0 votes
1 answer

How compression works in Hadoop?

It basically depends on the file type ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,015 views
+1 vote
1 answer

Reading Performance in Hadoop Cluster

For speeding up reading performance of your ...READ MORE

Jul 25, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
844 views
+2 votes
1 answer

How to calculate Maximum salary of the employee with the name using the Map Reduce Technique

Please try the below code and it ...READ MORE

Jul 25, 2018 in Big Data Hadoop by Neha
• 6,300 points
5,662 views
0 votes
1 answer

GROUP and COGROUP in PIG

Both GROUP and COGROUP operators are identical ...READ MORE

Jul 25, 2018 in Big Data Hadoop by shams
• 3,670 points
4,148 views
0 votes
1 answer

Use of MapReduce in PIG

Apache Pig programs are written in a ...READ MORE

Jul 25, 2018 in Big Data Hadoop by shams
• 3,670 points
2,264 views
0 votes
1 answer

Difference between a failed task attempt and a killed task attempt

Let me explain you the difference between ...READ MORE

Jul 24, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,217 views
0 votes
1 answer

What metadata is stored on a DataNode when a block is written to it?

Let me explain you step by step.  Each ...READ MORE

Jul 23, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,130 views
0 votes
1 answer

Importing a portion of Relational Database to HDFS

Yes, there is a tool called Sqoop ...READ MORE

Jul 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,896 views
0 votes
1 answer

How can Hadoop discover missing or corrupt HDFS data?

HDFS supports fsck command to check for ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,453 views
0 votes
1 answer

How the FIFO Scheduler arbitrates the cluster resources for a job and its tasks?

Let me clear your doubt. So what ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,021 views
0 votes
1 answer

Synchronizing sqoop imported data in HDFS

Data can be synchronised using incremental parameter ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,232 views
0 votes
1 answer

Sqoop Metastore ?

Sqoop metastore is a shared metadata repository ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,324 views
0 votes
1 answer

How to pause and resume hive job?

Practically speaking, it's difficult/impossible to pause and resume ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,357 views
0 votes
1 answer

Channel in Flume

 A transient store that receives the events ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Ashish
• 2,650 points
592 views
+1 vote
1 answer

When I try to execute any Hadoop commands error pops up saying unable to find or load main class M

The error might be due to the ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,045 views
0 votes
1 answer

How does HDFS Federation help HDFS Scale horizontally?

Let me try to explain you the ...READ MORE

Jul 12, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,379 views
0 votes
1 answer

How to avoid a “split-brain” scenario with NameNodes?

Okay, so let me tell you that ...READ MORE

Jul 11, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
4,499 views
0 votes
1 answer

How Namenode handles data node failures?

Let me explain you the whole scenario. NameNode ...READ MORE

Jul 11, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
6,304 views
0 votes
1 answer

Hive Precedence Order

SET Command in HIVE The command line –hiveconf ...READ MORE

Jul 9, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
960 views
0 votes
1 answer

Bucketing vs Partitioning in HIve

Partition divides large amount of data into ...READ MORE

Jul 9, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
27,081 views
0 votes
1 answer

Are there any Limitations of Pig?

Limitations of the Apache Pig are: As the ...READ MORE

Jul 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,739 views
0 votes
1 answer

MapFile in Pig

MapFile is a class which serves file-based ...READ MORE

Jul 6, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
615 views
0 votes
1 answer

Checkpointing in Hadoop

Checkpointing is the process of combining the ...READ MORE

Jul 3, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,310 views
0 votes
1 answer

Hadoop Nodes for Multiple Disks

Datanodes can store blocks in multiple directories ...READ MORE

Jul 3, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,973 views
0 votes
1 answer

Different partitoners in Cassandra

Murmur3Partitioner is the default partitioner. It is ...READ MORE

Jun 27, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
657 views
0 votes
1 answer

Gossip Protocol features.

Gossip Protocol in Cassandra is a peer-to-peer ...READ MORE

Jun 26, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
665 views
0 votes
1 answer

How to analyze block placement on datanodes and rebalancing data across Hadoop nodes?

HDFS provides a tool for administrators i.e. ...READ MORE

Jun 21, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
992 views
0 votes
1 answer

Is there any way to setup Hadoop nodes (data nodes/namenodes) to use multiple volumes/disks?

Datanodes can store blocks in multiple directories ...READ MORE

Jun 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
969 views
0 votes
1 answer

NameNode Status Check

There's one more way of checking the ...READ MORE

Jun 18, 2018 in Big Data Hadoop by shams
• 3,670 points
2,329 views