Latest questions in Big Data Hadoop

0 votes
1 answer

How to create a project for the first time in Hadoop.?

If you want to learn Hadoop framework ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Neha
• 6,300 points
972 views
0 votes
1 answer

When do reduce tasks start in Hadoop?

The reduce phase has 3 steps: shuffle, ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
840 views
0 votes
1 answer

How compression works in Hadoop?

It basically depends on the file type ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,824 views
+1 vote
1 answer

Reading Performance in Hadoop Cluster

For speeding up reading performance of your ...READ MORE

Jul 25, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
552 views
+2 votes
1 answer

How to calculate Maximum salary of the employee with the name using the Map Reduce Technique

Please try the below code and it ...READ MORE

Jul 25, 2018 in Big Data Hadoop by Neha
• 6,300 points
5,332 views
+1 vote
0 answers

How to set up Hadoop cluster on Mac in intelliJ IDEA

I have Installed hadoop using brew and ...READ MORE

Jul 25, 2018 in Big Data Hadoop by Neha
• 6,300 points
941 views
0 votes
1 answer

GROUP and COGROUP in PIG

Both GROUP and COGROUP operators are identical ...READ MORE

Jul 25, 2018 in Big Data Hadoop by shams
• 3,670 points
3,756 views
0 votes
1 answer

Use of MapReduce in PIG

Apache Pig programs are written in a ...READ MORE

Jul 25, 2018 in Big Data Hadoop by shams
• 3,670 points
1,997 views
0 votes
1 answer

Difference between a failed task attempt and a killed task attempt

Let me explain you the difference between ...READ MORE

Jul 24, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,888 views
0 votes
1 answer

What metadata is stored on a DataNode when a block is written to it?

Let me explain you step by step.  Each ...READ MORE

Jul 23, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,791 views
+1 vote
3 answers

when i tried to run the hdfs namenode -format command...its failed to run..can anyone help me with this???

Hi Suriyaprakash, I guess the path for hadoop ...READ MORE

Jul 23, 2018 in Big Data Hadoop by avi_man_utd
11,574 views
0 votes
1 answer

Importing a portion of Relational Database to HDFS

Yes, there is a tool called Sqoop ...READ MORE

Jul 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,602 views
0 votes
1 answer

How can Hadoop discover missing or corrupt HDFS data?

HDFS supports fsck command to check for ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,299 views
0 votes
1 answer

How the FIFO Scheduler arbitrates the cluster resources for a job and its tasks?

Let me clear your doubt. So what ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
853 views
0 votes
1 answer

Synchronizing sqoop imported data in HDFS

Data can be synchronised using incremental parameter ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,017 views
0 votes
1 answer

Sqoop Metastore ?

Sqoop metastore is a shared metadata repository ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,158 views
0 votes
1 answer

How to pause and resume hive job?

Practically speaking, it's difficult/impossible to pause and resume ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,157 views
0 votes
1 answer

Channel in Flume

 A transient store that receives the events ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Ashish
• 2,650 points
404 views
+1 vote
1 answer

When I try to execute any Hadoop commands error pops up saying unable to find or load main class M

The error might be due to the ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,665 views
0 votes
1 answer

How does HDFS Federation help HDFS Scale horizontally?

Let me try to explain you the ...READ MORE

Jul 12, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,073 views
0 votes
2 answers

How do I integrate Kdb+ and hadoop?

 kdb+ could be installed on every Hadoop ...READ MORE

Aug 6, 2018 in Big Data Hadoop by Abhi
• 3,720 points
1,446 views
0 votes
1 answer

How to avoid a “split-brain” scenario with NameNodes?

Okay, so let me tell you that ...READ MORE

Jul 11, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,980 views
0 votes
1 answer

How Namenode handles data node failures?

Let me explain you the whole scenario. NameNode ...READ MORE

Jul 11, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
6,054 views
0 votes
1 answer

Hive Precedence Order

SET Command in HIVE The command line –hiveconf ...READ MORE

Jul 9, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
788 views
0 votes
1 answer

Bucketing vs Partitioning in HIve

Partition divides large amount of data into ...READ MORE

Jul 9, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
26,573 views
0 votes
1 answer

Are there any Limitations of Pig?

Limitations of the Apache Pig are: As the ...READ MORE

Jul 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,518 views
0 votes
1 answer

MapFile in Pig

MapFile is a class which serves file-based ...READ MORE

Jul 6, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
491 views
0 votes
1 answer

Checkpointing in Hadoop

Checkpointing is the process of combining the ...READ MORE

Jul 3, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,107 views
0 votes
1 answer

Hadoop Nodes for Multiple Disks

Datanodes can store blocks in multiple directories ...READ MORE

Jul 3, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,750 views
0 votes
1 answer

Different partitoners in Cassandra

Murmur3Partitioner is the default partitioner. It is ...READ MORE

Jun 27, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
437 views
0 votes
1 answer

Gossip Protocol features.

Gossip Protocol in Cassandra is a peer-to-peer ...READ MORE

Jun 26, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
430 views
0 votes
1 answer

How to analyze block placement on datanodes and rebalancing data across Hadoop nodes?

HDFS provides a tool for administrators i.e. ...READ MORE

Jun 21, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
856 views
0 votes
2 answers

What is the difference between the Smart Data Access of SAP HANA and SAP HANA Vora?

Hadoop: Used to store Big Data in ...READ MORE

Jun 21, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,131 views
0 votes
1 answer

Is there any way to setup Hadoop nodes (data nodes/namenodes) to use multiple volumes/disks?

Datanodes can store blocks in multiple directories ...READ MORE

Jun 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
794 views
0 votes
1 answer

NameNode Status Check

There's one more way of checking the ...READ MORE

Jun 18, 2018 in Big Data Hadoop by shams
• 3,670 points
2,185 views
0 votes
1 answer

What is Speculative Execution?

If a node is executing a task ...READ MORE

Jun 18, 2018 in Big Data Hadoop by shams
• 3,670 points
370 views
0 votes
1 answer

Different Hadoop Modes

Hadoop can run in 3 different modes. 1. ...READ MORE

Jun 13, 2018 in Big Data Hadoop by shams
• 3,670 points
12,665 views
0 votes
2 answers

Hadoop fs -stat command

hadoop fs -stat is as hadoop command used ...READ MORE

Oct 25, 2018 in Big Data Hadoop by Omkar
• 69,230 points
10,260 views
0 votes
1 answer

HDFS usage for small files

Well, the HDFS works more reliably with ...READ MORE

Jun 12, 2018 in Big Data Hadoop by BigData_is_my_ex
399 views
0 votes
1 answer

NameNode & DataNode memory

Name Node only store metadata which is ...READ MORE

Jun 12, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,096 views
0 votes
1 answer

NAS and HDFS Comparison

NAS stands for Network-attached Storage and is ...READ MORE

Jun 11, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,462 views
0 votes
1 answer

Replication and Calculations

Since there are 3 nodes, when we ...READ MORE

Jun 11, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
682 views
0 votes
1 answer

JobTracker Task Scheduling

The task tracker send out heartbeat messages ...READ MORE

Jun 5, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
755 views
+5 votes
2 answers

Hive to HDFS transfer

Hive does not store data. Hive is ...READ MORE

Sep 20, 2018 in Big Data Hadoop by SM
3,691 views
0 votes
1 answer

InputSplit vs HDFS Block

By definition Block – Block is the continuous ...READ MORE

Jun 1, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
4,183 views
0 votes
1 answer

Sqoop vs distCP

DistCP utility can be used to transfer ...READ MORE

May 30, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,121 views
0 votes
2 answers

What is Shuffle Phase?

The process where the intermediate results from ...READ MORE

Mar 27, 2019 in Big Data Hadoop by Krishnan
• 140 points
1,301 views
0 votes
1 answer

NameNode without any data

There does not exist any NameNode without ...READ MORE

May 29, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,158 views
0 votes
1 answer

What is new in Hadoop 3?

Here are few changes in Hadoop 3 1. ...READ MORE

May 28, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
386 views
0 votes
1 answer

out of Memory Error in Hadoop

See if you can share the screenshot ...READ MORE

May 22, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,657 views