Most voted questions in Big Data Hadoop

0 votes
1 answer

Cloudera Managing and Compatibality

With more partners than any other Hadoop ...READ MORE

Aug 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
365 views
0 votes
1 answer

What are the different ways of Installing Hadoop into our local machine?

Hadoop runs on Unix and on Windows. ...READ MORE

Aug 4, 2018 in Big Data Hadoop by Neha
• 6,300 points
4,622 views
0 votes
1 answer

Which is the most preferable language for Hadooop Map-Reduce programs?

MapReduce is a programming model to perform ...READ MORE

Aug 4, 2018 in Big Data Hadoop by Neha
• 6,300 points
3,244 views
0 votes
1 answer

Is there any way to make Apache Commons Math Library available to MapReducer job at runtime?

So, what you can do is when ...READ MORE

Aug 3, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
928 views
0 votes
1 answer

Reading the file and populating the associative array

You can use the configure method to ...READ MORE

Aug 2, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
935 views
0 votes
1 answer

What happens in a MapReduce job when you set the number of reducers to one?

If you set number of reducers as ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,808 views
0 votes
1 answer

HDFS for customer information file

No, because HDFS is optimized for write-once, ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
427 views
0 votes
1 answer

How a client reads a file from HDFS?

Let me explain you it briefly. So, ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,330 views
0 votes
1 answer

Mappers in Hadoop

The number of mappers is determined by ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
300 views
0 votes
1 answer

Backup and Checkpoint Node

Checkpoint Node keeps track of the latest ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
2,379 views
0 votes
1 answer

Checkpoint Node in Hadoop

Checkpoint Node keeps track of the latest ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,162 views
0 votes
1 answer

Block Scanner HDFS

Block scanner runs periodically on every DataNode ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
2,164 views
0 votes
1 answer

Passing keys and values to the reducers during a standard sort and shuffle phase of MapReduce

Let me explain you the whole scenario Reducer ...READ MORE

Jul 27, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,082 views
0 votes
1 answer

Hortonworks vs Cloudera Architecture Difference

Features Cloudera MapR HortonWorks License Commercial Commercial Open-Source Community Yes Yes Yes SQL Impala Hive, Drill, Shark Stinger MapReduce Yes Yes Yes Unique Features Cloudera Manager Direct Access NFS 100% ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
660 views
0 votes
1 answer

How to create a project for the first time in Hadoop.?

If you want to learn Hadoop framework ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Neha
• 6,300 points
938 views
0 votes
1 answer

When do reduce tasks start in Hadoop?

The reduce phase has 3 steps: shuffle, ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
792 views
0 votes
1 answer

How compression works in Hadoop?

It basically depends on the file type ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,764 views
0 votes
1 answer

GROUP and COGROUP in PIG

Both GROUP and COGROUP operators are identical ...READ MORE

Jul 25, 2018 in Big Data Hadoop by shams
• 3,670 points
3,682 views
0 votes
1 answer

Use of MapReduce in PIG

Apache Pig programs are written in a ...READ MORE

Jul 25, 2018 in Big Data Hadoop by shams
• 3,670 points
1,942 views
0 votes
1 answer

Difference between a failed task attempt and a killed task attempt

Let me explain you the difference between ...READ MORE

Jul 24, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,836 views
0 votes
1 answer

What metadata is stored on a DataNode when a block is written to it?

Let me explain you step by step.  Each ...READ MORE

Jul 23, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,725 views
0 votes
1 answer

Importing a portion of Relational Database to HDFS

Yes, there is a tool called Sqoop ...READ MORE

Jul 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,554 views
0 votes
1 answer

How can Hadoop discover missing or corrupt HDFS data?

HDFS supports fsck command to check for ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,261 views
0 votes
1 answer

How the FIFO Scheduler arbitrates the cluster resources for a job and its tasks?

Let me clear your doubt. So what ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
831 views
0 votes
1 answer

Synchronizing sqoop imported data in HDFS

Data can be synchronised using incremental parameter ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
974 views
0 votes
1 answer

Sqoop Metastore ?

Sqoop metastore is a shared metadata repository ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,102 views
0 votes
1 answer

How to pause and resume hive job?

Practically speaking, it's difficult/impossible to pause and resume ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,128 views
0 votes
1 answer

Channel in Flume

 A transient store that receives the events ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Ashish
• 2,650 points
382 views
0 votes
1 answer

How does HDFS Federation help HDFS Scale horizontally?

Let me try to explain you the ...READ MORE

Jul 12, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,028 views
0 votes
2 answers

How do I integrate Kdb+ and hadoop?

 kdb+ could be installed on every Hadoop ...READ MORE

Aug 6, 2018 in Big Data Hadoop by Abhi
• 3,720 points
1,398 views
0 votes
1 answer

How to avoid a “split-brain” scenario with NameNodes?

Okay, so let me tell you that ...READ MORE

Jul 11, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,869 views
0 votes
1 answer

How Namenode handles data node failures?

Let me explain you the whole scenario. NameNode ...READ MORE

Jul 11, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
5,982 views
0 votes
1 answer

Hive Precedence Order

SET Command in HIVE The command line –hiveconf ...READ MORE

Jul 9, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
769 views
0 votes
1 answer

Bucketing vs Partitioning in HIve

Partition divides large amount of data into ...READ MORE

Jul 9, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
26,487 views
0 votes
1 answer

Are there any Limitations of Pig?

Limitations of the Apache Pig are: As the ...READ MORE

Jul 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,440 views
0 votes
1 answer

MapFile in Pig

MapFile is a class which serves file-based ...READ MORE

Jul 6, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
454 views
0 votes
1 answer

Checkpointing in Hadoop

Checkpointing is the process of combining the ...READ MORE

Jul 3, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,056 views
0 votes
1 answer

Hadoop Nodes for Multiple Disks

Datanodes can store blocks in multiple directories ...READ MORE

Jul 3, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,716 views
0 votes
1 answer

Different partitoners in Cassandra

Murmur3Partitioner is the default partitioner. It is ...READ MORE

Jun 27, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
411 views
0 votes
1 answer

Gossip Protocol features.

Gossip Protocol in Cassandra is a peer-to-peer ...READ MORE

Jun 26, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
411 views
0 votes
1 answer

How to analyze block placement on datanodes and rebalancing data across Hadoop nodes?

HDFS provides a tool for administrators i.e. ...READ MORE

Jun 21, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
824 views
0 votes
2 answers

What is the difference between the Smart Data Access of SAP HANA and SAP HANA Vora?

Hadoop: Used to store Big Data in ...READ MORE

Jun 21, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,099 views
0 votes
1 answer

Is there any way to setup Hadoop nodes (data nodes/namenodes) to use multiple volumes/disks?

Datanodes can store blocks in multiple directories ...READ MORE

Jun 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
763 views
0 votes
1 answer

NameNode Status Check

There's one more way of checking the ...READ MORE

Jun 18, 2018 in Big Data Hadoop by shams
• 3,670 points
2,157 views
0 votes
1 answer

What is Speculative Execution?

If a node is executing a task ...READ MORE

Jun 18, 2018 in Big Data Hadoop by shams
• 3,670 points
351 views
0 votes
1 answer

Different Hadoop Modes

Hadoop can run in 3 different modes. 1. ...READ MORE

Jun 13, 2018 in Big Data Hadoop by shams
• 3,670 points
12,492 views
0 votes
2 answers

Hadoop fs -stat command

hadoop fs -stat is as hadoop command used ...READ MORE

Oct 25, 2018 in Big Data Hadoop by Omkar
• 69,210 points
10,040 views
0 votes
1 answer

HDFS usage for small files

Well, the HDFS works more reliably with ...READ MORE

Jun 12, 2018 in Big Data Hadoop by BigData_is_my_ex
380 views
0 votes
1 answer

NameNode & DataNode memory

Name Node only store metadata which is ...READ MORE

Jun 12, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,052 views
0 votes
1 answer

NAS and HDFS Comparison

NAS stands for Network-attached Storage and is ...READ MORE

Jun 11, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,442 views