Most voted questions in Big Data Hadoop

0 votes
1 answer

Cloudera Managing and Compatibality

With more partners than any other Hadoop ...READ MORE

Aug 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
375 views
0 votes
1 answer

What are the different ways of Installing Hadoop into our local machine?

Hadoop runs on Unix and on Windows. ...READ MORE

Aug 4, 2018 in Big Data Hadoop by Neha
• 6,300 points
4,651 views
0 votes
1 answer

Which is the most preferable language for Hadooop Map-Reduce programs?

MapReduce is a programming model to perform ...READ MORE

Aug 4, 2018 in Big Data Hadoop by Neha
• 6,300 points
3,288 views
0 votes
1 answer

Is there any way to make Apache Commons Math Library available to MapReducer job at runtime?

So, what you can do is when ...READ MORE

Aug 3, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
944 views
0 votes
1 answer

Reading the file and populating the associative array

You can use the configure method to ...READ MORE

Aug 2, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
945 views
0 votes
1 answer

What happens in a MapReduce job when you set the number of reducers to one?

If you set number of reducers as ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,828 views
0 votes
1 answer

HDFS for customer information file

No, because HDFS is optimized for write-once, ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
436 views
0 votes
1 answer

How a client reads a file from HDFS?

Let me explain you it briefly. So, ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,347 views
0 votes
1 answer

Mappers in Hadoop

The number of mappers is determined by ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
319 views
0 votes
1 answer

Backup and Checkpoint Node

Checkpoint Node keeps track of the latest ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
2,450 views
0 votes
1 answer

Checkpoint Node in Hadoop

Checkpoint Node keeps track of the latest ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,170 views
0 votes
1 answer

Block Scanner HDFS

Block scanner runs periodically on every DataNode ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
2,189 views
0 votes
1 answer

Passing keys and values to the reducers during a standard sort and shuffle phase of MapReduce

Let me explain you the whole scenario Reducer ...READ MORE

Jul 27, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,127 views
0 votes
1 answer

Hortonworks vs Cloudera Architecture Difference

Features Cloudera MapR HortonWorks License Commercial Commercial Open-Source Community Yes Yes Yes SQL Impala Hive, Drill, Shark Stinger MapReduce Yes Yes Yes Unique Features Cloudera Manager Direct Access NFS 100% ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
672 views
0 votes
1 answer

How to create a project for the first time in Hadoop.?

If you want to learn Hadoop framework ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Neha
• 6,300 points
950 views
0 votes
1 answer

When do reduce tasks start in Hadoop?

The reduce phase has 3 steps: shuffle, ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
812 views
0 votes
1 answer

How compression works in Hadoop?

It basically depends on the file type ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,788 views
0 votes
1 answer

GROUP and COGROUP in PIG

Both GROUP and COGROUP operators are identical ...READ MORE

Jul 25, 2018 in Big Data Hadoop by shams
• 3,670 points
3,710 views
0 votes
1 answer

Use of MapReduce in PIG

Apache Pig programs are written in a ...READ MORE

Jul 25, 2018 in Big Data Hadoop by shams
• 3,670 points
1,965 views
0 votes
1 answer

Difference between a failed task attempt and a killed task attempt

Let me explain you the difference between ...READ MORE

Jul 24, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,857 views
0 votes
1 answer

What metadata is stored on a DataNode when a block is written to it?

Let me explain you step by step.  Each ...READ MORE

Jul 23, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,748 views
0 votes
1 answer

Importing a portion of Relational Database to HDFS

Yes, there is a tool called Sqoop ...READ MORE

Jul 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,575 views
0 votes
1 answer

How can Hadoop discover missing or corrupt HDFS data?

HDFS supports fsck command to check for ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,274 views
0 votes
1 answer

How the FIFO Scheduler arbitrates the cluster resources for a job and its tasks?

Let me clear your doubt. So what ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
837 views
0 votes
1 answer

Synchronizing sqoop imported data in HDFS

Data can be synchronised using incremental parameter ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
990 views
0 votes
1 answer

Sqoop Metastore ?

Sqoop metastore is a shared metadata repository ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,120 views
0 votes
1 answer

How to pause and resume hive job?

Practically speaking, it's difficult/impossible to pause and resume ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,135 views
0 votes
1 answer

Channel in Flume

 A transient store that receives the events ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Ashish
• 2,650 points
388 views
0 votes
1 answer

How does HDFS Federation help HDFS Scale horizontally?

Let me try to explain you the ...READ MORE

Jul 12, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,049 views
0 votes
2 answers

How do I integrate Kdb+ and hadoop?

 kdb+ could be installed on every Hadoop ...READ MORE

Aug 6, 2018 in Big Data Hadoop by Abhi
• 3,720 points
1,415 views
0 votes
1 answer

How to avoid a “split-brain” scenario with NameNodes?

Okay, so let me tell you that ...READ MORE

Jul 11, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,912 views
0 votes
1 answer

How Namenode handles data node failures?

Let me explain you the whole scenario. NameNode ...READ MORE

Jul 11, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
6,009 views
0 votes
1 answer

Hive Precedence Order

SET Command in HIVE The command line –hiveconf ...READ MORE

Jul 9, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
772 views
0 votes
1 answer

Bucketing vs Partitioning in HIve

Partition divides large amount of data into ...READ MORE

Jul 9, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
26,532 views
0 votes
1 answer

Are there any Limitations of Pig?

Limitations of the Apache Pig are: As the ...READ MORE

Jul 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,473 views
0 votes
1 answer

MapFile in Pig

MapFile is a class which serves file-based ...READ MORE

Jul 6, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
467 views
0 votes
1 answer

Checkpointing in Hadoop

Checkpointing is the process of combining the ...READ MORE

Jul 3, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,075 views
0 votes
1 answer

Hadoop Nodes for Multiple Disks

Datanodes can store blocks in multiple directories ...READ MORE

Jul 3, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,732 views
0 votes
1 answer

Different partitoners in Cassandra

Murmur3Partitioner is the default partitioner. It is ...READ MORE

Jun 27, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
424 views
0 votes
1 answer

Gossip Protocol features.

Gossip Protocol in Cassandra is a peer-to-peer ...READ MORE

Jun 26, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
416 views
0 votes
1 answer

How to analyze block placement on datanodes and rebalancing data across Hadoop nodes?

HDFS provides a tool for administrators i.e. ...READ MORE

Jun 21, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
833 views
0 votes
2 answers

What is the difference between the Smart Data Access of SAP HANA and SAP HANA Vora?

Hadoop: Used to store Big Data in ...READ MORE

Jun 21, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,107 views
0 votes
1 answer

Is there any way to setup Hadoop nodes (data nodes/namenodes) to use multiple volumes/disks?

Datanodes can store blocks in multiple directories ...READ MORE

Jun 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
775 views
0 votes
1 answer

NameNode Status Check

There's one more way of checking the ...READ MORE

Jun 18, 2018 in Big Data Hadoop by shams
• 3,670 points
2,170 views
0 votes
1 answer

What is Speculative Execution?

If a node is executing a task ...READ MORE

Jun 18, 2018 in Big Data Hadoop by shams
• 3,670 points
358 views
0 votes
1 answer

Different Hadoop Modes

Hadoop can run in 3 different modes. 1. ...READ MORE

Jun 13, 2018 in Big Data Hadoop by shams
• 3,670 points
12,561 views
0 votes
2 answers

Hadoop fs -stat command

hadoop fs -stat is as hadoop command used ...READ MORE

Oct 25, 2018 in Big Data Hadoop by Omkar
• 69,210 points
10,154 views
0 votes
1 answer

HDFS usage for small files

Well, the HDFS works more reliably with ...READ MORE

Jun 12, 2018 in Big Data Hadoop by BigData_is_my_ex
385 views
0 votes
1 answer

NameNode & DataNode memory

Name Node only store metadata which is ...READ MORE

Jun 12, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,072 views
0 votes
1 answer

NAS and HDFS Comparison

NAS stands for Network-attached Storage and is ...READ MORE

Jun 11, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,448 views