Most answered questions in Big Data Hadoop

0 votes
1 answer

How to run Map Reduce program using Ubuntu terminal?

 I used the following steps to execute it ...READ MORE

Aug 7, 2018 in Big Data Hadoop by Neha
• 6,300 points
3,695 views
0 votes
1 answer

I get an error stating- Hadoop: «ERROR : JAVA_HOME is not set». How to resolve this?

Make sure that you have removed the ...READ MORE

Aug 7, 2018 in Big Data Hadoop by Frankie
• 9,830 points
5,295 views
0 votes
1 answer

How should I connect Hadoop in C#?

The Uri should be your Cluster Name, ...READ MORE

Aug 7, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,678 views
0 votes
1 answer

Increasing HFile block size

If you increase the block size then ...READ MORE

Aug 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
721 views
0 votes
1 answer

Designing Schema in HBase

Just create a single table with two ...READ MORE

Aug 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
632 views
0 votes
1 answer

What are the security guidelines for Impala?

Security Guidelines for Impala: The following are the ...READ MORE

Aug 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
545 views
0 votes
1 answer

Cloudera Managing and Compatibality

With more partners than any other Hadoop ...READ MORE

Aug 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
391 views
0 votes
1 answer

What are the different ways of Installing Hadoop into our local machine?

Hadoop runs on Unix and on Windows. ...READ MORE

Aug 4, 2018 in Big Data Hadoop by Neha
• 6,300 points
4,734 views
0 votes
1 answer

Which is the most preferable language for Hadooop Map-Reduce programs?

MapReduce is a programming model to perform ...READ MORE

Aug 4, 2018 in Big Data Hadoop by Neha
• 6,300 points
3,326 views
0 votes
1 answer

Is there any way to make Apache Commons Math Library available to MapReducer job at runtime?

So, what you can do is when ...READ MORE

Aug 3, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
967 views
0 votes
1 answer

Reading the file and populating the associative array

You can use the configure method to ...READ MORE

Aug 2, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
953 views
0 votes
1 answer

What happens in a MapReduce job when you set the number of reducers to one?

If you set number of reducers as ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,856 views
0 votes
1 answer

HDFS for customer information file

No, because HDFS is optimized for write-once, ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
452 views
0 votes
1 answer

How a client reads a file from HDFS?

Let me explain you it briefly. So, ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,366 views
0 votes
1 answer

Mappers in Hadoop

The number of mappers is determined by ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
348 views
0 votes
1 answer

Backup and Checkpoint Node

Checkpoint Node keeps track of the latest ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
2,615 views
0 votes
1 answer

Checkpoint Node in Hadoop

Checkpoint Node keeps track of the latest ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,196 views
0 votes
1 answer

Block Scanner HDFS

Block scanner runs periodically on every DataNode ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
2,237 views
0 votes
1 answer

Passing keys and values to the reducers during a standard sort and shuffle phase of MapReduce

Let me explain you the whole scenario Reducer ...READ MORE

Jul 27, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,184 views
0 votes
1 answer

Hortonworks vs Cloudera Architecture Difference

Features Cloudera MapR HortonWorks License Commercial Commercial Open-Source Community Yes Yes Yes SQL Impala Hive, Drill, Shark Stinger MapReduce Yes Yes Yes Unique Features Cloudera Manager Direct Access NFS 100% ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
689 views
0 votes
1 answer

How to create a project for the first time in Hadoop.?

If you want to learn Hadoop framework ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Neha
• 6,300 points
972 views
0 votes
1 answer

When do reduce tasks start in Hadoop?

The reduce phase has 3 steps: shuffle, ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
843 views
0 votes
1 answer

How compression works in Hadoop?

It basically depends on the file type ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,825 views
+1 vote
1 answer

Reading Performance in Hadoop Cluster

For speeding up reading performance of your ...READ MORE

Jul 25, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
553 views
+2 votes
1 answer

How to calculate Maximum salary of the employee with the name using the Map Reduce Technique

Please try the below code and it ...READ MORE

Jul 25, 2018 in Big Data Hadoop by Neha
• 6,300 points
5,337 views
0 votes
1 answer

GROUP and COGROUP in PIG

Both GROUP and COGROUP operators are identical ...READ MORE

Jul 25, 2018 in Big Data Hadoop by shams
• 3,670 points
3,756 views
0 votes
1 answer

Use of MapReduce in PIG

Apache Pig programs are written in a ...READ MORE

Jul 25, 2018 in Big Data Hadoop by shams
• 3,670 points
1,997 views
0 votes
1 answer

Difference between a failed task attempt and a killed task attempt

Let me explain you the difference between ...READ MORE

Jul 24, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,889 views
0 votes
1 answer

What metadata is stored on a DataNode when a block is written to it?

Let me explain you step by step.  Each ...READ MORE

Jul 23, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,791 views
0 votes
1 answer

Importing a portion of Relational Database to HDFS

Yes, there is a tool called Sqoop ...READ MORE

Jul 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,602 views
0 votes
1 answer

How can Hadoop discover missing or corrupt HDFS data?

HDFS supports fsck command to check for ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,300 views
0 votes
1 answer

How the FIFO Scheduler arbitrates the cluster resources for a job and its tasks?

Let me clear your doubt. So what ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
853 views
0 votes
1 answer

Synchronizing sqoop imported data in HDFS

Data can be synchronised using incremental parameter ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,018 views
0 votes
1 answer

Sqoop Metastore ?

Sqoop metastore is a shared metadata repository ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,159 views
0 votes
1 answer

How to pause and resume hive job?

Practically speaking, it's difficult/impossible to pause and resume ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,159 views
0 votes
1 answer

Channel in Flume

 A transient store that receives the events ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Ashish
• 2,650 points
404 views
+1 vote
1 answer

When I try to execute any Hadoop commands error pops up saying unable to find or load main class M

The error might be due to the ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,666 views
0 votes
1 answer

How does HDFS Federation help HDFS Scale horizontally?

Let me try to explain you the ...READ MORE

Jul 12, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,074 views
0 votes
1 answer

How to avoid a “split-brain” scenario with NameNodes?

Okay, so let me tell you that ...READ MORE

Jul 11, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,981 views
0 votes
1 answer

How Namenode handles data node failures?

Let me explain you the whole scenario. NameNode ...READ MORE

Jul 11, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
6,057 views
0 votes
1 answer

Hive Precedence Order

SET Command in HIVE The command line –hiveconf ...READ MORE

Jul 9, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
788 views
0 votes
1 answer

Bucketing vs Partitioning in HIve

Partition divides large amount of data into ...READ MORE

Jul 9, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
26,574 views
0 votes
1 answer

Are there any Limitations of Pig?

Limitations of the Apache Pig are: As the ...READ MORE

Jul 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,519 views
0 votes
1 answer

MapFile in Pig

MapFile is a class which serves file-based ...READ MORE

Jul 6, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
491 views
0 votes
1 answer

Checkpointing in Hadoop

Checkpointing is the process of combining the ...READ MORE

Jul 3, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,108 views
0 votes
1 answer

Hadoop Nodes for Multiple Disks

Datanodes can store blocks in multiple directories ...READ MORE

Jul 3, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,751 views
0 votes
1 answer

Different partitoners in Cassandra

Murmur3Partitioner is the default partitioner. It is ...READ MORE

Jun 27, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
437 views
0 votes
1 answer

Gossip Protocol features.

Gossip Protocol in Cassandra is a peer-to-peer ...READ MORE

Jun 26, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
430 views
0 votes
1 answer

How to analyze block placement on datanodes and rebalancing data across Hadoop nodes?

HDFS provides a tool for administrators i.e. ...READ MORE

Jun 21, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
858 views
0 votes
1 answer

Is there any way to setup Hadoop nodes (data nodes/namenodes) to use multiple volumes/disks?

Datanodes can store blocks in multiple directories ...READ MORE

Jun 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
794 views