Trending questions in Big Data Hadoop

0 votes
1 answer

How to programmatically access hadoop cluster where kerberos is enable?

Okay,here's the code snippet to work in the ...READ MORE

Mar 27, 2018 in Big Data Hadoop by coldcode
• 2,080 points
6,993 views
0 votes
2 answers

Which of these will vanish: Flink vs Spark?

At first glance, Flink and Spark would ...READ MORE

Aug 13, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,009 views
0 votes
1 answer

InputSplit vs HDFS Block

By definition Block – Block is the continuous ...READ MORE

Jun 1, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
4,132 views
0 votes
1 answer

What are the options for hadoop on scala?

Definitely check out Scalding. Speaking as a user ...READ MORE

Aug 22, 2018 in Big Data Hadoop by Frankie
• 9,830 points
438 views
0 votes
1 answer

Storing collection of images in HDFS

Using Hadoop Sequence Files, you can store ...READ MORE

Aug 9, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
955 views
0 votes
1 answer

How will the Fair Scheduler handle more than one Job?

In your case when job B gets ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
562 views
+1 vote
1 answer

Hadoop home path?

you can manually input the hadoop home ...READ MORE

Mar 26, 2018 in Big Data Hadoop by Ashish
• 2,650 points
6,682 views
0 votes
1 answer

Determining HDFS space in a cluster?

There are two ways to determine available ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
540 views
0 votes
1 answer

Can I have a list of property files used in Hadoop Framework?

Here is a complete list of configuration ...READ MORE

Aug 14, 2018 in Big Data Hadoop by Frankie
• 9,830 points
609 views
0 votes
1 answer

Checkpoint Node in Hadoop

Checkpoint Node keeps track of the latest ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,162 views
0 votes
1 answer

How to extract Cloudera Hadoop VM archive?

If you are Windows, you need to ...READ MORE

Aug 9, 2018 in Big Data Hadoop by Frankie
• 9,830 points
770 views
0 votes
2 answers

How do I integrate Kdb+ and hadoop?

 kdb+ could be installed on every Hadoop ...READ MORE

Aug 6, 2018 in Big Data Hadoop by Abhi
• 3,720 points
1,398 views
0 votes
1 answer

How do I include all the Hadoop dependencies using Maven?

This is a dependency mismatch error. I ...READ MORE

Apr 10, 2018 in Big Data Hadoop by Shubham
• 13,490 points
5,958 views
0 votes
1 answer

Importing a portion of Relational Database to HDFS

Yes, there is a tool called Sqoop ...READ MORE

Jul 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,554 views
0 votes
1 answer

What is the command to navigate in HDFS?

First of all there is no command ...READ MORE

Apr 27, 2018 in Big Data Hadoop by Shubham
• 13,490 points
5,181 views
0 votes
1 answer

How Hadoop distributes block writes into HDFS?

So, what happens is the slave node ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
376 views
+1 vote
2 answers

Failed to restart Hadoop namenode using cloudera quickstart

You can use Cloudera Manager to manage ...READ MORE

Mar 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points

edited Jun 9, 2020 by MD 3,801 views
0 votes
1 answer

Determining number of reduces?

Number of reduces are set by the ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
338 views
0 votes
1 answer

Is there any way to make Apache Commons Math Library available to MapReducer job at runtime?

So, what you can do is when ...READ MORE

Aug 3, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
928 views
0 votes
1 answer

Example or Use Case of Big Data

Let me answer your questions one by ...READ MORE

Aug 9, 2018 in Big Data Hadoop by Frankie
• 9,830 points
671 views
0 votes
1 answer

Reading the file and populating the associative array

You can use the configure method to ...READ MORE

Aug 2, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
935 views
+1 vote
1 answer

When I try to execute any Hadoop commands error pops up saying unable to find or load main class M

The error might be due to the ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,576 views
0 votes
1 answer

Operation to simulate a Hadoop Production Cluster

In this case, what you can do ...READ MORE

Aug 9, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
615 views
0 votes
1 answer

Increasing HFile block size

If you increase the block size then ...READ MORE

Aug 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
686 views
0 votes
2 answers

How does Hadoop/Spark is used for building large analytics report?

The best possible framework for this task ...READ MORE

Aug 7, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
575 views
0 votes
1 answer

What is the difference between a Big Data Warehouse and a traditional Data Warehouse

Hadoop is similar in architecture to MPP data ...READ MORE

Aug 10, 2018 in Big Data Hadoop by Frankie
• 9,830 points
497 views
0 votes
1 answer

How to print the content of a file in console present in HDFS?

Yes, you can use hdfs dfs command ...READ MORE

Apr 19, 2018 in Big Data Hadoop by Shubham
• 13,490 points
5,329 views
0 votes
1 answer

Designing Schema in HBase

Just create a single table with two ...READ MORE

Aug 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
605 views
0 votes
1 answer

When a reduce class also serve as a combiner?

It will act as a combiner when ...READ MORE

Aug 9, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
503 views
0 votes
1 answer

How to assess and compare Hadoop for Business Intelligence?

Hadoop is a great tool to be ...READ MORE

Aug 8, 2018 in Big Data Hadoop by Frankie
• 9,830 points
414 views
0 votes
1 answer

What's the difference between Hadoop and NoSQL ?

Let's start with the definitions Hadoop is an ...READ MORE

Apr 2, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
5,925 views
0 votes
1 answer

Where can I find hadoop-streaming.jar JAR file?

You will find the streaming jar here: ...READ MORE

Mar 21, 2018 in Big Data Hadoop by coldcode
• 2,080 points
6,448 views
0 votes
1 answer

How to create a project for the first time in Hadoop.?

If you want to learn Hadoop framework ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Neha
• 6,300 points
938 views
0 votes
1 answer

How can Hadoop discover missing or corrupt HDFS data?

HDFS supports fsck command to check for ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,260 views
0 votes
1 answer

What are the security guidelines for Impala?

Security Guidelines for Impala: The following are the ...READ MORE

Aug 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
482 views
0 votes
1 answer

What is hadoop.tmp.dir ?

hadoop.tmp.dir is used as the base for temporary ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
6,185 views
0 votes
1 answer

What does NameNode stores and where it stores?

NameNode records the metadata of all the ...READ MORE

Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,490 points
5,282 views
+1 vote
0 answers

How to set up Hadoop cluster on Mac in intelliJ IDEA

I have Installed hadoop using brew and ...READ MORE

Jul 25, 2018 in Big Data Hadoop by Neha
• 6,300 points
918 views
0 votes
1 answer

Cloudera Managing and Compatibality

With more partners than any other Hadoop ...READ MORE

Aug 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
365 views
0 votes
1 answer

When do reduce tasks start in Hadoop?

The reduce phase has 3 steps: shuffle, ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
791 views
0 votes
1 answer

Sqoop Metastore ?

Sqoop metastore is a shared metadata repository ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,102 views
0 votes
1 answer

How to pause and resume hive job?

Practically speaking, it's difficult/impossible to pause and resume ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,127 views
0 votes
1 answer

Hadoop Nodes for Multiple Disks

Datanodes can store blocks in multiple directories ...READ MORE

Jul 3, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,716 views
0 votes
1 answer

Hortonworks vs Cloudera Architecture Difference

Features Cloudera MapR HortonWorks License Commercial Commercial Open-Source Community Yes Yes Yes SQL Impala Hive, Drill, Shark Stinger MapReduce Yes Yes Yes Unique Features Cloudera Manager Direct Access NFS 100% ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
660 views
0 votes
1 answer

Synchronizing sqoop imported data in HDFS

Data can be synchronised using incremental parameter ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
974 views
0 votes
1 answer

HDFS for customer information file

No, because HDFS is optimized for write-once, ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
427 views
0 votes
1 answer

Are there any Limitations of Pig?

Limitations of the Apache Pig are: As the ...READ MORE

Jul 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,438 views
0 votes
1 answer

When is the reduce tasks start in Hadoop?

Let me explain you the whole scenario. ...READ MORE

May 22, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,345 views
0 votes
1 answer

How the FIFO Scheduler arbitrates the cluster resources for a job and its tasks?

Let me clear your doubt. So what ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
831 views
0 votes
1 answer

Mappers in Hadoop

The number of mappers is determined by ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
300 views