Trending questions in Big Data Hadoop

0 votes
1 answer

How to programmatically access hadoop cluster where kerberos is enable?

Okay,here's the code snippet to work in the ...READ MORE

Mar 27, 2018 in Big Data Hadoop by coldcode
• 2,080 points
6,983 views
0 votes
2 answers

Which of these will vanish: Flink vs Spark?

At first glance, Flink and Spark would ...READ MORE

Aug 13, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
999 views
0 votes
1 answer

InputSplit vs HDFS Block

By definition Block – Block is the continuous ...READ MORE

Jun 1, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
4,124 views
0 votes
1 answer

What are the options for hadoop on scala?

Definitely check out Scalding. Speaking as a user ...READ MORE

Aug 22, 2018 in Big Data Hadoop by Frankie
• 9,830 points
431 views
0 votes
1 answer

Storing collection of images in HDFS

Using Hadoop Sequence Files, you can store ...READ MORE

Aug 9, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
947 views
0 votes
1 answer

How will the Fair Scheduler handle more than one Job?

In your case when job B gets ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
558 views
0 votes
1 answer

Determining HDFS space in a cluster?

There are two ways to determine available ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
533 views
+1 vote
1 answer

Hadoop home path?

you can manually input the hadoop home ...READ MORE

Mar 26, 2018 in Big Data Hadoop by Ashish
• 2,650 points
6,663 views
0 votes
1 answer

Can I have a list of property files used in Hadoop Framework?

Here is a complete list of configuration ...READ MORE

Aug 14, 2018 in Big Data Hadoop by Frankie
• 9,830 points
602 views
0 votes
1 answer

Checkpoint Node in Hadoop

Checkpoint Node keeps track of the latest ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,160 views
0 votes
1 answer

How to extract Cloudera Hadoop VM archive?

If you are Windows, you need to ...READ MORE

Aug 9, 2018 in Big Data Hadoop by Frankie
• 9,830 points
763 views
0 votes
2 answers

How do I integrate Kdb+ and hadoop?

 kdb+ could be installed on every Hadoop ...READ MORE

Aug 6, 2018 in Big Data Hadoop by Abhi
• 3,720 points
1,389 views
0 votes
1 answer

How do I include all the Hadoop dependencies using Maven?

This is a dependency mismatch error. I ...READ MORE

Apr 10, 2018 in Big Data Hadoop by Shubham
• 13,490 points
5,931 views
0 votes
1 answer

How Hadoop distributes block writes into HDFS?

So, what happens is the slave node ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
374 views
0 votes
1 answer

Importing a portion of Relational Database to HDFS

Yes, there is a tool called Sqoop ...READ MORE

Jul 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,546 views
0 votes
1 answer

What is the command to navigate in HDFS?

First of all there is no command ...READ MORE

Apr 27, 2018 in Big Data Hadoop by Shubham
• 13,490 points
5,166 views
+1 vote
2 answers

Failed to restart Hadoop namenode using cloudera quickstart

You can use Cloudera Manager to manage ...READ MORE

Mar 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points

edited Jun 9, 2020 by MD 3,791 views
0 votes
1 answer

Determining number of reduces?

Number of reduces are set by the ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
334 views
0 votes
1 answer

Example or Use Case of Big Data

Let me answer your questions one by ...READ MORE

Aug 9, 2018 in Big Data Hadoop by Frankie
• 9,830 points
669 views
0 votes
1 answer

Reading the file and populating the associative array

You can use the configure method to ...READ MORE

Aug 2, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
933 views
0 votes
1 answer

Is there any way to make Apache Commons Math Library available to MapReducer job at runtime?

So, what you can do is when ...READ MORE

Aug 3, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
922 views
+1 vote
1 answer

When I try to execute any Hadoop commands error pops up saying unable to find or load main class M

The error might be due to the ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,562 views
0 votes
1 answer

Operation to simulate a Hadoop Production Cluster

In this case, what you can do ...READ MORE

Aug 9, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
613 views
0 votes
1 answer

Increasing HFile block size

If you increase the block size then ...READ MORE

Aug 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
685 views
0 votes
2 answers

How does Hadoop/Spark is used for building large analytics report?

The best possible framework for this task ...READ MORE

Aug 7, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
569 views
0 votes
1 answer

What is the difference between a Big Data Warehouse and a traditional Data Warehouse

Hadoop is similar in architecture to MPP data ...READ MORE

Aug 10, 2018 in Big Data Hadoop by Frankie
• 9,830 points
493 views
0 votes
1 answer

Designing Schema in HBase

Just create a single table with two ...READ MORE

Aug 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
598 views
0 votes
1 answer

When a reduce class also serve as a combiner?

It will act as a combiner when ...READ MORE

Aug 9, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
500 views
0 votes
1 answer

How to print the content of a file in console present in HDFS?

Yes, you can use hdfs dfs command ...READ MORE

Apr 19, 2018 in Big Data Hadoop by Shubham
• 13,490 points
5,304 views
0 votes
1 answer

How to assess and compare Hadoop for Business Intelligence?

Hadoop is a great tool to be ...READ MORE

Aug 8, 2018 in Big Data Hadoop by Frankie
• 9,830 points
407 views
0 votes
1 answer

How to create a project for the first time in Hadoop.?

If you want to learn Hadoop framework ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Neha
• 6,300 points
933 views
0 votes
1 answer

How can Hadoop discover missing or corrupt HDFS data?

HDFS supports fsck command to check for ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,256 views
0 votes
1 answer

What are the security guidelines for Impala?

Security Guidelines for Impala: The following are the ...READ MORE

Aug 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
475 views
0 votes
1 answer

What's the difference between Hadoop and NoSQL ?

Let's start with the definitions Hadoop is an ...READ MORE

Apr 2, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
5,901 views
0 votes
1 answer

Where can I find hadoop-streaming.jar JAR file?

You will find the streaming jar here: ...READ MORE

Mar 21, 2018 in Big Data Hadoop by coldcode
• 2,080 points
6,422 views
0 votes
1 answer

What is hadoop.tmp.dir ?

hadoop.tmp.dir is used as the base for temporary ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
6,163 views
0 votes
1 answer

What does NameNode stores and where it stores?

NameNode records the metadata of all the ...READ MORE

Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,490 points
5,266 views
+1 vote
0 answers

How to set up Hadoop cluster on Mac in intelliJ IDEA

I have Installed hadoop using brew and ...READ MORE

Jul 25, 2018 in Big Data Hadoop by Neha
• 6,300 points
914 views
0 votes
1 answer

Cloudera Managing and Compatibality

With more partners than any other Hadoop ...READ MORE

Aug 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
361 views
0 votes
1 answer

When do reduce tasks start in Hadoop?

The reduce phase has 3 steps: shuffle, ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
778 views
0 votes
1 answer

Sqoop Metastore ?

Sqoop metastore is a shared metadata repository ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,091 views
0 votes
1 answer

How to pause and resume hive job?

Practically speaking, it's difficult/impossible to pause and resume ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,124 views
0 votes
1 answer

Hadoop Nodes for Multiple Disks

Datanodes can store blocks in multiple directories ...READ MORE

Jul 3, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,707 views
0 votes
1 answer

Hortonworks vs Cloudera Architecture Difference

Features Cloudera MapR HortonWorks License Commercial Commercial Open-Source Community Yes Yes Yes SQL Impala Hive, Drill, Shark Stinger MapReduce Yes Yes Yes Unique Features Cloudera Manager Direct Access NFS 100% ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
658 views
0 votes
1 answer

Synchronizing sqoop imported data in HDFS

Data can be synchronised using incremental parameter ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
971 views
0 votes
1 answer

HDFS for customer information file

No, because HDFS is optimized for write-once, ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
427 views
0 votes
1 answer

Are there any Limitations of Pig?

Limitations of the Apache Pig are: As the ...READ MORE

Jul 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,432 views
0 votes
1 answer

When is the reduce tasks start in Hadoop?

Let me explain you the whole scenario. ...READ MORE

May 22, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,338 views
0 votes
1 answer

How the FIFO Scheduler arbitrates the cluster resources for a job and its tasks?

Let me clear your doubt. So what ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
827 views
0 votes
1 answer

Mappers in Hadoop

The number of mappers is determined by ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
298 views