Trending questions in Big Data Hadoop

0 votes
1 answer

What is the difference between Big Data and Data Mining?

Big data and data mining are two ...READ MORE

Aug 23, 2018 in Big Data Hadoop by Frankie
• 9,830 points
597 views
0 votes
2 answers

Which of these will vanish: Flink vs Spark?

At first glance, Flink and Spark would ...READ MORE

Aug 13, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,020 views
0 votes
1 answer

InputSplit vs HDFS Block

By definition Block – Block is the continuous ...READ MORE

Jun 1, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
4,149 views
0 votes
1 answer

What are the options for hadoop on scala?

Definitely check out Scalding. Speaking as a user ...READ MORE

Aug 22, 2018 in Big Data Hadoop by Frankie
• 9,830 points
452 views
0 votes
1 answer

Storing collection of images in HDFS

Using Hadoop Sequence Files, you can store ...READ MORE

Aug 9, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
980 views
0 votes
1 answer

How will the Fair Scheduler handle more than one Job?

In your case when job B gets ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
595 views
+1 vote
1 answer

Hadoop home path?

you can manually input the hadoop home ...READ MORE

Mar 26, 2018 in Big Data Hadoop by Ashish
• 2,650 points
6,716 views
0 votes
1 answer

Determining HDFS space in a cluster?

There are two ways to determine available ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
560 views
0 votes
1 answer

Can I have a list of property files used in Hadoop Framework?

Here is a complete list of configuration ...READ MORE

Aug 14, 2018 in Big Data Hadoop by Frankie
• 9,830 points
622 views
0 votes
1 answer

How do I include all the Hadoop dependencies using Maven?

This is a dependency mismatch error. I ...READ MORE

Apr 10, 2018 in Big Data Hadoop by Shubham
• 13,490 points
6,014 views
0 votes
1 answer

Checkpoint Node in Hadoop

Checkpoint Node keeps track of the latest ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,173 views
0 votes
1 answer

How to extract Cloudera Hadoop VM archive?

If you are Windows, you need to ...READ MORE

Aug 9, 2018 in Big Data Hadoop by Frankie
• 9,830 points
783 views
0 votes
2 answers

How do I integrate Kdb+ and hadoop?

 kdb+ could be installed on every Hadoop ...READ MORE

Aug 6, 2018 in Big Data Hadoop by Abhi
• 3,720 points
1,418 views
0 votes
1 answer

What is the command to navigate in HDFS?

First of all there is no command ...READ MORE

Apr 27, 2018 in Big Data Hadoop by Shubham
• 13,490 points
5,220 views
0 votes
1 answer

Importing a portion of Relational Database to HDFS

Yes, there is a tool called Sqoop ...READ MORE

Jul 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,576 views
0 votes
1 answer

How Hadoop distributes block writes into HDFS?

So, what happens is the slave node ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
384 views
+1 vote
2 answers

Failed to restart Hadoop namenode using cloudera quickstart

You can use Cloudera Manager to manage ...READ MORE

Mar 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points

edited Jun 9, 2020 by MD 3,823 views
+1 vote
1 answer

When I try to execute any Hadoop commands error pops up saying unable to find or load main class M

The error might be due to the ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,624 views
0 votes
1 answer

Is there any way to make Apache Commons Math Library available to MapReducer job at runtime?

So, what you can do is when ...READ MORE

Aug 3, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
946 views
0 votes
1 answer

Determining number of reduces?

Number of reduces are set by the ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
346 views
0 votes
1 answer

Example or Use Case of Big Data

Let me answer your questions one by ...READ MORE

Aug 9, 2018 in Big Data Hadoop by Frankie
• 9,830 points
689 views
0 votes
1 answer

Reading the file and populating the associative array

You can use the configure method to ...READ MORE

Aug 2, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
945 views
0 votes
1 answer

Operation to simulate a Hadoop Production Cluster

In this case, what you can do ...READ MORE

Aug 9, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
634 views
0 votes
1 answer

Increasing HFile block size

If you increase the block size then ...READ MORE

Aug 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
704 views
0 votes
1 answer

How to print the content of a file in console present in HDFS?

Yes, you can use hdfs dfs command ...READ MORE

Apr 19, 2018 in Big Data Hadoop by Shubham
• 13,490 points
5,387 views
0 votes
2 answers

How does Hadoop/Spark is used for building large analytics report?

The best possible framework for this task ...READ MORE

Aug 7, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
590 views
0 votes
1 answer

What is the difference between a Big Data Warehouse and a traditional Data Warehouse

Hadoop is similar in architecture to MPP data ...READ MORE

Aug 10, 2018 in Big Data Hadoop by Frankie
• 9,830 points
502 views
0 votes
1 answer

Designing Schema in HBase

Just create a single table with two ...READ MORE

Aug 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
624 views
0 votes
1 answer

When a reduce class also serve as a combiner?

It will act as a combiner when ...READ MORE

Aug 9, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
509 views
0 votes
1 answer

What's the difference between Hadoop and NoSQL ?

Let's start with the definitions Hadoop is an ...READ MORE

Apr 2, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
6,012 views
0 votes
1 answer

Where can I find hadoop-streaming.jar JAR file?

You will find the streaming jar here: ...READ MORE

Mar 21, 2018 in Big Data Hadoop by coldcode
• 2,080 points
6,516 views
0 votes
1 answer

What is hadoop.tmp.dir ?

hadoop.tmp.dir is used as the base for temporary ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
6,254 views
0 votes
1 answer

How to assess and compare Hadoop for Business Intelligence?

Hadoop is a great tool to be ...READ MORE

Aug 8, 2018 in Big Data Hadoop by Frankie
• 9,830 points
423 views
0 votes
1 answer

What are the security guidelines for Impala?

Security Guidelines for Impala: The following are the ...READ MORE

Aug 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
508 views
0 votes
1 answer

How to create a project for the first time in Hadoop.?

If you want to learn Hadoop framework ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Neha
• 6,300 points
951 views
0 votes
1 answer

How can Hadoop discover missing or corrupt HDFS data?

HDFS supports fsck command to check for ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,275 views
0 votes
1 answer

What does NameNode stores and where it stores?

NameNode records the metadata of all the ...READ MORE

Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,490 points
5,332 views
+1 vote
0 answers

How to set up Hadoop cluster on Mac in intelliJ IDEA

I have Installed hadoop using brew and ...READ MORE

Jul 25, 2018 in Big Data Hadoop by Neha
• 6,300 points
926 views
0 votes
1 answer

Cloudera Managing and Compatibality

With more partners than any other Hadoop ...READ MORE

Aug 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
379 views
0 votes
1 answer

When do reduce tasks start in Hadoop?

The reduce phase has 3 steps: shuffle, ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
814 views
0 votes
1 answer

Sqoop Metastore ?

Sqoop metastore is a shared metadata repository ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,120 views
0 votes
1 answer

How to pause and resume hive job?

Practically speaking, it's difficult/impossible to pause and resume ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,136 views
0 votes
1 answer

Hadoop Nodes for Multiple Disks

Datanodes can store blocks in multiple directories ...READ MORE

Jul 3, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,733 views
0 votes
1 answer

Hortonworks vs Cloudera Architecture Difference

Features Cloudera MapR HortonWorks License Commercial Commercial Open-Source Community Yes Yes Yes SQL Impala Hive, Drill, Shark Stinger MapReduce Yes Yes Yes Unique Features Cloudera Manager Direct Access NFS 100% ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
674 views
0 votes
1 answer

Synchronizing sqoop imported data in HDFS

Data can be synchronised using incremental parameter ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
992 views
0 votes
1 answer

HDFS for customer information file

No, because HDFS is optimized for write-once, ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
437 views
0 votes
1 answer

Are there any Limitations of Pig?

Limitations of the Apache Pig are: As the ...READ MORE

Jul 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,476 views
0 votes
1 answer

When is the reduce tasks start in Hadoop?

Let me explain you the whole scenario. ...READ MORE

May 22, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,386 views
0 votes
1 answer

Mappers in Hadoop

The number of mappers is determined by ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
322 views
0 votes
1 answer

How the FIFO Scheduler arbitrates the cluster resources for a job and its tasks?

Let me clear your doubt. So what ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
841 views