Trending questions in Big Data Hadoop

0 votes
1 answer

InputSplit vs HDFS Block

By definition Block – Block is the continuous ...READ MORE

Jun 1, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
4,184 views
0 votes
1 answer

What is the difference between Big Data and Data Mining?

Big data and data mining are two ...READ MORE

Aug 23, 2018 in Big Data Hadoop by Frankie
• 9,830 points
615 views
0 votes
2 answers

Which of these will vanish: Flink vs Spark?

At first glance, Flink and Spark would ...READ MORE

Aug 13, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,037 views
0 votes
1 answer

What are the options for hadoop on scala?

Definitely check out Scalding. Speaking as a user ...READ MORE

Aug 22, 2018 in Big Data Hadoop by Frankie
• 9,830 points
464 views
0 votes
1 answer

Storing collection of images in HDFS

Using Hadoop Sequence Files, you can store ...READ MORE

Aug 9, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,001 views
0 votes
1 answer

How will the Fair Scheduler handle more than one Job?

In your case when job B gets ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
631 views
+1 vote
1 answer

Hadoop home path?

you can manually input the hadoop home ...READ MORE

Mar 26, 2018 in Big Data Hadoop by Ashish
• 2,650 points
6,757 views
0 votes
1 answer

Determining HDFS space in a cluster?

There are two ways to determine available ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
595 views
0 votes
1 answer

Can I have a list of property files used in Hadoop Framework?

Here is a complete list of configuration ...READ MORE

Aug 14, 2018 in Big Data Hadoop by Frankie
• 9,830 points
641 views
0 votes
1 answer

How do I include all the Hadoop dependencies using Maven?

This is a dependency mismatch error. I ...READ MORE

Apr 10, 2018 in Big Data Hadoop by Shubham
• 13,490 points
6,068 views
0 votes
1 answer

Checkpoint Node in Hadoop

Checkpoint Node keeps track of the latest ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,196 views
0 votes
2 answers

How do I integrate Kdb+ and hadoop?

 kdb+ could be installed on every Hadoop ...READ MORE

Aug 6, 2018 in Big Data Hadoop by Abhi
• 3,720 points
1,447 views
0 votes
1 answer

How to extract Cloudera Hadoop VM archive?

If you are Windows, you need to ...READ MORE

Aug 9, 2018 in Big Data Hadoop by Frankie
• 9,830 points
804 views
0 votes
1 answer

What is the command to navigate in HDFS?

First of all there is no command ...READ MORE

Apr 27, 2018 in Big Data Hadoop by Shubham
• 13,490 points
5,267 views
0 votes
1 answer

Importing a portion of Relational Database to HDFS

Yes, there is a tool called Sqoop ...READ MORE

Jul 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,602 views
+1 vote
2 answers

Failed to restart Hadoop namenode using cloudera quickstart

You can use Cloudera Manager to manage ...READ MORE

Mar 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points

edited Jun 9, 2020 by MD 3,865 views
+1 vote
1 answer

When I try to execute any Hadoop commands error pops up saying unable to find or load main class M

The error might be due to the ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,665 views
0 votes
1 answer

How Hadoop distributes block writes into HDFS?

So, what happens is the slave node ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
396 views
0 votes
1 answer

Is there any way to make Apache Commons Math Library available to MapReducer job at runtime?

So, what you can do is when ...READ MORE

Aug 3, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
967 views
0 votes
1 answer

Example or Use Case of Big Data

Let me answer your questions one by ...READ MORE

Aug 9, 2018 in Big Data Hadoop by Frankie
• 9,830 points
705 views
0 votes
1 answer

Determining number of reduces?

Number of reduces are set by the ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
355 views
0 votes
1 answer

Reading the file and populating the associative array

You can use the configure method to ...READ MORE

Aug 2, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
953 views
0 votes
1 answer

Operation to simulate a Hadoop Production Cluster

In this case, what you can do ...READ MORE

Aug 9, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
666 views
0 votes
1 answer

Increasing HFile block size

If you increase the block size then ...READ MORE

Aug 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
721 views
0 votes
1 answer

How to print the content of a file in console present in HDFS?

Yes, you can use hdfs dfs command ...READ MORE

Apr 19, 2018 in Big Data Hadoop by Shubham
• 13,490 points
5,439 views
0 votes
2 answers

How does Hadoop/Spark is used for building large analytics report?

The best possible framework for this task ...READ MORE

Aug 7, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
615 views
0 votes
1 answer

What is the difference between a Big Data Warehouse and a traditional Data Warehouse

Hadoop is similar in architecture to MPP data ...READ MORE

Aug 10, 2018 in Big Data Hadoop by Frankie
• 9,830 points
519 views
0 votes
1 answer

What's the difference between Hadoop and NoSQL ?

Let's start with the definitions Hadoop is an ...READ MORE

Apr 2, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
6,087 views
0 votes
1 answer

Designing Schema in HBase

Just create a single table with two ...READ MORE

Aug 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
632 views
0 votes
1 answer

Where can I find hadoop-streaming.jar JAR file?

You will find the streaming jar here: ...READ MORE

Mar 21, 2018 in Big Data Hadoop by coldcode
• 2,080 points
6,587 views
0 votes
1 answer

When a reduce class also serve as a combiner?

It will act as a combiner when ...READ MORE

Aug 9, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
526 views
0 votes
1 answer

What is hadoop.tmp.dir ?

hadoop.tmp.dir is used as the base for temporary ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
6,326 views
0 votes
1 answer

What are the security guidelines for Impala?

Security Guidelines for Impala: The following are the ...READ MORE

Aug 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
544 views
0 votes
1 answer

How to assess and compare Hadoop for Business Intelligence?

Hadoop is a great tool to be ...READ MORE

Aug 8, 2018 in Big Data Hadoop by Frankie
• 9,830 points
438 views
0 votes
1 answer

How can Hadoop discover missing or corrupt HDFS data?

HDFS supports fsck command to check for ...READ MORE

Jul 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,300 views
0 votes
1 answer

What does NameNode stores and where it stores?

NameNode records the metadata of all the ...READ MORE

Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,490 points
5,390 views
0 votes
1 answer

How to create a project for the first time in Hadoop.?

If you want to learn Hadoop framework ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Neha
• 6,300 points
972 views
+1 vote
0 answers

How to set up Hadoop cluster on Mac in intelliJ IDEA

I have Installed hadoop using brew and ...READ MORE

Jul 25, 2018 in Big Data Hadoop by Neha
• 6,300 points
941 views
0 votes
1 answer

Cloudera Managing and Compatibality

With more partners than any other Hadoop ...READ MORE

Aug 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
391 views
0 votes
1 answer

When do reduce tasks start in Hadoop?

The reduce phase has 3 steps: shuffle, ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
841 views
0 votes
1 answer

Sqoop Metastore ?

Sqoop metastore is a shared metadata repository ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,158 views
0 votes
1 answer

How to pause and resume hive job?

Practically speaking, it's difficult/impossible to pause and resume ...READ MORE

Jul 17, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,157 views
0 votes
1 answer

Hadoop Nodes for Multiple Disks

Datanodes can store blocks in multiple directories ...READ MORE

Jul 3, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,750 views
0 votes
1 answer

Synchronizing sqoop imported data in HDFS

Data can be synchronised using incremental parameter ...READ MORE

Jul 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,017 views
0 votes
1 answer

Hortonworks vs Cloudera Architecture Difference

Features Cloudera MapR HortonWorks License Commercial Commercial Open-Source Community Yes Yes Yes SQL Impala Hive, Drill, Shark Stinger MapReduce Yes Yes Yes Unique Features Cloudera Manager Direct Access NFS 100% ...READ MORE

Jul 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
688 views
0 votes
1 answer

HDFS for customer information file

No, because HDFS is optimized for write-once, ...READ MORE

Jul 31, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
451 views
0 votes
1 answer

Are there any Limitations of Pig?

Limitations of the Apache Pig are: As the ...READ MORE

Jul 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,519 views
0 votes
1 answer

When is the reduce tasks start in Hadoop?

Let me explain you the whole scenario. ...READ MORE

May 22, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,439 views
+1 vote
1 answer

Hadoop Installation Issue on Windows

Below is the main error you are ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
5,802 views
0 votes
1 answer

Mappers in Hadoop

The number of mappers is determined by ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
347 views