Most viewed questions in Big Data Hadoop

0 votes
1 answer

How to implement sqoop jobs?

Step 1: Give the below command to ...READ MORE

Dec 18, 2018 in Big Data Hadoop by Omkar
• 69,180 points
991 views
0 votes
1 answer

Can I run Hadoop with Docker for both DEV and PROD environments?

Hi, Yes, you can run Hadoop with Docker ...READ MORE

Jan 24, 2020 in Big Data Hadoop by MD
• 95,460 points
990 views
0 votes
1 answer

How to set history server api in Hue?

You have to set this in hue configuration ...READ MORE

May 30, 2019 in Big Data Hadoop by jana
987 views
0 votes
1 answer

Classes implementing InputFormat frequently

FileInputFormat : Base class for all file-based InputFormats Other ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Reshma
986 views
0 votes
1 answer

How to add filter for groups in Hue?

Open the Hue configuration file Search for [[ldap]] section Search for [[[groups]]] section Set the ...READ MORE

May 28, 2019 in Big Data Hadoop by Karan
984 views
0 votes
1 answer

Mapreduce shuffle and sort

The mapreduce task happens in the following ...READ MORE

Dec 27, 2018 in Big Data Hadoop by Omkar
• 69,180 points
984 views
0 votes
1 answer

Hadoop: intervals and JOIN

Hey, a solution was given on Biostar: http://biostar.stackexchange.com/questions/8821. Hope ...READ MORE

Sep 24, 2018 in Big Data Hadoop by slayer
• 29,370 points
984 views
0 votes
1 answer

How to install Hadoop on Ubuntu?

You can manually create a file hadoop-env.sh ...READ MORE

Jul 5, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
982 views
0 votes
1 answer

Hadoop with Python: How to use python to stop Hadoop tasktracker?

Here you go os.system('sudo -u hadoop bin/hadoop-daemon.sh stop ...READ MORE

Nov 16, 2018 in Big Data Hadoop by Omkar
• 69,180 points
981 views
0 votes
1 answer

Setting user limit for Hbase?

Hey, Because HBase is a database, it opens ...READ MORE

May 29, 2019 in Big Data Hadoop by Gitika
• 65,730 points
978 views
0 votes
1 answer

How to embed HDFS cluster information on a website?

Hi @Bhavish.  It is possible to get cluster ...READ MORE

May 27, 2019 in Big Data Hadoop by Omkar
• 69,180 points
978 views
0 votes
1 answer

Do worker hosts need access to the Cloudera public repositories for an install with Cloudera Manager?

You can perform an installation or upgrade ...READ MORE

Dec 7, 2018 in Big Data Hadoop by Frankie
• 9,830 points
978 views
0 votes
1 answer

Getting error in Custom Input Format

Use this property directly to delimit the ...READ MORE

Apr 19, 2018 in Big Data Hadoop by Shubham
• 13,490 points
976 views
0 votes
1 answer

Configure Oozie to use MariaDB

Add the following properties in the oozie-site.xml file, it ...READ MORE

Jun 4, 2019 in Big Data Hadoop by Esha
975 views
0 votes
1 answer

How to create table in cassandra?

Hi@akhtar, To create table in cassandra, you have ...READ MORE

Apr 1, 2020 in Big Data Hadoop by MD
• 95,460 points
973 views
0 votes
1 answer

How to enable LDAP authenticaition in Hue?

You can enable LDAP authentication with HiveServer2 by ...READ MORE

Jun 3, 2019 in Big Data Hadoop by Bhavan
973 views
0 votes
1 answer

Setting the resource manager port in Hue

Open the hue configuration file and search for the [[yarn_clusters]] sub-section. ...READ MORE

May 30, 2019 in Big Data Hadoop by Khan
973 views
0 votes
1 answer

What is DML in Hive?

Hi, DML is Data Manipulation Language. DML statements are ...READ MORE

May 15, 2019 in Big Data Hadoop by Gitika
• 65,730 points
973 views
0 votes
1 answer

Why Count keyword is used in HBase?

Hey, An aggregate function that returns the number ...READ MORE

May 24, 2019 in Big Data Hadoop by Gitika
• 65,730 points
971 views
0 votes
1 answer

How to build Zookeeper repository for Hue?

Open the terminal and run the following ...READ MORE

May 30, 2019 in Big Data Hadoop by Liam
970 views
0 votes
1 answer

Default sqoop validator and how to change?

The default validator used is  org.apache.sqoop.validation.RowCountValidator. If you ...READ MORE

May 15, 2019 in Big Data Hadoop by Sharan
970 views
0 votes
1 answer

How does Hadoop accesses the files which are distributed among different boundaries?

Hadoop's MapReduce function does not work on ...READ MORE

May 7, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
970 views
0 votes
1 answer

What is the use of setting the name of the job in mapreduce?

Job job = new Job(conf,"job_name") is just used ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Umesh
969 views
–1 vote
1 answer

When to use Sqoop?

Sqoop is used to transfer any data ...READ MORE

Dec 31, 2018 in Big Data Hadoop by Omkar
• 69,180 points
969 views
0 votes
0 answers

Cloudera Certification

Hi, I want to write CCA-175 (CCA Spark ...READ MORE

Aug 23, 2019 in Big Data Hadoop by Nagajyothi
• 160 points
968 views
0 votes
1 answer

Explain to me how to transfer data between Azure tables and Hadoop on Azure

I shall redirect you to a link ...READ MORE

Jul 4, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
968 views
0 votes
1 answer

How to set High Availability for MapReduce cluster in Hue?

To do this, you have to set ...READ MORE

May 30, 2019 in Big Data Hadoop by Tina
968 views
+1 vote
1 answer

datanode decomissioning

You have set the replication factor to ...READ MORE

Nov 28, 2019 in Big Data Hadoop by Rahul
967 views
0 votes
1 answer

Spark: Problem creating RDD

Please use the command as shown below ...READ MORE

May 24, 2019 in Big Data Hadoop by Raj
966 views
0 votes
1 answer

What are the options for hadoop on scala?

Definitely check out Scalding. Speaking as a user ...READ MORE

Aug 22, 2018 in Big Data Hadoop by Frankie
• 9,830 points
966 views
0 votes
1 answer

When and when not to use PigStore?

The Pig storage is not used only ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Rushil
965 views
0 votes
1 answer

What is Oozie Sqoop submit command?

Hey, This is the command you can use: oozie ...READ MORE

Jun 24, 2019 in Big Data Hadoop by Gitika
• 65,730 points

edited Jun 26, 2019 by Gitika 965 views
0 votes
1 answer
0 votes
1 answer

Explain what does the conf.setMapper Class do?

Hi, Conf.setMapperclass  sets the mapper class and all ...READ MORE

Jun 10, 2019 in Big Data Hadoop by Gitika
• 65,730 points
962 views
0 votes
1 answer

Problem while running MapReduce code.

You will have to create a Mapper ...READ MORE

May 23, 2019 in Big Data Hadoop by Rishi
961 views
0 votes
1 answer

Executing mapper and reducer individually

This is what happens: Map reduce framework will ...READ MORE

Dec 18, 2018 in Big Data Hadoop by Omkar
• 69,180 points
961 views
0 votes
1 answer

What Oozie CLI retries connection used for?

Hi, Oozie CLI retries the connection to Oozie servers ...READ MORE

Jun 24, 2019 in Big Data Hadoop by Gitika
• 65,730 points
960 views
0 votes
1 answer

How to create database in Mysql?

The command you can use is: create table ...READ MORE

Apr 15, 2019 in Big Data Hadoop by Gitika
• 65,730 points
960 views
0 votes
1 answer

Determining number of reduces?

Number of reduces are set by the ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
959 views
0 votes
1 answer

How to install Hadoop in Ubuntu?

Follow the steps as mentioned in the ...READ MORE

May 17, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
959 views
0 votes
1 answer

Installing WebHCat on Ubuntu

To install WebHCat on ubuntu using apt-get, ...READ MORE

Jun 3, 2019 in Big Data Hadoop by Richard
957 views
0 votes
1 answer

How to set alternate user for new connection in Hue?

You can use the hive.server2.proxy.user property to do ...READ MORE

Jun 3, 2019 in Big Data Hadoop by Lohith
956 views
0 votes
1 answer

How to set Sentry server details in Hue?

You have to set the server details ...READ MORE

May 30, 2019 in Big Data Hadoop by Karna
955 views
0 votes
1 answer

How does the number of groups affect the cost of the shuffling phase?

Shuffle happens with key-value pairs. So, when ...READ MORE

Jan 22, 2019 in Big Data Hadoop by Omkar
• 69,180 points
955 views
0 votes
1 answer

Gossip Protocol features.

Gossip Protocol in Cassandra is a peer-to-peer ...READ MORE

Jun 26, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
955 views
0 votes
1 answer

while executing this iam getting this error can anyone please help me with the solution please

Hi@srija, It seems you have not provided complete ...READ MORE

Apr 6, 2020 in Big Data Hadoop by MD
• 95,460 points
954 views
0 votes
1 answer

What are Apache pig running modes?

Hey, You can run Apache pig runs in ...READ MORE

May 7, 2019 in Big Data Hadoop by Gitika
• 65,730 points

edited May 8, 2019 by Gitika 954 views
0 votes
1 answer

Merging Files using PIG

In order to merge two or more ...READ MORE

Jul 9, 2019 in Big Data Hadoop by Tina
953 views
0 votes
1 answer

How to load unencrypted tables to encrypted in Hive?

Refer to the below command to load ...READ MORE

May 30, 2019 in Big Data Hadoop by John
953 views
0 votes
1 answer

How Hadoop distributes block writes into HDFS?

So, what happens is the slave node ...READ MORE

Aug 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
953 views