Most viewed questions in Big Data Hadoop

0 votes
1 answer

What is DML in Hive?

Hi, DML is Data Manipulation Language. DML statements are ...READ MORE

May 15, 2019 in Big Data Hadoop by Gitika
• 65,770 points
846 views
0 votes
1 answer

Is there a good online tutorial for Hadoop development on a Windows 7 machine?

I would suggest you go through this ...READ MORE

Mar 29, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
845 views
0 votes
1 answer

Classes implementing InputFormat frequently

FileInputFormat : Base class for all file-based InputFormats Other ...READ MORE

Jul 24, 2019 in Big Data Hadoop by Reshma
841 views
0 votes
1 answer

How to install Hadoop on Ubuntu?

You can manually create a file hadoop-env.sh ...READ MORE

Jul 5, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
840 views
–1 vote
1 answer

Issue with MySql Server connection

Try to restart the mysqld server and then login: sudo ...READ MORE

Jan 2, 2019 in Big Data Hadoop by Omkar
• 69,220 points
840 views
–1 vote
1 answer

When to use Sqoop?

Sqoop is used to transfer any data ...READ MORE

Dec 31, 2018 in Big Data Hadoop by Omkar
• 69,220 points
840 views
0 votes
1 answer

How to enable LDAP authenticaition in Hue?

You can enable LDAP authentication with HiveServer2 by ...READ MORE

Jun 3, 2019 in Big Data Hadoop by Bhavan
839 views
0 votes
1 answer

Which Operating system is more preferable for data node?

In a real installation (1 active namenode, ...READ MORE

Sep 4, 2018 in Big Data Hadoop by Neha
• 6,300 points
839 views
0 votes
1 answer

How to set High Availability for MapReduce cluster in Hue?

To do this, you have to set ...READ MORE

May 30, 2019 in Big Data Hadoop by Tina
838 views
+1 vote
1 answer

datanode decomissioning

You have set the replication factor to ...READ MORE

Nov 28, 2019 in Big Data Hadoop by Rahul
837 views
0 votes
1 answer

Configure Oozie to use MariaDB

Add the following properties in the oozie-site.xml file, it ...READ MORE

Jun 4, 2019 in Big Data Hadoop by Esha
837 views
0 votes
1 answer
0 votes
1 answer

How does Hadoop accesses the files which are distributed among different boundaries?

Hadoop's MapReduce function does not work on ...READ MORE

May 7, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
831 views
0 votes
1 answer

What is the use of setting the name of the job in mapreduce?

Job job = new Job(conf,"job_name") is just used ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Umesh
829 views
0 votes
1 answer

How does the number of groups affect the cost of the shuffling phase?

Shuffle happens with key-value pairs. So, when ...READ MORE

Jan 22, 2019 in Big Data Hadoop by Omkar
• 69,220 points
829 views
0 votes
1 answer

Explain to me how to transfer data between Azure tables and Hadoop on Azure

I shall redirect you to a link ...READ MORE

Jul 4, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
828 views
0 votes
1 answer

Merging Files using PIG

In order to merge two or more ...READ MORE

Jul 9, 2019 in Big Data Hadoop by Tina
827 views
0 votes
1 answer

What Oozie CLI retries connection used for?

Hi, Oozie CLI retries the connection to Oozie servers ...READ MORE

Jun 24, 2019 in Big Data Hadoop by Gitika
• 65,770 points
827 views
0 votes
1 answer

Is data sent to some other location to be processed in pig?

With pig, we are not taking data ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Adi
826 views
0 votes
1 answer

"Write successful" message from Data node to Name Node

What happens is namenode on the basis ...READ MORE

Jun 20, 2019 in Big Data Hadoop by Quill
826 views
0 votes
1 answer

How to create database in Mysql?

The command you can use is: create table ...READ MORE

Apr 15, 2019 in Big Data Hadoop by Gitika
• 65,770 points
826 views
0 votes
1 answer

MapFile in Pig

MapFile is a class which serves file-based ...READ MORE

Jul 6, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
826 views
0 votes
0 answers

flume installation steps on ubuntu [closed]

Jun 27, 2019 in Big Data Hadoop by shraddha
• 120 points

closed Jun 27, 2019 by Omkar 825 views
0 votes
1 answer

How to build Zookeeper repository for Hue?

Open the terminal and run the following ...READ MORE

May 30, 2019 in Big Data Hadoop by Liam
825 views
0 votes
1 answer

What is the difference between a Big Data Warehouse and a traditional Data Warehouse

Hadoop is similar in architecture to MPP data ...READ MORE

Aug 10, 2018 in Big Data Hadoop by Frankie
• 9,830 points
825 views
0 votes
1 answer

What is Oozie Hive submit command?

Hi, You can use this example : oozie hive ...READ MORE

Jun 24, 2019 in Big Data Hadoop by Gitika
• 65,770 points
824 views
0 votes
1 answer

How to implement sqoop jobs?

Step 1: Give the below command to ...READ MORE

Dec 18, 2018 in Big Data Hadoop by Omkar
• 69,220 points
824 views
0 votes
1 answer

How to loacte data cell in Hbase?

Hey, Hbase uses the coordinates to locate a ...READ MORE

May 21, 2019 in Big Data Hadoop by Gitika
• 65,770 points
822 views
0 votes
0 answers

Hive data load error

I have table in hive with below ...READ MORE

Jun 19, 2019 in Big Data Hadoop by Rishi
821 views
0 votes
1 answer

How to set alternate user for new connection in Hue?

You can use the hive.server2.proxy.user property to do ...READ MORE

Jun 3, 2019 in Big Data Hadoop by Lohith
821 views
0 votes
1 answer

When and when not to use PigStore?

The Pig storage is not used only ...READ MORE

Jul 10, 2019 in Big Data Hadoop by Rushil
820 views
0 votes
1 answer

Setting the resource manager port in Hue

Open the hue configuration file and search for the [[yarn_clusters]] sub-section. ...READ MORE

May 30, 2019 in Big Data Hadoop by Khan
820 views
0 votes
1 answer

Setting user limit for Hbase?

Hey, Because HBase is a database, it opens ...READ MORE

May 29, 2019 in Big Data Hadoop by Gitika
• 65,770 points
819 views
0 votes
1 answer

Installing WebHCat on Ubuntu

To install WebHCat on ubuntu using apt-get, ...READ MORE

Jun 3, 2019 in Big Data Hadoop by Richard
818 views
0 votes
1 answer

Why tuple keywords is used in pig?

Hey, A tuple is a set of field, ...READ MORE

May 6, 2019 in Big Data Hadoop by Gitika
• 65,770 points
818 views
0 votes
1 answer

How to set Sentry server details in Hue?

You have to set the server details ...READ MORE

May 30, 2019 in Big Data Hadoop by Karna
817 views
0 votes
1 answer

Problem while running MapReduce code.

You will have to create a Mapper ...READ MORE

May 23, 2019 in Big Data Hadoop by Rishi
817 views
0 votes
1 answer

How to set rep to particular user in hdfs?

HDFS does not allocate capacity separately based ...READ MORE

Jul 5, 2019 in Big Data Hadoop by Leena
816 views
0 votes
1 answer

How to embed HDFS cluster information on a website?

Hi @Bhavish.  It is possible to get cluster ...READ MORE

May 27, 2019 in Big Data Hadoop by Omkar
• 69,220 points
816 views
0 votes
1 answer

Why INSERT command in Hive?

Hey, The insert command is used to load the data ...READ MORE

May 15, 2019 in Big Data Hadoop by Gitika
• 65,770 points
816 views
0 votes
1 answer

Do worker hosts need access to the Cloudera public repositories for an install with Cloudera Manager?

You can perform an installation or upgrade ...READ MORE

Dec 7, 2018 in Big Data Hadoop by Frankie
• 9,830 points
815 views
0 votes
1 answer

Explain what does the conf.setMapper Class do?

Hi, Conf.setMapperclass  sets the mapper class and all ...READ MORE

Jun 10, 2019 in Big Data Hadoop by Gitika
• 65,770 points
813 views
0 votes
1 answer

What is the benefit of using CDH over other Distributors?

CDH is basically a packaged deal, where ...READ MORE

Mar 29, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
810 views
0 votes
1 answer

How to check the directory is present in shell script or not?

Hi, To check the directory is present in ...READ MORE

Jun 25, 2019 in Big Data Hadoop by Gitika
• 65,770 points
809 views
0 votes
1 answer

How to format a Hadoop Master Node?

Hi@akhtar, To start the Hadoop NameNode, you need ...READ MORE

Sep 21, 2020 in Big Data Hadoop by MD
• 95,460 points
808 views
0 votes
1 answer

What are the properties that have to be mentioned in . properties in Oozie?

Hey, These are the listed properties that need ...READ MORE

Jun 26, 2019 in Big Data Hadoop by Gitika
• 65,770 points
808 views
0 votes
1 answer

Spark: Problem creating RDD

Please use the command as shown below ...READ MORE

May 24, 2019 in Big Data Hadoop by Raj
808 views
0 votes
1 answer

Hadoop: intervals and JOIN

Hey, a solution was given on Biostar: http://biostar.stackexchange.com/questions/8821. Hope ...READ MORE

Sep 24, 2018 in Big Data Hadoop by slayer
• 29,370 points
808 views
0 votes
1 answer

How to load unencrypted tables to encrypted in Hive?

Refer to the below command to load ...READ MORE

May 30, 2019 in Big Data Hadoop by John
806 views