Most viewed questions in Big Data Hadoop

–1 vote
1 answer

How to start namenode and datanode individually?

You can use these commands. For namenode: ./hadoop-daemon.sh start ...READ MORE

Dec 21, 2018 in Big Data Hadoop by Omkar
• 69,180 points
12,255 views
+1 vote
1 answer

Why minimum 3 Journal Nodes are required in Hadoop HA architecture?

Initially in Hadoop 1.x, the NameNode was ...READ MORE

Apr 20, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
12,196 views
0 votes
2 answers

How can I list NameNode & DataNodes from any machine in the Hadoop cluster?

You can browse hadoop page from any ...READ MORE

Jan 23, 2020 in Big Data Hadoop by MD
• 95,460 points
12,177 views
0 votes
1 answer

Copy file from local to hdfs from the spark job in yarn mode

Please refer to the below code: import org.apache.hadoop.conf.Configuration import ...READ MORE

Jul 16, 2019 in Big Data Hadoop by Raj
12,144 views
+2 votes
1 answer

FileStreamSink: Error while looking for metadata directory. java.lang.IllegalArgumentException: java.net.UnknownHostException: hive

Hi@akhtar, You tried to read file from your ...READ MORE

Feb 13, 2020 in Big Data Hadoop by MD
• 95,460 points
11,935 views
0 votes
1 answer

What is the difference between S3n, S3a, and S3?

Your doubt is quite an interesting one. Yes, ...READ MORE

Jul 30, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
11,864 views
0 votes
1 answer

How to add user in supergroup of hdfs in linux?

Yes , now i have whole idea ...READ MORE

Sep 21, 2018 in Big Data Hadoop by Frankie
• 9,830 points
11,841 views
0 votes
3 answers

Hadoop Spark: How to iterate hdfs directories?

Using PySpark  hadoop = sc._jvm.org.apache.hadoop fs = hadoop.fs.FileSystem conf = ...READ MORE

Dec 5, 2018 in Big Data Hadoop by Kiran
11,824 views
+1 vote
2 answers

Hadoop Hive: Not recognizing alias named in select part?

You can not use column aliases in ...READ MORE

Nov 26, 2019 in Big Data Hadoop by anonymous
11,722 views
0 votes
1 answer

How to import data in sqoop as a Parquet file?

Sqoop allows you to import the file ...READ MORE

May 15, 2019 in Big Data Hadoop by Nanda
11,531 views
0 votes
1 answer

How can anyone recover deleted HDFS data?

Yes, there is a trash policy in ...READ MORE

May 16, 2018 in Big Data Hadoop by Shubham
• 13,490 points
11,516 views
0 votes
1 answer

How to uninstall all versions of hadoop completely from the system?

Do a: sudo dpkg -l | grep hadoop to ...READ MORE

Oct 29, 2018 in Big Data Hadoop by Frankie
• 9,830 points
11,385 views
0 votes
3 answers

How to change the delimiter in Sqoop?

--fields-terminated-by <char> READ MORE

Jun 25, 2019 in Big Data Hadoop by anonymous
11,147 views
0 votes
1 answer

Best way of starting & stopping the Hadoop daemons with command line

First way is to use start-all.sh & ...READ MORE

Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,490 points
11,139 views
0 votes
1 answer

Read Parquet file using hdfs dfs -cat command

When you try to read a parquet ...READ MORE

May 24, 2019 in Big Data Hadoop by Calvin
11,003 views
0 votes
1 answer

How to select particular column In a table in hive?

Hive is a high-level language to analyze ...READ MORE

Jul 31, 2019 in Big Data Hadoop by Killian
10,937 views
+1 vote
1 answer

ssh: connect to host localhost port 22: Connection refused in Hadoop.

Hi@akhtar, You need to uninstall SSH and reinstall ...READ MORE

Dec 18, 2020 in Big Data Hadoop by MD
• 95,460 points
10,873 views
0 votes
1 answer

Opening a HDFS file in browser

In HDFS 8020 is IPC port, you ...READ MORE

Oct 29, 2018 in Big Data Hadoop by Frankie
• 9,830 points
10,832 views
0 votes
1 answer

Error: Failed to locate WinUtils binary in the Hadoop Binary Path

I will redirect you to a link ...READ MORE

Jun 20, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
10,802 views
0 votes
1 answer

How to select partition in Hive?

Follow these steps: A. Create Database ------------------ create database retail123; B. ...READ MORE

Feb 19, 2019 in Big Data Hadoop by Omkar
• 69,180 points
10,754 views
0 votes
1 answer

Moving data to hdfs using copyFromLocal command

Actually dfs.data.dir and dfs.name.dir have to point ...READ MORE

Mar 27, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
10,733 views
0 votes
1 answer

How to create sequential znode in zookeeper?

Hey, To create a Sequential znode, add -s flag as shown ...READ MORE

May 28, 2019 in Big Data Hadoop by Gitika
• 65,730 points
10,653 views
0 votes
1 answer

Error: FUNCTION 'NUCLEUS_ASCII' already exists.

Hi@akhtar, This error occurs when hive-shell started before ...READ MORE

Feb 5, 2020 in Big Data Hadoop by MD
• 95,460 points
10,593 views
0 votes
1 answer

How to remove duplicate records from Hive table?

A record is duplicate if there are ...READ MORE

Jul 11, 2019 in Big Data Hadoop by Bhuvan
10,423 views
0 votes
1 answer

How to check if a particular partition exists in Hive?

Hey, Basically, with the following query, we can ...READ MORE

Jun 26, 2019 in Big Data Hadoop by Gitika
• 65,730 points
10,368 views
0 votes
1 answer

Is there a way to copy data from one one Hadoop distributed file system(HDFS) to another HDFS?

The distributed copy command, distcp, is a ...READ MORE

Mar 22, 2018 in Big Data Hadoop by Ashish
• 2,650 points
10,249 views
+1 vote
2 answers

How to find the database daily refresh time / sync up time

I use this code to get the ...READ MORE

Sep 17, 2019 in Big Data Hadoop by Raman
10,225 views
0 votes
1 answer

Hadoop Hive: How to insert data in Hive table?

First, copy data into HDFS. Then create ...READ MORE

Nov 12, 2018 in Big Data Hadoop by Omkar
• 69,180 points
10,193 views
0 votes
1 answer

What is the use of sequence file in Hadoop?

Sequence files are binary files containing serialized ...READ MORE

Apr 6, 2018 in Big Data Hadoop by Ashish
• 2,650 points
10,188 views
0 votes
1 answer

Hadoop Hive: How to split a single row into multiple rows?

Try this SELECT ID1, Sub FROM tableName lateral view ...READ MORE

Nov 14, 2018 in Big Data Hadoop by Omkar
• 69,180 points
10,165 views
0 votes
2 answers

How to change the location of a table in hive?

Changing location requires 2 steps: 1.) Change location ...READ MORE

Feb 12, 2020 in Big Data Hadoop by Saksham Sehrawet
9,907 views
+1 vote
1 answer

Hadoop: "DEPRECATED: Use of this script to execute hdfs command is deprecated." error

The use of dfs in your command is "Deprecated". ...READ MORE

Nov 26, 2018 in Big Data Hadoop by Omkar
• 69,180 points
9,858 views
0 votes
1 answer

How to find Hadoop version in CDH?

The simplest way to check Hadoop version ...READ MORE

Apr 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
9,752 views
+1 vote
1 answer

Where to set hadoop.tmp.dir? core-site.xml or hdfs-site.xml?

hadoop.tmp.dir (A base for other temporary directories) is ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
9,741 views
0 votes
1 answer

Hadoop absolute path - pwd ?

"hdfs dfs -pwd" does not exist because ...READ MORE

Mar 27, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
9,611 views
0 votes
1 answer

How to remove a Znode in Zookeeper?

Hey, Removes a specified znode and recursively all ...READ MORE

May 28, 2019 in Big Data Hadoop by Gitika
• 65,730 points
9,593 views
0 votes
1 answer

How can I append data to an existing file in HDFS?

You have to do some configurations as ...READ MORE

Jul 25, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
9,575 views
0 votes
1 answer

Hadoop Hive: How to skip the first line of csv while loading in hive table?

You can try this: CREATE TABLE temp ...READ MORE

Nov 8, 2018 in Big Data Hadoop by Omkar
• 69,180 points
9,531 views
0 votes
1 answer

I am getting this log4j:ERROR setFile(null,true) call failed. java.io.FileNotFoundException: /ozone.log (Read-only file system)

Hi@Raj, I think you need to provide permission ...READ MORE

Dec 4, 2020 in Big Data Hadoop by MD
• 95,460 points
9,474 views
0 votes
1 answer

can't find start-all.sh in hadoop installation

Actually, you are not mentioning the path for ...READ MORE

Apr 17, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
9,435 views
0 votes
1 answer

Hive:Unable to insert values to an array column using "insert".

make a dummy table which has at least one row. INSERT ...READ MORE

Dec 1, 2018 in Big Data Hadoop by Omkar
• 69,180 points
9,415 views
0 votes
1 answer

Speculative Execution in Hadoop

In Hadoop, Speculative Execution is a process ...READ MORE

Aug 28, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
9,412 views
0 votes
1 answer

How does data gets split in Sqoop?

I will drop the answer in the ...READ MORE

Jul 16, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
9,406 views
0 votes
1 answer

Getting error in Hadoop: Output file already exist

When you executed your code earlier, you ...READ MORE

Apr 19, 2018 in Big Data Hadoop by Shubham
• 13,490 points
9,390 views
0 votes
1 answer

How can we use IN/EXIST operator in Hive?

Hey, Yes, now Hive supports IN or EXIST, ...READ MORE

Jun 19, 2019 in Big Data Hadoop by Gitika
• 65,730 points
9,320 views
0 votes
2 answers

How do I output the results of a HiveQL query to CSV?

Hi, The insert overwrite directory is running in ...READ MORE

Dec 18, 2020 in Big Data Hadoop by akhtar
• 38,260 points
9,277 views
0 votes
1 answer

Load data from HDFS to hive

You can't directly create a parquet table. ...READ MORE

Feb 8, 2019 in Big Data Hadoop by Omkar
• 69,180 points
9,158 views
+1 vote
0 answers

Job failed as tasks failed. failedMaps

Hello All, I am new to hadoop, i ...READ MORE

Aug 1, 2019 in Big Data Hadoop by Hemanth
• 250 points

edited Aug 1, 2019 by Omkar 9,127 views