Most viewed questions in Big Data Hadoop

+2 votes
1 answer

FileStreamSink: Error while looking for metadata directory. java.lang.IllegalArgumentException: java.net.UnknownHostException: hive

Hi@akhtar, You tried to read file from your ...READ MORE

Feb 13, 2020 in Big Data Hadoop by MD
• 95,440 points
10,531 views
0 votes
1 answer

How to import data in sqoop as a Parquet file?

Sqoop allows you to import the file ...READ MORE

May 15, 2019 in Big Data Hadoop by Nanda
10,363 views
0 votes
1 answer

What is the difference between S3n, S3a, and S3?

Your doubt is quite an interesting one. Yes, ...READ MORE

Jul 30, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
10,322 views
0 votes
1 answer

How can anyone recover deleted HDFS data?

Yes, there is a trash policy in ...READ MORE

May 16, 2018 in Big Data Hadoop by Shubham
• 13,490 points
10,299 views
+1 vote
2 answers

Hadoop Hive: Not recognizing alias named in select part?

You can not use column aliases in ...READ MORE

Nov 26, 2019 in Big Data Hadoop by anonymous
10,289 views
0 votes
1 answer

Format HDFS Namenode Error: Could not find or load main class ”-Djava.library.path=.home.hadoop.hadoop-3.2.1.lib.native”

Hi@fwood, According to your configuration, you didn't set ...READ MORE

Jun 12, 2020 in Big Data Hadoop by MD
• 95,440 points
10,164 views
0 votes
2 answers

Hadoop fs -stat command

hadoop fs -stat is as hadoop command used ...READ MORE

Oct 25, 2018 in Big Data Hadoop by Omkar
• 69,210 points
10,150 views
0 votes
1 answer

How to create sequential znode in zookeeper?

Hey, To create a Sequential znode, add -s flag as shown ...READ MORE

May 28, 2019 in Big Data Hadoop by Gitika
• 65,910 points
10,093 views
0 votes
1 answer

How to uninstall all versions of hadoop completely from the system?

Do a: sudo dpkg -l | grep hadoop to ...READ MORE

Oct 29, 2018 in Big Data Hadoop by Frankie
• 9,830 points
10,081 views
0 votes
1 answer

Error: Failed to locate WinUtils binary in the Hadoop Binary Path

I will redirect you to a link ...READ MORE

Jun 20, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
10,009 views
0 votes
3 answers

How to change the delimiter in Sqoop?

--fields-terminated-by <char> READ MORE

Jun 25, 2019 in Big Data Hadoop by anonymous
9,991 views
0 votes
1 answer

Namespace in HDFS?

Basically when we say Namespace we mean ...READ MORE

Apr 8, 2019 in Big Data Hadoop by Gitika
• 65,910 points
9,890 views
–1 vote
1 answer

How to start namenode and datanode individually?

You can use these commands. For namenode: ./hadoop-daemon.sh start ...READ MORE

Dec 21, 2018 in Big Data Hadoop by Omkar
• 69,210 points
9,863 views
0 votes
1 answer

How to select partition in Hive?

Follow these steps: A. Create Database ------------------ create database retail123; B. ...READ MORE

Feb 19, 2019 in Big Data Hadoop by Omkar
• 69,210 points
9,849 views
+1 vote
1 answer

ssh: connect to host localhost port 22: Connection refused in Hadoop.

Hi@akhtar, You need to uninstall SSH and reinstall ...READ MORE

Dec 18, 2020 in Big Data Hadoop by MD
• 95,440 points
9,651 views
0 votes
1 answer

Moving data to hdfs using copyFromLocal command

Actually dfs.data.dir and dfs.name.dir have to point ...READ MORE

Mar 27, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
9,553 views
0 votes
1 answer

Best way of starting & stopping the Hadoop daemons with command line

First way is to use start-all.sh & ...READ MORE

Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,490 points
9,546 views
0 votes
1 answer

How to select particular column In a table in hive?

Hive is a high-level language to analyze ...READ MORE

Jul 31, 2019 in Big Data Hadoop by Killian
9,542 views
0 votes
1 answer

Read Parquet file using hdfs dfs -cat command

When you try to read a parquet ...READ MORE

May 24, 2019 in Big Data Hadoop by Calvin
9,512 views
0 votes
1 answer

Is there a way to copy data from one one Hadoop distributed file system(HDFS) to another HDFS?

The distributed copy command, distcp, is a ...READ MORE

Mar 22, 2018 in Big Data Hadoop by Ashish
• 2,650 points
9,493 views
0 votes
1 answer

Hadoop Hive: How to insert data in Hive table?

First, copy data into HDFS. Then create ...READ MORE

Nov 12, 2018 in Big Data Hadoop by Omkar
• 69,210 points
9,464 views
0 votes
1 answer

How to remove duplicate records from Hive table?

A record is duplicate if there are ...READ MORE

Jul 11, 2019 in Big Data Hadoop by Bhuvan
9,297 views
0 votes
1 answer

Opening a HDFS file in browser

In HDFS 8020 is IPC port, you ...READ MORE

Oct 29, 2018 in Big Data Hadoop by Frankie
• 9,830 points
9,296 views
0 votes
1 answer

What is the use of sequence file in Hadoop?

Sequence files are binary files containing serialized ...READ MORE

Apr 6, 2018 in Big Data Hadoop by Ashish
• 2,650 points
9,204 views
0 votes
1 answer

How to check if a particular partition exists in Hive?

Hey, Basically, with the following query, we can ...READ MORE

Jun 26, 2019 in Big Data Hadoop by Gitika
• 65,910 points
9,158 views
+1 vote
2 answers

How to find the database daily refresh time / sync up time

I use this code to get the ...READ MORE

Sep 17, 2019 in Big Data Hadoop by Raman
8,794 views
0 votes
1 answer

Error: FUNCTION 'NUCLEUS_ASCII' already exists.

Hi@akhtar, This error occurs when hive-shell started before ...READ MORE

Feb 5, 2020 in Big Data Hadoop by MD
• 95,440 points
8,661 views
0 votes
1 answer

Hadoop Hive: How to split a single row into multiple rows?

Try this SELECT ID1, Sub FROM tableName lateral view ...READ MORE

Nov 14, 2018 in Big Data Hadoop by Omkar
• 69,210 points
8,600 views
0 votes
2 answers

How to know Hive and Hadoop versions from command prompt?

Hi, Hadoop and hive have their individual commands. ...READ MORE

Dec 18, 2020 in Big Data Hadoop by akhtar
• 38,230 points
8,551 views
0 votes
1 answer

How does data gets split in Sqoop?

I will drop the answer in the ...READ MORE

Jul 16, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
8,547 views
+1 vote
1 answer

Where to set hadoop.tmp.dir? core-site.xml or hdfs-site.xml?

hadoop.tmp.dir (A base for other temporary directories) is ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
8,503 views
0 votes
1 answer

Hadoop Hive: How to skip the first line of csv while loading in hive table?

You can try this: CREATE TABLE temp ...READ MORE

Nov 8, 2018 in Big Data Hadoop by Omkar
• 69,210 points
8,443 views
0 votes
1 answer

Load data from HDFS to hive

You can't directly create a parquet table. ...READ MORE

Feb 8, 2019 in Big Data Hadoop by Omkar
• 69,210 points
8,374 views
0 votes
1 answer

How to find Hadoop version in CDH?

The simplest way to check Hadoop version ...READ MORE

Apr 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
8,360 views
0 votes
1 answer

I am getting this log4j:ERROR setFile(null,true) call failed. java.io.FileNotFoundException: /ozone.log (Read-only file system)

Hi@Raj, I think you need to provide permission ...READ MORE

Dec 4, 2020 in Big Data Hadoop by MD
• 95,440 points
8,291 views
0 votes
1 answer

Hadoop absolute path - pwd ?

"hdfs dfs -pwd" does not exist because ...READ MORE

Mar 27, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
8,273 views
0 votes
2 answers

How to change the location of a table in hive?

Changing location requires 2 steps: 1.) Change location ...READ MORE

Feb 12, 2020 in Big Data Hadoop by Saksham Sehrawet
8,230 views
0 votes
1 answer

How to save Spark dataframe as dynamic partitioned table in Hive?

Hey, you can try something like this: df.write.partitionBy('year', ...READ MORE

Nov 6, 2018 in Big Data Hadoop by Omkar
• 69,210 points
8,204 views
0 votes
1 answer

How to remove a Znode in Zookeeper?

Hey, Removes a specified znode and recursively all ...READ MORE

May 28, 2019 in Big Data Hadoop by Gitika
• 65,910 points
8,182 views
+1 vote
1 answer

Hadoop: "DEPRECATED: Use of this script to execute hdfs command is deprecated." error

The use of dfs in your command is "Deprecated". ...READ MORE

Nov 26, 2018 in Big Data Hadoop by Omkar
• 69,210 points
8,142 views
+1 vote
1 answer

How to write file in hdfs using python?

#!/usr/bin/python from subprocess import Popen, PIPE cat = Popen(["hadoop", ...READ MORE

Dec 6, 2018 in Big Data Hadoop by Omkar
• 69,210 points
8,081 views
0 votes
1 answer

How can I append data to an existing file in HDFS?

You have to do some configurations as ...READ MORE

Jul 25, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
8,058 views
0 votes
1 answer

How can we use IN/EXIST operator in Hive?

Hey, Yes, now Hive supports IN or EXIST, ...READ MORE

Jun 19, 2019 in Big Data Hadoop by Gitika
• 65,910 points
8,055 views
0 votes
1 answer

Getting error in Hadoop: Output file already exist

When you executed your code earlier, you ...READ MORE

Apr 19, 2018 in Big Data Hadoop by Shubham
• 13,490 points
8,050 views
0 votes
1 answer

Speculative Execution in Hadoop

In Hadoop, Speculative Execution is a process ...READ MORE

Aug 28, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
8,016 views
0 votes
1 answer

How does Avro schema evolution work?

If you have one avro file and ...READ MORE

Sep 19, 2018 in Big Data Hadoop by Frankie
• 9,830 points
7,948 views
0 votes
1 answer

How to see the version of Zookeeper I am running?

Hey, First, you need to have a running ...READ MORE

May 31, 2019 in Big Data Hadoop by Gitika
• 65,910 points
7,917 views
0 votes
1 answer

Error while starting the daemon process in windows 10

Hi@Arun, By default, the parameter "dfs.datanode.failed.volumes.tolerated" is set ...READ MORE

Apr 20, 2020 in Big Data Hadoop by MD
• 95,440 points
7,905 views
0 votes
1 answer

Hive:Unable to insert values to an array column using "insert".

make a dummy table which has at least one row. INSERT ...READ MORE

Dec 1, 2018 in Big Data Hadoop by Omkar
• 69,210 points
7,850 views