Most viewed questions in Big Data Hadoop

+2 votes
1 answer

FileStreamSink: Error while looking for metadata directory. java.lang.IllegalArgumentException: java.net.UnknownHostException: hive

Hi@akhtar, You tried to read file from your ...READ MORE

Feb 13, 2020 in Big Data Hadoop by MD
• 95,440 points
10,620 views
0 votes
1 answer

How to import data in sqoop as a Parquet file?

Sqoop allows you to import the file ...READ MORE

May 15, 2019 in Big Data Hadoop by Nanda
10,436 views
0 votes
1 answer

What is the difference between S3n, S3a, and S3?

Your doubt is quite an interesting one. Yes, ...READ MORE

Jul 30, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
10,406 views
+1 vote
2 answers

Hadoop Hive: Not recognizing alias named in select part?

You can not use column aliases in ...READ MORE

Nov 26, 2019 in Big Data Hadoop by anonymous
10,390 views
0 votes
1 answer

How can anyone recover deleted HDFS data?

Yes, there is a trash policy in ...READ MORE

May 16, 2018 in Big Data Hadoop by Shubham
• 13,490 points
10,386 views
0 votes
1 answer

Format HDFS Namenode Error: Could not find or load main class ”-Djava.library.path=.home.hadoop.hadoop-3.2.1.lib.native”

Hi@fwood, According to your configuration, you didn't set ...READ MORE

Jun 12, 2020 in Big Data Hadoop by MD
• 95,440 points
10,289 views
0 votes
2 answers

Hadoop fs -stat command

hadoop fs -stat is as hadoop command used ...READ MORE

Oct 25, 2018 in Big Data Hadoop by Omkar
• 69,230 points
10,251 views
0 votes
1 answer

How to uninstall all versions of hadoop completely from the system?

Do a: sudo dpkg -l | grep hadoop to ...READ MORE

Oct 29, 2018 in Big Data Hadoop by Frankie
• 9,830 points
10,176 views
0 votes
1 answer

How to create sequential znode in zookeeper?

Hey, To create a Sequential znode, add -s flag as shown ...READ MORE

May 28, 2019 in Big Data Hadoop by Gitika
• 65,910 points
10,114 views
0 votes
3 answers

How to change the delimiter in Sqoop?

--fields-terminated-by <char> READ MORE

Jun 25, 2019 in Big Data Hadoop by anonymous
10,076 views
0 votes
1 answer

Error: Failed to locate WinUtils binary in the Hadoop Binary Path

I will redirect you to a link ...READ MORE

Jun 20, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
10,067 views
–1 vote
1 answer

How to start namenode and datanode individually?

You can use these commands. For namenode: ./hadoop-daemon.sh start ...READ MORE

Dec 21, 2018 in Big Data Hadoop by Omkar
• 69,230 points
10,029 views
0 votes
1 answer

Namespace in HDFS?

Basically when we say Namespace we mean ...READ MORE

Apr 8, 2019 in Big Data Hadoop by Gitika
• 65,910 points
9,995 views
0 votes
1 answer

How to select partition in Hive?

Follow these steps: A. Create Database ------------------ create database retail123; B. ...READ MORE

Feb 19, 2019 in Big Data Hadoop by Omkar
• 69,230 points
9,902 views
+1 vote
1 answer

ssh: connect to host localhost port 22: Connection refused in Hadoop.

Hi@akhtar, You need to uninstall SSH and reinstall ...READ MORE

Dec 18, 2020 in Big Data Hadoop by MD
• 95,440 points
9,731 views
0 votes
1 answer

How to select particular column In a table in hive?

Hive is a high-level language to analyze ...READ MORE

Jul 31, 2019 in Big Data Hadoop by Killian
9,645 views
0 votes
1 answer

Moving data to hdfs using copyFromLocal command

Actually dfs.data.dir and dfs.name.dir have to point ...READ MORE

Mar 27, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
9,632 views
0 votes
1 answer

Best way of starting & stopping the Hadoop daemons with command line

First way is to use start-all.sh & ...READ MORE

Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,490 points
9,603 views
0 votes
1 answer

Read Parquet file using hdfs dfs -cat command

When you try to read a parquet ...READ MORE

May 24, 2019 in Big Data Hadoop by Calvin
9,597 views
0 votes
1 answer

Is there a way to copy data from one one Hadoop distributed file system(HDFS) to another HDFS?

The distributed copy command, distcp, is a ...READ MORE

Mar 22, 2018 in Big Data Hadoop by Ashish
• 2,650 points
9,517 views
0 votes
1 answer

Hadoop Hive: How to insert data in Hive table?

First, copy data into HDFS. Then create ...READ MORE

Nov 12, 2018 in Big Data Hadoop by Omkar
• 69,230 points
9,507 views
0 votes
1 answer

Opening a HDFS file in browser

In HDFS 8020 is IPC port, you ...READ MORE

Oct 29, 2018 in Big Data Hadoop by Frankie
• 9,830 points
9,388 views
0 votes
1 answer

How to remove duplicate records from Hive table?

A record is duplicate if there are ...READ MORE

Jul 11, 2019 in Big Data Hadoop by Bhuvan
9,347 views
0 votes
1 answer

How to check if a particular partition exists in Hive?

Hey, Basically, with the following query, we can ...READ MORE

Jun 26, 2019 in Big Data Hadoop by Gitika
• 65,910 points
9,262 views
0 votes
1 answer

What is the use of sequence file in Hadoop?

Sequence files are binary files containing serialized ...READ MORE

Apr 6, 2018 in Big Data Hadoop by Ashish
• 2,650 points
9,250 views
+1 vote
2 answers

How to find the database daily refresh time / sync up time

I use this code to get the ...READ MORE

Sep 17, 2019 in Big Data Hadoop by Raman
8,898 views
0 votes
2 answers

How to know Hive and Hadoop versions from command prompt?

Hi, Hadoop and hive have their individual commands. ...READ MORE

Dec 18, 2020 in Big Data Hadoop by akhtar
• 38,230 points
8,829 views
0 votes
1 answer

Error: FUNCTION 'NUCLEUS_ASCII' already exists.

Hi@akhtar, This error occurs when hive-shell started before ...READ MORE

Feb 5, 2020 in Big Data Hadoop by MD
• 95,440 points
8,770 views
0 votes
1 answer

Hadoop Hive: How to split a single row into multiple rows?

Try this SELECT ID1, Sub FROM tableName lateral view ...READ MORE

Nov 14, 2018 in Big Data Hadoop by Omkar
• 69,230 points
8,698 views
0 votes
1 answer

How does data gets split in Sqoop?

I will drop the answer in the ...READ MORE

Jul 16, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
8,593 views
+1 vote
1 answer

Where to set hadoop.tmp.dir? core-site.xml or hdfs-site.xml?

hadoop.tmp.dir (A base for other temporary directories) is ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
8,568 views
0 votes
1 answer

Hadoop Hive: How to skip the first line of csv while loading in hive table?

You can try this: CREATE TABLE temp ...READ MORE

Nov 8, 2018 in Big Data Hadoop by Omkar
• 69,230 points
8,488 views
0 votes
1 answer

How to find Hadoop version in CDH?

The simplest way to check Hadoop version ...READ MORE

Apr 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
8,440 views
0 votes
1 answer

Load data from HDFS to hive

You can't directly create a parquet table. ...READ MORE

Feb 8, 2019 in Big Data Hadoop by Omkar
• 69,230 points
8,435 views
0 votes
1 answer

I am getting this log4j:ERROR setFile(null,true) call failed. java.io.FileNotFoundException: /ozone.log (Read-only file system)

Hi@Raj, I think you need to provide permission ...READ MORE

Dec 4, 2020 in Big Data Hadoop by MD
• 95,440 points
8,373 views
0 votes
1 answer

Hadoop absolute path - pwd ?

"hdfs dfs -pwd" does not exist because ...READ MORE

Mar 27, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
8,342 views
0 votes
2 answers

How to change the location of a table in hive?

Changing location requires 2 steps: 1.) Change location ...READ MORE

Feb 12, 2020 in Big Data Hadoop by Saksham Sehrawet
8,329 views
+1 vote
1 answer

Hadoop: "DEPRECATED: Use of this script to execute hdfs command is deprecated." error

The use of dfs in your command is "Deprecated". ...READ MORE

Nov 26, 2018 in Big Data Hadoop by Omkar
• 69,230 points
8,251 views
0 votes
1 answer

How to remove a Znode in Zookeeper?

Hey, Removes a specified znode and recursively all ...READ MORE

May 28, 2019 in Big Data Hadoop by Gitika
• 65,910 points
8,244 views
0 votes
1 answer

How to save Spark dataframe as dynamic partitioned table in Hive?

Hey, you can try something like this: df.write.partitionBy('year', ...READ MORE

Nov 6, 2018 in Big Data Hadoop by Omkar
• 69,230 points
8,231 views
0 votes
1 answer

How can I append data to an existing file in HDFS?

You have to do some configurations as ...READ MORE

Jul 25, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
8,127 views
0 votes
1 answer

How can we use IN/EXIST operator in Hive?

Hey, Yes, now Hive supports IN or EXIST, ...READ MORE

Jun 19, 2019 in Big Data Hadoop by Gitika
• 65,910 points
8,127 views
0 votes
1 answer

Getting error in Hadoop: Output file already exist

When you executed your code earlier, you ...READ MORE

Apr 19, 2018 in Big Data Hadoop by Shubham
• 13,490 points
8,115 views
+1 vote
1 answer

How to write file in hdfs using python?

#!/usr/bin/python from subprocess import Popen, PIPE cat = Popen(["hadoop", ...READ MORE

Dec 6, 2018 in Big Data Hadoop by Omkar
• 69,230 points
8,105 views
0 votes
1 answer

Speculative Execution in Hadoop

In Hadoop, Speculative Execution is a process ...READ MORE

Aug 28, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
8,077 views
0 votes
1 answer

Error while starting the daemon process in windows 10

Hi@Arun, By default, the parameter "dfs.datanode.failed.volumes.tolerated" is set ...READ MORE

Apr 20, 2020 in Big Data Hadoop by MD
• 95,440 points
7,983 views
0 votes
1 answer

How does Avro schema evolution work?

If you have one avro file and ...READ MORE

Sep 19, 2018 in Big Data Hadoop by Frankie
• 9,830 points
7,977 views
0 votes
1 answer

How to see the version of Zookeeper I am running?

Hey, First, you need to have a running ...READ MORE

May 31, 2019 in Big Data Hadoop by Gitika
• 65,910 points
7,973 views
0 votes
1 answer

Hive:Unable to insert values to an array column using "insert".

make a dummy table which has at least one row. INSERT ...READ MORE

Dec 1, 2018 in Big Data Hadoop by Omkar
• 69,230 points
7,933 views