Most viewed questions in Big Data Hadoop

+2 votes
1 answer

FileStreamSink: Error while looking for metadata directory. java.lang.IllegalArgumentException: java.net.UnknownHostException: hive

Hi@akhtar, You tried to read file from your ...READ MORE

Feb 13, 2020 in Big Data Hadoop by MD
• 95,440 points
10,555 views
0 votes
1 answer

How to import data in sqoop as a Parquet file?

Sqoop allows you to import the file ...READ MORE

May 15, 2019 in Big Data Hadoop by Nanda
10,370 views
0 votes
1 answer

What is the difference between S3n, S3a, and S3?

Your doubt is quite an interesting one. Yes, ...READ MORE

Jul 30, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
10,345 views
0 votes
1 answer

How can anyone recover deleted HDFS data?

Yes, there is a trash policy in ...READ MORE

May 16, 2018 in Big Data Hadoop by Shubham
• 13,490 points
10,322 views
+1 vote
2 answers

Hadoop Hive: Not recognizing alias named in select part?

You can not use column aliases in ...READ MORE

Nov 26, 2019 in Big Data Hadoop by anonymous
10,314 views
0 votes
1 answer

Format HDFS Namenode Error: Could not find or load main class ”-Djava.library.path=.home.hadoop.hadoop-3.2.1.lib.native”

Hi@fwood, According to your configuration, you didn't set ...READ MORE

Jun 12, 2020 in Big Data Hadoop by MD
• 95,440 points
10,190 views
0 votes
2 answers

Hadoop fs -stat command

hadoop fs -stat is as hadoop command used ...READ MORE

Oct 25, 2018 in Big Data Hadoop by Omkar
• 69,210 points
10,167 views
0 votes
1 answer

How to uninstall all versions of hadoop completely from the system?

Do a: sudo dpkg -l | grep hadoop to ...READ MORE

Oct 29, 2018 in Big Data Hadoop by Frankie
• 9,830 points
10,101 views
0 votes
1 answer

How to create sequential znode in zookeeper?

Hey, To create a Sequential znode, add -s flag as shown ...READ MORE

May 28, 2019 in Big Data Hadoop by Gitika
• 65,910 points
10,099 views
0 votes
1 answer

Error: Failed to locate WinUtils binary in the Hadoop Binary Path

I will redirect you to a link ...READ MORE

Jun 20, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
10,023 views
0 votes
3 answers

How to change the delimiter in Sqoop?

--fields-terminated-by <char> READ MORE

Jun 25, 2019 in Big Data Hadoop by anonymous
10,010 views
0 votes
1 answer

Namespace in HDFS?

Basically when we say Namespace we mean ...READ MORE

Apr 8, 2019 in Big Data Hadoop by Gitika
• 65,910 points
9,910 views
–1 vote
1 answer

How to start namenode and datanode individually?

You can use these commands. For namenode: ./hadoop-daemon.sh start ...READ MORE

Dec 21, 2018 in Big Data Hadoop by Omkar
• 69,210 points
9,901 views
0 votes
1 answer

How to select partition in Hive?

Follow these steps: A. Create Database ------------------ create database retail123; B. ...READ MORE

Feb 19, 2019 in Big Data Hadoop by Omkar
• 69,210 points
9,858 views
+1 vote
1 answer

ssh: connect to host localhost port 22: Connection refused in Hadoop.

Hi@akhtar, You need to uninstall SSH and reinstall ...READ MORE

Dec 18, 2020 in Big Data Hadoop by MD
• 95,440 points
9,669 views
0 votes
1 answer

Moving data to hdfs using copyFromLocal command

Actually dfs.data.dir and dfs.name.dir have to point ...READ MORE

Mar 27, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
9,569 views
0 votes
1 answer

How to select particular column In a table in hive?

Hive is a high-level language to analyze ...READ MORE

Jul 31, 2019 in Big Data Hadoop by Killian
9,559 views
0 votes
1 answer

Best way of starting & stopping the Hadoop daemons with command line

First way is to use start-all.sh & ...READ MORE

Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,490 points
9,559 views
0 votes
1 answer

Read Parquet file using hdfs dfs -cat command

When you try to read a parquet ...READ MORE

May 24, 2019 in Big Data Hadoop by Calvin
9,530 views
0 votes
1 answer

Is there a way to copy data from one one Hadoop distributed file system(HDFS) to another HDFS?

The distributed copy command, distcp, is a ...READ MORE

Mar 22, 2018 in Big Data Hadoop by Ashish
• 2,650 points
9,497 views
0 votes
1 answer

Hadoop Hive: How to insert data in Hive table?

First, copy data into HDFS. Then create ...READ MORE

Nov 12, 2018 in Big Data Hadoop by Omkar
• 69,210 points
9,468 views
0 votes
1 answer

Opening a HDFS file in browser

In HDFS 8020 is IPC port, you ...READ MORE

Oct 29, 2018 in Big Data Hadoop by Frankie
• 9,830 points
9,319 views
0 votes
1 answer

How to remove duplicate records from Hive table?

A record is duplicate if there are ...READ MORE

Jul 11, 2019 in Big Data Hadoop by Bhuvan
9,308 views
0 votes
1 answer

What is the use of sequence file in Hadoop?

Sequence files are binary files containing serialized ...READ MORE

Apr 6, 2018 in Big Data Hadoop by Ashish
• 2,650 points
9,211 views
0 votes
1 answer

How to check if a particular partition exists in Hive?

Hey, Basically, with the following query, we can ...READ MORE

Jun 26, 2019 in Big Data Hadoop by Gitika
• 65,910 points
9,187 views
+1 vote
2 answers

How to find the database daily refresh time / sync up time

I use this code to get the ...READ MORE

Sep 17, 2019 in Big Data Hadoop by Raman
8,813 views
0 votes
1 answer

Error: FUNCTION 'NUCLEUS_ASCII' already exists.

Hi@akhtar, This error occurs when hive-shell started before ...READ MORE

Feb 5, 2020 in Big Data Hadoop by MD
• 95,440 points
8,681 views
0 votes
2 answers

How to know Hive and Hadoop versions from command prompt?

Hi, Hadoop and hive have their individual commands. ...READ MORE

Dec 18, 2020 in Big Data Hadoop by akhtar
• 38,230 points
8,632 views
0 votes
1 answer

Hadoop Hive: How to split a single row into multiple rows?

Try this SELECT ID1, Sub FROM tableName lateral view ...READ MORE

Nov 14, 2018 in Big Data Hadoop by Omkar
• 69,210 points
8,615 views
0 votes
1 answer

How does data gets split in Sqoop?

I will drop the answer in the ...READ MORE

Jul 16, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
8,557 views
+1 vote
1 answer

Where to set hadoop.tmp.dir? core-site.xml or hdfs-site.xml?

hadoop.tmp.dir (A base for other temporary directories) is ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
8,516 views
0 votes
1 answer

Hadoop Hive: How to skip the first line of csv while loading in hive table?

You can try this: CREATE TABLE temp ...READ MORE

Nov 8, 2018 in Big Data Hadoop by Omkar
• 69,210 points
8,455 views
0 votes
1 answer

Load data from HDFS to hive

You can't directly create a parquet table. ...READ MORE

Feb 8, 2019 in Big Data Hadoop by Omkar
• 69,210 points
8,392 views
0 votes
1 answer

How to find Hadoop version in CDH?

The simplest way to check Hadoop version ...READ MORE

Apr 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
8,376 views
0 votes
1 answer

I am getting this log4j:ERROR setFile(null,true) call failed. java.io.FileNotFoundException: /ozone.log (Read-only file system)

Hi@Raj, I think you need to provide permission ...READ MORE

Dec 4, 2020 in Big Data Hadoop by MD
• 95,440 points
8,307 views
0 votes
1 answer

Hadoop absolute path - pwd ?

"hdfs dfs -pwd" does not exist because ...READ MORE

Mar 27, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
8,290 views
0 votes
2 answers

How to change the location of a table in hive?

Changing location requires 2 steps: 1.) Change location ...READ MORE

Feb 12, 2020 in Big Data Hadoop by Saksham Sehrawet
8,252 views
0 votes
1 answer

How to save Spark dataframe as dynamic partitioned table in Hive?

Hey, you can try something like this: df.write.partitionBy('year', ...READ MORE

Nov 6, 2018 in Big Data Hadoop by Omkar
• 69,210 points
8,208 views
0 votes
1 answer

How to remove a Znode in Zookeeper?

Hey, Removes a specified znode and recursively all ...READ MORE

May 28, 2019 in Big Data Hadoop by Gitika
• 65,910 points
8,201 views
+1 vote
1 answer

Hadoop: "DEPRECATED: Use of this script to execute hdfs command is deprecated." error

The use of dfs in your command is "Deprecated". ...READ MORE

Nov 26, 2018 in Big Data Hadoop by Omkar
• 69,210 points
8,171 views
+1 vote
1 answer

How to write file in hdfs using python?

#!/usr/bin/python from subprocess import Popen, PIPE cat = Popen(["hadoop", ...READ MORE

Dec 6, 2018 in Big Data Hadoop by Omkar
• 69,210 points
8,085 views
0 votes
1 answer

How can I append data to an existing file in HDFS?

You have to do some configurations as ...READ MORE

Jul 25, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
8,076 views
0 votes
1 answer

How can we use IN/EXIST operator in Hive?

Hey, Yes, now Hive supports IN or EXIST, ...READ MORE

Jun 19, 2019 in Big Data Hadoop by Gitika
• 65,910 points
8,071 views
0 votes
1 answer

Getting error in Hadoop: Output file already exist

When you executed your code earlier, you ...READ MORE

Apr 19, 2018 in Big Data Hadoop by Shubham
• 13,490 points
8,064 views
0 votes
1 answer

Speculative Execution in Hadoop

In Hadoop, Speculative Execution is a process ...READ MORE

Aug 28, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
8,026 views
0 votes
1 answer

How does Avro schema evolution work?

If you have one avro file and ...READ MORE

Sep 19, 2018 in Big Data Hadoop by Frankie
• 9,830 points
7,951 views
0 votes
1 answer

Error while starting the daemon process in windows 10

Hi@Arun, By default, the parameter "dfs.datanode.failed.volumes.tolerated" is set ...READ MORE

Apr 20, 2020 in Big Data Hadoop by MD
• 95,440 points
7,930 views
0 votes
1 answer

How to see the version of Zookeeper I am running?

Hey, First, you need to have a running ...READ MORE

May 31, 2019 in Big Data Hadoop by Gitika
• 65,910 points
7,926 views
0 votes
1 answer

Hive:Unable to insert values to an array column using "insert".

make a dummy table which has at least one row. INSERT ...READ MORE

Dec 1, 2018 in Big Data Hadoop by Omkar
• 69,210 points
7,881 views