Trending questions in Big Data Hadoop

0 votes
1 answer

How to remove duplicate records from Hive table?

A record is duplicate if there are ...READ MORE

Jul 11, 2019 in Big Data Hadoop by Bhuvan
9,308 views
0 votes
1 answer

Difference between Cluster and nodes?

In Hadoop distributed system, Node is a single ...READ MORE

Apr 9, 2019 in Big Data Hadoop by Gitika
• 65,910 points
13,180 views
+1 vote
1 answer

Unhandled internal error. Java heap space

Hi As your file size is too large, ...READ MORE

Jan 21, 2020 in Big Data Hadoop by anonymous
759 views
0 votes
1 answer

Error: Failed to locate WinUtils binary in the Hadoop Binary Path

I will redirect you to a link ...READ MORE

Jun 20, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
10,023 views
+1 vote
1 answer

what is splunk?

Splunk is a software platform to search, ...READ MORE

Jan 24, 2020 in Big Data Hadoop by Vardhan
452 views
+1 vote
2 answers

Unable to create external table on HIVE

Hi @Ozer, This error occurred because there is ...READ MORE

Nov 7, 2019 in Big Data Hadoop by Richa
4,186 views
0 votes
1 answer

How does data gets split in Sqoop?

I will drop the answer in the ...READ MORE

Jul 16, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
8,557 views
0 votes
1 answer

How to install Hive on Windows Operating System?

First, download Hive from the following link: http://www.apache.org/dyn/closer.cgi/hive/ Next, ...READ MORE

Feb 27, 2019 in Big Data Hadoop by Kirti
14,538 views
0 votes
1 answer

How can I append data to an existing file in HDFS?

You have to do some configurations as ...READ MORE

Jul 25, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
8,076 views
0 votes
1 answer

How to check if a particular partition exists in Hive?

Hey, Basically, with the following query, we can ...READ MORE

Jun 26, 2019 in Big Data Hadoop by Gitika
• 65,910 points
9,187 views
0 votes
3 answers

Sqoop: java.lang.RuntimeException: Could not load db driver class: com.mysql.jdbc.Driver

I add mysql driver for sqoop and it ...READ MORE

Mar 1, 2020 in Big Data Hadoop by Manh Quang
• 180 points
6,833 views
0 votes
1 answer

Sqoop could not load db driver class

Hello, Have you tried to put the both ...READ MORE

Dec 13, 2019 in Big Data Hadoop by Alexandru
• 510 points
2,725 views
+1 vote
2 answers

Hadoop Hive: Not recognizing alias named in select part?

You can not use column aliases in ...READ MORE

Nov 26, 2019 in Big Data Hadoop by anonymous
10,314 views
0 votes
1 answer

How to create sequential znode in zookeeper?

Hey, To create a Sequential znode, add -s flag as shown ...READ MORE

May 28, 2019 in Big Data Hadoop by Gitika
• 65,910 points
10,099 views
0 votes
1 answer

java.lang.IllegalArgumentException: Unable to instantiate SparkSession with Hive support because Hive classes are not found.

 Try using the below build.sbt, code.scala and command ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Kishan
7,320 views
+1 vote
1 answer

How to append data to a parquet file?

Try using Spark API to append the ...READ MORE

Jan 11, 2019 in Big Data Hadoop by Omkar
• 69,210 points
15,811 views
0 votes
1 answer

Sqoop: ERROR tool.BaseSqoopTool: Error parsing arguments

There are two mistakes in the command ...READ MORE

Aug 2, 2019 in Big Data Hadoop by Kunal
7,060 views
0 votes
2 answers

Hive update and delete limitations

Vinyl-like crackle sounds. And what is my ...READ MORE

May 3, 2020 in Big Data Hadoop by Carter
• 140 points
5,615 views
0 votes
1 answer

How to import data in sqoop as a Parquet file?

Sqoop allows you to import the file ...READ MORE

May 15, 2019 in Big Data Hadoop by Nanda
10,371 views
+1 vote
0 answers

Job failed as tasks failed. failedMaps

Hello All, I am new to hadoop, i ...READ MORE

Aug 1, 2019 in Big Data Hadoop by Hemanth
• 250 points

edited Aug 1, 2019 by Omkar 6,990 views
0 votes
1 answer

Hive: Assign subquery to variable

Refer to this example: Step1: Check table test1 ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Tina
7,037 views
0 votes
1 answer

How to convert a string to timestamp with milliseconds in Hive?

 concatenation of substrings using the following code: select ...READ MORE

Oct 31, 2018 in Big Data Hadoop by Neha
• 6,300 points
18,709 views
+1 vote
0 answers

Hadoop Mapreduce : Null Pointer Exception

While running the below program, I got ...READ MORE

Dec 6, 2019 in Big Data Hadoop by Tarun
• 160 points
1,344 views
0 votes
1 answer

Converting CSV files to Parquet

--Create Hive external Table for existing data CREATE ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Yogi
6,875 views
+1 vote
1 answer

flume twitter data file not generated in HDFS

Seems like you've missed the source and ...READ MORE

Dec 2, 2019 in Big Data Hadoop by Esha
2,853 views
0 votes
2 answers

Function of “hadoop namenode -format” command

Namenode is the node in the Hadoop ...READ MORE

Feb 4, 2019 in Big Data Hadoop by Kanika
20,585 views
0 votes
1 answer

Read Parquet file using hdfs dfs -cat command

When you try to read a parquet ...READ MORE

May 24, 2019 in Big Data Hadoop by Calvin
9,530 views
0 votes
1 answer

What is "java.sql.SQLException: org.apache.thrift.transport.TTransportException" in Hive?

This particular exception is related to Hive logs. ...READ MORE

Jul 30, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
6,579 views
+2 votes
5 answers

How to transpose/pivot data in hive?

Below is also a way for Pivot SELECT ...READ MORE

Oct 12, 2018 in Big Data Hadoop by Rahul
19,164 views
0 votes
1 answer

How can we use IN/EXIST operator in Hive?

Hey, Yes, now Hive supports IN or EXIST, ...READ MORE

Jun 19, 2019 in Big Data Hadoop by Gitika
• 65,910 points
8,072 views
0 votes
1 answer

How to replace characters in string in hive?

You can do it using regexp_replace. This is ...READ MORE

Nov 30, 2018 in Big Data Hadoop by Omkar
• 69,210 points
16,769 views
+1 vote
1 answer

Namenode v/s Secondary namenode

Name Node is a primary node in ...READ MORE

Sep 9, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
4,236 views
+1 vote
1 answer

" org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Permissions incorrectly set for dir " problem

This error is caused when hadoop tries ...READ MORE

Sep 9, 2019 in Big Data Hadoop by Kumar
4,203 views
+2 votes
0 answers
0 votes
1 answer

Unzip files in hdfs

To unzip a gzipped (or bzipped) file, ...READ MORE

Jul 31, 2019 in Big Data Hadoop by Joe
5,751 views
+1 vote
1 answer

bash: sbin/hadoop: No such file or directory

The hadoop command is not present in ...READ MORE

Sep 18, 2019 in Big Data Hadoop by Omkar
• 69,210 points
3,561 views
+1 vote
1 answer

datanode decomissioning

You have set the replication factor to ...READ MORE

Nov 28, 2019 in Big Data Hadoop by Rahul
538 views
0 votes
3 answers

Spark Scala: How to list all folders in directory

val spark = SparkSession.builder().appName("Demo").getOrCreate() val path = new ...READ MORE

Dec 5, 2018 in Big Data Hadoop by Mark
16,591 views
0 votes
1 answer

Why is Hive called as Data Warehouse?

Hive is a data warehouse infrastructure tool ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Joshua
5,733 views
+2 votes
4 answers

Datanode process not running in Hadoop

Run the following commands: Stop-all.sh start-dfs.sh start-yarn.sh mr-jobhistory-daemon.sh ...READ MORE

Oct 25, 2018 in Big Data Hadoop by Anand
17,371 views
0 votes
1 answer

How to remove a Znode in Zookeeper?

Hey, Removes a specified znode and recursively all ...READ MORE

May 28, 2019 in Big Data Hadoop by Gitika
• 65,910 points
8,201 views
+1 vote
0 answers

Oozie filter all coordinator jobs based on nominalTime

Hi Edureka team. I would like to ...READ MORE

Nov 12, 2019 in Big Data Hadoop by Thayanban
• 130 points
967 views
0 votes
1 answer

How to run mapreduce program in terminal?

You can reference the below steps: Step 1: ...READ MORE

Jan 31, 2019 in Big Data Hadoop by Srishti
13,147 views
0 votes
1 answer

How to see the version of Zookeeper I am running?

Hey, First, you need to have a running ...READ MORE

May 31, 2019 in Big Data Hadoop by Gitika
• 65,910 points
7,927 views
0 votes
3 answers

How to change the delimiter in Sqoop?

--fields-terminated-by <char> READ MORE

Jun 25, 2019 in Big Data Hadoop by anonymous
10,010 views
0 votes
1 answer

Namespace in HDFS?

Basically when we say Namespace we mean ...READ MORE

Apr 8, 2019 in Big Data Hadoop by Gitika
• 65,910 points
9,910 views
0 votes
1 answer

Sqoop: ERROR tool.BaseSqoopTool: Unrecognized argument

The command you are executing is slightly ...READ MORE

Aug 1, 2019 in Big Data Hadoop by Sam
4,913 views
0 votes
2 answers

How can I list NameNode & DataNodes from any machine in the Hadoop cluster?

You can browse hadoop page from any ...READ MORE

Jan 23, 2020 in Big Data Hadoop by MD
• 95,440 points
11,158 views
0 votes
1 answer

Error: Container is running beyond Memory Limits

I had a similar problem while I was working ...READ MORE

Jun 20, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
6,508 views
0 votes
1 answer

Sqoop - moving data from DB2 to HDFS ( hive partition table)

sqoop import --driver com.ibm.db2.jcc.DB2Driver --connect jdbc:db2://db2.my.com:50000/databaseName --username ...READ MORE

Aug 9, 2019 in Big Data Hadoop by Payal
4,285 views