Trending questions in Big Data Hadoop

0 votes
1 answer

How to remove duplicate records from Hive table?

A record is duplicate if there are ...READ MORE

Jul 11, 2019 in Big Data Hadoop by Bhuvan
9,344 views
0 votes
1 answer

Difference between Cluster and nodes?

In Hadoop distributed system, Node is a single ...READ MORE

Apr 9, 2019 in Big Data Hadoop by Gitika
• 65,910 points
13,229 views
+1 vote
1 answer

Unhandled internal error. Java heap space

Hi As your file size is too large, ...READ MORE

Jan 21, 2020 in Big Data Hadoop by anonymous
772 views
0 votes
1 answer

Error: Failed to locate WinUtils binary in the Hadoop Binary Path

I will redirect you to a link ...READ MORE

Jun 20, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
10,062 views
+1 vote
1 answer

what is splunk?

Splunk is a software platform to search, ...READ MORE

Jan 24, 2020 in Big Data Hadoop by Vardhan
464 views
+1 vote
2 answers

Unable to create external table on HIVE

Hi @Ozer, This error occurred because there is ...READ MORE

Nov 7, 2019 in Big Data Hadoop by Richa
4,238 views
0 votes
1 answer

How does data gets split in Sqoop?

I will drop the answer in the ...READ MORE

Jul 16, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
8,586 views
0 votes
1 answer

How to install Hive on Windows Operating System?

First, download Hive from the following link: http://www.apache.org/dyn/closer.cgi/hive/ Next, ...READ MORE

Feb 27, 2019 in Big Data Hadoop by Kirti
14,612 views
0 votes
1 answer

How can I append data to an existing file in HDFS?

You have to do some configurations as ...READ MORE

Jul 25, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
8,119 views
0 votes
1 answer

How to check if a particular partition exists in Hive?

Hey, Basically, with the following query, we can ...READ MORE

Jun 26, 2019 in Big Data Hadoop by Gitika
• 65,910 points
9,256 views
0 votes
3 answers

Sqoop: java.lang.RuntimeException: Could not load db driver class: com.mysql.jdbc.Driver

I add mysql driver for sqoop and it ...READ MORE

Mar 1, 2020 in Big Data Hadoop by Manh Quang
• 180 points
6,865 views
0 votes
1 answer

Sqoop could not load db driver class

Hello, Have you tried to put the both ...READ MORE

Dec 13, 2019 in Big Data Hadoop by Alexandru
• 510 points
2,758 views
+1 vote
2 answers

Hadoop Hive: Not recognizing alias named in select part?

You can not use column aliases in ...READ MORE

Nov 26, 2019 in Big Data Hadoop by anonymous
10,380 views
0 votes
1 answer

How to create sequential znode in zookeeper?

Hey, To create a Sequential znode, add -s flag as shown ...READ MORE

May 28, 2019 in Big Data Hadoop by Gitika
• 65,910 points
10,112 views
0 votes
1 answer

java.lang.IllegalArgumentException: Unable to instantiate SparkSession with Hive support because Hive classes are not found.

 Try using the below build.sbt, code.scala and command ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Kishan
7,355 views
+1 vote
1 answer

How to append data to a parquet file?

Try using Spark API to append the ...READ MORE

Jan 11, 2019 in Big Data Hadoop by Omkar
• 69,230 points
15,874 views
0 votes
1 answer

Sqoop: ERROR tool.BaseSqoopTool: Error parsing arguments

There are two mistakes in the command ...READ MORE

Aug 2, 2019 in Big Data Hadoop by Kunal
7,111 views
0 votes
2 answers

Hive update and delete limitations

Vinyl-like crackle sounds. And what is my ...READ MORE

May 3, 2020 in Big Data Hadoop by Carter
• 140 points
5,667 views
0 votes
1 answer

How to import data in sqoop as a Parquet file?

Sqoop allows you to import the file ...READ MORE

May 15, 2019 in Big Data Hadoop by Nanda
10,429 views
+1 vote
0 answers

Job failed as tasks failed. failedMaps

Hello All, I am new to hadoop, i ...READ MORE

Aug 1, 2019 in Big Data Hadoop by Hemanth
• 250 points

edited Aug 1, 2019 by Omkar 7,044 views
0 votes
1 answer

Hive: Assign subquery to variable

Refer to this example: Step1: Check table test1 ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Tina
7,085 views
0 votes
1 answer

How to convert a string to timestamp with milliseconds in Hive?

 concatenation of substrings using the following code: select ...READ MORE

Oct 31, 2018 in Big Data Hadoop by Neha
• 6,300 points
18,799 views
+1 vote
0 answers

Hadoop Mapreduce : Null Pointer Exception

While running the below program, I got ...READ MORE

Dec 6, 2019 in Big Data Hadoop by Tarun
• 160 points
1,366 views
+1 vote
1 answer

flume twitter data file not generated in HDFS

Seems like you've missed the source and ...READ MORE

Dec 2, 2019 in Big Data Hadoop by Esha
2,917 views
0 votes
1 answer

Converting CSV files to Parquet

--Create Hive external Table for existing data CREATE ...READ MORE

Jul 30, 2019 in Big Data Hadoop by Yogi
6,908 views
0 votes
2 answers

Function of “hadoop namenode -format” command

Namenode is the node in the Hadoop ...READ MORE

Feb 4, 2019 in Big Data Hadoop by Kanika
20,646 views
0 votes
1 answer

Read Parquet file using hdfs dfs -cat command

When you try to read a parquet ...READ MORE

May 24, 2019 in Big Data Hadoop by Calvin
9,591 views
0 votes
1 answer

What is "java.sql.SQLException: org.apache.thrift.transport.TTransportException" in Hive?

This particular exception is related to Hive logs. ...READ MORE

Jul 30, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
6,602 views
+2 votes
5 answers

How to transpose/pivot data in hive?

Below is also a way for Pivot SELECT ...READ MORE

Oct 12, 2018 in Big Data Hadoop by Rahul
19,209 views
0 votes
1 answer

How to replace characters in string in hive?

You can do it using regexp_replace. This is ...READ MORE

Nov 30, 2018 in Big Data Hadoop by Omkar
• 69,230 points
16,831 views
0 votes
1 answer

How can we use IN/EXIST operator in Hive?

Hey, Yes, now Hive supports IN or EXIST, ...READ MORE

Jun 19, 2019 in Big Data Hadoop by Gitika
• 65,910 points
8,117 views
+1 vote
1 answer

Namenode v/s Secondary namenode

Name Node is a primary node in ...READ MORE

Sep 9, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
4,285 views
+1 vote
1 answer

" org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Permissions incorrectly set for dir " problem

This error is caused when hadoop tries ...READ MORE

Sep 9, 2019 in Big Data Hadoop by Kumar
4,238 views
+2 votes
0 answers
0 votes
1 answer

Unzip files in hdfs

To unzip a gzipped (or bzipped) file, ...READ MORE

Jul 31, 2019 in Big Data Hadoop by Joe
5,815 views
+1 vote
1 answer

bash: sbin/hadoop: No such file or directory

The hadoop command is not present in ...READ MORE

Sep 18, 2019 in Big Data Hadoop by Omkar
• 69,230 points
3,617 views
+1 vote
1 answer

datanode decomissioning

You have set the replication factor to ...READ MORE

Nov 28, 2019 in Big Data Hadoop by Rahul
547 views
0 votes
3 answers

Spark Scala: How to list all folders in directory

val spark = SparkSession.builder().appName("Demo").getOrCreate() val path = new ...READ MORE

Dec 5, 2018 in Big Data Hadoop by Mark
16,664 views
+2 votes
4 answers

Datanode process not running in Hadoop

Run the following commands: Stop-all.sh start-dfs.sh start-yarn.sh mr-jobhistory-daemon.sh ...READ MORE

Oct 25, 2018 in Big Data Hadoop by Anand
17,504 views
0 votes
1 answer

Why is Hive called as Data Warehouse?

Hive is a data warehouse infrastructure tool ...READ MORE

Jul 26, 2019 in Big Data Hadoop by Joshua
5,783 views
0 votes
1 answer

How to remove a Znode in Zookeeper?

Hey, Removes a specified znode and recursively all ...READ MORE

May 28, 2019 in Big Data Hadoop by Gitika
• 65,910 points
8,234 views
+1 vote
0 answers

Oozie filter all coordinator jobs based on nominalTime

Hi Edureka team. I would like to ...READ MORE

Nov 12, 2019 in Big Data Hadoop by Thayanban
• 130 points
981 views
0 votes
1 answer

How to run mapreduce program in terminal?

You can reference the below steps: Step 1: ...READ MORE

Jan 31, 2019 in Big Data Hadoop by Srishti
13,221 views
0 votes
1 answer

How to see the version of Zookeeper I am running?

Hey, First, you need to have a running ...READ MORE

May 31, 2019 in Big Data Hadoop by Gitika
• 65,910 points
7,965 views
0 votes
3 answers

How to change the delimiter in Sqoop?

--fields-terminated-by <char> READ MORE

Jun 25, 2019 in Big Data Hadoop by anonymous
10,067 views
0 votes
1 answer

Namespace in HDFS?

Basically when we say Namespace we mean ...READ MORE

Apr 8, 2019 in Big Data Hadoop by Gitika
• 65,910 points
9,983 views
0 votes
1 answer

Sqoop: ERROR tool.BaseSqoopTool: Unrecognized argument

The command you are executing is slightly ...READ MORE

Aug 1, 2019 in Big Data Hadoop by Sam
4,955 views
0 votes
2 answers

How can I list NameNode & DataNodes from any machine in the Hadoop cluster?

You can browse hadoop page from any ...READ MORE

Jan 23, 2020 in Big Data Hadoop by MD
• 95,440 points
11,192 views
0 votes
1 answer

Error: Container is running beyond Memory Limits

I had a similar problem while I was working ...READ MORE

Jun 20, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
6,538 views
0 votes
1 answer

Sqoop - moving data from DB2 to HDFS ( hive partition table)

sqoop import --driver com.ibm.db2.jcc.DB2Driver --connect jdbc:db2://db2.my.com:50000/databaseName --username ...READ MORE

Aug 9, 2019 in Big Data Hadoop by Payal
4,310 views