Recent questions tagged developer

0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

Sqoop Metastore ?

Jul 19, 2018 in Big Data Hadoop by shams
• 3,670 points
1,459 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

How Namenode handles data node failures?

Jul 11, 2018 in Big Data Hadoop by Shubham
• 13,490 points
6,672 views
0 votes
1 answer

Kafka topic not being deleted

Jul 9, 2018 in Apache Kafka by Shubham
• 13,490 points
3,267 views
+1 vote
8 answers

How to print the contents of RDD in Apache Spark?

Jul 6, 2018 in Apache Spark by Shubham
• 13,490 points
62,463 views
0 votes
2 answers

How to use RDD filter with other function?

Jul 5, 2018 in Apache Spark by Shubham
• 13,490 points
10,121 views
0 votes
1 answer

How to add third party java jars for use in PySpark?

Jul 4, 2018 in Apache Spark by Shubham
• 13,490 points
8,971 views
0 votes
1 answer
0 votes
1 answer
+1 vote
1 answer

map vs mapValues in Spark

Jun 29, 2018 in Apache Spark by Shubham
• 13,490 points
16,450 views
+1 vote
3 answers

Which cluster type should I choose for Spark?

Jun 27, 2018 in Apache Spark by Shubham
• 13,490 points
1,899 views
0 votes
1 answer

Which is better in term of speed, Shark or Spark?

Jun 26, 2018 in Apache Spark by Shubham
• 13,490 points
1,085 views
0 votes
1 answer

Spark Driver roles

Jun 21, 2018 in Apache Spark by shams
• 3,670 points
1,230 views
0 votes
1 answer

Spark standalone client mode

Jun 20, 2018 in Apache Spark by shams
• 3,670 points
1,294 views
0 votes
1 answer

Ways to create RDD in Apache Spark

Jun 19, 2018 in Apache Spark by Shubham
• 13,490 points
4,276 views
0 votes
3 answers

Lineage Graph in Spark

Jun 19, 2018 in Apache Spark by Data_Nerd
• 2,390 points
12,318 views
0 votes
1 answer
0 votes
1 answer

How RDD persist the data in Spark?

Jun 18, 2018 in Apache Spark by kurt_cobain
• 9,350 points
1,588 views
0 votes
1 answer

What do we mean by an RDD in Spark?

Jun 18, 2018 in Apache Spark by kurt_cobain
• 9,350 points
4,245 views
0 votes
1 answer

Different Hadoop Modes

Jun 13, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
13,328 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

InputSplit vs HDFS Block

Jun 1, 2018 in Big Data Hadoop by shams
• 3,670 points
4,617 views
0 votes
1 answer

How does partitioning work in Spark?

May 31, 2018 in Apache Spark by coldcode
• 2,090 points
1,412 views
0 votes
1 answer

Is there any way to uncache RDD?

May 30, 2018 in Apache Spark by kurt_cobain
• 9,350 points
1,879 views
0 votes
1 answer

Sqoop vs distCP

May 30, 2018 in Big Data Hadoop by shams
• 3,670 points
1,484 views
0 votes
1 answer

NameNode without any data

May 29, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,470 views
0 votes
1 answer
0 votes
1 answer

How to find max value in pair RDD?

May 26, 2018 in Apache Spark by kurt_cobain
• 9,350 points
8,218 views
0 votes
1 answer
0 votes
1 answer

out of Memory Error in Hadoop

May 22, 2018 in Big Data Hadoop by coldcode
• 2,090 points
1,969 views
0 votes
1 answer
0 votes
1 answer

Is a HDFS block sequential ?

May 21, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,840 views
0 votes
1 answer
0 votes
1 answer

Sqoop vs Oracle Hadoop Connectors

May 18, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,115 views
0 votes
1 answer
0 votes
1 answer

How to install Hadoop in Ubuntu?

May 17, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
829 views