Recent questions tagged developer

0 votes
1 answer

Sqoop Metastore ?

Jul 19, 2018 in Big Data Hadoop by shams
• 3,670 points
1,120 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

How Namenode handles data node failures?

Jul 11, 2018 in Big Data Hadoop by Shubham
• 13,490 points
6,013 views
0 votes
1 answer

Kafka topic not being deleted

Jul 9, 2018 in Apache Kafka by Shubham
• 13,490 points
2,794 views
+1 vote
8 answers

How to print the contents of RDD in Apache Spark?

Jul 6, 2018 in Apache Spark by Shubham
• 13,490 points
60,776 views
0 votes
2 answers

How to use RDD filter with other function?

Jul 5, 2018 in Apache Spark by Shubham
• 13,490 points
9,238 views
0 votes
1 answer

How to add third party java jars for use in PySpark?

Jul 4, 2018 in Apache Spark by Shubham
• 13,490 points
8,358 views
0 votes
1 answer
0 votes
1 answer
+1 vote
1 answer

map vs mapValues in Spark

Jun 29, 2018 in Apache Spark by Shubham
• 13,490 points
15,395 views
+1 vote
3 answers

Which cluster type should I choose for Spark?

Jun 27, 2018 in Apache Spark by Shubham
• 13,490 points
1,238 views
0 votes
1 answer

Which is better in term of speed, Shark or Spark?

Jun 26, 2018 in Apache Spark by Shubham
• 13,490 points
754 views
0 votes
1 answer

Spark Driver roles

Jun 21, 2018 in Apache Spark by shams
• 3,670 points
791 views
0 votes
1 answer

Spark standalone client mode

Jun 20, 2018 in Apache Spark by shams
• 3,670 points
604 views
0 votes
1 answer

Ways to create RDD in Apache Spark

Jun 19, 2018 in Apache Spark by Shubham
• 13,490 points
3,859 views
0 votes
3 answers

Lineage Graph in Spark

Jun 19, 2018 in Apache Spark by Data_Nerd
• 2,390 points
11,083 views
0 votes
1 answer
0 votes
1 answer

How RDD persist the data in Spark?

Jun 18, 2018 in Apache Spark by kurt_cobain
• 9,390 points
1,196 views
0 votes
1 answer

What do we mean by an RDD in Spark?

Jun 18, 2018 in Apache Spark by kurt_cobain
• 9,390 points
3,809 views
0 votes
1 answer

Different Hadoop Modes

Jun 13, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
12,584 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

InputSplit vs HDFS Block

Jun 1, 2018 in Big Data Hadoop by shams
• 3,670 points
4,150 views
0 votes
1 answer

How does partitioning work in Spark?

May 31, 2018 in Apache Spark by coldcode
• 2,080 points
965 views
0 votes
1 answer

Is there any way to uncache RDD?

May 30, 2018 in Apache Spark by kurt_cobain
• 9,390 points
1,494 views
0 votes
1 answer

Sqoop vs distCP

May 30, 2018 in Big Data Hadoop by shams
• 3,670 points
1,108 views
0 votes
1 answer

NameNode without any data

May 29, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,142 views
0 votes
1 answer
0 votes
1 answer

How to find max value in pair RDD?

May 26, 2018 in Apache Spark by kurt_cobain
• 9,390 points
7,673 views
0 votes
1 answer
0 votes
1 answer

out of Memory Error in Hadoop

May 22, 2018 in Big Data Hadoop by coldcode
• 2,080 points
1,635 views
0 votes
1 answer
0 votes
1 answer

Is a HDFS block sequential ?

May 21, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,417 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

How to install Hadoop in Ubuntu?

May 17, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
467 views
0 votes
1 answer
0 votes
1 answer

Visualization Tool in Cloudera CDH

May 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
1,061 views
0 votes
10 answers