Recent questions tagged spark

0 votes
1 answer

Primary keys in Apache Spark

Sep 11, 2019 in Apache Spark by nitinrawat895
• 10,840 points
108 views
+1 vote
1 answer

Primary keys in Apache Spark

Aug 9, 2019 in Apache Spark by nitinrawat895
• 10,840 points
323 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

Primary keys in apache Spark

Jul 11, 2019 in Big Data Hadoop by nitinrawat895
• 10,840 points
170 views
0 votes
1 answer
0 votes
1 answer

Spark SQL in databricks

Feb 23, 2019 in Apache Spark by Sunny
330 views
+1 vote
1 answer
0 votes
1 answer

ReduceByKey Avereage

Jan 21, 2019 in Big Data Hadoop by slayer
• 29,260 points
41 views
0 votes
1 answer

What is Executor Memory in a Spark application?

Jan 4, 2019 in Apache Spark by Neha
• 6,280 points
1,082 views
–1 vote
1 answer

How to start pyspark?

Jan 3, 2019 in Big Data Hadoop by digger
• 26,660 points
55 views
0 votes
1 answer

Apache Spark vs MapReduce

Dec 19, 2018 in Big Data Hadoop by slayer
• 29,260 points
91 views
0 votes
1 answer
0 votes
1 answer

How to get ID of a map task in Spark?

Nov 20, 2018 in Apache Spark by Neha
• 6,280 points
635 views
0 votes
1 answer

How to open/stream .zip files through Spark?

Nov 20, 2018 in Apache Spark by Neha
• 6,280 points
545 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

Is 'sparkline' a method?

Nov 9, 2018 in Apache Spark by Neha
• 6,280 points
108 views
0 votes
1 answer

Filter, Option or FlatMap in spark

Nov 9, 2018 in Apache Spark by Neha
• 6,280 points
679 views
0 votes
3 answers

Hadoop Spark: How to iterate hdfs directories?

Oct 29, 2018 in Big Data Hadoop by digger
• 26,660 points
3,856 views
0 votes
1 answer

Does Hadoop and Spark support iPv6 now?

Oct 15, 2018 in Big Data Hadoop by Neha
• 6,280 points
293 views
0 votes
1 answer

Internal work of Spark

Oct 11, 2018 in Apache Spark by Meci Matt
• 9,460 points
165 views
0 votes
1 answer

Spark - repartition() vs coalesce()

Oct 11, 2018 in Apache Spark by Meci Matt
• 9,460 points
4,397 views
0 votes
1 answer

Spark - load CSV file as DataFrame?

Sep 25, 2018 in Big Data Hadoop by digger
• 26,660 points
3,761 views
0 votes
1 answer
0 votes
1 answer

What happens to RDD when one of the nodes goes down?

Sep 3, 2018 in Apache Spark by Shubham
• 13,370 points
275 views
0 votes
1 answer

Does Spark provide the storage layer too?

Sep 3, 2018 in Apache Spark by Shubham
• 13,370 points
78 views
0 votes
1 answer

Functions of Spark SQL?

Sep 3, 2018 in Apache Spark by Meci Matt
• 9,460 points
133 views
0 votes
1 answer

Languages supported by Apache Spark?

Sep 3, 2018 in Apache Spark by Meci Matt
• 9,460 points
208 views
0 votes
1 answer

How to connect Amazon RedShift in Apache Spark?

Aug 22, 2018 in AWS by datageek
• 2,460 points
2,796 views
+2 votes
3 answers
0 votes
2 answers

Which cluster type should I choose for Spark?

Aug 21, 2018 in Apache Spark by Shubham
• 13,370 points
166 views
0 votes
1 answer
0 votes
2 answers
0 votes
2 answers
0 votes
1 answer

What makes Spark faster than MapReduce?

Jul 26, 2018 in Apache Spark by Neha
• 6,280 points
235 views
0 votes
1 answer

PySpark Config ?

Jul 26, 2018 in Apache Spark by shams
• 3,580 points
59 views
+1 vote
1 answer
0 votes
1 answer
0 votes
7 answers

How to print the contents of RDD in Apache Spark?

Jul 6, 2018 in Apache Spark by Shubham
• 13,370 points
16,267 views
0 votes
2 answers

How to use RDD filter with other function?

Jul 5, 2018 in Apache Spark by Shubham
• 13,370 points
999 views
0 votes
1 answer

How to add third party java jars for use in PySpark?

Jul 4, 2018 in Apache Spark by Shubham
• 13,370 points
1,847 views