Recent questions tagged spark

0 votes
1 answer
0 votes
1 answer

Primary keys in apache Spark

Jul 11 in Big Data Hadoop by nitinrawat895
• 10,110 points
20 views
0 votes
1 answer

Spark SQL in databricks

Feb 23 in Apache Spark by Sunny
128 views
+1 vote
1 answer
0 votes
1 answer

ReduceByKey Avereage

Jan 21 in Big Data Hadoop by slayer
• 29,050 points
18 views
0 votes
1 answer
0 votes
1 answer

How to start pyspark?

Jan 3 in Big Data Hadoop by digger
• 27,620 points
28 views
0 votes
1 answer

Apache Spark vs MapReduce

Dec 19, 2018 in Big Data Hadoop by slayer
• 29,050 points
62 views
0 votes
1 answer
0 votes
1 answer

How to get ID of a map task in Spark?

Nov 20, 2018 in Apache Spark by Neha
• 6,280 points
202 views
0 votes
1 answer

How to open/stream .zip files through Spark?

Nov 20, 2018 in Apache Spark by Neha
• 6,280 points
226 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

Is 'sparkline' a method?

Nov 9, 2018 in Apache Spark by Neha
• 6,280 points
34 views
0 votes
1 answer

Filter, Option or FlatMap in spark

Nov 9, 2018 in Apache Spark by Neha
• 6,280 points
345 views
0 votes
3 answers

Hadoop Spark: How to iterate hdfs directories?

Oct 29, 2018 in Big Data Hadoop by digger
• 27,620 points
553 views
0 votes
1 answer

Does Hadoop and Spark support iPv6 now?

Oct 15, 2018 in Big Data Hadoop by Neha
• 6,280 points
131 views
0 votes
1 answer

Internal work of Spark

Oct 11, 2018 in Apache Spark by Meci Matt
• 9,400 points
88 views
0 votes
1 answer

Spark - repartition() vs coalesce()

Oct 11, 2018 in Apache Spark by Meci Matt
• 9,400 points
2,260 views
0 votes
1 answer

Spark - load CSV file as DataFrame?

Sep 25, 2018 in Big Data Hadoop by digger
• 27,620 points
2,239 views
0 votes
1 answer
0 votes
1 answer

What happens to RDD when one of the nodes goes down?

Sep 3, 2018 in Apache Spark by Shubham
• 13,190 points
168 views
0 votes
1 answer

Does Spark provide the storage layer too?

Sep 3, 2018 in Apache Spark by Shubham
• 13,190 points
47 views
0 votes
1 answer

Functions of Spark SQL?

Sep 3, 2018 in Apache Spark by Meci Matt
• 9,400 points
76 views
0 votes
1 answer

Languages supported by Apache Spark?

Sep 3, 2018 in Apache Spark by Meci Matt
• 9,400 points
44 views
0 votes
1 answer

How to connect Amazon RedShift in Apache Spark?

Aug 22, 2018 in AWS by datageek
• 2,440 points
1,319 views
+2 votes
3 answers
0 votes
2 answers

Which cluster type should I choose for Spark?

Aug 21, 2018 in Apache Spark by Shubham
• 13,190 points
104 views
0 votes
1 answer
0 votes
2 answers
0 votes
2 answers
0 votes
1 answer

What makes Spark faster than MapReduce?

Jul 26, 2018 in Apache Spark by Neha
• 6,280 points
94 views
0 votes
1 answer

PySpark Config ?

Jul 26, 2018 in Apache Spark by shams
• 3,580 points
33 views
+1 vote
1 answer
0 votes
1 answer
0 votes
7 answers

How to print the contents of RDD in Apache Spark?

Jul 6, 2018 in Apache Spark by Shubham
• 13,190 points
7,040 views
0 votes
2 answers

How to use RDD filter with other function?

Jul 5, 2018 in Apache Spark by Shubham
• 13,190 points
245 views
0 votes
1 answer

How to add third party java jars for use in PySpark?

Jul 4, 2018 in Apache Spark by Shubham
• 13,190 points
986 views
0 votes
1 answer
0 votes
1 answer