Query regarding a spark split logic

Question

I have a csv file containing 10 lines. I need to have a spark code to split even and odd numbers of lines into 2 text files results. Could you help me out?

Omkar · Answer 1 · Feb 9, 2019

First, import the data in Spark and add IDs to it. Run these commands in scala console:

val df = spark.read.csv("file.csv")
val df1 = df.withColumn("id",monotonicallyIncreasingId)

Then use this logic to split:

val df2 = df1.filter($"id"%2!==0)
val df2 = df1.filter($"id"%2!==0)

answered Feb 9, 2019 by Omkar
• 69,180 points

Query regarding a spark split logic

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Apache Spark

Error : split value is not a member of org.apache.spark.sql.Row

Query regarding Appending " to a string in Scala

Error : split value is not a member of org.apache.spark.sql.Row

Unable to run select query with selected columns on a temp view registered in spark application

How do I get number of columns in each line from a delimited file??

Hadoop Mapreduce word count Program

hadoop.mapred vs hadoop.mapreduce?

hadoop fs -put command?

Not able to use sc in spark shell

Spark and Scale Auxiliary constructor doubt

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES