Can I read a CSV represented as a string into Apache Spark

I have a CSV file represented as a string. Is there any way to convert this string directly to a dataframe?

May 3, 2018 in Apache Spark by Data_Nerd
You can use the following command. This will require you to do a bit of data cleansing and verification.

val mydata : Array[List[String]] = myString.split('\n').flatMap(CSVParser.parseLine(_))

After that you can convert it to a RDD

val myRDD : RDD[List[String]] = sparkContext.parallelize(msdata)

answered May 3, 2018 by kurt_cobain
