Date formats : how to cast string to date?

0 votes

The following date formats not working for me

>>> df = spark.createDataFrame([("1997/02/28",)], ["t"])
>>> df.select(F.date_format(df.t,"yyyy/MM/dd")).show()
+--------------------------+
|date_format(t, yyyy/MM/dd)|
+--------------------------+
| null|
+--------------------------+

>>> df = spark.createDataFrame([["28-02-1997"],["01-02-2000"],["01-02-2022"]], ["t"])
>>> df.select(df.t.cast("date").alias("n_d")).orderBy("n_d").show()
+----+
| n_d|
+----+
|null|
|null|
|null|
+----+
Jul 29 in Apache Spark by Rahul
53 views

1 answer to this question.

0 votes

Try this, it should work:

> from pyspark.sql.functions import unix_timestamp

> df = spark.createDataFrame([("11/25/1991",), ("11/24/1991",), ("11/30/1991",)], ['date_str'])

> df2 = df.select('date_str', from_unixtime(unix_timestamp('date_str', 'MM/dd/yyy')).alias('date'))

DataFrame[date_str: string, date: timestamp]

> df2.show()


+----------+--------------------+

| date_str| date|

+----------+--------------------+

|11/25/1991|1991-11-25 00:00:...|

|11/24/1991|1991-11-24 00:00:...|

|11/30/1991|1991-11-30 00:00:...|

+----------+--------------------+
answered Jul 29 by Niall

Related Questions In Apache Spark

0 votes
0 answers

How to create RDD as string file?

Can anyone suggest how to create RDD ...READ MORE

Jul 4 in Apache Spark by anand
26 views
0 votes
1 answer

How to format a string in Scala?

Hey, To format a string, use the .format ...READ MORE

answered Jul 30 in Apache Spark by Gitika
• 25,340 points
25 views
0 votes
1 answer

How to access variables in s string interpolation in Scala?

Hey, You can use below code to access variables ...READ MORE

answered Jul 31 in Apache Spark by Gitika
• 25,340 points
33 views
0 votes
1 answer

How to stop messages from being displayed on spark console?

In your log4j.properties file you need to ...READ MORE

answered Apr 24, 2018 in Apache Spark by kurt_cobain
• 9,260 points
1,210 views
0 votes
1 answer
0 votes
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,690 points
3,063 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,690 points
341 views
0 votes
10 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyFr ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
15,058 views
0 votes
1 answer

How to print string text in scala?

Hi, You can see this example to see ...READ MORE

answered Jul 5 in Apache Spark by Gitika
• 25,340 points
38 views
0 votes
4 answers

How to change the spark Session configuration in Pyspark?

You can dynamically load properties. First create ...READ MORE

answered Dec 10, 2018 in Apache Spark by Vini
15,007 views