Pyspark dataframe with random values

How to create a column in pyspark dataframe with random values within a range?

Aug 1, 2019 in Apache Spark by Esha
• 9,388 views

1 answer to this question.

Hey @Esha, you can use this code. Let me know if it doesn't work:

from pyspark.sql.functions import rand,when df1 = df.withColumn('isVal', when(rand() > 0.5, 1).otherwise(0))

Hope this helps!

Join PySpark Online training today to know more about Pyspark.

Thanks.

answered Aug 1, 2019 by Zed

Related Questions In Apache Spark

0 votes

3 answers

Filtering a row in Spark DataFrame based on matching values from a list

Use the function as following: var notFollowingList=List(9.8,7,6,3,1) df.filter(col("uid").isin(notFollowingList:_*)) You can ...READ MORE

answered Jun 6, 2018 in Apache Spark by Shubham
• 13,490 points • 93,521 views

+1 vote

1 answer

getting null values in spark dataframe while reading data from hbase

Can you share the screenshots for the ...READ MORE

answered Jul 31, 2018 in Apache Spark by kurt_cobain
• 9,350 points • 2,681 views

+2 votes

0 answers

How can I import zip files and process the excel files ( inside the zip files ) by using pyspark connecting with pymongo ?

How can I import zip files and ...READ MORE

Aug 6, 2019 in Apache Spark by Ahmed
• 1,026 views

+1 vote

1 answer

How to convert a json file structure with values in single quotes to quoteless ?

You can do this by turning off ...READ MORE

answered Oct 4, 2019 in Apache Spark by Jisha
• 4,552 views

–1 vote

1 answer

Pyspark rdd How to get partition number in output ?

The glom function is what you are looking for: glom(self): ...READ MORE

answered Jan 8, 2019 in Python by Omkar
• 69,180 points • 2,949 views

+1 vote

2 answers

How do I get number of columns in each line from a delimited file??

Instead of spliting on '\n'. You should ...READ MORE

answered Aug 7, 2019 in Apache Spark by ashish
• 6,174 views

0 votes

1 answer

How to get SQL configuration in Spark using Python?

You can get the configuration details through ...READ MORE

answered Mar 18, 2019 in Apache Spark by John
• 1,625 views

0 votes

2 answers

how can i randomly select items from a list?

You can also use the random library's ...READ MORE

answered Apr 9, 2020 in Python by Patrick
• 5,833 views

+1 vote

8 answers

How to replace null values in Spark DataFrame?

Hi, In Spark, fill() function of DataFrameNaFunctions class is used to replace ...READ MORE

answered Dec 15, 2020 in Apache Spark by MD
• 95,460 points • 76,938 views

+2 votes

14 answers

How to create new column with function in Spark Dataframe?

val coder: (Int => String) = v ...READ MORE

answered Apr 5, 2019 in Apache Spark by anonymous

edited Apr 5, 2019 by Omkar • 90,846 views

Subscribe to our Newsletter, and get personalized recommendations.

REGISTER FOR FREE WEBINAR

Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP