How to find the number of null contain in dataframe

May 3, 2019 in Apache Spark by anonymous
• 120 points
edited May 3, 2019 by Omkar • 6,172 views

1 answer to this question.

Hey there!

You can use the select method of the dataframe to filter out the values.

df.select([count(when(isnull(c), c)).alias(c) for c in df.columns]).show()

This will display a table with column names and the number of Null values in each column.

If you want to check Null values for a column, then you can use the below code:

df.where(df.col("<Enter column name here>").isNull).count()

answered May 3, 2019 by Omkar
• 69,180 points

I am getting an error with this command and it says "illegal start of simple expresssion". Please help.

df.select([count(when(isnull(c), c)).alias(c) for c in df.columns]).show()

commented Feb 11, 2020 by Manish
• 120 points
edited Feb 11, 2020 by Gitika

How to find the number of elements present in the array in a Spark DataFame column?

You can select the column and apply ...READ MORE

answered Jun 6, 2018 in Apache Spark by Shubham
• 13,490 points • 23,672 views

0 votes

1 answer

How to get the number of elements in partition?

rdd.mapPartitions(iter => Array(iter.size).iterator, true) This command will ...READ MORE

answered May 8, 2018 in Apache Spark by kurt_cobain
• 9,350 points • 3,110 views

+1 vote

8 answers

How to replace null values in Spark DataFrame?

Hi, In Spark, fill() function of DataFrameNaFunctions class is used to replace ...READ MORE

answered Dec 15, 2020 in Apache Spark by MD
• 95,460 points • 80,000 views

+1 vote

8 answers

How to print the contents of RDD in Apache Spark?

Save it to a text file: line.saveAsTextFile("alicia.txt") Print contains ...READ MORE

answered Dec 10, 2018 in Apache Spark by Akshay
• 65,691 views

+1 vote

2 answers

How do I get number of columns in each line from a delimited file??

Instead of spliting on '\n'. You should ...READ MORE

answered Aug 7, 2019 in Apache Spark by ashish
• 7,138 views

+1 vote

1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points • 13,955 views

0 votes

1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points • 4,779 views

+2 votes

11 answers

hadoop fs -put command?

Hi, You can create one directory in HDFS ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points • 118,566 views

0 votes

1 answer

How to increase the amount of data to be transferred to shuffle service at the same time?

The amount of data to be transferred ...READ MORE

answered Mar 1, 2019 in Apache Spark by Omkar
• 69,180 points • 1,770 views

–1 vote

1 answer

Not able to use sc in spark shell

Seems like master and worker are not ...READ MORE

answered Jan 3, 2019 in Apache Spark by Omkar
• 69,180 points • 2,583 views

All categories
Generative AI (1,454)
Power BI (1,316)
DevOps & Agile (4,138)
Data Science (100)
ChatGPT (30)
Cyber Security & Ethical Hacking (1,057)
Data Analytics (1,266)
Cloud Computing (4,053)
Machine Learning (337)
PMP (1,069)
Python (3,488)
SalesForce (201)
Selenium (1,624)
Software Testing (58)
Tableau (608)
Web Development (3,972)
UI UX Design (24)
Java (1,358)
Azure (157)
Database (858)
Big Data Hadoop (1,907)
Blockchain (1,673)
Digital Marketing (121)
C# (141)
C++ (272)
IoT (Internet of Things) (390)
Kotlin (8)
Linux Administration (389)
MicroStrategy (7)
Mobile Development (395)
Others (2,386)
RPA (653)
Talend (73)
TypeSript (124)
Apache Kafka (84)
Apache Spark (596)
Career Counselling (1,091)
Events & Trending Topics (28)
Ask us Anything! (71)

Subscribe to our Newsletter, and get personalized recommendations.

Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP

How to find the number of null contain in dataframe

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Apache Spark

How to find the number of elements present in the array in a Spark DataFame column?

How to get the number of elements in partition?

How to replace null values in Spark DataFrame?

How to print the contents of RDD in Apache Spark?

How do I get number of columns in each line from a delimited file??

Hadoop Mapreduce word count Program

hadoop.mapred vs hadoop.mapreduce?

hadoop fs -put command?

How to increase the amount of data to be transferred to shuffle service at the same time?

Not able to use sc in spark shell

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES