How to treat missing values during analysis?

0 votes
I am trying to perform analysis in R and I have a lot of missing values in my dataset.

 I want to know how to treat these missing values.

Can someone please help!
Jul 12, 2018 in Data Analytics by CodingByHeart77
• 3,680 points
22 views

1 answer to this question.

0 votes

The extent of the missing values is identified after identifying the variables with missing values. If any patterns are identified the analyst has to concentrate on them as it could lead to interesting and meaningful business insights.

But, if there are no patterns identified, then the missing values can be substituted with mean or median values (imputation) or they can simply be ignored. 

Assigning a default value which can be mean, minimum or maximum value. Getting into the data is important.

If it is a categorical variable, the default value is assigned. The missing value is assigned a default value. If you have a distribution of data coming, for normal distribution give the mean value.

If 80% of the values for a variable are missing then you can answer that you would be dropping the variable instead of treating the missing values.

You can do this practically in R with the methods such as the mean imputation, median imputation, replace with dummy values, filling with co-relations and similarities, remove the record entirely, and leave the record as it is. 

answered Jul 12, 2018 by darklord
• 6,140 points

Related Questions In Data Analytics

0 votes
1 answer

How to write a custom function which will replace all the missing values in a vector with the mean of values in R?

Consider this vector: a<-c(1,2,3,NA,4,5,NA,NA) Write the function to impute ...READ MORE

answered Jul 4, 2018 in Data Analytics by CodingByHeart77
• 3,680 points
94 views
0 votes
1 answer

How do I copy an excel file to my Rconsole with all the missing values?

You can use read.table function in the ...READ MORE

answered Nov 16, 2018 in Data Analytics by Maverick
• 10,040 points
28 views
+1 vote
1 answer

Custom Function to replace missing values in a vector with the mean of values

You have missed out on "na.rm=TRUE" inside ...READ MORE

answered Mar 27, 2018 in Data Analytics by Bharani
• 4,550 points
48 views
0 votes
3 answers

How to remove NA values with dplyr::filter()

Null values have no notion of equality ...READ MORE

answered Apr 11 in Data Analytics by Zane
4,101 views
0 votes
2 answers

Number of missing values in dataset

Missing value treatment is a 2 step ...READ MORE

answered Aug 10, 2018 in Data Analytics by Atul
• 180 points
52 views
0 votes
1 answer

Big Data transformations with R

Dear Koushik, Hope you are doing great. You can ...READ MORE

answered Dec 17, 2017 in Data Analytics by Sudhir
• 1,610 points
35 views
0 votes
2 answers

Transforming a key/value string into distinct rows in R

We would start off by loading the ...READ MORE

answered Mar 26, 2018 in Data Analytics by Bharani
• 4,550 points
46 views
0 votes
1 answer

Finding frequency of observations in R

You can use the "dplyr" package to ...READ MORE

answered Mar 26, 2018 in Data Analytics by Bharani
• 4,550 points
78 views
0 votes
1 answer

How to remove rows with missing values (NAs) in a data frame?

You can use complete.cases in the following ...READ MORE

answered Apr 13, 2018 in Data Analytics by darklord
• 6,140 points
3,727 views
0 votes
1 answer

How to count unique values in R?

You can get the information printed in ...READ MORE

answered Apr 9, 2018 in Data Analytics by darklord
• 6,140 points
942 views