Treat outliers in Dataset

0 votes
Hi! I want to know how to treat the outliers dataset in R.
Jul 12, 2018 in Data Analytics by CodingByHeart77
• 3,690 points
53 views

1 answer to this question.

0 votes

Outlier values can be identified by using univariate or any other graphical analysis method. 

If the number of outlier values is few then they can be assessed individually, but for large number of outliers the values can be substituted with either the 99th or the 1st percentile values.

Note: All extreme values are not outlier values. 

The most common ways to treat outlier values

  1. To change the value and bring in within a range
  2. To just remove the value.
answered Jul 12, 2018 by darklord
• 6,170 points

Related Questions In Data Analytics

0 votes
3 answers

Number of missing values in dataset

Try this, lapply(airquality, function(x) { sum(is.na(x)) }) READ MORE

answered Aug 6 in Data Analytics by anonymous
210 views
0 votes
1 answer

How can you find total number of null values in a dataset column wise?

You can write a custom sapply function ...READ MORE

answered Oct 12, 2018 in Data Analytics by Anmol
• 3,620 points
53 views
0 votes
2 answers

When scoring a logistic regression model , is having the predicted variable in test dataset mandatory ?

Answer to your follow up question: We can ...READ MORE

answered Oct 17, 2018 in Data Analytics by Anmol
• 1,610 points
71 views
0 votes
1 answer

How do I get different distributions of our dataset in R?

There are multiple ways of getting this. ...READ MORE

answered Nov 26, 2018 in Data Analytics by Maverick
• 10,040 points
65 views
0 votes
1 answer

How to plot side-by-side Plots with ggplot2 in R?

By Using gridExtra library we can easily ...READ MORE

answered Apr 16, 2018 in Data Analytics by DeepCoder786
• 1,720 points
1,692 views
0 votes
11 answers

Changing the legend title in ggplot

Hi, you can also try guides() to ...READ MORE

answered Jul 30 in Data Analytics by Cherukuri
• 31,840 points
6,367 views
0 votes
1 answer

How to order bars in a bar graph using ggplot2?

The key to ordering is to set ...READ MORE

answered Jun 1, 2018 in Data Analytics by DataKing99
• 8,130 points
186 views
0 votes
1 answer

How to add regression line equation and R2 on graph?

Below is one solution: # GET EQUATION AND ...READ MORE

answered Jun 1, 2018 in Data Analytics by DataKing99
• 8,130 points
813 views
0 votes
1 answer

How to cluster a very large dataset in R?

You can initially use kmeans, to calculate ...READ MORE

answered Jun 19, 2018 in Data Analytics by darklord
• 6,170 points
154 views
0 votes
1 answer

How to change y axis max in time series using R?

The axis limits are being set using ...READ MORE

answered Apr 3, 2018 in Data Analytics by darklord
• 6,170 points
162 views