Treat outliers in Dataset

0 votes
Hi! I want to know how to treat the outliers dataset in R.
Jul 12, 2018 in Data Analytics by CodingByHeart77
• 3,680 points
28 views

1 answer to this question.

Your answer

Your name to display (optional):
Privacy: Your email address will only be used for sending these notifications.
0 votes

Outlier values can be identified by using univariate or any other graphical analysis method. 

If the number of outlier values is few then they can be assessed individually, but for large number of outliers the values can be substituted with either the 99th or the 1st percentile values.

Note: All extreme values are not outlier values. 

The most common ways to treat outlier values

  1. To change the value and bring in within a range
  2. To just remove the value.
answered Jul 12, 2018 by darklord
• 6,140 points

Related Questions In Data Analytics

0 votes
2 answers

Number of missing values in dataset

Missing value treatment is a 2 step ...READ MORE

answered Aug 10, 2018 in Data Analytics by Atul
• 180 points
44 views
0 votes
1 answer

How can you find total number of null values in a dataset column wise?

You can write a custom sapply function ...READ MORE

answered Oct 12, 2018 in Data Analytics by ANMOL
• 3,620 points
18 views
0 votes
2 answers

When scoring a logistic regression model , is having the predicted variable in test dataset mandatory ?

Answer to your follow up question: We can ...READ MORE

answered Oct 17, 2018 in Data Analytics by Anmol
• 1,620 points
32 views
0 votes
1 answer

How do I get different distributions of our dataset in R?

There are multiple ways of getting this. ...READ MORE

answered Nov 26, 2018 in Data Analytics by Maverick
• 10,000 points
41 views
0 votes
1 answer

How to plot side-by-side Plots with ggplot2 in R?

By Using gridExtra library we can easily ...READ MORE

answered Apr 16, 2018 in Data Analytics by DeepCoder786
• 1,700 points
945 views
0 votes
10 answers

Changing the legend title in ggplot

Example : p <- ggplot(mtcars, aes(mpg, wt, colour ...READ MORE

answered Dec 10, 2018 in Data Analytics by Rajni
3,546 views
0 votes
1 answer

How to order bars in a bar graph using ggplot2?

The key to ordering is to set ...READ MORE

answered Jun 1, 2018 in Data Analytics by DataKing99
• 8,100 points
74 views
0 votes
1 answer

How to add regression line equation and R2 on graph?

Below is one solution: # GET EQUATION AND ...READ MORE

answered Jun 1, 2018 in Data Analytics by DataKing99
• 8,100 points
118 views
0 votes
1 answer

How to cluster a very large dataset in R?

You can initially use kmeans, to calculate ...READ MORE

answered Jun 19, 2018 in Data Analytics by darklord
• 6,140 points
47 views
0 votes
1 answer

How to change y axis max in time series using R?

The axis limits are being set using ...READ MORE

answered Apr 3, 2018 in Data Analytics by darklord
• 6,140 points
42 views

© 2018 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
"PMP®","PMI®", "PMI-ACP®" and "PMBOK®" are registered marks of the Project Management Institute, Inc. MongoDB®, Mongo and the leaf logo are the registered trademarks of MongoDB, Inc.