Handling Imbalanced dataset

0 votes
What would be your strategy to handle a situation indicating an imbalanced dataset?
Oct 17, 2018 in Data Analytics by shams
• 3,580 points
14 views

1 answer to this question.

Your answer

Your name to display (optional):
Privacy: Your email address will only be used for sending these notifications.
0 votes
This usually occurs when a vast set of data keep in a single class. Sampling the dataset again is one of the possible solutions and the other one being the migration of data to parallel classes. The dataset should not be damaged.

Hope this helps
answered Oct 17, 2018 by kurt_cobain
• 9,260 points

Related Questions In Data Analytics

0 votes
1 answer

How to cluster a very large dataset in R?

You can initially use kmeans, to calculate ...READ MORE

answered Jun 19, 2018 in Data Analytics by darklord
• 6,140 points
44 views
0 votes
1 answer

Treat outliers in Dataset

Outlier values can be identified by using ...READ MORE

answered Jul 12, 2018 in Data Analytics by darklord
• 6,140 points
28 views
0 votes
2 answers

Number of missing values in dataset

Missing value treatment is a 2 step ...READ MORE

answered Aug 10, 2018 in Data Analytics by Atul
• 180 points
43 views
0 votes
1 answer

How do you know which whether to apply supervised learning or unsupervised learning on a dataset

Supervised Learning is applied when we have ...READ MORE

answered Aug 21, 2018 in Data Analytics by ANMOL
• 3,620 points
23 views
0 votes
1 answer

How can you find total number of null values in a dataset column wise?

You can write a custom sapply function ...READ MORE

answered Oct 12, 2018 in Data Analytics by ANMOL
• 3,620 points
17 views
0 votes
1 answer

What are the options for deploying models in production with R?

Well, I could say that the answer ...READ MORE

answered Apr 12, 2018 in Data Analytics by DataKing99
• 8,100 points
125 views
0 votes
1 answer

How to refresh shiny dataset?

When you refresh the page in the ...READ MORE

answered May 30, 2018 in Data Analytics by darklord
• 6,140 points
178 views
0 votes
1 answer

Use different distance formula other than euclidean distance in k means

K-means is based on variance minimization. The sum-of-variance formula ...READ MORE

answered Jun 21, 2018 in Data Analytics by darklord
• 6,140 points
249 views
0 votes
1 answer

Overfitting vs Underfitting

In statistics and machine learning, one of ...READ MORE

answered Jul 11, 2018 in Data Analytics by CodingByHeart77
• 3,680 points
28 views

© 2018 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
"PMP®","PMI®", "PMI-ACP®" and "PMBOK®" are registered marks of the Project Management Institute, Inc. MongoDB®, Mongo and the leaf logo are the registered trademarks of MongoDB, Inc.