Most voted questions in Data Analytics

0 votes
1 answer

Check Distribution of Categorical Variable in R

Consider the iris dataset use the table() function ...READ MORE

Jul 30, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
8,050 views
0 votes
1 answer

Data Imputation Packages

These are some packages in R which ...READ MORE

Jul 28, 2018 in Data Analytics by Sahiti
• 6,370 points
404 views
0 votes
1 answer

Load CSV File in R

Loading a .csv file in R is ...READ MORE

Jul 28, 2018 in Data Analytics by Sahiti
• 6,370 points
848 views
0 votes
1 answer

How do I become a data scientist step by step?

I am assuming that you are a ...READ MORE

Jul 26, 2018 in Data Analytics by Abhi
• 3,720 points
556 views
0 votes
2 answers

Why should anyone learn Python instead of R for machine learning?

Machine learning is the latest technology everyone ...READ MORE

Apr 13, 2019 in Data Analytics by SA
• 1,090 points
735 views
0 votes
1 answer

Mean of one column with respect to another

Consider the inbuilt iris dataset in R. Use ...READ MORE

Jul 25, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
1,658 views
0 votes
1 answer

Steps to evaluate Linear Regression in R

 These are sequential steps which need to ...READ MORE

Jul 25, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
963 views
0 votes
1 answer

Would time taken to train n-random forest be equal to n x (decision trees)

No, the time to train the random ...READ MORE

Aug 7, 2018 in Data Analytics by Abhi
• 3,720 points
354 views
0 votes
1 answer

On a given dataset would time taken to train n - random forest be equal to time taken to train n X (Decision tree)

No, the time to train the random ...READ MORE

Jul 24, 2018 in Data Analytics by Abhi
• 3,720 points
1,049 views
0 votes
1 answer

Components in grammar of graphics

Broadly speaking these are different components in grammar of ...READ MORE

Jul 24, 2018 in Data Analytics by Sahiti
• 6,370 points
557 views
0 votes
2 answers

How does data cleaning play a vital role in data analysis

Data is the core you do your ...READ MORE

Jul 24, 2018 in Data Analytics by Abhi
• 3,720 points
4,978 views
0 votes
1 answer

Turn values into scientific notation in R

Consider the below vector: a<-c(0.1324,0.0001234,234.21341324,09.324324) You can convert it ...READ MORE

Jul 19, 2018 in Data Analytics by DataKing99
• 8,240 points
11,382 views
0 votes
1 answer

Join multiple strings in R

Joining strings in R is quite an ...READ MORE

Jul 19, 2018 in Data Analytics by DataKing99
• 8,240 points
1,326 views
0 votes
2 answers

Recommender Systems

hello, i have one project which is based ...READ MORE

Mar 30, 2019 in Data Analytics by JAY SHAH
1,288 views
0 votes
1 answer

Logistic Regression Example

Logistic Regression often referred to as the ...READ MORE

Jul 19, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
1,229 views
0 votes
2 answers

PCA model in R

Principal component analysis (PCA) is routinely employed on ...READ MORE

Jul 19, 2018 in Data Analytics by zombie
• 3,790 points
995 views
0 votes
1 answer

Multiple plots onto a single page in R

Plotting multiple plots onto a single page ...READ MORE

Jul 17, 2018 in Data Analytics by Sahiti
• 6,370 points
438 views
0 votes
1 answer

Cross Validation

Cross-validation is a model validation technique for ...READ MORE

Jul 17, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
452 views
0 votes
1 answer

Difference between Eigenvectors and Eigenvalues

Eigenvectors are used for understanding linear transformations. In data ...READ MORE

Jul 17, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
9,276 views
0 votes
1 answer

Univariate/ Bivariate/ Multivariate

Univariate: These analyses are descriptive statistical analysis techniques which ...READ MORE

Jul 13, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
2,352 views
0 votes
1 answer

Statistical Power of Sensitivity

Sensitivity is commonly used to validate the ...READ MORE

Jul 13, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
788 views
0 votes
1 answer

Treat outliers in Dataset

Outlier values can be identified by using ...READ MORE

Jul 12, 2018 in Data Analytics by Sahiti
• 6,370 points
590 views
0 votes
1 answer

How to treat missing values during analysis?

The extent of the missing values is ...READ MORE

Jul 12, 2018 in Data Analytics by Sahiti
• 6,370 points
800 views
0 votes
1 answer

Overfitting vs Underfitting

In statistics and machine learning, one of ...READ MORE

Jul 11, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
952 views
0 votes
1 answer

Selection Bias

Selection bias is the bias introduced by the ...READ MORE

Jul 11, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
747 views
0 votes
1 answer

White noise model in R

The white noise (WN) model is a ...READ MORE

Jul 10, 2018 in Data Analytics by Sahiti
• 6,370 points
808 views
0 votes
1 answer

Random Walk model in R

A random walk is a simple example ...READ MORE

Jul 10, 2018 in Data Analytics by Sahiti
• 6,370 points
770 views
0 votes
2 answers

Apply function in R

apply Description: “Returns a vector or array or ...READ MORE

Jul 10, 2018 in Data Analytics by zombie
• 3,790 points
768 views
0 votes
2 answers

Clustering in R

Clustering is a type of unsupervised learning ...READ MORE

Jul 9, 2018 in Data Analytics by zombie
• 3,790 points
662 views
0 votes
1 answer

Rattle packages in R

Rattle is a popular GUI for data mining ...READ MORE

Jul 6, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
612 views
0 votes
1 answer

How to build a Scatter-plot using plotly?

With the help of “plotly” we can ...READ MORE

Jul 6, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
1,438 views
0 votes
1 answer

How to create a new R6 Class in R?

You have to first create an object ...READ MORE

Jul 5, 2018 in Data Analytics by DataKing99
• 8,240 points
1,063 views
0 votes
1 answer

Confusion matrix in R

A confusion matrix can be used to ...READ MORE

Jul 5, 2018 in Data Analytics by DataKing99
• 8,240 points
1,001 views
0 votes
1 answer

How to create a box-plot using “plotly” in R?

You can use this command to create ...READ MORE

Jul 4, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
7,220 views
0 votes
1 answer

How to write a custom function which will replace all the missing values in a vector with the mean of values in R?

Consider this vector: a<-c(1,2,3,NA,4,5,NA,NA) Write the function to impute ...READ MORE

Jul 4, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
4,252 views
0 votes
1 answer

List packages are used for data mining in R?

You can refer to the following packages ...READ MORE

Jul 3, 2018 in Data Analytics by DataKing99
• 8,240 points
1,285 views
0 votes
1 answer

What is a Random Walk model and how can you simulate it using R?

A random walk is a simple example ...READ MORE

Jul 3, 2018 in Data Analytics by DataKing99
• 8,240 points
3,045 views
0 votes
1 answer

How to sample random rows in dataframe?

Create data frame and then implement as ...READ MORE

Jul 3, 2018 in Data Analytics by Sahiti
• 6,370 points
625 views
0 votes
1 answer

How to sample n random rows per group in a dataframe?

You can assign a random ID to ...READ MORE

Jul 3, 2018 in Data Analytics by Sahiti
• 6,370 points
4,732 views
0 votes
1 answer

Create vector matrix of movie ratings using R project?

Why do'nt you try the dcast function, in the reshape2 package. d ...READ MORE

Jun 30, 2018 in Data Analytics by anonymous
896 views
0 votes
1 answer

How to calculate group mean and assign it to new data in R

You can use something like this: df$grp.mean.values <- ...READ MORE

Jun 27, 2018 in Data Analytics by Sahiti
• 6,370 points
1,558 views
0 votes
1 answer

How to find out cluster center mean of DBSCAN in R?

Just index back into the original data ...READ MORE

Jun 27, 2018 in Data Analytics by Sahiti
• 6,370 points
1,200 views
0 votes
1 answer

TwitteR package download of package ‘rjson’ fails in R

If you don't want to upgrade your ...READ MORE

Jun 27, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
1,191 views
0 votes
1 answer

SMOTE-function not working in R

If you convert 'y' to a factor, ...READ MORE

Jun 27, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
2,704 views
0 votes
1 answer

hclust size limit

Classic hierarchical clustering approaches are O(n^3) in runtime and O(n^2) in ...READ MORE

Jun 26, 2018 in Data Analytics by DataKing99
• 8,240 points
665 views
0 votes
1 answer

How to cluster center mean of DBSCAN in R?

Just index back into the original data ...READ MORE

Jun 26, 2018 in Data Analytics by DataKing99
• 8,240 points
572 views
0 votes
1 answer

How to convert a sentence to word table in R?

Try the following code: sentence <- c("case sweden", ...READ MORE

Jun 21, 2018 in Data Analytics by Sahiti
• 6,370 points
1,262 views
0 votes
1 answer

Use different distance formula other than euclidean distance in k means

K-means is based on variance minimization. The sum-of-variance formula ...READ MORE

Jun 21, 2018 in Data Analytics by Sahiti
• 6,370 points
1,413 views
0 votes
1 answer

Is there any easy way to fill in missing data?

You can try the following code: First, you ...READ MORE

Jun 20, 2018 in Data Analytics by DataKing99
• 8,240 points
896 views
0 votes
1 answer

Extract lhs items from rules in R

Try the following code: data("Adult") rules <- apriori(Adult, ...READ MORE

Jun 20, 2018 in Data Analytics by DataKing99
• 8,240 points
1,155 views