Most voted questions in Data Analytics

0 votes
1 answer

Mean of one column with respect to another

Consider the inbuilt iris dataset in R. Use ...READ MORE

Jul 25, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
2,361 views
0 votes
1 answer

Steps to evaluate Linear Regression in R

 These are sequential steps which need to ...READ MORE

Jul 25, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,564 views
0 votes
1 answer

Would time taken to train n-random forest be equal to n x (decision trees)

No, the time to train the random ...READ MORE

Aug 7, 2018 in Data Analytics by Abhi
• 3,720 points
1,339 views
0 votes
1 answer

On a given dataset would time taken to train n - random forest be equal to time taken to train n X (Decision tree)

No, the time to train the random ...READ MORE

Jul 24, 2018 in Data Analytics by Abhi
• 3,720 points
1,588 views
0 votes
1 answer

Components in grammar of graphics

Broadly speaking these are different components in grammar of ...READ MORE

Jul 24, 2018 in Data Analytics by Sahiti
• 6,370 points
1,115 views
0 votes
2 answers

How does data cleaning play a vital role in data analysis

Data is the core you do your ...READ MORE

Jul 24, 2018 in Data Analytics by Abhi
• 3,720 points
5,913 views
0 votes
1 answer

Turn values into scientific notation in R

Consider the below vector: a<-c(0.1324,0.0001234,234.21341324,09.324324) You can convert it ...READ MORE

Jul 19, 2018 in Data Analytics by DataKing99
• 8,250 points
12,098 views
0 votes
1 answer

Join multiple strings in R

Joining strings in R is quite an ...READ MORE

Jul 19, 2018 in Data Analytics by DataKing99
• 8,250 points
1,934 views
0 votes
2 answers

Recommender Systems

hello, i have one project which is based ...READ MORE

Mar 30, 2019 in Data Analytics by JAY SHAH
1,945 views
0 votes
1 answer

Logistic Regression Example

Logistic Regression often referred to as the ...READ MORE

Jul 19, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,956 views
0 votes
2 answers

PCA model in R

Principal component analysis (PCA) is routinely employed on ...READ MORE

Jul 19, 2018 in Data Analytics by zombie
• 3,790 points
1,641 views
0 votes
1 answer

Multiple plots onto a single page in R

Plotting multiple plots onto a single page ...READ MORE

Jul 17, 2018 in Data Analytics by Sahiti
• 6,370 points
886 views
0 votes
1 answer

Cross Validation

Cross-validation is a model validation technique for ...READ MORE

Jul 17, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,133 views
0 votes
1 answer

Difference between Eigenvectors and Eigenvalues

Eigenvectors are used for understanding linear transformations. In data ...READ MORE

Jul 17, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
10,488 views
0 votes
1 answer

Univariate/ Bivariate/ Multivariate

Univariate: These analyses are descriptive statistical analysis techniques which ...READ MORE

Jul 13, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
2,993 views
0 votes
1 answer

Statistical Power of Sensitivity

Sensitivity is commonly used to validate the ...READ MORE

Jul 13, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,533 views
0 votes
1 answer

Treat outliers in Dataset

Outlier values can be identified by using ...READ MORE

Jul 12, 2018 in Data Analytics by Sahiti
• 6,370 points
1,088 views
0 votes
1 answer

How to treat missing values during analysis?

The extent of the missing values is ...READ MORE

Jul 12, 2018 in Data Analytics by Sahiti
• 6,370 points
1,270 views
0 votes
1 answer

Overfitting vs Underfitting

In statistics and machine learning, one of ...READ MORE

Jul 11, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,551 views
0 votes
1 answer

Selection Bias

Selection bias is the bias introduced by the ...READ MORE

Jul 11, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,465 views
0 votes
1 answer

White noise model in R

The white noise (WN) model is a ...READ MORE

Jul 10, 2018 in Data Analytics by Sahiti
• 6,370 points
1,271 views
0 votes
1 answer

Random Walk model in R

A random walk is a simple example ...READ MORE

Jul 10, 2018 in Data Analytics by Sahiti
• 6,370 points
1,256 views
0 votes
2 answers

Apply function in R

apply Description: “Returns a vector or array or ...READ MORE

Jul 10, 2018 in Data Analytics by zombie
• 3,790 points
1,479 views
0 votes
2 answers

Clustering in R

Clustering is a type of unsupervised learning ...READ MORE

Jul 9, 2018 in Data Analytics by zombie
• 3,790 points
1,471 views
0 votes
1 answer

Rattle packages in R

Rattle is a popular GUI for data mining ...READ MORE

Jul 6, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,106 views
0 votes
1 answer

How to build a Scatter-plot using plotly?

With the help of “plotly” we can ...READ MORE

Jul 6, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
2,245 views
0 votes
1 answer

How to create a new R6 Class in R?

You have to first create an object ...READ MORE

Jul 5, 2018 in Data Analytics by DataKing99
• 8,250 points
1,619 views
0 votes
1 answer

Confusion matrix in R

A confusion matrix can be used to ...READ MORE

Jul 5, 2018 in Data Analytics by DataKing99
• 8,250 points
1,646 views
0 votes
1 answer

How to create a box-plot using “plotly” in R?

You can use this command to create ...READ MORE

Jul 4, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
8,557 views
0 votes
1 answer

How to write a custom function which will replace all the missing values in a vector with the mean of values in R?

Consider this vector: a<-c(1,2,3,NA,4,5,NA,NA) Write the function to impute ...READ MORE

Jul 4, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
5,010 views
0 votes
1 answer

List packages are used for data mining in R?

You can refer to the following packages ...READ MORE

Jul 3, 2018 in Data Analytics by DataKing99
• 8,250 points
1,981 views
0 votes
1 answer

What is a Random Walk model and how can you simulate it using R?

A random walk is a simple example ...READ MORE

Jul 3, 2018 in Data Analytics by DataKing99
• 8,250 points
3,577 views
0 votes
1 answer

How to sample random rows in dataframe?

Create data frame and then implement as ...READ MORE

Jul 3, 2018 in Data Analytics by Sahiti
• 6,370 points
1,299 views
0 votes
1 answer

How to sample n random rows per group in a dataframe?

You can assign a random ID to ...READ MORE

Jul 3, 2018 in Data Analytics by Sahiti
• 6,370 points
5,627 views
0 votes
1 answer

Create vector matrix of movie ratings using R project?

Why do'nt you try the dcast function, in the reshape2 package. d ...READ MORE

Jun 30, 2018 in Data Analytics by anonymous
1,536 views
0 votes
1 answer

How to calculate group mean and assign it to new data in R

You can use something like this: df$grp.mean.values <- ...READ MORE

Jun 27, 2018 in Data Analytics by Sahiti
• 6,370 points
2,280 views
0 votes
1 answer

How to find out cluster center mean of DBSCAN in R?

Just index back into the original data ...READ MORE

Jun 27, 2018 in Data Analytics by Sahiti
• 6,370 points
1,859 views
0 votes
1 answer

TwitteR package download of package ‘rjson’ fails in R

If you don't want to upgrade your ...READ MORE

Jun 27, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,637 views
0 votes
1 answer

SMOTE-function not working in R

If you convert 'y' to a factor, ...READ MORE

Jun 27, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
3,481 views
0 votes
1 answer

hclust size limit

Classic hierarchical clustering approaches are O(n^3) in runtime and O(n^2) in ...READ MORE

Jun 26, 2018 in Data Analytics by DataKing99
• 8,250 points
1,470 views
0 votes
1 answer

How to cluster center mean of DBSCAN in R?

Just index back into the original data ...READ MORE

Jun 26, 2018 in Data Analytics by DataKing99
• 8,250 points
1,166 views
0 votes
1 answer

How to convert a sentence to word table in R?

Try the following code: sentence <- c("case sweden", ...READ MORE

Jun 21, 2018 in Data Analytics by Sahiti
• 6,370 points
1,852 views
0 votes
1 answer

Use different distance formula other than euclidean distance in k means

K-means is based on variance minimization. The sum-of-variance formula ...READ MORE

Jun 21, 2018 in Data Analytics by Sahiti
• 6,370 points
2,009 views
0 votes
1 answer

Is there any easy way to fill in missing data?

You can try the following code: First, you ...READ MORE

Jun 20, 2018 in Data Analytics by DataKing99
• 8,250 points
1,395 views
0 votes
1 answer

Extract lhs items from rules in R

Try the following code: data("Adult") rules <- apriori(Adult, ...READ MORE

Jun 20, 2018 in Data Analytics by DataKing99
• 8,250 points
1,670 views
0 votes
1 answer

How to write rules generated by Apriori?

I found out one solution: Use as() ...READ MORE

Jun 19, 2018 in Data Analytics by Sahiti
• 6,370 points
950 views
0 votes
1 answer

How to cluster a very large dataset in R?

You can initially use kmeans, to calculate ...READ MORE

Jun 19, 2018 in Data Analytics by Sahiti
• 6,370 points
3,364 views
0 votes
1 answer

Extract a subset of a data frame based on a condition involving a field

Here are the two main approaches. I ...READ MORE

Jun 19, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
16,631 views
0 votes
1 answer

Randomly subset of data with dplyr

Maybe this is what you want: # sample ...READ MORE

Jun 19, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,146 views
0 votes
1 answer

Do-while loop in R

You can use repeat{} condition and check ...READ MORE

Jun 14, 2018 in Data Analytics by DataKing99
• 8,250 points
973 views