Most voted questions in Data Analytics

0 votes
1 answer

Check Distribution of Categorical Variable in R

Consider the iris dataset use the table() function ...READ MORE

Jul 30, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
8,080 views
0 votes
1 answer

Data Imputation Packages

These are some packages in R which ...READ MORE

Jul 28, 2018 in Data Analytics by Sahiti
• 6,370 points
408 views
0 votes
1 answer

Load CSV File in R

Loading a .csv file in R is ...READ MORE

Jul 28, 2018 in Data Analytics by Sahiti
• 6,370 points
852 views
0 votes
1 answer

How do I become a data scientist step by step?

I am assuming that you are a ...READ MORE

Jul 26, 2018 in Data Analytics by Abhi
• 3,720 points
562 views
0 votes
2 answers

Why should anyone learn Python instead of R for machine learning?

Machine learning is the latest technology everyone ...READ MORE

Apr 13, 2019 in Data Analytics by SA
• 1,090 points
741 views
0 votes
1 answer

Mean of one column with respect to another

Consider the inbuilt iris dataset in R. Use ...READ MORE

Jul 25, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
1,665 views
0 votes
1 answer

Steps to evaluate Linear Regression in R

 These are sequential steps which need to ...READ MORE

Jul 25, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
971 views
0 votes
1 answer

Would time taken to train n-random forest be equal to n x (decision trees)

No, the time to train the random ...READ MORE

Aug 7, 2018 in Data Analytics by Abhi
• 3,720 points
366 views
0 votes
1 answer

On a given dataset would time taken to train n - random forest be equal to time taken to train n X (Decision tree)

No, the time to train the random ...READ MORE

Jul 24, 2018 in Data Analytics by Abhi
• 3,720 points
1,058 views
0 votes
1 answer

Components in grammar of graphics

Broadly speaking these are different components in grammar of ...READ MORE

Jul 24, 2018 in Data Analytics by Sahiti
• 6,370 points
564 views
0 votes
2 answers

How does data cleaning play a vital role in data analysis

Data is the core you do your ...READ MORE

Jul 24, 2018 in Data Analytics by Abhi
• 3,720 points
4,992 views
0 votes
1 answer

Turn values into scientific notation in R

Consider the below vector: a<-c(0.1324,0.0001234,234.21341324,09.324324) You can convert it ...READ MORE

Jul 19, 2018 in Data Analytics by DataKing99
• 8,240 points
11,395 views
0 votes
1 answer

Join multiple strings in R

Joining strings in R is quite an ...READ MORE

Jul 19, 2018 in Data Analytics by DataKing99
• 8,240 points
1,330 views
0 votes
2 answers

Recommender Systems

hello, i have one project which is based ...READ MORE

Mar 30, 2019 in Data Analytics by JAY SHAH
1,296 views
0 votes
1 answer

Logistic Regression Example

Logistic Regression often referred to as the ...READ MORE

Jul 19, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
1,239 views
0 votes
2 answers

PCA model in R

Principal component analysis (PCA) is routinely employed on ...READ MORE

Jul 19, 2018 in Data Analytics by zombie
• 3,790 points
1,009 views
0 votes
1 answer

Multiple plots onto a single page in R

Plotting multiple plots onto a single page ...READ MORE

Jul 17, 2018 in Data Analytics by Sahiti
• 6,370 points
441 views
0 votes
1 answer

Cross Validation

Cross-validation is a model validation technique for ...READ MORE

Jul 17, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
455 views
0 votes
1 answer

Difference between Eigenvectors and Eigenvalues

Eigenvectors are used for understanding linear transformations. In data ...READ MORE

Jul 17, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
9,304 views
0 votes
1 answer

Univariate/ Bivariate/ Multivariate

Univariate: These analyses are descriptive statistical analysis techniques which ...READ MORE

Jul 13, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
2,375 views
0 votes
1 answer

Statistical Power of Sensitivity

Sensitivity is commonly used to validate the ...READ MORE

Jul 13, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
796 views
0 votes
1 answer

Treat outliers in Dataset

Outlier values can be identified by using ...READ MORE

Jul 12, 2018 in Data Analytics by Sahiti
• 6,370 points
597 views
0 votes
1 answer

How to treat missing values during analysis?

The extent of the missing values is ...READ MORE

Jul 12, 2018 in Data Analytics by Sahiti
• 6,370 points
805 views
0 votes
1 answer

Overfitting vs Underfitting

In statistics and machine learning, one of ...READ MORE

Jul 11, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
960 views
0 votes
1 answer

Selection Bias

Selection bias is the bias introduced by the ...READ MORE

Jul 11, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
754 views
0 votes
1 answer

White noise model in R

The white noise (WN) model is a ...READ MORE

Jul 10, 2018 in Data Analytics by Sahiti
• 6,370 points
816 views
0 votes
1 answer

Random Walk model in R

A random walk is a simple example ...READ MORE

Jul 10, 2018 in Data Analytics by Sahiti
• 6,370 points
777 views
0 votes
2 answers

Apply function in R

apply Description: “Returns a vector or array or ...READ MORE

Jul 10, 2018 in Data Analytics by zombie
• 3,790 points
775 views
0 votes
2 answers

Clustering in R

Clustering is a type of unsupervised learning ...READ MORE

Jul 9, 2018 in Data Analytics by zombie
• 3,790 points
670 views
0 votes
1 answer

Rattle packages in R

Rattle is a popular GUI for data mining ...READ MORE

Jul 6, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
617 views
0 votes
1 answer

How to build a Scatter-plot using plotly?

With the help of “plotly” we can ...READ MORE

Jul 6, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
1,447 views
0 votes
1 answer

How to create a new R6 Class in R?

You have to first create an object ...READ MORE

Jul 5, 2018 in Data Analytics by DataKing99
• 8,240 points
1,073 views
0 votes
1 answer

Confusion matrix in R

A confusion matrix can be used to ...READ MORE

Jul 5, 2018 in Data Analytics by DataKing99
• 8,240 points
1,011 views
0 votes
1 answer

How to create a box-plot using “plotly” in R?

You can use this command to create ...READ MORE

Jul 4, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
7,240 views
0 votes
1 answer

How to write a custom function which will replace all the missing values in a vector with the mean of values in R?

Consider this vector: a<-c(1,2,3,NA,4,5,NA,NA) Write the function to impute ...READ MORE

Jul 4, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
4,261 views
0 votes
1 answer

List packages are used for data mining in R?

You can refer to the following packages ...READ MORE

Jul 3, 2018 in Data Analytics by DataKing99
• 8,240 points
1,292 views
0 votes
1 answer

What is a Random Walk model and how can you simulate it using R?

A random walk is a simple example ...READ MORE

Jul 3, 2018 in Data Analytics by DataKing99
• 8,240 points
3,054 views
0 votes
1 answer

How to sample random rows in dataframe?

Create data frame and then implement as ...READ MORE

Jul 3, 2018 in Data Analytics by Sahiti
• 6,370 points
631 views
0 votes
1 answer

How to sample n random rows per group in a dataframe?

You can assign a random ID to ...READ MORE

Jul 3, 2018 in Data Analytics by Sahiti
• 6,370 points
4,749 views
0 votes
1 answer

Create vector matrix of movie ratings using R project?

Why do'nt you try the dcast function, in the reshape2 package. d ...READ MORE

Jun 30, 2018 in Data Analytics by anonymous
901 views
0 votes
1 answer

How to calculate group mean and assign it to new data in R

You can use something like this: df$grp.mean.values <- ...READ MORE

Jun 27, 2018 in Data Analytics by Sahiti
• 6,370 points
1,571 views
0 votes
1 answer

How to find out cluster center mean of DBSCAN in R?

Just index back into the original data ...READ MORE

Jun 27, 2018 in Data Analytics by Sahiti
• 6,370 points
1,204 views
0 votes
1 answer

TwitteR package download of package ‘rjson’ fails in R

If you don't want to upgrade your ...READ MORE

Jun 27, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
1,194 views
0 votes
1 answer

SMOTE-function not working in R

If you convert 'y' to a factor, ...READ MORE

Jun 27, 2018 in Data Analytics by CodingByHeart77
• 3,740 points
2,708 views
0 votes
1 answer

hclust size limit

Classic hierarchical clustering approaches are O(n^3) in runtime and O(n^2) in ...READ MORE

Jun 26, 2018 in Data Analytics by DataKing99
• 8,240 points
681 views
0 votes
1 answer

How to cluster center mean of DBSCAN in R?

Just index back into the original data ...READ MORE

Jun 26, 2018 in Data Analytics by DataKing99
• 8,240 points
578 views
0 votes
1 answer

How to convert a sentence to word table in R?

Try the following code: sentence <- c("case sweden", ...READ MORE

Jun 21, 2018 in Data Analytics by Sahiti
• 6,370 points
1,269 views
0 votes
1 answer

Use different distance formula other than euclidean distance in k means

K-means is based on variance minimization. The sum-of-variance formula ...READ MORE

Jun 21, 2018 in Data Analytics by Sahiti
• 6,370 points
1,421 views
0 votes
1 answer

Is there any easy way to fill in missing data?

You can try the following code: First, you ...READ MORE

Jun 20, 2018 in Data Analytics by DataKing99
• 8,240 points
903 views
0 votes
1 answer

Extract lhs items from rules in R

Try the following code: data("Adult") rules <- apriori(Adult, ...READ MORE

Jun 20, 2018 in Data Analytics by DataKing99
• 8,240 points
1,166 views