Latest questions in Data Analytics

0 votes
1 answer

How to apply loops to multiply across various columns in a dataframe?

I am assuming that your columns are ...READ MORE

Apr 6, 2018 in Data Analytics by kappa3010
• 2,090 points
6,411 views
0 votes
1 answer

How to use dplyr functions such as filter() inside nested data frames with map()

You can use map() call as follows:  map(full, ...READ MORE

Apr 6, 2018 in Data Analytics by Sahiti
• 6,370 points
4,299 views
0 votes
1 answer

Join list of data.frames using map() call

You can use Reduce set.seed(24) r1 <- map(c(5, 10, 15), ...READ MORE

Apr 6, 2018 in Data Analytics by Sahiti
• 6,370 points
802 views
0 votes
1 answer

How to join two tables (tibbles) by *list* columns in R

You can use the hash from digest ...READ MORE

Apr 6, 2018 in Data Analytics by kappa3010
• 2,090 points
1,426 views
0 votes
1 answer

How to convert a text mining termDocumentMatrix into excel or csv in R?

By assuming that all the values are ...READ MORE

Apr 5, 2018 in Data Analytics by DeepCoder786
• 1,720 points
1,646 views
0 votes
1 answer

By using dpylr package sum of multiple columns

Basically here we are making an equation ...READ MORE

Apr 5, 2018 in Data Analytics by DeepCoder786
• 1,720 points
2,025 views
0 votes
1 answer

How to achieve pivot like data using tidyverse library in R?

You need not spread twice, if you ...READ MORE

Apr 4, 2018 in Data Analytics by kappa3010
• 2,090 points
884 views
0 votes
1 answer

How can I use parallel so that it preserves the list of data frames

You can use pmap as follows: nc <- ...READ MORE

Apr 4, 2018 in Data Analytics by kappa3010
• 2,090 points
794 views
0 votes
1 answer

Removing unimportant variables before building Random Forest

Build the randomForest model on top of ...READ MORE

Apr 4, 2018 in Data Analytics by Bharani
• 4,660 points
1,935 views
0 votes
1 answer

AUC for 3-Class prediction in R

Use type='probs' inside predict() function, so that ...READ MORE

Apr 4, 2018 in Data Analytics by Bharani
• 4,660 points
1,177 views
0 votes
2 answers

In data frame how to spilt strings into values?

You can do this using dplyr and ...READ MORE

Dec 5, 2018 in Data Analytics by Kalgi
• 52,360 points
806 views
0 votes
1 answer

Calculating accuracy of prediction of rpart model

Your first task would be to build ...READ MORE

Apr 4, 2018 in Data Analytics by Bharani
• 4,660 points
7,895 views
0 votes
1 answer

Selecting a sublist with purrr

Yes, purrr does provide a package called ...READ MORE

Apr 4, 2018 in Data Analytics by Bharani
• 4,660 points
1,060 views
+1 vote
1 answer

How to convert a list of vectors with various length into a Data.Frame?

We can easily use this command as.data.frame(lapply(d1, "length< ...READ MORE

Apr 4, 2018 in Data Analytics by DeepCoder786
• 1,720 points
1,297 views
0 votes
3 answers

R vs MATLAB, which is better with respect to machine learning?

Hello, Both are a good programming language you ...READ MORE

Apr 12, 2019 in Data Analytics by SA
• 1,090 points
1,136 views
0 votes
2 answers

Integration of R with Java

there is a package called rjava that ...READ MORE

Dec 5, 2018 in Data Analytics by Kalgi
• 52,360 points
811 views
0 votes
1 answer

How to change mulitiple characters in a column to a date

Firstly we have to set dataf variable ...READ MORE

Apr 3, 2018 in Data Analytics by DeepCoder786
• 1,720 points
524 views
0 votes
1 answer

How to use a function to repeat a set of procedures on specific set of columns in a data frame?

You can parse the strings to symbols. ...READ MORE

Apr 3, 2018 in Data Analytics by kappa3010
• 2,090 points
1,316 views
0 votes
5 answers

How to remove NA values with dplyr::filter()

Try this: df %>% filter(!is.na(col1)) READ MORE

Mar 26, 2019 in Data Analytics by anonymous
321,772 views
0 votes
1 answer

How to replace NA with 0 using starts_with()

Well I could suggest various options such ...READ MORE

Apr 3, 2018 in Data Analytics by Sahiti
• 6,370 points
1,340 views
0 votes
1 answer

With the help of tidyverse: how to rename a column to a variable name

With the help of Dplyr: rename function ...READ MORE

Apr 3, 2018 in Data Analytics by DeepCoder786
• 1,720 points

edited Apr 3, 2018 by DeepCoder786 763 views
0 votes
1 answer

How to change y axis max in time series using R?

The axis limits are being set using ...READ MORE

Apr 3, 2018 in Data Analytics by Sahiti
• 6,370 points
3,554 views
0 votes
1 answer

Converting R data-frame to h2o object

All you have to do is run ...READ MORE

Apr 3, 2018 in Data Analytics by Bharani
• 4,660 points
2,385 views
0 votes
1 answer

Building Random Forest on a data-set comprising of missing(NA) values

You have two options, either impute the ...READ MORE

Apr 3, 2018 in Data Analytics by Bharani
• 4,660 points
1,058 views
0 votes
1 answer

Imputation in R

You can work with the "Hmisc" package, ...READ MORE

Mar 27, 2018 in Data Analytics by Bharani
• 4,660 points
390 views
+1 vote
2 answers

Finding number of missing values and removing those missing values from a data-frame

To find number of missing values for ...READ MORE

Aug 14, 2019 in Data Analytics by anonymous
879 views
+1 vote
2 answers

Custom Function to replace missing values in a vector with the mean of values

Try this. lapply(a,function(x){ifelse(is.na(x),mean(a,na.rm = TRUE ...READ MORE

Aug 14, 2019 in Data Analytics by anonymous
1,662 views
0 votes
1 answer

Plotting multiple graphs on the same page in R

If you want to plot 4 graphs ...READ MORE

Mar 27, 2018 in Data Analytics by Bharani
• 4,660 points
1,208 views
0 votes
1 answer

Left Join and Right Join using "dplyr"

The below is the code to perform ...READ MORE

Mar 27, 2018 in Data Analytics by Bharani
• 4,660 points
867 views
0 votes
1 answer

Using "dplyr" to summarise multiple columns

You can use the "sumamrise_all()" function for ...READ MORE

Mar 27, 2018 in Data Analytics by Bharani
• 4,660 points
1,485 views
+1 vote
3 answers

Filtering R data-frame with multiple conditions

You can use the 'filter' function from ...READ MORE

Mar 26, 2018 in Data Analytics by Bharani
• 4,660 points
86,538 views
0 votes
1 answer

Finding frequency of observations in R

You can use the "dplyr" package to ...READ MORE

Mar 26, 2018 in Data Analytics by Bharani
• 4,660 points
5,570 views
+1 vote
3 answers

Integration of Google Collaboratory with github

You can use SSH protocol to connect ...READ MORE

Aug 7, 2018 in Data Analytics by Kalgi
• 52,360 points
2,802 views
0 votes
2 answers

Transforming a key/value string into distinct rows in R

We would start off by loading the ...READ MORE

Mar 26, 2018 in Data Analytics by Bharani
• 4,660 points
854 views
+2 votes
3 answers

Problem with installation of Wordcloud in anaconda

Using Anaconda Python 3.6 version For Windows ...READ MORE

Aug 7, 2018 in Data Analytics by Priyaj
• 58,090 points
18,097 views
0 votes
2 answers

Installing MXNet for R in Windows System

You can install it for python in ...READ MORE

Dec 4, 2018 in Data Analytics by Kalgi
• 52,360 points
1,935 views
0 votes
2 answers

"Train" and "Test" sets in Data Science

Normally to perform supervised learning you need ...READ MORE

Aug 3, 2018 in Data Analytics by Abhi
• 3,720 points
995 views
0 votes
1 answer

Hadoop Streaming job vs regular jobs?

In certain cases, Hadoop Streaming is beneficial ...READ MORE

Mar 22, 2018 in Data Analytics by kurt_cobain
• 9,390 points
616 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
2,222 views
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
10,635 views
+3 votes
1 answer

Hortonworks Setup for Virtualbox

Dear Learner, Please go to the below link ...READ MORE

Dec 18, 2017 in Data Analytics by Sudhir
• 1,610 points
434 views
0 votes
2 answers

why we need beeline?

Remote we are connecting to hive with ...READ MORE

Jul 24, 2020 in Data Analytics by K. Raja Yasodhar
15,843 views
0 votes
1 answer

Accessing Remote Server

Hey Sudhanshu, Hope you're doing great. It was pleasure ...READ MORE

Dec 18, 2017 in Data Analytics by Sudhir
• 1,610 points
621 views
0 votes
1 answer

Related to cluster topic

Hi Learner, Hope you are doing well, Please find ...READ MORE

Dec 18, 2017 in Data Analytics by Sudhir
• 1,610 points
361 views
0 votes
1 answer

Splunk Cluster instalation document was not uploaded in the cource.

Dear Learner, Greetings ! We would like to inform ...READ MORE

Dec 18, 2017 in Data Analytics by Sudhir
• 1,610 points
457 views
0 votes
1 answer

Error , screenshot shared.

Hope you are doing great. Please share with ...READ MORE

Dec 18, 2017 in Data Analytics by Sudhir
• 1,610 points
382 views
+1 vote
1 answer

Join using two mappers - invalid inputfile exception

Dear Learner, Hope you are doing well. Can you ...READ MORE

Dec 18, 2017 in Data Analytics by Sudhir
• 1,610 points
444 views
0 votes
1 answer

compiler error in Selenium

Dear Learner, Hope you are doing great. Your code ...READ MORE

Dec 18, 2017 in Data Analytics by Sudhir
• 1,610 points
471 views
0 votes
1 answer

Issue while running Oozie job

Dear Learner, Hope you are doing well. To overcome ...READ MORE

Dec 18, 2017 in Data Analytics by Sudhir
• 1,610 points
1,117 views
0 votes
1 answer

webinar details to join the session

Session it got resolved READ MORE

Dec 18, 2017 in Data Analytics by Sudhir
• 1,610 points
1,621 views