.SD in data.table in R

Question

What does .SD stand for? How is it helpful and when to use it?According to some source, .SD&#160;is a&#160;data.table&#160;containing the subset of&#160;x's data for each group, excluding the group column(s).Can be used when grouping by&#160;i, when grouping by&#160;by, keyed&#160;by, and adhoc_&#160;byDoes that mean that the subset data.tables is held in memory for the upcoming/next operation?

nirvana · Answer

.SD stands for "Subset of Data.table". The dot before SD has no significance but doesn't let it clash with a user-defined column name.Consider your data.table as follows:DT = data.table(a=rep(c("x","y","z"),each=2), b=c(1,3), v=1:6)
setkey(DT, p)
DT
#    a b p
# 1: x 1 1
# 2: y 1 3
# 3: z 1 5
# 4: x 3 2
# 5: y 3 4
# 6: z 3 6Try the below code to understand what .SD does:DT[ , .SD[ , paste(a, p, sep="", collapse="_")], by=b]
#    b       V1
# 1: 1 x1_y3_z5
# 2: 3 x2_y4_z6The by=b statements divides the original data.table into a subset of 2 data.tablesDT[ , print(.SD), by=b]
# 1st sub-data.table, called '.SD' while it's being operated on:
#    a p
# 1: x 1
# 2: y 3
# 3: z 5
# 2nd sub-data.table, called '.SD' while it's being operated on:
#    a p
# 1: x 2
# 2: y 4
# 3: z 6
# Final output, since print() doesn't return anything
# Empty data.table (0 rows) of 1 col: b
and operates on them in turn.While it is operating on any one of the subset, it let's you refer to the current subset of data.table by using a nick-name/handle/symbol .SD.So, you can access and operate on the columns very easily.But, data.table will carry out the operations on every single sub-data.table defined by combinations of the key, and then "pasting" them back together. After which it will return the results in a single data.table!

SD in data table in R

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Data Analytics

How to sort a data frame by columns in R?

How to convert tables to a data frame in R ?

How to filter a data frame with dplyr and tidy evaluation in R?

How to forecast season and trend of data using STL and ARIMA in R?

How to convert a list of vectors with various length into a Data.Frame?

In data frame how to spilt strings into values?

How to achieve pivot like data using tidyverse library in R?

How to convert a text mining termDocumentMatrix into excel or csv in R?

How to convert a list to data frame in R?

Is there any way to check for missing packages and install them in R?

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES