What is raw data?

0 votes
What is raw data?
Nov 14, 2018 in Data Analytics by Ali
• 10,430 points
15 views

1 answer to this question.

0 votes

Raw data is the data that hasn’t been processed. It’s also called source data or atomic data. It is basically unstructured or unformatted repository data. It can be in form of files, images or database records. How does raw data look like? A table with rows and columns? Maybe, but this isn’t the case all the time. Lets again understand this better with the Netflix example. As mentioned earlier, there are hundreds of files associated with each episode. These files contain the records of the views that episodes have gotten from different regions and networks and over different time intervals. These files might also contain corrupt, inaccurate, irrelevant or even redundant data which aren’t required. Extracting the right amount of data from each file and representing it in form of tables with the exact intention of using it, is something we call data cleaning.

answered Nov 14, 2018 by Maverick
• 10,040 points

Related Questions In Data Analytics

0 votes
2 answers

What is data science?

Data Science is the practice of: Asking questions (formulating hypothesis), ...READ MORE

answered Aug 2, 2018 in Data Analytics by Anmol
• 3,620 points
42 views
0 votes
1 answer
0 votes
2 answers

What is nominal data and how to deal with it?

Nominal Data: Nominal values represent discrete units and ...READ MORE

answered Sep 3, 2018 in Data Analytics by shams
• 3,580 points
78 views
0 votes
2 answers

What is difference between Distributed search head and Search head cluster?

 A distributed environment describes the separation of ...READ MORE

answered Dec 3, 2018 in Data Analytics by Ali
• 10,430 points
182 views
0 votes
4 answers

How to remove NA values with dplyr::filter()

Can we create a alist as below ...READ MORE

answered Aug 5 in Data Analytics by anonymous
8,897 views
0 votes
1 answer
0 votes
1 answer

How can I use parallel so that it preserves the list of data frames

You can use pmap as follows: nc <- ...READ MORE

answered Apr 4, 2018 in Data Analytics by kappa3010
• 2,020 points
39 views
0 votes
1 answer
0 votes
2 answers

What are the skills required for data science?

Data Science is a platform for analyzing ...READ MORE

answered Apr 4 in Data Analytics by MrBoot
• 1,210 points
42 views
0 votes
1 answer

What is active binding in R programming

Active bindings in R are much like ...READ MORE

answered Oct 30, 2018 in Data Analytics by Maverick
• 10,040 points
40 views