What is raw data

0 votes
What is raw data?
Nov 14, 2018 in Data Analytics by Ali
• 11,360 points
880 views

1 answer to this question.

0 votes

Raw data is the data that hasn’t been processed. It’s also called source data or atomic data. It is basically unstructured or unformatted repository data. It can be in form of files, images or database records. How does raw data look like? A table with rows and columns? Maybe, but this isn’t the case all the time. Lets again understand this better with the Netflix example. As mentioned earlier, there are hundreds of files associated with each episode. These files contain the records of the views that episodes have gotten from different regions and networks and over different time intervals. These files might also contain corrupt, inaccurate, irrelevant or even redundant data which aren’t required. Extracting the right amount of data from each file and representing it in form of tables with the exact intention of using it, is something we call data cleaning.

answered Nov 14, 2018 by Maverick
• 10,840 points

Related Questions In Data Analytics

0 votes
1 answer

What is data science?

Data Science is the practice of: Asking questions (formulating hypothesis), ...READ MORE

answered Aug 3, 2018 in Data Analytics by Abhi
• 3,720 points
896 views
0 votes
1 answer
0 votes
2 answers

What is nominal data and how to deal with it?

Nominal Data: Nominal values represent discrete units and ...READ MORE

answered Sep 4, 2018 in Data Analytics by shams
• 3,670 points
1,734 views
0 votes
2 answers

What is difference between Distributed search head and Search head cluster?

 A distributed environment describes the separation of ...READ MORE

answered Dec 4, 2018 in Data Analytics by Ali
• 11,360 points
2,519 views
+1 vote
5 answers

How to remove NA values with dplyr::filter()

Try this: df %>% filter(!is.na(col1)) READ MORE

answered Mar 26, 2019 in Data Analytics by anonymous
332,258 views
0 votes
1 answer
0 votes
1 answer

How can I use parallel so that it preserves the list of data frames

You can use pmap as follows: nc <- ...READ MORE

answered Apr 4, 2018 in Data Analytics by kappa3010
• 2,090 points
1,106 views
0 votes
1 answer
0 votes
2 answers

What are the skills required for data science?

Data Science is a platform for analyzing ...READ MORE

answered Apr 4, 2019 in Data Analytics by MrBoot
• 1,230 points
1,420 views
0 votes
1 answer

What is active binding in R programming

Active bindings in R are much like ...READ MORE

answered Oct 30, 2018 in Data Analytics by Maverick
• 10,840 points
1,503 views
webinar REGISTER FOR FREE WEBINAR X
REGISTER NOW
webinar_success Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP