What is raw data

What is raw data?

Nov 14, 2018 in Data Analytics by Ali
• 11,360 points • 1,185 views

1 answer to this question.

Raw data is the data that hasn’t been processed. It’s also called source data or atomic data. It is basically unstructured or unformatted repository data. It can be in form of files, images or database records. How does raw data look like? A table with rows and columns? Maybe, but this isn’t the case all the time. Lets again understand this better with the Netflix example. As mentioned earlier, there are hundreds of files associated with each episode. These files contain the records of the views that episodes have gotten from different regions and networks and over different time intervals. These files might also contain corrupt, inaccurate, irrelevant or even redundant data which aren’t required. Extracting the right amount of data from each file and representing it in form of tables with the exact intention of using it, is something we call data cleaning.

answered Nov 14, 2018 by Maverick
• 10,840 points

Related Questions In Data Analytics

0 votes

1 answer

What is data science?

Data Science is the practice of: Asking questions (formulating hypothesis), ...READ MORE

answered Aug 3, 2018 in Data Analytics by Abhi
• 3,720 points • 1,299 views

0 votes

1 answer

What is data science? How would you say it is similar or different to business analytics and business intelligence?

Data science is a field that deals ...READ MORE

answered Aug 24, 2018 in Data Analytics by Abhi
• 3,720 points • 1,234 views

0 votes

2 answers

What is nominal data and how to deal with it?

Nominal Data: Nominal values represent discrete units and ...READ MORE

answered Sep 4, 2018 in Data Analytics by shams
• 3,670 points • 2,231 views

0 votes

2 answers

What is difference between Distributed search head and Search head cluster?

A distributed environment describes the separation of ...READ MORE

answered Dec 4, 2018 in Data Analytics by Ali
• 11,360 points • 3,043 views

+1 vote

5 answers

How to filter out na in R?

Try this: df %>% filter(!is.na(col1)) READ MORE

answered Mar 26, 2019 in Data Analytics by anonymous
• 338,656 views

0 votes

1 answer

How to use a function to repeat a set of procedures on specific set of columns in a data frame?

You can parse the strings to symbols. ...READ MORE

answered Apr 3, 2018 in Data Analytics by kappa3010
• 2,090 points • 2,145 views

0 votes

1 answer

How can I use parallel so that it preserves the list of data frames

You can use pmap as follows: nc <- ...READ MORE

answered Apr 4, 2018 in Data Analytics by kappa3010
• 2,090 points • 1,602 views

0 votes

1 answer

How to achieve pivot like data using tidyverse library in R?

You need not spread twice, if you ...READ MORE

answered Apr 4, 2018 in Data Analytics by kappa3010
• 2,090 points • 1,556 views

0 votes

2 answers

What are the skills required for data science?

Data Science is a platform for analyzing ...READ MORE

answered Apr 4, 2019 in Data Analytics by MrBoot
• 1,230 points • 1,858 views

0 votes

1 answer

What is active binding in R programming

Active bindings in R are much like ...READ MORE

answered Oct 30, 2018 in Data Analytics by Maverick
• 10,840 points • 1,915 views

Subscribe to our Newsletter, and get personalized recommendations.

REGISTER FOR FREE WEBINAR

Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP