What is raw data?

0 votes
What is raw data?
Nov 14, 2018 in Data Analytics by Ali
• 10,290 points
5 views

1 answer to this question.

Your answer

Your name to display (optional):
Privacy: Your email address will only be used for sending these notifications.
0 votes

Raw data is the data that hasn’t been processed. It’s also called source data or atomic data. It is basically unstructured or unformatted repository data. It can be in form of files, images or database records. How does raw data look like? A table with rows and columns? Maybe, but this isn’t the case all the time. Lets again understand this better with the Netflix example. As mentioned earlier, there are hundreds of files associated with each episode. These files contain the records of the views that episodes have gotten from different regions and networks and over different time intervals. These files might also contain corrupt, inaccurate, irrelevant or even redundant data which aren’t required. Extracting the right amount of data from each file and representing it in form of tables with the exact intention of using it, is something we call data cleaning.

answered Nov 14, 2018 by Maverick
• 10,000 points

Related Questions In Data Analytics

0 votes
2 answers

What is data science?

Data Science is the practice of: Asking questions (formulating hypothesis), ...READ MORE

answered Aug 2, 2018 in Data Analytics by ANMOL
• 3,620 points
20 views
0 votes
1 answer
0 votes
2 answers

What is nominal data and how to deal with it?

Nominal Data: Nominal values represent discrete units and ...READ MORE

answered Sep 3, 2018 in Data Analytics by shams
• 3,580 points
47 views
0 votes
2 answers

What is difference between Distributed search head and Search head cluster?

 A distributed environment describes the separation of ...READ MORE

answered Dec 3, 2018 in Data Analytics by Ali
• 10,290 points
103 views
0 votes
3 answers

How to remove NA values with dplyr::filter()

Null values have no notion of equality ...READ MORE

answered Apr 11 in Data Analytics by Zane
2,719 views
0 votes
1 answer
0 votes
1 answer

How can I use parallel so that it preserves the list of data frames

You can use pmap as follows: nc <- ...READ MORE

answered Apr 4, 2018 in Data Analytics by kappa3010
• 2,010 points
18 views
0 votes
1 answer
0 votes
2 answers

What are the skills required for data science?

Data Science is a platform for analyzing ...READ MORE

answered Apr 4 in Data Analytics by MrBoot
• 1,210 points
15 views
0 votes
1 answer

What is active binding in R programming

Active bindings in R are much like ...READ MORE

answered Oct 30, 2018 in Data Analytics by Maverick
• 10,000 points
13 views

© 2018 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
"PMP®","PMI®", "PMI-ACP®" and "PMBOK®" are registered marks of the Project Management Institute, Inc. MongoDB®, Mongo and the leaf logo are the registered trademarks of MongoDB, Inc.