What is the difference between a Big Data Warehouse and a traditional Data Warehouse

Question

Usually, data warehouses in the context of big data are managed and implemented on the basis of Hadoop-based system, like Apache Hive (right?).&#160;On the other hand, my question regards the methodological process.&#160;How do big data affect the design process of a data warehouse?&#160;Is the process similar or new tasks must be considered?

Frankie · Answer

Hadoop&#160;is similar in architecture to MPP data warehouses, but with some significant differences. Instead of rigidly defined by a parallel architecture, processors are loosely coupled across a Hadoop cluster and each can work on different data sources.The data manipulation engine, data catalog, and storage engine can work independently of each other with Hadoop serving as a collection point. Also critical is that Hadoop can easily accommodate both structured and unstructured data.&#160;This makes it an ideal environment for iterative inquiry. Instead of having to define analytics outputs according to narrow constructs defined by the schema, business users can experiment to find what queries matter to them most. Relevant data can then be extracted and loaded into a data warehouse for fast queries.The Hadoop ecosystem starts from the same aim of wanting to collect together as much interesting data as possible from different systems, but approaches it in a radically better way. With this approach, you dump all data of interest into a big data store (usually HDFS &#8211; Hadoop Distributed File System). This is often in cloud storage &#8211; cloud storage is good for the task, because it&#8217;s cheap and flexible, and because it puts the data close to cheap cloud computing power.&#160;You can still then do ETL and create a data warehouse using tools like Hive if you want, but more importantly you also still have all of the raw data available so you can also define new questions and do complex analyses over all of the raw historical data if you wish. The Hadoop toolset allows great flexibility and power of analysis, since it does big computation by splitting a task over large numbers of cheap commodity machines, letting you perform much more powerful, speculative, and rapid analyses than is possible in a traditional warehouse.

What is the difference between a Big Data Warehouse and a traditional Data Warehouse

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Big Data Hadoop

What is the difference between a zero reducer and identity reducer in Hadoop Mapreduce?

What is the difference between the Smart Data Access of SAP HANA and SAP HANA Vora?

What is the difference between partitioning and bucketing a table in Hive ?

What is the difference between Mongodb and Hadoop?

What is the difference between local file system commands touch and touchz?

How do I print hadoop properties in command line?

What is Network Topology in Hadoop?

Hadoop Mapreduce word count Program

What is the difference between a Big Data Warehouse and a traditional Data Warehouse?

What is the difference between Big Data and Data Mining?

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES