Table of Contents

INFO-H-419: Data Warehouses 2012

Lecturer

Volume

Study Programme

Old Exams

Old exams of my Advanced Databases course at the TU/e; please note that only the questions about data warehousing are relevant.

Solution to the January 2013 exam

Course Summary

Relational and object-oriented databases are mainly suited for operational settings in which there are many small transactions querying and writing to the database. Consistency of the database (in the presence of potentially conflicting transactions) is of utmost importance. Much different is the situation in analytical processing where historical data is analyzed and aggregated in many different ways. Such queries differ significantly from the typical transactional queries in the relational model:

For these reasons, data to be analyzed is typically collected into a data warehouse with Online Analytical Processing support. Online here refers to the fact that the answers to the queries should not take too long to be computed. Collecting the data is often referred to as Extract-Transform-Load (ELT). The data in the data warehouse needs to be organized in a way to enable the analytical queries to be executed efficiently. For the relational model star and snowflake schemes are popular designs. Next to OLAP on top of a relational database (ROLAP), also native OLAP solutions based on multidimensional structures (MOLAP) exist. In order to further improve query answering efficiency, some query results can already be materialized in the database, and new indexing techniques have been developped.

In the course, the main concepts of multidimensional databases will be covered and illustrated using the SQL Server tools. Complimentary to the course, IBM offers a “proof of technology” session to illustrate a business perspective and alternative tools.

Books and other lecture material

Extra books

The following books have been used to construct the course material, but are not required reading for the course:

Software

The 2012-2013 academic year student password is 'Analytics'.

Detailed Schedule

This schedule is as detailed as possible at this moment; it may be subject to change. Changes will clearly be communicated to the students through this page, and during the lectures.

Important note: as from week 6 on, the schedule for the Data Warehousing course has changed. Find the correct dates and locations in the schedule below.

Week 1 till 5:

The lectures take place on Thursday afternoon: * Thu. 2-4: Theory in S.C4.219 * Thu. 4-6: Exercises in S.C4.219

Week 6 and following:

Lectures are on Wednesday:

Schedule: