Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
teaching:infoh419 [2018/09/01 10:37]
ezimanyi [Software]
teaching:infoh419 [2018/09/17 18:39]
ezimanyi [Group Project]
Line 6: Line 6:
   * [[http://​cs.ulb.ac.be/​members/​esteban/​|Esteban Zimányi]]   * [[http://​cs.ulb.ac.be/​members/​esteban/​|Esteban Zimányi]]
   * <​ezimanyi@ulb.ac.be>​   * <​ezimanyi@ulb.ac.be>​
-  * Room SU A 4.115 
- 
 ===== Volume ===== ===== Volume =====
  
Line 18: Line 16:
   * Master in Computer Sciences [INFO]   * Master in Computer Sciences [INFO]
   * Erasmus Mundus Master in Big Data Management and Analytics (BDMA)   * Erasmus Mundus Master in Big Data Management and Analytics (BDMA)
 +
 +===== Schedule =====
 +
 +The course is given during the first semester ​
 +  * Lectures on Tuesdays from 2 pm to 4 pm at the room S.UA4.218
 +  * Exercises on Fridays from 4 pm to 6 pm at the room S.UB4.130
  
 ===== Grading ===== ===== Grading =====
Line 59: Line 63:
   * {{teaching:​infoh419:​dw01-introduction.pdf|Introduction}}   * {{teaching:​infoh419:​dw01-introduction.pdf|Introduction}}
   * {{teaching:​infoh419:​dw02-cubes.pdf|Cubes}}   * {{teaching:​infoh419:​dw02-cubes.pdf|Cubes}}
 +    * {{teaching:​infoh419:​database_explosion_report.pdf|Database explosion report}}
 +    * {{teaching:​infoh419:​database_explosion.pdf|Database explosion}}
   * {{teaching:​infoh419:​dw03-dfm.pdf|Dimension Fact Model}}   * {{teaching:​infoh419:​dw03-dfm.pdf|Dimension Fact Model}}
   * {{teaching:​infoh419:​dw04-logicalmodel.pdf|Logical Model}}   * {{teaching:​infoh419:​dw04-logicalmodel.pdf|Logical Model}}
Line 73: Line 79:
 ===== Software ===== ===== Software =====
  
-All software used in the course is available in the computer labs. Students who wish a personal copy of the software on their own computers, can get free copies of the software. Succinct instructions to acquire the software have been included below; in case additional help is required you can contact the sysadmin of our group: Arthur Lesuisse <​alesuiss@ulb.ac.be>​+All software used in the course is available in the computer labs. Students who wish a personal copy of the software on their own computers, can get free copies of the software. Succinct instructions to acquire the software have been included below; in case additional help is required you can contact the sysadmin of the department: Arthur Lesuisse <​alesuiss@ulb.ac.be>​
  
   * MS SQL Server Tools: can be downloaded for free from http://​www.academicshop.be/​msdnaa/​ Register on this page with your ULB email address, and '​order'​ the free msdnaa. After verification you receive login credentials to download quite a few software packages for free. Select the SQL Server 2014 Enterprise edition.   * MS SQL Server Tools: can be downloaded for free from http://​www.academicshop.be/​msdnaa/​ Register on this page with your ULB email address, and '​order'​ the free msdnaa. After verification you receive login credentials to download quite a few software packages for free. Select the SQL Server 2014 Enterprise edition.
Line 83: Line 89:
   * [[teaching:​infoh419:​TP|Exercices Web page]]   * [[teaching:​infoh419:​TP|Exercices Web page]]
  
-===== Group assignment ​===== +===== Group Project ​=====
- +
-The assignment is carried out in groups of 3 to 4 people. Before you can submit assignment part I, you will have to register in a group. The link to register a group is included below. Please to select your group before or on 25/10/2018.+
  
-The assignment consist ​of 2 parts:+[[http://​www.tpc.org|TPC]] is a non-profit corporation that defines transaction processing and database benchmarks and disseminates objective, verifiable TPC performance data to the industry. Regarding data warehouses, two TPC benchmarks are relevant: 
 +  * [[http://​www.tpc.org/​tpcds/​|TPC-DS]],​ the Decision Support Benchmark, which models the decision support functions ​of a retail product supplier.  
 +  * [[http://​www.tpc.org/​tpcdi/​|TPC-DI]],​ the Data Integration Support Benchmark, which models a typical ETL process that loads a data warehouse.
  
-  ​* Part I: Create a conceptual model and translate to a logical schema  ​(deadline ​15/11/2018) +The project of the course consist of 2 parts: 
-  * Part II: (deadline 20/​12/​2018) +  ​* Part I: Implement the TPC-DS benchmark ​(deadline ​1/11/2018) 
-    * Creating ETL scripts for updating ​the database in SSIS, +  * Part II: Implement the TPC-DI benchmark ​(deadline 20/​12/​2018) 
-    * Predicting how the size of the data warehouse ​will grow over time, +You will have free choice to use the tools on which the two benchmarks ​will be built. For examplethe TPC-DS benchmark can be implemented ​on SQL Server Analysis Services, Pentaho Analysis Services (aka Mondrian), etc. Similarly, ​the TPC-DI could be implemented on SQL Server Integration Services, Pentaho Data Integration,​ Talend Data Studio, or even SQL scripts.
-    *  Deploy a data cube on top of the data warehouse and create a report.+
  
-Assignment ​part I will be available ​on 25/10. For the next parts, assignment II will become available right after the submission deadline of assignment part I. The submission deadlines for parts I and II are strict.+The project is carried out in groups of 3 to 4 people. Before you can submit assignment ​part I, you will have to register in a group. For registering a group send an email to the lecturer. Please to select your group before or on 1/10/2018. The submission deadlines for parts I and II are strict.
  
-The assignment ​evaluation will count for 30% of your total grade. This may seem undervalued,​ however, putting effort in the assignment ​will definitely help you in achieving a better understanding of the course material which will result in a better score in the paper exam which amounts for 70% of the grade.+The project ​evaluation will count for 30% of your total grade. This may seem undervalued,​ however, putting effort in the project ​will definitely help you in achieving a better understanding of the course material which will result in a better score in the paper exam which amounts for 70% of the grade.
  
 ===== Examinations from Previous Years ===== ===== Examinations from Previous Years =====
 
teaching/infoh419.txt · Last modified: 2023/11/20 16:18 by ezimanyi