This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
teaching:infoh419 [2018/09/01 10:37] ezimanyi [Software] |
teaching:infoh419 [2018/09/17 18:49] ezimanyi [Group Project] |
||
---|---|---|---|
Line 6: | Line 6: | ||
* [[http://cs.ulb.ac.be/members/esteban/|Esteban Zimányi]] | * [[http://cs.ulb.ac.be/members/esteban/|Esteban Zimányi]] | ||
* <ezimanyi@ulb.ac.be> | * <ezimanyi@ulb.ac.be> | ||
- | * Room SU A 4.115 | ||
- | |||
===== Volume ===== | ===== Volume ===== | ||
Line 18: | Line 16: | ||
* Master in Computer Sciences [INFO] | * Master in Computer Sciences [INFO] | ||
* Erasmus Mundus Master in Big Data Management and Analytics (BDMA) | * Erasmus Mundus Master in Big Data Management and Analytics (BDMA) | ||
+ | |||
+ | ===== Schedule ===== | ||
+ | |||
+ | The course is given during the first semester | ||
+ | * Lectures on Tuesdays from 2 pm to 4 pm at the room S.UA4.218 | ||
+ | * Exercises on Fridays from 4 pm to 6 pm at the room S.UB4.130 | ||
===== Grading ===== | ===== Grading ===== | ||
Line 59: | Line 63: | ||
* {{teaching:infoh419:dw01-introduction.pdf|Introduction}} | * {{teaching:infoh419:dw01-introduction.pdf|Introduction}} | ||
* {{teaching:infoh419:dw02-cubes.pdf|Cubes}} | * {{teaching:infoh419:dw02-cubes.pdf|Cubes}} | ||
+ | * {{teaching:infoh419:database_explosion_report.pdf|Database explosion report}} | ||
+ | * {{teaching:infoh419:database_explosion.pdf|Database explosion}} | ||
* {{teaching:infoh419:dw03-dfm.pdf|Dimension Fact Model}} | * {{teaching:infoh419:dw03-dfm.pdf|Dimension Fact Model}} | ||
* {{teaching:infoh419:dw04-logicalmodel.pdf|Logical Model}} | * {{teaching:infoh419:dw04-logicalmodel.pdf|Logical Model}} | ||
Line 73: | Line 79: | ||
===== Software ===== | ===== Software ===== | ||
- | All software used in the course is available in the computer labs. Students who wish a personal copy of the software on their own computers, can get free copies of the software. Succinct instructions to acquire the software have been included below; in case additional help is required you can contact the sysadmin of our group: Arthur Lesuisse <alesuiss@ulb.ac.be> | + | All software used in the course is available in the computer labs. Students who wish a personal copy of the software on their own computers, can get free copies of the software. Succinct instructions to acquire the software have been included below; in case additional help is required you can contact the sysadmin of the department: Arthur Lesuisse <alesuiss@ulb.ac.be> |
* MS SQL Server Tools: can be downloaded for free from http://www.academicshop.be/msdnaa/ Register on this page with your ULB email address, and 'order' the free msdnaa. After verification you receive login credentials to download quite a few software packages for free. Select the SQL Server 2014 Enterprise edition. | * MS SQL Server Tools: can be downloaded for free from http://www.academicshop.be/msdnaa/ Register on this page with your ULB email address, and 'order' the free msdnaa. After verification you receive login credentials to download quite a few software packages for free. Select the SQL Server 2014 Enterprise edition. | ||
Line 83: | Line 89: | ||
* [[teaching:infoh419:TP|Exercices Web page]] | * [[teaching:infoh419:TP|Exercices Web page]] | ||
- | ===== Group assignment ===== | + | ===== Group Project ===== |
- | The assignment is carried out in groups of 3 to 4 people. Before you can submit assignment part I, you will have to register in a group. The link to register a group is included below. Please to select your group before or on 25/10/2018. | + | [[http://www.tpc.org|TPC]] is a non-profit corporation that defines transaction processing and database benchmarks and disseminates objective, verifiable TPC performance data to the industry. Regarding data warehouses, two TPC benchmarks are relevant: |
+ | * [[http://www.tpc.org/tpcds/|TPC-DS]], the Decision Support Benchmark, which models the decision support functions of a retail product supplier. | ||
+ | * [[http://www.tpc.org/tpcdi/|TPC-DI]], the Data Integration Support Benchmark, which models a typical ETL process that loads a data warehouse. | ||
- | The assignment consist of 2 parts: | + | The project of the course consist of 2 parts: |
+ | * Part I: Implement the TPC-DS benchmark (deadline 1/11/2018) | ||
+ | * Part II: Implement the TPC-DI benchmark (deadline 20/12/2018) | ||
+ | You will have free choice to use the tools on which the two benchmarks will be built. For example, the TPC-DS benchmark can be implemented on SQL Server Analysis Services, Pentaho Analysis Services (aka Mondrian), etc. Similarly, the TPC-DI could be implemented on SQL Server Integration Services, Pentaho Data Integration, Talend Data Studio, or even SQL scripts. | ||
- | * Part I: Create a conceptual model and translate to a logical schema (deadline 15/11/2018) | + | The project is carried out in groups of 3 to 4 people. Before you can submit part I of the project, you will have to register in a group, which will be the same for the two parts. For registering a group send an email to the lecturer. Please to select your group before or on 1/10/2018. The submission deadlines for parts I and II are strict. |
- | * Part II: (deadline 20/12/2018) | + | |
- | * Creating ETL scripts for updating the database in SSIS, | + | |
- | * Predicting how the size of the data warehouse will grow over time, | + | |
- | * Deploy a data cube on top of the data warehouse and create a report. | + | |
- | Assignment part I will be available on 25/10. For the next parts, assignment II will become available right after the submission deadline of assignment part I. The submission deadlines for parts I and II are strict. | + | The deliverables expected for each part of the project are the following: |
+ | * A report in pdf explaining the essential aspects of your implementation, and | ||
+ | * A zip file containing the code of your implementation, with all necessary instructions to be able to replicate your implementation by the lecturer. | ||
- | The assignment evaluation will count for 30% of your total grade. This may seem undervalued, however, putting effort in the assignment will definitely help you in achieving a better understanding of the course material which will result in a better score in the paper exam which amounts for 70% of the grade. | + | The project evaluation will count for 30% of your total grade. This may seem undervalued, however, putting effort in the project will definitely help you in achieving a better understanding of the course material which will result in a better score in the paper exam which amounts for 70% of the grade. |
===== Examinations from Previous Years ===== | ===== Examinations from Previous Years ===== |