Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
teaching:infoh415 [2019/11/28 18:03]
ezimanyi [Topics for the current academic year]
teaching:infoh415 [2023/12/04 18:14] (current)
ezimanyi [Topics for the current academic year]
Line 2: Line 2:
  
  
 +===== Last important announcement ====
 +Dear all,
 +
 +The first practical session will be next week on Thursday (see schedule for room and hour). Please have your computer ready with the tools needed. If you do not have a laptop I advise you to pair up with somebody with one for the practical session.
 +
 +The first three sessions will be on spatial databases (see [[teaching:​infoh415:​TP|Exercices Web page]])!
 +
 +See you next Thursday,
 +
 +Boris.
 ===== Lecturer ===== ===== Lecturer =====
  
Line 10: Line 20:
 ===== Teaching Assistant ===== ===== Teaching Assistant =====
  
-  * [[Gilles.Dejaegere@ulb.ac.be|Gilles Dejaegere]]+  * [[boris.coquelet@ulb.be|Boris Coquelet]]
  
  
Line 32: Line 42:
  
 The course is given during the first semester ​ The course is given during the first semester ​
-  * Lectures on Mondays from 4 pm to 6 pm at the room S.AY2.108 **except 14/10/2019 which is a practical session in S.UB4.130** +  * Lectures on Mondays from 4 pm to 6 pm in the UB4.132 (Solbosch campus) 
-  * Exercises on Thursdays from 2 pm to 4 pm at the room S.UB4.130 <​del>​**except 24/10/2019 which is a lecture in S.UD2.119** +  * Exercises on Thursdays from 2 pm to 4 pm
-</​del>​+
  
 /*  /* 
Line 41: Line 50:
   * [[http://​www.google.com/​calendar/​embed?​src=dug2eihu8tqtnkjhmtuupj0je0%40group.calendar.google.com&​ctz=Europe/​Brussels|Online schedule]]   * [[http://​www.google.com/​calendar/​embed?​src=dug2eihu8tqtnkjhmtuupj0je0%40group.calendar.google.com&​ctz=Europe/​Brussels|Online schedule]]
 */ */
 +
 +
 +
 ===== Objectives ===== ===== Objectives =====
  
 Today, databases are moving away from typical management applications,​ and address new application areas. For this, databases must consider (1) recent developments in computer technology, as the object paradigm and distribution,​ and (2) management of new data types such as spatial or temporal data. This course introduces the concepts and techniques of some innovative database applications. Today, databases are moving away from typical management applications,​ and address new application areas. For this, databases must consider (1) recent developments in computer technology, as the object paradigm and distribution,​ and (2) management of new data types such as spatial or temporal data. This course introduces the concepts and techniques of some innovative database applications.
 +
 +
 +
 ===== Content ===== ===== Content =====
  
-==== Active ​Databases ====+==== Spatial ​Databases ====
  
-Taxonomy ​of conceptsApplications ​of active databases: integrity maintenance,​ derived ​data, replicationDesign of active databases: termination,​ confluence, determinism,​ modularisation.+Spatial data and applications. Space ontology. Conceptual modeling ​of spatial aspectsManipulation ​of spatial ​data with standard SQL. 
 + 
 +==== Mobility Databases ==== 
 + 
 +...
  
 ==== Temporal Databases ==== ==== Temporal Databases ====
Line 54: Line 73:
 Temporal data and applications. Time ontology. Conceptual modeling of temporal aspects. Manipulation of temporal data with standard SQL. Temporal data and applications. Time ontology. Conceptual modeling of temporal aspects. Manipulation of temporal data with standard SQL.
  
-==== Graph Databases ====+==== Active ​Databases ====
  
-...+Taxonomy of conceptsApplications of active databases: integrity maintenance,​ derived data, replicationDesign of active databases: termination,​ confluence, determinism,​ modularisation.
  
-==== Spatial Databases ==== 
- 
-Spatial data and applications. Space ontology. Conceptual modeling of spatial aspects. Manipulation of spatial data with standard SQL. 
  
  
Line 92: Line 108:
   * IBM, A Matter of Time: Temporal Data Management in DB2 for z/OS. ({{teaching:​infoh415:​a_matter_of_time.pdf|version pdf}})   * IBM, A Matter of Time: Temporal Data Management in DB2 for z/OS. ({{teaching:​infoh415:​a_matter_of_time.pdf|version pdf}})
 ===== Links ===== ===== Links =====
-  * Temporal ​databases  +  * Spatial ​databases 
-    * [[http://timecenter.cs.aau.dk/|TimeCenter]], an international research centre for temporal databases. +    * [[https://postgis.net/​workshops/​postgis-intro/|Introduction to PostGIS]] 
-    * [[http://www.timeconsult.com/Software/Software.html|TimeDB]], a temporal relational database+    * [[https://learn.crunchydata.com/postgis|Crunchy Data Interactive PostGIS Learning Portal]] 
 +  * Mobility databases 
 +    * [[https://mobilitydb.com/|MobilityDB]]  
   * Object databases   * Object databases
     * [[http://​www.odbms.org/​|ODBMS.ORG]],​ portal of ressources about object databases.     * [[http://​www.odbms.org/​|ODBMS.ORG]],​ portal of ressources about object databases.
-    * [[http://​www.db4o.com/​|db4o]],​ an open source object database. 
     * [[http://​www.objectstore.com/​datasheet/​index.ssp|ObjectStore]],​ an object database     * [[http://​www.objectstore.com/​datasheet/​index.ssp|ObjectStore]],​ an object database
     * [[http://​www.objectivity.com|Objectivity]],​ an object database     * [[http://​www.objectivity.com|Objectivity]],​ an object database
-    * [[http://​www.versant.com/​|Versant]],​ an object database 
-    * [[http://​www.jade.co.nz/​jade/​|Jade]],​ an object database 
-    * [[http://​sourceforge.net/​projects/​ozone/​|Ozone]],​ an object database 
   * Post-relationnal databases   * Post-relationnal databases
-    * [[http://​www.fresher.com/​|Matisse]] 
     * [[http://​www.intersystems.com/​cache/​index.html|Caché]]     * [[http://​www.intersystems.com/​cache/​index.html|Caché]]
-  * Spatial databases 
-    * [[https://​postgis.net/​workshops/​postgis-intro/​|Introduction to PostGIS]]  ​ 
  
 ===== Course Slides ===== ===== Course Slides =====
  
-  * {{teaching:​infoh415:​activenotes.pdf|Active databases}} 
-  * {{teaching:​infoh415:​temporalnotes.pdf|Temporal databases}} 
-/*   * {{teaching:​infoh415:​objectnotes.pdf|Object databases}} ​  */ 
-  * {{:​teaching:​infoh415:​graph_databases_notes.zip|Graph Notes}} 
   * {{teaching:​infoh415:​spatialnotes.pdf|Spatial databases}}   * {{teaching:​infoh415:​spatialnotes.pdf|Spatial databases}}
 +  * Mobility databases
 +  * {{teaching:​infoh415:​temporalnotes.pdf|Temporal databases}}
 +  * {{teaching:​infoh415:​activenotes.pdf|Active databases}}
 +/*  * {{:​teaching:​infoh415:​graphdb-ulb-2021.zip|Graph Notes (2021 version)}} */
 +/*   * {{teaching:​infoh415:​objectnotes.pdf|Object databases}} ​  
 +  * {{:​teaching:​infoh415:​graph_databases_notes.zip|Graph Notes}}*/
  
  
Line 121: Line 134:
  
   * [[teaching:​infoh415:​TP|Exercices Web page]]   * [[teaching:​infoh415:​TP|Exercices Web page]]
 +
 ===== Project ===== ===== Project =====
  
Line 129: Line 143:
 */ */
  
-Students, in groups of two, will realize a project in a topic relevant to advanced databases. Examples of topics are given in the next section of this document.+Students, in groups of four students, will realize a project in a topic relevant to advanced databases. Examples of topics are given in the next section of this document. Please notice that the template for these topics is "<​Technology>​ with <​Tool1>​ and <​Tool2>"​.
  
-Each group will study a database technology and illustrate it with an application developed ​​in ​database management ​system ​to be chosen (e.g., Oracle, PostgreSQL, DB2, SQL Server, ​mySQL, etc..). +Each group will study a database technology ​(e.g., document stores, time series databases, etc.) and illustrate it with an application developed ​​in ​two database management ​systems ​to be chosen (e.g., SQL Server, ​PostgreSQL, MongoDB, etc.). The topic should be addressed in a technical way, to explain ​the foundations of the underlying ​technology. The application must use the chosen ​technology. Examples of technologies and tools can be found for example in the following ​ [[https://​db-engines.com/​en/​ranking|web site]].
-The topic should be addressed in a technical way, to explain the underlying ​technologies. The application must use the specific ​technology ​manipulated.+
  
-The choice of topic and the application must be made ​​in agreement with the lecturer. The topic should not be included in the programme ​of the Master in Computer Science and Engineering. The project will be presented to the lecturer and the fellow students at the end of the semester. This presentation will be supported by a slideshow. A written report containing the contents of the presentation is also required. The presentation and written ​report will explain the possibilities offered ​by the database management ​system ​chosen and give general description ​of the application ​implemented.+It is important to understand that the objective of the project is NOT about developing an application with a GUI. The objective is to benchmark the proposed tool in relation to the database requirements of your application. Therefore, it is necessary to determine the set of queries and updates that your application requires and do a benchmark with, e.g., 1K, 10K, 100K, and 1M "​objects"​ (rows, documents, nodes, etc. depending on the technology used) to determine if the tool shows a linear or exponential behavior. Please notice that you SHOULD NOT generate data for the benchmark since you can find in Internet (1) a huge number of available datasets (2) alternatively,​ there are many available data generators. 
 + 
 +As usual when performing benchmarks, the queries and updates are executed n times (e.g., 6 times where the first execution is not considered because it is different from the others since the cache structures must be filled) and the average of the execution times is computed. A comparison with traditional relational technology (e.g., using PostgreSQL) must be provided to show that the chosen tool is THE technology of choice for your application,​ better than all other alternatives,​ and that it will perform correctly when the system is deployed at full scale. Please notice that there are MANY standard benchmarks for various database technologies so in that case you should prefer using a standard benchmark that reinventing the wheel and create your own benchmark. 
 + 
 +The choice of topic and the application must be made ​​in agreement with the lecturer. The topic should not be included in the program ​of the Master in Computer Science and Engineering. The project will be presented to the lecturer and the fellow students at the end of the semester. This presentation will be supported by a slideshow. A written report containing the contents of the presentation is also required. The presentation and the report will (1) explain the foundations of the technology chosen, (2) explain how these foundations are implemented ​by the database management ​systems ​chosen and (3) illustrate all these concepts with the application implemented. 
 + 
 +The duration of the presentation is 45 minutes. It will structured in three parts of SIMILAR length 
 +   * An introduction to technology 
 +   * An introduction to the two tools, each presented by subgroup of two persons 
 +   * A common assessment ​of the advantages and disadvantages of both tools tested in a common example ​application.
  
 The evaluation of the project focuses on the following criteria: The evaluation of the project focuses on the following criteria:
Line 143: Line 165:
 The project will count for 25% of the final grade. The project will count for 25% of the final grade.
  
-The project must be submitted by **Monday, December ​162019**.+The project must be submitted by **Monday, December ​112023**. Please send the report and the presentation in PDF format to the lecturer
  
-===== Examples of topics from the previous academic year ===== +  ​* Cloud databases and Microsoft Azure, AWS, ...
- +
-You can take a look at the [[https://​db-engines.com/​en/​|DB-Engines]] web site to get an idea of the currently available technologies and tools. Examples of previous topics are given next: +
- +
-  * Analytical databases and Endeca +
-  ​* Cloud databases and Microsoft Azure+
   * Column stores and Cassandra, Hbase, ...   * Column stores and Cassandra, Hbase, ...
   * Data warehouses and Apache Hive   * Data warehouses and Apache Hive
-  ​* Deductive Databases and XSB +  * Distributed databases and SQL Server, ​Oracle, Citus, ...
-  ​* Distributed databases and SQL Server, ​DynamoDB, ...+
   * Document stores and Cloudant, Couchbase, CouchDB, MongoDB, RavenDB, RethinkDB, ...   * Document stores and Cloudant, Couchbase, CouchDB, MongoDB, RavenDB, RethinkDB, ...
   * Embedded databases and BerkeleyDB   * Embedded databases and BerkeleyDB
   * In-memory databases and Kdb+, MemSQL, Oracle TimesTen, Memcached, ....   * In-memory databases and Kdb+, MemSQL, Oracle TimesTen, Memcached, ....
-  * Key-value stores and Redis, Voldermort, ... +  * Key-value stores and BerkeleyDB, DynamoDB, ​Redis, Voldermort, ... 
-  * Multimedia databases and Oracle +  * Multi-model databases and MarkLogic, CosmosDB 
-  * Multi-model databases and MarkLogic +  * NewSQL databases and VoltDB, CockrachDB, ... 
-  * NewSQL databases and VoltDB +  * Object-oriented databases and ObjectBox, Perst
-  * Object-oriented databases and db4o+
   * Real-time databases and Firebase   * Real-time databases and Firebase
   * Search engines and Solr, ElasticSearch,​ Sphinx ...   * Search engines and Solr, ElasticSearch,​ Sphinx ...
-  * Spatial databases and Rasdaman +  * Spatial ​raster ​databases and Rasdaman 
-  * Stream databases and Apache Kafka+  * Stream databases and Apache Kafka, Event Stores
   * Time series databases and Influx DB, Kdb+, ...   * Time series databases and Influx DB, Kdb+, ...
   * XML databases and BaseX   * XML databases and BaseX
- 
  
 ===== Topics for the current academic year ===== ===== Topics for the current academic year =====
  
-  ​* Analytics data warehouse and Google BigQuery: Hridaya Sagar Subedi, Alp Albay +  * Cloud databases ​with Microsoft Azure and AWSMaria Camila SalazarValerio Rocca, Ludovica Caiola, Simon Coessens 
-  ​* Cloud databases ​and Microsoft Azure: ​Rodaina MohamedKarim Maatouk +  * Column stores ​with Cassandra ​and HBaseNoah LaravineClément Alloin, François Diximier 
-  * Column stores ​and Cassandra: ​Andrea ArmaniAnant Gupta +  * Data warehousing with Google BigQuery ​and SnowflakeYutao Chen, Qianyun Zhuang, Min ZhangZiyong Zhang 
-  * Data warehouses ​and Apache HiveEmir NurmatbekovMahmudul Hasan +  * Distributed databases with YugaByteDB ​and CockroachDBSony ShresthaAayush Paudel, MD Kamrul Islam, Shofiyyah Nadhiroh 
-  * Document stores ​and MongoDBLedia IsajFabrício Ferreira +  * Distributed databases with Apache Cassandra ​and CitusCatalina CorreaVassili Papadakis, Paeg Hing Leong, Mohamed Bouchkhachakh 
-  * Document stores ​and CouchbaseSamia AzzouziBrahim Amssafi +  * Document stores ​with CouchDB and MongoDBAryan Gupta, Dilbar IsakovaHareem Raza, Muhammad Qasim Khan 
-  * Document stores ​and CouchDB: Abdelilah El Majjaoui +  * Document stores with RavenDB ​and RethinkDBLeila BouroufNargis Sghiouar, Joel Niarisi 
-  * Embedded Databases ​and BerkeleyDBAli ArousMaria Letizia Losso +  * Embedded databases with Berkeley DB et SQLiteAlexandre DuboisAdel Nehili, Edgardo Cuellar Sanchez, Shri-Krishna Mungur 
-  * In-memory databases ​and Oracle TimesTenNathan WolperKamdem Tagne Thomas Borel +  * Graph databases ​with Neo4J and OrientDBGabriela KaczmarekBerat Furkan Koçak, Jakub Kwiatkowski,​ Arijit Samal 
-  * Key-value stores and RedisIra nazarchukJulio Candela +  * Graph databases ​with TigerGraph ​and MemgraphMelissa TsombengThomas Borremans 
-  * Multimedia ​databases and OracleMuthi Dorel AdrianFan Chen +  * Key-value stores with Redis and Amazon DynamoDBDionisius MayrHerma Elezi, Rana İşlek, Thomas Suau 
-  * Multimodel ​databases and MarkLogicGian Marco PaldinoPiotr Rochala +  * NewSQL ​Databases with CockroachDB ​and NuoDBOnur BacaksızEmmanuel Leguede, Narmina Mahmudova, Mohamed Louai Bouzaher 
-  * Multimodel databases ​and Microsoft Azure Cosmos DBDimitrios TsesmelisRicardo Holthausen Bermejo +  * Object databases ​with Object Box and ZopeNadine GuettatSuman Khan, Naoufal Belgada, Latoundji Salim 
-  * NewSQL ​databases ​and VoltDBTamara BojanicIva Mihajlovska +  * Real time database avec Firebase ​et RedisMathieu Van den BremtNabil AbdellaouiElliot Silberwasser,​ Arkadiusz Snarski 
-  * Object-oriented ​databases and VersantYi Chiau LiYu Hsuan Chen +  * Search ​engines with Elasticsearch ​and SolrBenjamin GoldQuentin Demonceau, Nils Van Es Ostos, David García Morillo 
-  * Real-time databases and Firebase: ​Jesus HueteValdemar Hernández +  * Stream databases ​with Kafka and EventStoreDBNoubissi Kamgang Allan, Talhaoui Youssef, Gauthier Roger FranceStevens Quentin 
-  * Search Engines and Elastic Search: Haroon RashidDjordjije Krivokapic +  * Stream ​databases with Apache Flink and Apache Storm: ​Jintao MaLinhan WangIyoha Peace Osamuyi, Hieu Nguyen 
-  * Search ​Engines ​and ELK stack with KafkaHaftamu Hailu TeferaIshaan Rachit Dwivedi +  * Time series databases ​with InfluxDB and KdbGian Tejada GargateGabriel Lozano Pinzón, José Carlos Lozano DibildoxEnxhi Nushi
-  * Streaming Databases ​with Apache ​Kafka: ​Nithish SankaranarayananGayane Vardanyan +
-  * Stream ​Processing ​and Apache Storm: ​Ariston Harianto LimHung Nguyen +
-  * Stream processing and SQL stream: Sheida Shafiee SarvestaniNicolas Feron +
-  * Time series databases ​and InfluxDB: Yalei Li, Haonan Jin +
-  * Time series databases ​and TimescaleDBUchechukwu Fortune NjokuAkash Malhotra +
-  * XML databases and BaseX: Ayman MountasserMohammed Amine Belfarsi+
  
-/*  * {{:teaching:infoh415:student_projects:2019:azure.pdf|Cloud ​databases and Microsoft Azure}}Sara DiazBuse Ozer */+ 
 +/* 
 +  ​* ​Microsoft Azure and Google Cloud SQL: Marques Correia Tiago, Kellian Germain, Sébastien Arte, Nehili Adel 
 +  * Document databases with ArangoDB and MarkLogicMir Wise Khan, Rishika Gupta, Ahmad, Chidiebere Ogbuchi 
 +  * Document databases with MongoDB and CouchBaseAbd Abu Sbei, Hoschek Maren, Gupta Prashant, TBD 
 +  * Document databases with CouchDB and RavenDBAissa Abdoul-Aziz,​ Helin Demirel, Imane Moussaoui, Salma Mekarnia 
 +  * Embedded databases and BerkeleyDB and CouchBase LiteTalhaoui Yassin, Arfani Abdessamad, Faek Ilias, Adegnon Kokou 
 +  * Key-value databases with etcd and Hazelcast: Liliia Aliakberova,​ Arina Gepalova, Jose Antonio Lorencio Abril, Mariana Mayorga Llano 
 +  * Key-value databases with OrientDB and Memcached: Mustapha Ayadi, Valentin De Baene, Soumaya Izmar, Yi Zhu 
 +  * Oriented Object Databases with ObjectBox and Perst: Belgada Naoufal, El Hamri Ayoub, Akroune Sami, Sif Eddine Boughris 
 +  * RDF databases with Virtuoso and Apache Jena: Nikola Ivanović, Bogdana Živković, Tianheng Zhou, You Xu 
 +  * Search engines with ElasticSearch and OpenSearch: Muhammad Rizwan Khalid, Sayyor Yusupov, Ali Abusaleh, Ali Belyazid 
 +  * Search engines databases with Solr and Manticore Search: Rachel Aouad Albshara, Loïc Cordeiro Fonseca, Quentin Magron, Dang Phi LPham 
 +  * Stream Databases with PipelineDB and HStreamDB: Idil Dikbas, Ehsan Gifani, TBD, TBD 
 +  * Time Series ​databases ​with InfluxDB ​and KDBLuis Alfredo León, Jezuela Gega, Satria Wicaksono, Isabella Forero 
 +  * Timeseries databases with TimescaleDB and QuestDB: Koumudi Ganepola, Adina Bondoc, Zyad Al-Azazi, Alaa Almutawa 
 +  * Wide column stores with Cassandra and HBase: Anthony Zhou, Arnaud Cools, Damien DecleireThomas Dudziak 
 +*/
 ===== Examinations from Previous Years ===== ===== Examinations from Previous Years =====
  
 
teaching/infoh415.1574960625.txt.gz · Last modified: 2019/11/28 18:03 by ezimanyi