Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
teaching:projh402 [2013/09/23 11:47]
svsummer
teaching:projh402 [2020/10/01 11:56]
mahmsakr [Map-matching as a Service]
Line 5: Line 5:
 The course PROJ-H-402 is managed by Dr. Mauro Birattari. Please refer to the course description page http://​iridia.ulb.ac.be/​proj-h-402/​index.php/​Main_Page for the rules concerning the project. ​ What follows is a list of project proposals supervised by academic members of CoDE. The course PROJ-H-402 is managed by Dr. Mauro Birattari. Please refer to the course description page http://​iridia.ulb.ac.be/​proj-h-402/​index.php/​Main_Page for the rules concerning the project. ​ What follows is a list of project proposals supervised by academic members of CoDE.
  
-===== Project proposals ​=====+===== Projects in Mobility Databases ​=====
  
-==== Development ​of a Personal Scientific Digital Library Management System ====+Mobility databases (MOD) are database systems that can store and manage moving object geospatial trajectory data. A moving object is an object that changes its location over time (e.g., a car driving on the road network). Using a variety of sensors, the location tracks of moving objects can be recorded in digital formats. A MOD, then, helps storing and querying such data. A couple of prototype systems have been proposed by research groups. Yet, a mainstream system is by far still missing. By mainstream we mean that the development builds on widely accepted tools, that are actively being maintained and developed. A mainstream system would exploit the functionality of these tools, and would maximize the reuse of their ecosystems. As result, it becomes more closer to end users, and easily adopted in the industry.
  
-In this project, the student ​is asked to construct ​software system to help manage large collections ​of scientific papers in digital form. Specifically,​ the system must be able to: +Towards filling ​this gapour group is building ​the [[https://​github.com/​MobilityDB/​MobilityDB|MobilityDB]] system. It builds on [[https://​postgis.net/​|PostGIS]],​ which is a spatial database extension ​of [[https://www.postgresql.org/​|PostgreSQL]]MobilityDB extends ​the type system of PostgreSQL and PostGIS with ADTs for representing moving object dataIt definesfor instance, the tgeompoint type for representing a time dependant geometry point. MobilityDB types are well integrated into the platformto achieve maximal reusabilityhence a mainstream ​development. ​For instance, the tgeompoint type builds on the PostGIS geometry(pointtypeSimilarly MobilityDB builds on existing operationsindexingand optimization framework.
-  - Scan a given filesystem location for given filetypes (PDFs, EPUB, ...) containing scientific articles. +
-  - Extract ​the metadata from each identified fileHerethe metadata includes the title of the article, its authors, the publishing venue, ​the publisherthe year of publicationthe article'​s abstract ... The development ​of an intelligent way to retreive this metadata is requriedThis could be donefor example by a combination of parsing ​the file, contacting ​the internet repositories of known publishers ​(AMC, Springer, Elsevieretc to retrieve the data. +
-  - Offer search capabilitiesin order to allow a user to find all indexed articles matching certain criteria (title, author, ...) +
-  - Offer archiving capabilities+
  
-Use of semantic web technologies (RDF, SPARQL, ​...) to store and search the metadata is encouraged.+MobilityDB supports SQL as query interface. Currently it is quite rich in terms of types and functionsIt is incubated as community project in [[https://​www.osgeo.org/​projects/​mobilitydb/​|OSGeo]],​ which certifies high technical quality
  
-**Contact** : Stijn Vansummeren (stijn.vansummeren@ulb.ac.be)+The following project ideas contribute to different parts of MobilityDBThey all constitute innovative development,​ mixing both research and developmentThey hence will help developing the student skills in:
  
-**Status**: available+  ​Understanding the theory and the implementation of moving object databases. 
 +  ​Understanding the architecture of extensible databases, in this case PostgreSQL. 
 +  ​Writing open source software.
  
-==== Curriculum Revision Assistant ==== 
  
-In this project, ​the student ​is asked to construct ​software system ​that can assist ​in the revision ​of teaching curricula ​(also known as teaching programs). The system should have the following functionalities:​ +===== Visualization Moving Objects on the Web ===== 
-  - It should be able to load existing curricula from the ULB central administration. This could be done, for example, by parsing ​the webpages available at banner (the Civil Engineering ​in CS program ​is available at http://banssbfr.ulb.ac.be/​PROD_frFR/bzscrse.p_disp_prog_detail?​term_in=201314&​prog_in=MA-IRIF&​lang=FRENCHfor example). + 
-  ​- It should allow to make different versions of the teaching programsmuch in the same way as version control systems like GIT and subversion offer the possibility ​to make different "​development branches"​ of a program'​s source code+<​TBD>​ 
-  - It should allow to analyze ​the modifications proposed ​in the teaching programs, and summarize the impact ​that these changes could have on other programs(For exampleif a course is removed from the computer science curriculum, it should ​also be removed from all curricula that included ​the course.)+ 
 + 
 +===== Implementing TSBS on MobilityDB ===== 
 + 
 +The Time Series Benchmark Suite ([[https://​github.com/​timescale/​tsbs|TSBS]]) ​is a collection of Go programs that are used to generate datasets and then benchmark read and write performance of various time series databases. This bechmark has been developed by [[https://​www.timescale.com/​|TimescaleDB]],​ which is time series extension of PostgreSQL.  
 + 
 +A significant addition of TimescaleDB to PosgreSQL is the addition of the [[https://​blog.timescale.com/​blog/​simplified-time-series-analytics-using-the-time_bucket-function/​|time_bucket]] function. This function allows to partition the time line in user-defined interval units that are used for aggregating data. 
 + 
 +The project consists ​in implementing a multidimensional generalization of the time_bucket function that allows the user to partition the spatial and/or temporal domain ​of a table in units (or tilesthat can be used for aggregating dataThen, the project consists of performing a benchmark comparison of TimescaleDB and MobilityDB. 
 + 
 + 
 + 
 +===== Distributed Moving Object Database on Amazon AWS ===== 
 +A distributed database is an architecture in which multiple database instances on different machines are integrate in order to form a single database server. Both the data and the queries are then distributed over these database instances. This architecture is effective in deploying big databases on a cloud platform. 
 + 
 +MobilityDB is engineered as an extension of PostgreSQL. AWS supports PostgreSQL databases in Amazon RDS for PostgreSQL and in Amazon Aurora. The goal of this project is to integrate MobilityDB with these products. The key outcomes are a comprehensive assessment of which MOD API can/​cannot ​be distributedand an assessment of the performance gain. These outcomes should serve as a base for a thesis project to achieve effective integration. 
 + 
 + 
 +===== Distributed Moving Object Database on MS Azure ===== 
 +A distributed database is an architecture in which multiple database instances on different machines are integrate in order to form a single database server. Both the data and the queries are then distributed over these database instances. This architecture is effective ​in deploying big databases on a cloud platform. 
 + 
 +MobilityDB ​is engineered as an extension of PostgreSQL. MS Azure supports distributed PostgreSQL databases using [[https://www.citusdata.com/​|Citus]]We have made successful tests for integrating MobilityDB and Citus on a local cluster. The goal of this project is to repeat this work on MS Azureintegrate MobilityDB with these products. The key outcomes are a comprehensive assessment of which MOD API can/cannot be distributed,​ and an assessment of the performance gainThese outcomes should serve as a base for a thesis project to achieve effective integration. 
 + 
 +===== Map-matching as a Service ​===== 
 +GPS location tracks typically contain errorsas the GPS points will normally be some meters away from the true positionIf we know that the movement happened on a street network, e.g., a bus or a car, then we can correct this back by putting the points on the street. Luckily there are Algorithms for this, called Map-Matching. There are also a handful of open source systems that do map matching. ​It remains however difficult ​to end users to use them, because they involve non-trivial installation and configuration effort. Preparing ​the base mapwhich will be used in the matching is also an issue to users.  
 + 
 +The goal of this project is to build an architecture for a Map-Matching service. The challanges are that the GPS data arrives ​in different formats, and that Map-Matching is a time consuming AlgorithmThis architecture should thus allow different input formatsand should be able to automatically scale according to the request rateAnother key outcome of this project is to compare the existing Map-Matching implementations,​ and to discuss their suitability in real world problems. 
 + 
 +Links: 
 +[[https://​github.com/​bmwcarit/​barefoot|Barefoot]] 
 +[[https://​valhalla.readthedocs.io/​en/​latest/​api/​map-matching/​api-reference/​|Valhalla Map Matching API]]  
 +[[https://​github.com/​graphhopper/​map-matching|GraphHopper]] 
 +[[https://​github.com/​cyang-kth/​fmm|Fast Map Matching]] 
 + 
 + 
 +===== Geospatial Trajectory Data Cleaning ===== 
 + 
 + 
 +===== Geospatial Trajectory Similarity Measure ===== 
 + 
 + 
 +===== Spatiotemporal k-Nearest Neighbour (kNNQueries =====
  
-**Contact** : Stijn Vansummeren (stijn.vansummeren@ulb.ac.be) 
  
-**Status**: available 
  
 
teaching/projh402.txt · Last modified: 2022/09/06 10:39 by ezimanyi