This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
teaching:mfe:is [2014/03/25 13:10] svsummer [Design and Implementation of a Curriculum Revision Tool] |
teaching:mfe:is [2014/04/21 16:27] svsummer |
||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== MFE 2014-2015 : Web and Information Systems ====== | + | ====== MFE 2015-2016 : Web and Information Systems ====== |
===== Introduction ===== | ===== Introduction ===== | ||
Line 143: | Line 143: | ||
Contact : Stijn Vansummeren <stijn.vansummeren@ulb.ac.be>, Frédéric Robert <frrobert@ulb.ac.be> | Contact : Stijn Vansummeren <stijn.vansummeren@ulb.ac.be>, Frédéric Robert <frrobert@ulb.ac.be> | ||
===== Structural compression of relational and semantic web databases ===== | ===== Structural compression of relational and semantic web databases ===== | ||
+ | |||
+ | Stijn Vansummeren (WIT) | ||
Recent research in database management systems at ULB has shown how to | Recent research in database management systems at ULB has shown how to | ||
Line 159: | Line 161: | ||
* Contact : [[stijn.vansummeren@ulb.ac.be|Stijn Vansummeren]] | * Contact : [[stijn.vansummeren@ulb.ac.be|Stijn Vansummeren]] | ||
+ | |||
+ | ===== A contribution to Apache DRILL ===== | ||
+ | |||
+ | Google's research lab has produced a remarkable number of software | ||
+ | systems for the analytics of Big Data: | ||
+ | * [[|Map/Reduce]] for offline, batch-oriented data analysis over arbitrary datasets | ||
+ | * [[http://googleresearch.blogspot.be/2009/06/large-scale-graph-computing-at-google.html|Pregel]] for offline analysis over graph-structured datasets | ||
+ | * [[http://research.google.com/pubs/pub36632.html|Dremel]] for on-line analysis over structured datasets | ||
+ | |||
+ | For Map/Reduce and Pregel, the Apache Software foundation has | ||
+ | previously constructed open source implementations ([[http://hadoop.apache.org/|Hadoop]], | ||
+ | [[https://giraph.apache.org/|Giraph]]). For Dremel, a project is | ||
+ | currently underway to provide an Open Source implementation (known as | ||
+ | [[http://incubator.apache.org/drill/index.html|Apache Drill]]). | ||
+ | |||
+ | The goal of this thesis is to (1) study the current architecture of Apache | ||
+ | Drill, (2) compare this with the state of the art in query processing | ||
+ | for structured datasets; (3) contribute to the development of the | ||
+ | Drill implementation. | ||
+ | |||
+ | Students interested in this MFE are highly advised to follow the | ||
+ | course {{http://cs.ulb.ac.be/public/teaching/infoh417|INFOH417 | ||
+ | Database Systems Architecture}} for a background on query processing | ||
+ | in traditional database management systems. | ||
+ | |||
+ | * Contact : [[stijn.vansummeren@ulb.ac.be|Stijn Vansummeren]] | ||
===== Aspects of Text Analytics and Information Extraction ===== | ===== Aspects of Text Analytics and Information Extraction ===== | ||