Table of Contents
Small Summary
Supporting material
Exercises
INFOH419 Data Warehousing - Week 11
Small Summary
Data mining
high-level overview of clustering, classification, pattern mining
common errors: causality vs correlation, multi-hypotheses testing
Data stream processing
epsilon-delta approximations
maintaining a uniform sample
counting the number of distinct elements
Supporting material
Slides
Exercises
No Exercises this week