CS 520: Data Integration, Warehousing, and Provenance - 2020 Spring

Organization

There will be several practical labs and theoretical assignments during the course. These assignments are not graded and their main purpose is to practice the content covered in course, gain some practical experience with data integration tasks, and prepare for the exams. Homework assignments and solutions will be posted on this page.

Homework 1: Background: Datalog, Constraints

The assignment is available here. You can find the dataset to be used in the assignment here. You can find the solutions here.

Homework 2: Data cleaning, preparation, entity resolution

The assignment is available here. You can find the dataset to be used in the assignment here. You can find the solutions here.

Homework 3: Schema matching and mapping, Virtual Data Integration, Data Exchange

The assignment is available here. You can find the solutions here.

Homework 4: Data Warehousing and Big Data Analytics

The assignment is available here. You can find the solutions here.

Homework 5: Data Provenance

The assignment is available here. You can find the solutions here.