IIT Database Group

Header bar

2020

  1. Your notebook is not crumby enough, REPLace it
    Michael Brachmann, William Spoth, Oliver Kennedy, Boris Glavic, Heiko Mueller, Sonia Castel, Carlos Bautista and Juliana Freire
    Proceedings of the 10th Conference on Innovative Data Systems (2020).
    details

2019

  1. Provenance For Transactional Updates
    Bahareh Arab
    Illinois Institue of Technology.
    details
  2. Heuristic and Cost-based Optimization for Diverse Provenance Tasks
    Xing Niu, Raghav Kapoor, Boris Glavic, Dieter Gawlick, Zhen Hua Liu, Vasudha Krishnaswamy and Venkatesh Radhakrishnan
    IEEE Transactions on Knowledge and Data Engineering. 31, 7 (2019) , 1267–1280.
    details
  3. Going Beyond Provenance: Explaining Query Answers with Pattern-based Counterbalances
    Zhengjie Miao, Qitian Zeng, Boris Glavic and Sudeepa Roy
    Proceedings of the 44th International Conference on Management of Data (2019), pp. 485–502.
    details
  4. Snapshot Semantics for Temporal Multiset Relations
    Anton Dignös, Boris Glavic, Xing Niu, Michael H. Böhlen and Johann Gamper
    Proceedings of the VLDB Endowment. 12, 6 (2019) , 639–652.
    details
  5. A High-Performance Distributed Relational Database System for Scalable OLAP Processing
    Jason Arnold, Boris Glavic and Ioan Raicu
    Proceedings of the 33rd IEEE International Parallel and Distributed Processing Symposium (2019), pp. 738–748.
    details
  6. Uncertainty Annotated Databases - A Lightweight Approach for Approximating Certain Answers
    Su Feng, Aaron Huber, Boris Glavic and Oliver Kennedy
    Proceedings of the 44th International Conference on Management of Data (2019), pp. 1313–1330.
    details
  7. Analyzing Uncertain Tabular Data
    Oliver Kennedy and Boris Glavic
    Information Quality in Information Fusion and Decision Making
    Éloi Bossé and G. Rogova, eds. Springer. 291–320.
    details
  8. Data Debugging and Exploration with Vizier
    Mike Brachmann, Carlos Bautista, Sonia Castelo, Su Feng, Juliana Freire, Boris Glavic, Oliver Kennedy, Heiko Müller, Rémi Rampin, William Spoth and Ying Yang
    Proceedings of the 44th International Conference on Management of Data (Demonstration Track) (2019), pp. 1877–1880.
    details
  9. CAPE: Explaining Outliers by Counterbalancing
    Zhengjie Miao, Qitian Zeng, Chenjie Li, Boris Glavic, Oliver Kennedy and Sudeepa Roy
    Proceedings of the VLDB Endowment (Demonstration Track). (2019).
    details
  10. Query-based Why-not Explanations for Nested Data
    Ralf Diestelkämper, Boris Glavic, Melanie Herschel and Seokki Lee
    Proceedings of the 11th USENIX Workshop on the Theory and Practice of Provenance (2019).
    details
  11. PUG: a framework and practical implementation for why and why-not provenance
    Seokki Lee, Bertram Ludäscher and Boris Glavic
    The VLDB Journal. 28, 1 (Aug. 2019) , 47—71.
    details

2018

  1. Using Reenactment to Retroactively Capture Provenance for Transactions
    Bahareh Arab, Dieter Gawlick, Vasudha Krishnaswamy, Venkatesh Radhakrishnan and Boris Glavic
    IEEE Transactions on Knowledge and Data Engineering. 30, 3 (2018) , 599–612.
    details
  2. GProM - A Swiss Army Knife for Your Provenance Needs
    Bahareh Arab, Su Feng, Boris Glavic, Seokki Lee, Xing Niu and Qitian Zeng
    IEEE Data Engineering Bulletin. 41, 1 (2018) , 51–62.
    details
  3. Guest editorial: large-scale data curation and metadata management
    Mohamed Eltabakh and Boris Glavic editors
    Springer.
    details
  4. Snapshot Semantics for Temporal Multiset Relations (extended version)
    Anton Dignös, Boris Glavic, Xing Niu, Michael H. Böhlen and Johann Gamper
    Technical Report #IIT/CS-DB-2018-03
    Illinois Institute of Technology.
    details
  5. Improving Data-Shuffle Performance In Data-Parallel Distributed Systems
    Shweelan Samson
    Illinois Institute of Technology.
    details
  6. Provenance Summaries for Answers and Non-Answers
    Seokki Lee, Bertram Ludäscher and Boris Glavic
    Proceedings of the VLDB Endowment (Demonstration Track). 11, 12 (2018) , 1954–1957.
    details
  7. Let’s Make It Dirty with BART!
    Donatello Santoro, Patricia C. Arocena, Boris Glavic, Giansalvatore Mecca, Renée J. Miller and Paolo Papotti
    Proceedings of the 26th Italian Symposium on Advanced Database Systems (2018).
    details

2017

  1. Carving database storage to detect and trace security breaches
    James Wagner, Alexander Rasin, Boris Glavic, Karen Heart, Jacob Furst, Lucas Bressan and Jonathan Grier
    Digital Investigation. 22, (2017) , S127–S136.
    details
  2. DeepSea: Adaptive Workload-Aware Partitioning of Materialized Views in Scalable Data Analytics
    Jiang Du, Boris Glavic, Wei Tan and Renée J. Miller
    Proceedings of the 20th International Conference on Extending Database Technology (2017), pp. 198–209.
    details
  3. Adaptive Schema Databases
    William Spoth, Bahareh Arab, Eric S. Chan, Dieter Gawlick, Adel Ghoneimy, Boris Glavic, Beda Hammerschmidt, Oliver Kennedy, Seokki Lee, Zhen Hua Liu, Xing Niu and Ying Yang
    Proceedings of the 8th Biennial Conference on Innovative Data Systems (2017).
    details
  4. A SQL-Middleware Unifying Why and Why-Not Provenance for First-Order Queries
    Seokki Lee, Sven Köhler, Bertram Ludäscher and Boris Glavic
    Proceedings of the 33rd IEEE International Conference on Data Engineering (2017), pp. 485–496.
    details
  5. Answering Historical What-if Queries with Provenance, Reenactment, and Symbolic Execution
    Bahareh Arab and Boris Glavic
    Proceedings of the 8th USENIX Workshop on the Theory and Practice of Provenance (2017).
    details
  6. Integrating Approximate Summarization with Provenance Capture
    Seokki Lee, Xing Niu, Bertram Ludäscher and Boris Glavic
    Proceedings of the 8th USENIX Workshop on the Theory and Practice of Provenance (2017).
    details
  7. Debugging Transactions and Tracking their Provenance with Reenactment
    Xing Niu, Boris Glavic, Seokki Lee, Bahareh Arab, Dieter Gawlick, Zhen Hua Liu, Vasudha Krishnaswamy, Su Feng and Xun Zou
    Proceedings of the VLDB Endowment (Demonstration Track). 10, 12 (2017) , 1857–1860.
    details
  8. Provenance-aware Query Optimization
    Xing Niu, Raghav Kapoor, Boris Glavic, Dieter Gawlick, Zhen Hua Liu, Vasudha Krishnaswamy and Venkatesh Radhakrishnan
    Proceedings of the 33rd IEEE International Conference on Data Engineering (2017), pp. 473–484.
    details

2016

  1. Implementing Unified Why- and Why-Not Provenance Through Games
    Seokki Lee, Sven Köhler, Bertram Ludäscher and Boris Glavic
    Proceedings of the 8th USENIX Workshop on the Theory and Practice of Provenance (Poster) (2016).
    details
  2. Mimir: Bringing CTables into Practice
    Arindam Nandi, Ying Yang, Oliver Kennedy, Boris Glavic, Ronny Fehling, Zhen Hua Liu and Dieter Gawlick
    Technical Report #arXiv:1601.00073
    CoRR.
    details
  3. Provenance-aware Versioned Dataworkspaces
    Xing Niu, Bahareh Arab, Dieter Gawlick, Zhen Hua Liu, Vasudha Krishnaswamy, Oliver Kennedy and Boris Glavic
    Proceedings of the 8th USENIX Workshop on the Theory and Practice of Provenance (2016).
    details
  4. The Exception that Improves the Rule
    Juliana Freire, Boris Glavic, Oliver Kennedy and Heiko Müller
    SIGMOD Workshop on Human-In-the-Loop Data Analytics (2016).
    details
  5. Benchmarking Data Curation Systems
    Patricia C. Arocena, Boris Glavic, Giansalvatore Mecca, Renée J. Miller, Paolo Papotti and Donatello Santoro
    IEEE Data Engineering Bulletin. 39, 2 (2016) , 47–62.
    details
  6. Provenance and Annotation of Data and Processes - 6th International Provenance and Annotation Workshop, IPAW 2016, McLean, VA, USA, June 7-8, 2016, Proceedings
    Marta Mattoso and Boris Glavic editors
    Springer.
    details
  7. BART in Action: Error Generation and Empirical Evaluations of Data-Cleaning Systems
    Donatello Santoro, Patricia C. Arocena, Boris Glavic, Giansalvatore Mecca, Renée J. Miller and Paolo Papotti
    Proceedings of the 42nd International Conference on Management of Data (SIGMOD) (Demonstration Track) (2016), pp. 2161–2164.
    details
  8. Reenactment for Read-Committed Snapshot Isolation
    Bahareh Arab, Dieter Gawlick, Vasudha Krishnaswamy, Venkatesh Radhakrishnan and Boris Glavic
    Proceedings of the 25th ACM International Conference on Information and Knowledge Management (2016), pp. 841–850.
    details
  9. Reenactment for Read-Committed Snapshot Isolation (long version)
    Bahareh Arab, Dieter Gawlick, Vasudha Krishnaswamy, Venkatesh Radhakrishnan and Boris Glavic
    Illinois Institute of Technology.
    details
  10. Optimizing Provenance Capture and Queries - Algebraic Transformations and Cost-based Optimization
    Xing Niu and Boris Glavic
    Technical Report #IIT/CS-DB-2016-02
    Illinois Institute of Technology.
    details
  11. Efficiently Computing Provenance Graphs for Queries with Negation
    Seokki Lee, Sven Köhler, Bertram Ludäscher and Boris Glavic
    Technical Report #IIT/CS-DB-2016-03
    Illinois Institute of Technology.
    details
  12. Formal Foundations of Reenactment and Transaction Provenance
    Bahareh Arab, Dieter Gawlick, Vasudha Krishnaswamy, Venkatesh Radhakrishnan and Boris Glavic
    Technical Report #IIT/CS-DB-2016-01
    Illinois Institute of Technology.
    details

2015

  1. Computing Candidate Keys Of Relational Operators For Optimizing Rewrite-Based Provenance Computation
    Andrea Cornudella
    Illinois Institute of Technology.
    details
  2. Automatic Generation and Ranking of Explanations for Mapping Errors
    Seokki Lee, Zhen Wang, Boris Glavic and Renée J. Miller
    Technical Report #IIT/CS-DB-2015-01
    Illinois Institute of Technology.
    details
  3. The iBench Integration Metadata Generator
    Patricia C. Arocena, Boris Glavic, Radu Ciucanu and Renée J. Miller
    University of Toronto.
    details
  4. Towards Constraint-based Explanations for Answers and Non-Answers
    Boris Glavic, Sven Köhler, Sean Riddle and Bertram Ludäscher
    Proceedings of the 7th USENIX Workshop on the Theory and Practice of Provenance (2015).
    details
  5. Interoperability for Provenance-aware Databases using PROV and JSON
    Xing Niu, Raghav Kapoor, Dieter Gawlick, Zhen Hua Liu, Vasudha Krishnaswamy, Venkatesh Radhakrishnan and Boris Glavic
    Proceedings of the 7th USENIX Workshop on the Theory and Practice of Provenance (2015).
    details
  6. Sharing and Reproducing Database Applications
    Quan Pham, Richard Whaling, Boris Glavic and Tanu Malik
    Proceedings of the VLDB Endowment (Demonstration Track). 8, 12 (2015) , 1988–1999.
    details
  7. Heuristic and Cost-based Optimization for Provenance Computation
    Xing Niu, Raghav Kapoor and Boris Glavic
    Proceedings of the 7th USENIX Workshop on the Theory and Practice of Provenance (Poster) (2015).
    details
  8. Making Database Applications Shareable
    Boris Glavic, Tanu Malik and Quan Pham
    Proceedings of the 7th USENIX Workshop on the Theory and Practice of Provenance (Poster) (2015).
    details
  9. Error Generation for Evaluating Data Cleaning Algorithms
    Patricia C. Arocena, Boris Glavic, Giansalvatore Mecca, Renée J. Miller, Paolo Papotti and Donatello Santoro
    Technical Report #TR-01-2015
    Università della Basilicata.
    details
  10. Messing Up with Bart: Error Generation for Evaluating Data-Cleaning Algorithms
    Patricia C. Arocena, Boris Glavic, Giansalvatore Mecca, Renée J. Miller, Paolo Papotti and Donatello Santoro
    Proceedings of the VLDB Endowment. 9, 2 (2015) , 36–47.
    details
  11. Gain Control over your Integration Evaluations
    Patricia C. Arocena, Radu Ciucanu, Boris Glavic and Renée J. Miller
    Proceedings of the VLDB Endowment (Demonstration Track). 8, 12 (2015) , 1960–1971.
    details
  12. The iBench Integration Metadata Generator
    Patricia C. Arocena, Boris Glavic, Radu Ciucanu and Renée J. Miller
    Proceedings of the VLDB Endowment. 9, 3 (2015) , 108–119.
    details
  13. HRDBMS: A NewSQL Database for Analytics
    Jason Arnold, Boris Glavic and Ioan Raicu
    Proceedings of the IEEE International Conference on Cluster Computing (Poster) (2015).
    details
  14. An Efficient Implementation of Game Provenance in DBMS
    Seokki Lee, Yuchen Tang, Sven Köhler, Bertram Ludäscher and Boris Glavic
    Technical Report #IIT/CS-DB-2015-02
    Illinois Institute of Technology.
    details
  15. LDV: Light-weight Database Virtualization
    Quan Pham, Tanu Malik, Boris Glavic and Ian Foster
    Proceedings of the 31st IEEE International Conference on Data Engineering (2015), pp. 1179–1190.
    details

2014

  1. A Generic Provenance Middleware for Database Queries, Updates, and Transactions
    Bahareh Arab, Dieter Gawlick, Venkatesh Radhakrishnan, Hao Guo and Boris Glavic
    Proceedings of the 6th USENIX Workshop on the Theory and Practice of Provenance (2014).
    details
  2. Efficient Stream Provenance via Operator Instrumentation
    Boris Glavic, Kyumars Sheykh Esmaili, Peter M. Fischer and Nesime Tatbul
    Transactions on Internet Technology. 13, 1 (2014) , 7:1–7:26.
    details
  3. Efficient Scoring and Ranking of Explanation for Data Exchange Errors in Vagabond
    Zhen Wang
    Illinois Institute of Technology.
    details
  4. Reenacting Transactions to Compute their Provenance
    Bahareh Arab, Dieter Gawlick, Vasudha Krishnaswamy, Venkatesh Radhakrishnan and Boris Glavic
    Technical Report #IIT/CS-DB-2014-02
    Illinois Institute of Technology.
    details
  5. A Primer on Database Provenance
    Boris Glavic
    Technical Report #IIT/CS-DB-2014-01
    Illinois Institute of Technology.
    details
  6. LDV: Light-weight Database Virtualization
    Quan Pham, Tanu Malik, Boris Glavic and Ian Foster
    Technical Report #IIT/CS-DB-2014-03
    Illinois Institute of Technology.
    details

2013

  1. Using SQL for Efficient Generation and Querying of Provenance Information
    Boris Glavic, Renée J. Miller and Gustavo Alonso
    In search of elegance in the theory and practice of computation: a Festschrift in honour of Peter Buneman. (2013) , 291–320.
    details
  2. iBench First Cut
    Patricia C. Arocena, Mariana D’Angelo, Boris Glavic and Renée J. Miller
    University of Toronto.
    details
  3. Ariadne: Managing Fine-Grained Provenance on Data Streams
    Boris Glavic, Kyumars Sheykh Esmaili, Peter M. Fischer and Nesime Tatbul
    Proceedings of the 7th ACM International Conference on Distributed Event-Based Systems (2013), pp. 291–320.
    details
  4. Provenance Management for Frequent Itemsets
    Javed Siddique, Boris Glavic and Renée J. Miller
    University of Toronto.
    details
  5. Value Invention for Data Exchange
    Patricia C. Arocena, Boris Glavic and Renée J. Miller
    Proceedings of the 39th International Conference on Management of Data (2013), pp. 157–168.
    details
  6. Provenance for Data Mining
    Boris Glavic, Javed Siddique, Periklis Andritsos and Renée J. Miller
    Proceedings of the 5th USENIX Workshop on the Theory and Practice of Provenance (2013).
    details

2012

  1. Big Data Provenance: Challenges and Implications for Benchmarking
    Boris Glavic
    2nd Workshop on Big Data Benchmarking (2012), pp. 72–80.
    details
  2. Ariadne: Managing Fine-Grained Provenance on Data Streams
    Boris Glavic, Kyumars Sheykh Esmaili, Peter M. Fischer and Nesime Tatbul
    Technical Report #771
    ETH Zürich.
    details

2011

  1. The Case for Fine-Grained Stream Provenance
    Boris Glavic, Kyumars Sheykh Esmaili, Peter M. Fischer and Nesime Tatbul
    Proceedings of the 1st Workshop on Data Streams and Event Processing collocated with BTW (2011), pp. 58–61.
    details
  2. Smile: Enabling Easy and Fast Development of Domain-Specific Scheduling Protocols
    Christian Tilgner, Boris Glavic, Michael H. Böhlen and Carl-Christian Kanne
    Proceedings of the 28th British National Conference on Databases (2011), pp. 128–131.
    details
  3. Reexamining Some Holy Grails of Data Provenance
    Boris Glavic and Renée J. Miller
    Proceedings of the 3rd USENIX Workshop on the Theory and Practice of Provenance (2011).
    details
  4. Declarative Serializable Snapshot Isolation
    Christian Tilgner, Boris Glavic, Michael H. Böhlen and Carl-Christian Kanne
    Proceedings of the 15th International Conference on Advances in Database and Information Systems (2011), pp. 170–184.
    details
  5. Debugging Data Exchange with Vagabond
    Boris Glavic, Jiang Du, Renée J. Miller, Gustavo Alonso and Laura M. Haas
    Proceedings of the VLDB Endowment (Demonstration Track). 4, 12 (2011) , 1383–1386.
    details

2010

  1. TRAMP: Understanding the Behavior of Schema Mappings through Provenance
    Boris Glavic, Gustavo Alonso, Renée J. Miller and Laura M. Haas
    Proceedings of the Very Large Data Bases Endowment. 3, 1 (2010) , 1314–1325.
    details
  2. Perm: Efficient Provenance Support for Relational Databases
    Boris Glavic
    University of Zurich.
    details
  3. Formal Foundation of Contribution Semantics and Provenance Computation through Query Rewrite in TRAMP
    Boris Glavic
    University of Zurich.
    details
  4. Data lineage/provenance in XQuery
    Donald Kossmann, Peter M. Fischer, Kyumars Sheykh Esmaili, Boris Glavic and Beat Steiger
    ETH Zurich.
    details
  5. Correctness Proof of the Declarative SS2PL Protocol Implementation
    Christian Tilgner, Boris Glavic, Michael H. Böhlen and Carl-Christian Kanne
    University of Zurich.
    details

2009

  1. The Perm Provenance Management System in Action
    Boris Glavic and Gustavo Alonso
    Proceedings of the 35th ACM SIGMOD International Conference on Management of Data (Demonstration Track) (2009), pp. 1055–1058.
    details
  2. Provenance for Nested Subqueries
    Boris Glavic and Gustavo Alonso
    Proceedings of the 12th International Conference on Extending Database Technology (2009), pp. 982–993.
    details
  3. Perm: Processing Provenance and Data on the same Data Model through Query Rewriting
    Boris Glavic and Gustavo Alonso
    Proceedings of the 25th IEEE International Conference on Data Engineering (2009), pp. 174–185.
    details

2008

  1. Clustering Multidimensional Sequences in Spatial and Temporal Databases
    Ira Assent, Ralph Krieger, Boris Glavic and Thomas Seidl
    International Journal on Knowledge and Information Systems. 16, 1 (2008) , 29–51.
    details

2007

  1. Data Provenance: A Categorization of Existing Approaches
    Boris Glavic and Klaus R. Dittrich
    Proceedings of the 12th GI Conference on Datenbanksysteme in Buisness, Technologie und Web (2007), pp. 227–241.
    details

2006

  1. Spatial Multidimensional Sequence Clustering
    Ira Assent, Ralph Krieger, Boris Glavic and Thomas Seidl
    Proceedings of the 1st International Workshop on Spatial and Spatio-temporal Data Mining collocated with ICDM (2006), pp. 343–348.
    details
  2. sesam: Ensuring Privacy for an Interdisciplinary Longitudinal Study
    Boris Glavic and Klaus R. Dittrich
    Proceedings of the 1st Workshop Elektronische Datentreuhänderschaft - Anwendungen, Verfahren, Grundlagen collocated with GI Jahrestagung (2006), pp. 736–743.
    details

2005

  1. Subspace Sequence Clustering - Datamining zur Entscheidungsunterstützung in der Hydrologie
    Boris Glavic
    Proceedings of the 11th GI Conference on Database Systems for Business, Technology, and Web (Student Track) (2005), pp. 15–17.
    details