IIT Database Group

Header bar

Boris Glavic, Associate Professor

I am an Associate Professor in the Department of Computer Science at the Illinois Institute of Technology. I am a database guy and am leading the IIT DBGroup. My research interest spans several areas of database systems including data provenance, data integration, query execution and optimization, and data curation. I strive to build real systems that are based on solid theoretical foundations. For a list of my publications see below or have a look at DBLP and google scholar. Here is a current CV.

.

Short Bio

I received a Diploma (Master) in Computer Science from the RWTH Aachen in Germany, and a PhD in Computer Science from the University of Zurich in Switzerland being advised by Michael Böhlen and Gustavo Alonso. Afterwards, I spend to two years as a PostDoc in the Department of Computer Science at the University of Toronto working at the Database Research Group under Renée J. Miller.

Contact Information

Department of Computer Science
Stuart Building, Room SB 206B
Illinois Institute of Technology
10 W 31st Street
Chicago, IL 60616

Email: [bglavic] at [iit] o [edu]
Phone: 312 567 5205
Office Hours: Mondays, 11:00-12:00pm

Professional Service

Editorial Service:
  • TKDE: Associate Editor (since 2018)
  • DAPD: Guest Editor (2016, Special issue on Large-Scale Data Curation and Metadata Management)
Programm Committee Chair:
  • IPAW: 2016
Programm Committee Memberships:
  • AMW: 2012
  • Active: 2017
  • BigStream: 2018
  • Bx: 2018
  • CIKM: 2020, 2018, 2017
  • DASFAA: 2019
  • DATA: 2017, 2015, 2014
  • DSSA: 2016
  • EDBT: 2019, 2018, 2015
  • ICDE: 2020, 2019, 2018, 2017, 2016 (demos), 2015
  • SIGMOD: 2020, 2019, 2019 (demos), 2017, 2016, 2015 (demos), 2015, 2014 (demos), 2011
  • SSDBM: 2019, 2013
  • TaPP: 2019, 2018, 2015, 2014, 2013
  • VLDB: 2019, 2019 (demos), 2018 (demos), 2017
  • WBDB.us: 2013
  • WBDBCanada: 2015
  • WBDBChina: 2013
  • WebDB: 2015
Journal Reviewer:
  • ARTIF INTELL REV: 2015
  • CGA: 2019
  • Computer: 2014
  • IJSWIS: 2017
  • ISF: 2017
  • Information Systems: 2019, 2018, 2015
  • JBI: 2015
  • JDIQ: 2014
  • SCICO: 2017
  • TKDE: 2018, 2016, 2013, 2011
  • TODS: 2018, 2012, 2011
  • TOIT: 2016, 2013
  • TSE: 2016
  • VLDBJ: 2017, 2016, 2015, 2014, 2012
External Reviewer:
  • BIS: 2007
  • BTW: 2009
  • ICDE: 2008, 2010
  • ICDT: 2019
  • LICS: 2015
  • POPL: 2019
  • SIGMOD: 2007
  • TOOLS: 2007
  • VLDB: 2006, 2007
Proposal Review:
  • ERC: 2018
  • ISF: 2016
  • Marsden: 2014
  • NSERC: 2017
  • NSF: 2013, 2019

Publications

  1. Data Debugging and Exploration with Vizier
    Mike Brachmann, Carlos Bautista, Sonia Castelo, Su Feng, Juliana Freire, Boris Glavic, Oliver Kennedy, Heiko Müller, Rémi Rampin, William Spoth and Ying Yang
    Proceedings of the 44th International Conference on Management of Data (Demonstration Track) (2019).
    details
  2. Analyzing Uncertain Tabular Data
    Oliver Kennedy and Boris Glavic
    Information Quality in Information Fusion and Decision Making
    Éloi Bossé and G. Rogova, eds. Springer. 291–320.
    details
  3. Uncertainty Annotated Databases - A Lightweight Approach for Approximating Certain Answers
    Su Feng, Aaron Huber, Boris Glavic and Oliver Kennedy
    Proceedings of the 44th International Conference on Management of Data (2019).
    details
  4. A High-Performance Distributed Relational Database System for Scalable OLAP Processing
    Jason Arnold, Boris Glavic and Ioan Raicu
    Proceedings of the 33rd IEEE International Parallel and Distributed Processing Symposium (2019).
    details
  5. Snapshot Semantics for Temporal Multiset Relations
    Anton Dignös, Boris Glavic, Xing Niu, Michael H. Böhlen and Johann Gamper
    Proceedings of the VLDB Endowment. 12, 6 (2019) , 639–652.
    details
  6. Going Beyond Provenance: Explaining Query Answers with Pattern-based Counterbalances
    Zhengjie Miao, Qitian Zeng, Boris Glavic and Sudeepa Roy
    Proceedings of the 44th International Conference on Management of Data (2019).
    details
  7. CAPE: Explaining Outliers by Counterbalancing
    Zhengjie Miao, Qitian Zeng, Chenjie Li, Boris Glavic, Oliver Kennedy and and Sudeepa Roy
    Proceedings of the VLDB Endowment (Demonstration Track). (2019).
    details
  8. Let’s Make It Dirty with BART!
    Donatello Santoro, Patricia C. Arocena, Boris Glavic, Giansalvatore Mecca, Renée J. Miller and Paolo Papotti
    Proceedings of the 26th Italian Symposium on Advanced Database Systems (2018).
    details
  9. Provenance Summaries for Answers and Non-Answers
    Seokki Lee, Bertram Ludäscher and Boris Glavic
    Proceedings of the VLDB Endowment (Demonstration Track). 11, 12 (2018) , 1954–1957.
    details
  10. Snapshot Semantics for Temporal Multiset Relations (extended version)
    Anton Dignös, Boris Glavic, Xing Niu, Michael H. Böhlen and Johann Gamper
    Technical Report #IIT/CS-DB-2018-03
    Illinois Institute of Technology.
    details
  11. Heuristic and Cost-based Optimization for Diverse Provenance Tasks
    Xing Niu, Raghav Kapoor, Boris Glavic, Dieter Gawlick, Zhen Hua Liu, Vasudha Krishnaswamy and Venkatesh Radhakrishnan
    IEEE Transactions on Knowledge and Data Engineering. (2018).
    details
  12. GProM - A Swiss Army Knife for Your Provenance Needs
    Bahareh Arab, Su Feng, Boris Glavic, Seokki Lee, Xing Niu and Qitian Zeng
    IEEE Data Engineering Bulletin. 41, 1 (2018) , 51–62.
    details
  13. Using Reenactment to Retroactively Capture Provenance for Transactions
    Bahareh Arab, Dieter Gawlick, Vasudha Krishnaswamy, Venkatesh Radhakrishnan and Boris Glavic
    IEEE Transactions on Knowledge and Data Engineering. 30, 3 (2018) , 599–612.
    details
  14. PUG: a framework and practical implementation for why and why-not provenance
    Seokki Lee, Bertram Ludäscher and Boris Glavic
    The VLDB Journal. 28, 1 (Aug. 2018) , 47—71.
    details
  15. A SQL-Middleware Unifying Why and Why-Not Provenance for First-Order Queries
    Seokki Lee, Sven Köhler, Bertram Ludäscher and Boris Glavic
    Proceedings of the 33rd IEEE International Conference on Data Engineering (2017), pp. 485–496.
    details
  16. Integrating Approximate Summarization with Provenance Capture
    Seokki Lee, Xing Niu, Bertram Ludäscher and Boris Glavic
    Proceedings of the 8th USENIX Workshop on the Theory and Practice of Provenance (2017).
    details
  17. Answering Historical What-if Queries with Provenance, Reenactment, and Symbolic Execution
    Bahareh Arab and Boris Glavic
    Proceedings of the 8th USENIX Workshop on the Theory and Practice of Provenance (2017).
    details
  18. Adaptive Schema Databases
    William Spoth, Bahareh Arab, Eric S. Chan, Dieter Gawlick, Adel Ghoneimy, Boris Glavic, Beda Hammerschmidt, Oliver Kennedy, Seokki Lee, Zhen Hua Liu, Xing Niu and Ying Yang
    Proceedings of the 8th Biennial Conference on Innovative Data Systems (2017).
    details
  19. Provenance-aware Query Optimization
    Xing Niu, Raghav Kapoor, Boris Glavic, Dieter Gawlick, Zhen Hua Liu, Vasudha Krishnaswamy and Venkatesh Radhakrishnan
    Proceedings of the 33rd IEEE International Conference on Data Engineering (2017), pp. 473–484.
    details
  20. DeepSea: Adaptive Workload-Aware Partitioning of Materialized Views in Scalable Data Analytics
    Jiang Du, Boris Glavic, Wei Tan and Renée J. Miller
    Proceedings of the 20th International Conference on Extending Database Technology (2017), pp. 198–209.
    details
  21. Carving database storage to detect and trace security breaches
    James Wagner, Alexander Rasin, Boris Glavic, Karen Heart, Jacob Furst, Lucas Bressan and Jonathan Grier
    Digital Investigation. 22, (2017) , S127–S136.
    details
  22. Debugging Transactions and Tracking their Provenance with Reenactment
    Xing Niu, Boris Glavic, Seokki Lee, Bahareh Arab, Dieter Gawlick, Zhen Hua Liu, Vasudha Krishnaswamy, Su Feng and Xun Zou
    Proceedings of the VLDB Endowment (Demonstration Track). 10, 12 (2017) , 1857–1860.
    details
  23. The Exception that Improves the Rule
    Juliana Freire, Boris Glavic, Oliver Kennedy and Heiko Müller
    SIGMOD Workshop on Human-In-the-Loop Data Analytics (2016).
    details
  24. Provenance-aware Versioned Dataworkspaces
    Xing Niu, Bahareh Arab, Dieter Gawlick, Zhen Hua Liu, Vasudha Krishnaswamy, Oliver Kennedy and Boris Glavic
    Proceedings of the 8th USENIX Workshop on the Theory and Practice of Provenance (2016).
    details
  25. Mimir: Bringing CTables into Practice
    Arindam Nandi, Ying Yang, Oliver Kennedy, Boris Glavic, Ronny Fehling, Zhen Hua Liu and Dieter Gawlick
    Technical Report #arXiv:1601.00073
    CoRR.
    details
  26. Formal Foundations of Reenactment and Transaction Provenance
    Bahareh Arab, Dieter Gawlick, Vasudha Krishnaswamy, Venkatesh Radhakrishnan and Boris Glavic
    Technical Report #IIT/CS-DB-2016-01
    Illinois Institute of Technology.
    details
  27. Implementing Unified Why- and Why-Not Provenance Through Games
    Seokki Lee, Sven Köhler, Bertram Ludäscher and Boris Glavic
    Proceedings of the 8th USENIX Workshop on the Theory and Practice of Provenance (Poster) (2016).
    details
  28. Efficiently Computing Provenance Graphs for Queries with Negation
    Seokki Lee, Sven Köhler, Bertram Ludäscher and Boris Glavic
    Technical Report #IIT/CS-DB-2016-03
    Illinois Institute of Technology.
    details
  29. Optimizing Provenance Capture and Queries - Algebraic Transformations and Cost-based Optimization
    Xing Niu and Boris Glavic
    Technical Report #IIT/CS-DB-2016-02
    Illinois Institute of Technology.
    details
  30. Reenactment for Read-Committed Snapshot Isolation (long version)
    Bahareh Arab, Dieter Gawlick, Vasudha Krishnaswamy, Venkatesh Radhakrishnan and Boris Glavic
    Illinois Institute of Technology.
    details
  31. Reenactment for Read-Committed Snapshot Isolation
    Bahareh Arab, Dieter Gawlick, Vasudha Krishnaswamy, Venkatesh Radhakrishnan and Boris Glavic
    Proceedings of the 25th ACM International Conference on Information and Knowledge Management (2016), pp. 841–850.
    details
  32. BART in Action: Error Generation and Empirical Evaluations of Data-Cleaning Systems
    Donatello Santoro, Patricia C. Arocena, Boris Glavic, Giansalvatore Mecca, Renée J. Miller and Paolo Papotti
    Proceedings of the 42nd International Conference on Management of Data (SIGMOD) (Demonstration Track) (2016), pp. 2161–2164.
    details
  33. Benchmarking Data Curation Systems
    Patricia C. Arocena, Boris Glavic, Giansalvatore Mecca, Renée J. Miller, Paolo Papotti and Donatello Santoro
    IEEE Data Engineering Bulletin. 39, 2 (2016) , 47–62.
    details
  34. Making Database Applications Shareable
    Boris Glavic, Tanu Malik and Quan Pham
    Proceedings of the 7th USENIX Workshop on the Theory and Practice of Provenance (Poster) (2015).
    details
  35. LDV: Light-weight Database Virtualization
    Quan Pham, Tanu Malik, Boris Glavic and Ian Foster
    Proceedings of the 31st IEEE International Conference on Data Engineering (2015), pp. 1179–1190.
    details
  36. Automatic Generation and Ranking of Explanations for Mapping Errors
    Seokki Lee, Zhen Wang, Boris Glavic and Renée J. Miller
    Technical Report #IIT/CS-DB-2015-01
    Illinois Institute of Technology.
    details
  37. The iBench Integration Metadata Generator
    Patricia C. Arocena, Boris Glavic, Radu Ciucanu and Renée J. Miller
    University of Toronto.
    details
  38. Towards Constraint-based Explanations for Answers and Non-Answers
    Boris Glavic, Sven Köhler, Sean Riddle and Bertram Ludäscher
    Proceedings of the 7th USENIX Workshop on the Theory and Practice of Provenance (2015).
    details
  39. Interoperability for Provenance-aware Databases using PROV and JSON
    Xing Niu, Raghav Kapoor, Dieter Gawlick, Zhen Hua Liu, Vasudha Krishnaswamy, Venkatesh Radhakrishnan and Boris Glavic
    Proceedings of the 7th USENIX Workshop on the Theory and Practice of Provenance (2015).
    details
  40. Sharing and Reproducing Database Applications
    Quan Pham, Richard Whaling, Boris Glavic and Tanu Malik
    Proceedings of the VLDB Endowment (Demonstration Track). 8, 12 (2015) , 1988–1999.
    details
  41. Heuristic and Cost-based Optimization for Provenance Computation
    Xing Niu, Raghav Kapoor and Boris Glavic
    Proceedings of the 7th USENIX Workshop on the Theory and Practice of Provenance (Poster) (2015).
    details
  42. Error Generation for Evaluating Data Cleaning Algorithms
    Patricia C. Arocena, Boris Glavic, Giansalvatore Mecca, Renée J. Miller, Paolo Papotti and Donatello Santoro
    Technical Report #TR-01-2015
    Università della Basilicata.
    details
  43. Messing Up with Bart: Error Generation for Evaluating Data-Cleaning Algorithms
    Patricia C. Arocena, Boris Glavic, Giansalvatore Mecca, Renée J. Miller, Paolo Papotti and Donatello Santoro
    Proceedings of the VLDB Endowment. 9, 2 (2015) , 36–47.
    details
  44. Gain Control over your Integration Evaluations
    Patricia C. Arocena, Radu Ciucanu, Boris Glavic and Renée J. Miller
    Proceedings of the VLDB Endowment (Demonstration Track). 8, 12 (2015) , 1960–1971.
    details
  45. The iBench Integration Metadata Generator
    Patricia C. Arocena, Boris Glavic, Radu Ciucanu and Renée J. Miller
    Proceedings of the VLDB Endowment. 9, 3 (2015) , 108–119.
    details
  46. HRDBMS: A NewSQL Database for Analytics
    Jason Arnold, Boris Glavic and Ioan Raicu
    Proceedings of the IEEE International Conference on Cluster Computing (Poster) (2015).
    details
  47. An Efficient Implementation of Game Provenance in DBMS
    Seokki Lee, Yuchen Tang, Sven Köhler, Bertram Ludäscher and Boris Glavic
    Technical Report #IIT/CS-DB-2015-02
    Illinois Institute of Technology.
    details
  48. LDV: Light-weight Database Virtualization
    Quan Pham, Tanu Malik, Boris Glavic and Ian Foster
    Technical Report #IIT/CS-DB-2014-03
    Illinois Institute of Technology.
    details
  49. A Primer on Database Provenance
    Boris Glavic
    Technical Report #IIT/CS-DB-2014-01
    Illinois Institute of Technology.
    details
  50. Reenacting Transactions to Compute their Provenance
    Bahareh Arab, Dieter Gawlick, Vasudha Krishnaswamy, Venkatesh Radhakrishnan and Boris Glavic
    Technical Report #IIT/CS-DB-2014-02
    Illinois Institute of Technology.
    details
  51. Efficient Stream Provenance via Operator Instrumentation
    Boris Glavic, Kyumars Sheykh Esmaili, Peter M. Fischer and Nesime Tatbul
    Transactions on Internet Technology. 13, 1 (2014) , 7:1–7:26.
    details
  52. A Generic Provenance Middleware for Database Queries, Updates, and Transactions
    Bahareh Arab, Dieter Gawlick, Venkatesh Radhakrishnan, Hao Guo and Boris Glavic
    Proceedings of the 6th USENIX Workshop on the Theory and Practice of Provenance (2014).
    details
  53. Provenance for Data Mining
    Boris Glavic, Javed Siddique, Periklis Andritsos and Renée J. Miller
    Proceedings of the 5th USENIX Workshop on the Theory and Practice of Provenance (2013).
    details
  54. Using SQL for Efficient Generation and Querying of Provenance Information
    Boris Glavic, Renée J. Miller and Gustavo Alonso
    In search of elegance in the theory and practice of computation: a Festschrift in honour of Peter Buneman. (2013) , 291–320.
    details
  55. iBench First Cut
    Patricia C. Arocena, Mariana D’Angelo, Boris Glavic and Renée J. Miller
    University of Toronto.
    details
  56. Ariadne: Managing Fine-Grained Provenance on Data Streams
    Boris Glavic, Kyumars Sheykh Esmaili, Peter M. Fischer and Nesime Tatbul
    Proceedings of the 7th ACM International Conference on Distributed Event-Based Systems (2013), pp. 291–320.
    details
  57. Provenance Management for Frequent Itemsets
    Javed Siddique, Boris Glavic and Renée J. Miller
    University of Toronto.
    details
  58. Value Invention for Data Exchange
    Patricia C. Arocena, Boris Glavic and Renée J. Miller
    Proceedings of the 39th International Conference on Management of Data (2013), pp. 157–168.
    details
  59. Big Data Provenance: Challenges and Implications for Benchmarking
    Boris Glavic
    2nd Workshop on Big Data Benchmarking (2012), pp. 72–80.
    details
  60. Ariadne: Managing Fine-Grained Provenance on Data Streams
    Boris Glavic, Kyumars Sheykh Esmaili, Peter M. Fischer and Nesime Tatbul
    Technical Report #771
    ETH Zürich.
    details
  61. Reexamining Some Holy Grails of Data Provenance
    Boris Glavic and Renée J. Miller
    Proceedings of the 3rd USENIX Workshop on the Theory and Practice of Provenance (2011).
    details
  62. Smile: Enabling Easy and Fast Development of Domain-Specific Scheduling Protocols
    Christian Tilgner, Boris Glavic, Michael H. Böhlen and Carl-Christian Kanne
    Proceedings of the 28th British National Conference on Databases (2011), pp. 128–131.
    details
  63. The Case for Fine-Grained Stream Provenance
    Boris Glavic, Kyumars Sheykh Esmaili, Peter M. Fischer and Nesime Tatbul
    Proceedings of the 1st Workshop on Data Streams and Event Processing collocated with BTW (2011), pp. 58–61.
    details
  64. Debugging Data Exchange with Vagabond
    Boris Glavic, Jiang Du, Renée J. Miller, Gustavo Alonso and Laura M. Haas
    Proceedings of the VLDB Endowment (Demonstration Track). 4, 12 (2011) , 1383–1386.
    details
  65. Declarative Serializable Snapshot Isolation
    Christian Tilgner, Boris Glavic, Michael H. Böhlen and Carl-Christian Kanne
    Proceedings of the 15th International Conference on Advances in Database and Information Systems (2011), pp. 170–184.
    details
  66. Data lineage/provenance in XQuery
    Donald Kossmann, Peter M. Fischer, Kyumars Sheykh Esmaili, Boris Glavic and Beat Steiger
    ETH Zurich.
    details
  67. Perm: Efficient Provenance Support for Relational Databases
    Boris Glavic
    University of Zurich.
    details
  68. TRAMP: Understanding the Behavior of Schema Mappings through Provenance
    Boris Glavic, Gustavo Alonso, Renée J. Miller and Laura M. Haas
    Proceedings of the Very Large Data Bases Endowment. 3, 1 (2010) , 1314–1325.
    details
  69. Correctness Proof of the Declarative SS2PL Protocol Implementation
    Christian Tilgner, Boris Glavic, Michael H. Böhlen and Carl-Christian Kanne
    University of Zurich.
    details
  70. Formal Foundation of Contribution Semantics and Provenance Computation through Query Rewrite in TRAMP
    Boris Glavic
    University of Zurich.
    details
  71. The Perm Provenance Management System in Action
    Boris Glavic and Gustavo Alonso
    Proceedings of the 35th ACM SIGMOD International Conference on Management of Data (Demonstration Track) (2009), pp. 1055–1058.
    details
  72. Provenance for Nested Subqueries
    Boris Glavic and Gustavo Alonso
    Proceedings of the 12th International Conference on Extending Database Technology (2009), pp. 982–993.
    details
  73. Perm: Processing Provenance and Data on the same Data Model through Query Rewriting
    Boris Glavic and Gustavo Alonso
    Proceedings of the 25th IEEE International Conference on Data Engineering (2009), pp. 174–185.
    details
  74. Clustering Multidimensional Sequences in Spatial and Temporal Databases
    Ira Assent, Ralph Krieger, Boris Glavic and Thomas Seidl
    International Journal on Knowledge and Information Systems. 16, 1 (2008) , 29–51.
    details
  75. Data Provenance: A Categorization of Existing Approaches
    Boris Glavic and Klaus R. Dittrich
    Proceedings of the 12th GI Conference on Datenbanksysteme in Buisness, Technologie und Web (2007), pp. 227–241.
    details
  76. sesam: Ensuring Privacy for an Interdisciplinary Longitudinal Study
    Boris Glavic and Klaus R. Dittrich
    Proceedings of the 1st Workshop Elektronische Datentreuhänderschaft - Anwendungen, Verfahren, Grundlagen collocated with GI Jahrestagung (2006), pp. 736–743.
    details
  77. Spatial Multidimensional Sequence Clustering
    Ira Assent, Ralph Krieger, Boris Glavic and Thomas Seidl
    Proceedings of the 1st International Workshop on Spatial and Spatio-temporal Data Mining collocated with ICDM (2006), pp. 343–348.
    details
  78. Subspace Sequence Clustering - Datamining zur Entscheidungsunterstützung in der Hydrologie
    Boris Glavic
    Proceedings of the 11th GI Conference on Database Systems for Business, Technology, and Web (Student Track) (2005), pp. 15–17.
    details