Publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
-
Fuzzy Integration of Data Lake TablesIn Proceedings 29th International Conference on Extending Database Technology, EDBT 2026, Tampere, Finland, March 24-27, 2026, 2026
2025
-
Model LakesIn Proceedings 28th International Conference on Extending Database Technology, EDBT 2025, Barcelona, Spain, March 25-28, 2025, 2025 -
-
[Vision] Towards oblivious property graph databasesIn Proceedings of the 8th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA), Berlin, Germany, June 22-27, 2025, 2025 -
-
- Diverse Unionable Tuple Search: Novelty-Driven Discovery in Data Lakes [Technical Report]CoRR, May 2025
2024
- Similarity Measures For Incomplete Database InstancesIn Proceedings 27th International Conference on Extending Database Technology, EDBT 2024, Paestum, Italy, March 25 - March 28, May 2024
- Gen-T: Table Reclamation in Data LakesIn 40th IEEE International Conference on Data Engineering, ICDE 2024, Utrecht, The Netherlands, May 13-16, 2024, May 2024
- Comparing Incomplete Database InstancesIn Proceedings of the 32nd Symposium of Advanced Database Systems, Villasimius, Italy, June 23rd to 26th, 2024, May 2024
-
-
-
- ALT-GEN: Benchmarking Table Union Search using Large Language ModelsIn Proceedings of Workshops at the 50th International Conference on Very Large Data Bases, VLDB 2024, Guangzhou, China, August 26-30, 2024, May 2024
- TP-TR Benchmarks from Gen-T: Table Reclamation in Data Lakes (Version 1)Mar 2024Accessed on YYYY-MM-DD.
2023
-
- Explaining Dataset Changes for Semantic Data Versioning with Explain-Da-VProc. VLDB Endow., Mar 2023
- Semantics-aware Dataset Discovery from Data Lakes with Contextualized Column-based Representation LearningProc. VLDB Endow., Mar 2023
-
- DomainNet: Homograph Detection and Understanding in Data Lake DisambiguationACM Trans. Database Syst., Mar 2023
- Table Discovery in Data Lakes: State-of-the-art and Future DirectionsIn Companion of the 2023 International Conference on Management of Data, SIGMOD/PODS 2023, Seattle, WA, USA, June 18-23, 2023, Mar 2023
- DIALITE: Discover, Align and Integrate Open Data TablesIn Companion of the 2023 International Conference on Management of Data, SIGMOD/PODS 2023, Seattle, WA, USA, June 18-23, 2023, Mar 2023
- Explaining Dataset Changes for Semantic Data Versioning with Explain-Da-V (Technical Report)CoRR, Mar 2023
-
-
-
-
2022
-
-
- Semantics-aware Dataset Discovery from Data Lakes with Contextualized Column-based Representation LearningCoRR, Mar 2022
2021
-
- DomainNet: Homograph Detection for Data Lake DisambiguationIn Proceedings of the 24th International Conference on Extending Database Technology, EDBT 2021, Nicosia, Cyprus, March 23 - 26, 2021, Mar 2021
- Towards Knowledge Exchange: State-of-the-Art and Open ProblemsIn SOFSEM 2021: Theory and Practice of Computer Science - 47th International Conference on Current Trends in Theory and Practice of Computer Science, SOFSEM 2021, Bolzano-Bozen, Italy, January 25-29, 2021, Proceedings, Mar 2021
- DomainNet: Homograph Detection for Data Lake DisambiguationCoRR, Mar 2021
2020
- Knowledge TranslationProc. VLDB Endow., Mar 2020
- Pytheas: Pattern-based Table Discovery in CSV FilesProc. VLDB Endow., Mar 2020
-
- Knowledge Translation: Extended Technical ReportCoRR, Mar 2020
2019
-
-
- A Collective, Probabilistic Approach to Schema Mapping Using Diverse Noisy EvidenceIEEE Trans. Knowl. Data Eng., Mar 2019
-
- Towards a Benchmark for Knowledge Base ExchangeIn Proceedings of the 1st International Workshop on Challenges and Experiences from Data Integration to Knowledge Graphs co-located with the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD 2019), Anchorage, Alaska, August 5, 2019, Mar 2019
- JOSIE: Overlap Set Similarity Search for Finding Joinable Tables in Data LakesIn Proceedings of the 2019 International Conference on Management of Data, SIGMOD Conference 2019, Amsterdam, The Netherlands, June 30 - July 5, 2019, Mar 2019
2018
- Making Open Data Transparent: Data Discovery on Open DataIEEE Data Eng. Bull., Mar 2018
-
-
- Let’s Make It Dirty with BART!In Proceedings of the 26th Italian Symposium on Advanced Database Systems, Castellaneta Marina (Taranto), Italy, June 24-27, 2018, Mar 2018
-
- Optimizing Organizations for Navigating Data LakesCoRR, Mar 2018
2017
-
-
- Second annual workshop on data driven knowledge mobilizationIn Proceedings of the 27th Annual International Conference on Computer Science and Software Engineering, CASCON 2017, Markham, Ontario, Canada, November 6-8, 2017, Mar 2017
- DeepSea: Progressive Workload-Aware Partitioning of Materialized Views in Scalable Data AnalyticsIn Proceedings of the 20th International Conference on Extending Database Technology, EDBT 2017, Venice, Italy, March 21-24, 2017, Mar 2017
- A Collective, Probabilistic Approach to Schema MappingIn 33rd IEEE International Conference on Data Engineering, ICDE 2017, San Diego, CA, USA, April 19-22, 2017, Mar 2017
- VIQS: Visual Interactive Exploration of Query SemanticsIn Proceedings of the 2017 ACM Workshop on Exploratory Search and Interactive Data Analytics, ESIDA@IUI 2017, Limassol, Cyprus, March 13, 2017, Mar 2017
- The Future of Data IntegrationIn Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13 - 17, 2017, Mar 2017
- A Collective, Probabilistic Approach to Schema Mapping: AppendixCoRR, Mar 2017
2016
- Benchmarking Data Curation SystemsIEEE Data Eng. Bull., Mar 2016
-
-
- Data-driven knowledge mobilizationIn Proceedings of the 26th Annual International Conference on Computer Science and Software Engineering, CASCON 2016, Toronto, Ontario, Canada, October 31 - November 2, 2016, Mar 2016
- BART in Action: Error Generation and Empirical Evaluations of Data-Cleaning SystemsIn Proceedings of the 2016 International Conference on Management of Data, SIGMOD Conference 2016, San Francisco, CA, USA, June 26 - July 01, 2016, Mar 2016
- LSH Ensemble: Internet Scale Domain SearchCoRR, Mar 2016
2015
-
- Messing Up with BART: Error Generation for Evaluating Data-Cleaning AlgorithmsProc. VLDB Endow., Mar 2015
-
-
- LabBook: Metadata-driven social collaborative data analysisIn 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29 - November 1, 2015, Mar 2015
- Automatic Curation of Clinical Trials Data in LinkedCTIn The Semantic Web - ISWC 2015 - 14th International Semantic Web Conference, Bethlehem, PA, USA, October 11-15, 2015, Proceedings, Part II, Mar 2015
- LinkedCT Live: Platform for Online Curation of Clinical Trials DataIn Proceedings of the ISWC 2015 Posters & Demonstrations Track co-located with the 14th International Semantic Web Conference (ISWC-2015), Bethlehem, PA, USA, October 11, 2015, Mar 2015
- VizCurator: A Visual Tool for Curating Open DataIn Proceedings of the 24th International Conference on World Wide Web Companion, WWW 2015, Florence, Italy, May 18-22, 2015 - Companion Volume, Mar 2015
2014
- Big Data CurationIn 20th International Conference on Management of Data, COMAD 2014, Hyderabad, India, December 17-19, 2014, Mar 2014
- Continuous data cleaningIn IEEE 30th International Conference on Data Engineering, Chicago, ICDE 2014, IL, USA, March 31 - April 4, 2014, Mar 2014
- VoidWiz: Resolving incompleteness using network effectsIn IEEE 30th International Conference on Data Engineering, Chicago, ICDE 2014, IL, USA, March 31 - April 4, 2014, Mar 2014
2013
-
-
-
-
- Using SQL for Efficient Generation and Querying of Provenance InformationIn In Search of Elegance in the Theory and Practice of Computation - Essays Dedicated to Peter Buneman, Mar 2013
- Value invention in data exchangeIn Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2013, New York, NY, USA, June 22-27, 2013, Mar 2013
- Provenance for Data MiningIn 5th Workshop on the Theory and Practice of Provenance, TaPP’13, Lombard, IL, USA, April 2-3, 2013, Mar 2013
2012
- The Vivification Problem in Real-Time Business Intelligence: A VisionIn Enabling Real-Time Business Intelligence - 6th International Workshop, BIRTE 2012, Held at the 38th International Conference on Very Large Databases, VLDB 2012, Istanbul, Turkey, August 27, 2012, Revised Selected Papers, Mar 2012
- AutoDict: Automated Dictionary DiscoveryIn IEEE 28th International Conference on Data Engineering (ICDE 2012), Washington, DC, USA (Arlington, Virginia), 1-5 April, 2012, Mar 2012
- Automated dictionary discovery for the online marketplaceIn iConference 2012, Toronto, Ontario, Canada, February 7-10, 2012, Mar 2012
-
2011
- Debugging Data Exchange with VagabondProc. VLDB Endow., Mar 2011
- NSERC business intelligence network: selected topicsIn Center for Advanced Studies on Collaborative Research, CASCON ’11, Toronto, ON, Canada, November 7-10, 2011, Mar 2011
- A unified model for data and constraint repairIn Proceedings of the 27th International Conference on Data Engineering, ICDE 2011, April 11-16, 2011, Hannover, Germany, Mar 2011
- Active repair of data quality rulesIn Proceedings of the 16th International Conference on Information Quality, ICIQ 2011, Adelaide, Australia, November 18-20, 2011, Mar 2011
- Reexamining Some Holy Grails of Data ProvenanceIn 3rd Workshop on the Theory and Practice of Provenance, TaPP’11, Heraklion, Crete, Greece, June 20-21, 2011, Mar 2011
- Linking Semistructured Data on the WebIn Proceedings of the 14th International Workshop on the Web and Databases 2011, WebDB 2011, Athens, Greece, June 12, 2011, Mar 2011
- Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2011, Athens, Greece, June 12-16, 2011Mar 2011
2010
- SECRET: A Model for Analysis of the Execution Semantics of Stream Processing SystemsProc. VLDB Endow., Mar 2010
- TRAMP: Understanding the Behavior of Schema Mappings through ProvenanceProc. VLDB Endow., Mar 2010
-
-
- Information Integration: a Vision for Integration Independence and Linking Open DataIn Proceedings of the 4th Alberto Mendelzon International Workshop on Foundations of Data Management, Buenos Aires, Argentina, May 17-20, 2010, Mar 2010
- Online annotation of text streams with structured entitiesIn Proceedings of the 19th ACM Conference on Information and Knowledge Management, CIKM 2010, Toronto, Ontario, Canada, October 26-30, 2010, Mar 2010
- Stream schema: providing and exploiting static metadata for data stream processingIn EDBT 2010, 13th International Conference on Extending Database Technology, Lausanne, Switzerland, March 22-26, 2010, Proceedings, Mar 2010
- BibBase triplifiedIn Proceedings the 6th International Conference on Semantic Systems, I-SEMANTICS 2010, Graz, Austria, September 1-3, 2010, Mar 2010
- A first step towards integration independenceIn Workshops Proceedings of the 26th International Conference on Data Engineering, ICDE 2010, March 1-6, 2010, Long Beach, California, USA, Mar 2010
- Composing local-as-view mappings: closure and applicationsIn Database Theory - ICDT 2010, 13th International Conference, Lausanne, Switzerland, March 23-25, 2010, Proceedings, Mar 2010
- Publishing Bibliographic Data on the Semantic Web using BibBaseIn Proceedings of the ISWC 2010 Posters & Demonstrations Track: Collected Abstracts, Shanghai, China, November 9, 2010, Mar 2010
- Enabling Real-Time Business Intelligence - Third International Workshop, BIRTE 2009, Held at the 35th International Conference on Very Large Databases, VLDB 2009, Lyon, France, August 24, 2009, Revised Selected PapersMar 2010
2009
- Framework for Evaluating Clustering Algorithms in Duplicate DetectionProc. VLDB Endow., Mar 2009
-
-
- Clio: Schema Mapping Creation and Data ExchangeIn Conceptual Modeling: Foundations and Applications - Essays in Honor of John Mylopoulos, Mar 2009
- A framework for semantic link discovery over relational dataIn Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM 2009, Hong Kong, China, November 2-6, 2009, Mar 2009
- (Not) yet another matcherIn Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM 2009, Hong Kong, China, November 2-6, 2009, Mar 2009
- YAM: a schema matcher factoryIn Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM 2009, Hong Kong, China, November 2-6, 2009, Mar 2009
- Schema AND Data: A Holistic Approach to Mapping, Resolution and Fusion in Information IntegrationIn Conceptual Modeling - ER 2009, 28th International Conference on Conceptual Modeling, Gramado, Brazil, November 9-12, 2009. Proceedings, Mar 2009
-
- LinkedCT: A Linked Data Space for Clinical TrialsCoRR, Mar 2009
2008
-
-
- Muse: Mapping Understanding and deSign by ExampleIn Proceedings of the 24th International Conference on Data Engineering, ICDE 2008, April 7-12, 2008, Cancún, Mexico, Mar 2008
- Muse: a system for understanding and designing mappingsIn Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2008, Vancouver, BC, Canada, June 10-12, 2008, Mar 2008
2007
-
- Retrospective on Clio: Schema Mapping and Data Exchange in PracticeIn Proceedings of the 2007 International Workshop on Description Logics (DL2007), Brixen-Bressanone, near Bozen-Bolzano, Italy, 8-10 June, 2007, Mar 2007
- A Semantic Approach to Discovering Schema Mapping ExpressionsIn Proceedings of the 23rd International Conference on Data Engineering, ICDE 2007, The Marmara Hotel, Istanbul, Turkey, April 15-20, 2007, Mar 2007
- Creating Nested Mappings with ClioIn Proceedings of the 23rd International Conference on Data Engineering, ICDE 2007, The Marmara Hotel, Istanbul, Turkey, April 15-20, 2007, Mar 2007
- Management of Inconsistent and Uncertain DataIn Proceedings of the Fifth International Workshop on Quality in Databases, QDB 2007, at the VLDB 2007 conference, Vienna, Austria, September 23, 2007, Mar 2007
- Accuracy of Approximate String Joins Using GramsIn Proceedings of the Fifth International Workshop on Quality in Databases, QDB 2007, at the VLDB 2007 conference, Vienna, Austria, September 23, 2007, Mar 2007
- Leveraging data and structure in ontology integrationIn Proceedings of the ACM SIGMOD International Conference on Management of Data, Beijing, China, June 12-14, 2007, Mar 2007
- Geographically-Sensitive Link AnalysisIn 2007 IEEE / WIC / ACM International Conference on Web Intelligence, WI 2007, 2-5 November 2007, Silicon Valley, CA, USA, Main Conference Proceedings, Mar 2007
2006
-
- Authorization-Transparent Access Control for XML Under the Non-Truman ModelIn Advances in Database Technology - EDBT 2006, 10th International Conference on Extending Database Technology, Munich, Germany, March 26-31, 2006, Proceedings, Mar 2006
- Clean Answers over Dirty Databases: A Probabilistic ApproachIn Proceedings of the 22nd International Conference on Data Engineering, ICDE 2006, 3-8 April 2006, Atlanta, GA, USA, Mar 2006
- Nested Mappings: Schema Mapping ReloadedIn Proceedings of the 32nd International Conference on Very Large Data Bases, Seoul, Korea, September 12-15, 2006, Mar 2006
2005
-
-
-
- Representing and Querying Data TransformationsIn Proceedings of the 21st International Conference on Data Engineering, ICDE 2005, 5-8 April 2005, Tokyo, Japan, Mar 2005
- First-Order Query Rewriting for Inconsistent DatabasesIn Database Theory - ICDT 2005, 10th International Conference, Edinburgh, UK, January 5-7, 2005, Proceedings, Mar 2005
- Peer data exchangeIn Proceedings of the Twenty-fourth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, June 13-15, 2005, Baltimore, Maryland, USA, Mar 2005
- ConQuer: Efficient Management of Inconsistent DatabasesIn Proceedings of the ACM SIGMOD International Conference on Management of Data, Baltimore, Maryland, USA, June 14-16, 2005, Mar 2005
- Data Sharing in the Hyperion Peer Database SystemIn Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30 - September 2, 2005, Mar 2005
- ConQuer: A System for Efficient Querying Over Inconsistent DatabasesIn Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30 - September 2, 2005, Mar 2005
2004
-
-
- LIMBO: Scalable Clustering of Categorical DataIn Advances in Database Technology - EDBT 2004, 9th International Conference on Extending Database Technology, Heraklion, Crete, Greece, March 14-18, 2004, Proceedings, Mar 2004
- ToMAS: A System for Adapting Mappings while Schemas EvolveIn Proceedings of the 20th International Conference on Data Engineering, ICDE 2004, 30 March - 2 April 2004, Boston, MA, USA, Mar 2004
- Information-Theoretic Tools for Mining Database Structure from Large Data SetsIn Proceedings of the ACM SIGMOD International Conference on Management of Data, Paris, France, June 13-18, 2004, Mar 2004
- (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31 - September 3 2004Mar 2004
2003
- Letter from the Special Issue EditorIEEE Data Eng. Bull., Mar 2003
- Schema DiscoveryIEEE Data Eng. Bull., Mar 2003
-
-
- Managing Data Mappings in the Hyperion ProjectIn Proceedings of the 19th International Conference on Data Engineering, March 5-8, 2003, Bangalore, India, Mar 2003
- Data Exchange: Semantics and Query AnsweringIn Database Theory - ICDT 2003, 9th International Conference, Siena, Italy, January 8-10, 2003, Proceedings, Mar 2003
- Towards Inconsistency Management in Data Integration SystemsIn Proceedings of IJCAI-03 Workshop on Information Integration on the Web (IIWeb-03), August 9-10, 2003, Acapulco, Mexico, Mar 2003
- Using Categorical Clustering in Schema DiscoveryIn Proceedings of IJCAI-03 Workshop on Information Integration on the Web (IIWeb-03), August 9-10, 2003, Acapulco, Mexico, Mar 2003
- Mapping Data in Peer-to-Peer Systems: Semantics and Algorithmic IssuesIn Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, San Diego, California, USA, June 9-12, 2003, Mar 2003
- Mapping Adaptation under Evolving SchemasIn Proceedings of 29th International Conference on Very Large Data Bases, VLDB 2003, Berlin, Germany, September 9-12, 2003, Mar 2003
2002
- Letter from the Special Issue EditorIEEE Data Eng. Bull., Mar 2002
- Schema ManagementIEEE Data Eng. Bull., Mar 2002
-
- Similarity Search Over Time-Series Data Using WaveletsIn Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, February 26 - March 1, 2002, Mar 2002
- Mapping XML and Relational Schemas with ClioIn Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, February 26 - March 1, 2002, Mar 2002
- Translating Web DataIn Proceedings of 28th International Conference on Very Large Data Bases, VLDB 2002, Hong Kong, August 20-23, 2002, Mar 2002
2001
-
- Mining for Empty Rectangles in Large Data SetsIn Database Theory - ICDT 2001, 8th International Conference, London, UK, January 4-6, 2001, Proceedings, Mar 2001
- Reverse Engineering Meets Data AnalysisIn 9th International Workshop on Program Comprehension (IWPC 2001), 12-13 May 2001, Toronto, Canada, Mar 2001
- Data-Driven Understanding and Refinement of Schema MappingsIn Proceedings of the 2001 ACM SIGMOD international conference on Management of data, Santa Barbara, CA, USA, May 21-24, 2001, Mar 2001
- Clio: A Semi-Automatic Tool For Schema MappingIn Proceedings of the 2001 ACM SIGMOD international conference on Management of data, Santa Barbara, CA, USA, May 21-24, 2001, Mar 2001
2000
- Approximate Query Answering in High-Dimensional Data CubesIn 2000 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, Dallas, Texas, USA, May 14, 2000, Mar 2000
- Schema Mapping as Query DiscoveryIn VLDB 2000, Proceedings of 26th International Conference on Very Large Data Bases, September 10-14, 2000, Cairo, Egypt, Mar 2000
1999
- Transforming Heterogeneous Data with Database Middleware: Beyond IntegrationIEEE Data Eng. Bull., Mar 1999
-
1998
-
- Using Schematically Heterogeneous StructuresIn SIGMOD 1998, Proceedings ACM SIGMOD International Conference on Management of Data, June 2-4, 1998, Seattle, Washington, USA, Mar 1998
1997
-
- Association Rules over Interval DataIn SIGMOD 1997, Proceedings ACM SIGMOD International Conference on Management of Data, May 13-15, 1997, Tucson, Arizona, USA, Mar 1997
1996
- Using Metadata to Address Problems of Semantic Interoperability in Large Object SystemsIn Proceedings of the 1st IEEE Metadata Conference 1996, MD 1996, Silver Spring, MD, USA, April 16-18, 1996, Mar 1996
1994
-
- Schema Equivalence in Heterogeneous Systems: Bridging Theory and Practice (Extended Abstract)In Advances in Database Technology - EDBT’94. 4th International Conference on Extending Database Technology, Cambridge, United Kingdom, March 28-31, 1994, Proceedings, Mar 1994
1993
- Desktop Experiment ManagementIEEE Data Eng. Bull., Mar 1993
- Understanding SchemasIn RIDE-IMS ’93, Thirst International Workshop on Research Issues in Data Engineering: Interoperability in Multidatabase Systems, Vienna, Austria, April 19-20, 1993, Mar 1993
- The Use of Information Capacity in Schema Integration and TranslationIn 19th International Conference on Very Large Data Bases, August 24-27, 1993, Dublin, Ireland, Proceedings, Mar 1993