Publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2024
- Similarity Measures For Incomplete Database InstancesIn Proceedings 27th International Conference on Extending Database Technology, EDBT 2024, Paestum, Italy, March 25 - March 28, 2024
- Gen-T: Table Reclamation in Data LakesIn 40th IEEE International Conference on Data Engineering, ICDE 2024, Utrecht, The Netherlands, May 13-16, 2024, 2024
- Comparing Incomplete Database InstancesIn Proceedings of the 32nd Symposium of Advanced Database Systems, Villasimius, Italy, June 23rd to 26th, 2024, 2024
- A Large Scale Test Corpus for Semantic Table SearchIn Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2024, Washington DC, USA, July 14-18, 2024, 2024
-
-
2023
-
- Explaining Dataset Changes for Semantic Data Versioning with Explain-Da-VProc. VLDB Endow., 2023
- Semantics-aware Dataset Discovery from Data Lakes with Contextualized Column-based Representation LearningProc. VLDB Endow., 2023
-
- DomainNet: Homograph Detection and Understanding in Data Lake DisambiguationACM Trans. Database Syst., 2023
- Table Discovery in Data Lakes: State-of-the-art and Future DirectionsIn Companion of the 2023 International Conference on Management of Data, SIGMOD/PODS 2023, Seattle, WA, USA, June 18-23, 2023, 2023
- DIALITE: Discover, Align and Integrate Open Data TablesIn Companion of the 2023 International Conference on Management of Data, SIGMOD/PODS 2023, Seattle, WA, USA, June 18-23, 2023, 2023
- Explaining Dataset Changes for Semantic Data Versioning with Explain-Da-V (Technical Report)CoRR, 2023
-
-
-
2022
-
-
- Semantics-aware Dataset Discovery from Data Lakes with Contextualized Column-based Representation LearningCoRR, 2022
2021
-
- DomainNet: Homograph Detection for Data Lake DisambiguationIn Proceedings of the 24th International Conference on Extending Database Technology, EDBT 2021, Nicosia, Cyprus, March 23 - 26, 2021, 2021
- Towards Knowledge Exchange: State-of-the-Art and Open ProblemsIn SOFSEM 2021: Theory and Practice of Computer Science - 47th International Conference on Current Trends in Theory and Practice of Computer Science, SOFSEM 2021, Bolzano-Bozen, Italy, January 25-29, 2021, Proceedings, 2021
- DomainNet: Homograph Detection for Data Lake DisambiguationCoRR, 2021
2020
- Knowledge TranslationProc. VLDB Endow., 2020
- Pytheas: Pattern-based Table Discovery in CSV FilesProc. VLDB Endow., 2020
- Organizing Data Lakes for NavigationIn Proceedings of the 2020 International Conference on Management of Data, SIGMOD Conference 2020, online conference [Portland, OR, USA], June 14-19, 2020, 2020
- Knowledge Translation: Extended Technical ReportCoRR, 2020
2019
-
-
- A Collective, Probabilistic Approach to Schema Mapping Using Diverse Noisy EvidenceIEEE Trans. Knowl. Data Eng., 2019
-
- Towards a Benchmark for Knowledge Base ExchangeIn Proceedings of the 1st International Workshop on Challenges and Experiences from Data Integration to Knowledge Graphs co-located with the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD 2019), Anchorage, Alaska, August 5, 2019, 2019
- JOSIE: Overlap Set Similarity Search for Finding Joinable Tables in Data LakesIn Proceedings of the 2019 International Conference on Management of Data, SIGMOD Conference 2019, Amsterdam, The Netherlands, June 30 - July 5, 2019, 2019
2018
- Making Open Data Transparent: Data Discovery on Open DataIEEE Data Eng. Bull., 2018
-
-
- Let’s Make It Dirty with BART!In Proceedings of the 26th Italian Symposium on Advanced Database Systems, Castellaneta Marina (Taranto), Italy, June 24-27, 2018, 2018
-
- Optimizing Organizations for Navigating Data LakesCoRR, 2018
2017
-
-
- Second annual workshop on data driven knowledge mobilizationIn Proceedings of the 27th Annual International Conference on Computer Science and Software Engineering, CASCON 2017, Markham, Ontario, Canada, November 6-8, 2017, 2017
- DeepSea: Progressive Workload-Aware Partitioning of Materialized Views in Scalable Data AnalyticsIn Proceedings of the 20th International Conference on Extending Database Technology, EDBT 2017, Venice, Italy, March 21-24, 2017, 2017
- A Collective, Probabilistic Approach to Schema MappingIn 33rd IEEE International Conference on Data Engineering, ICDE 2017, San Diego, CA, USA, April 19-22, 2017, 2017
- VIQS: Visual Interactive Exploration of Query SemanticsIn Proceedings of the 2017 ACM Workshop on Exploratory Search and Interactive Data Analytics, ESIDA@IUI 2017, Limassol, Cyprus, March 13, 2017, 2017
- The Future of Data IntegrationIn Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13 - 17, 2017, 2017
- A Collective, Probabilistic Approach to Schema Mapping: AppendixCoRR, 2017
2016
- Benchmarking Data Curation SystemsIEEE Data Eng. Bull., 2016
-
-
- Data-driven knowledge mobilizationIn Proceedings of the 26th Annual International Conference on Computer Science and Software Engineering, CASCON 2016, Toronto, Ontario, Canada, October 31 - November 2, 2016, 2016
- BART in Action: Error Generation and Empirical Evaluations of Data-Cleaning SystemsIn Proceedings of the 2016 International Conference on Management of Data, SIGMOD Conference 2016, San Francisco, CA, USA, June 26 - July 01, 2016, 2016
- LSH Ensemble: Internet Scale Domain SearchCoRR, 2016
2015
-
- Messing Up with BART: Error Generation for Evaluating Data-Cleaning AlgorithmsProc. VLDB Endow., 2015
-
-
- LabBook: Metadata-driven social collaborative data analysisIn 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29 - November 1, 2015, 2015
- Automatic Curation of Clinical Trials Data in LinkedCTIn The Semantic Web - ISWC 2015 - 14th International Semantic Web Conference, Bethlehem, PA, USA, October 11-15, 2015, Proceedings, Part II, 2015
- LinkedCT Live: Platform for Online Curation of Clinical Trials DataIn Proceedings of the ISWC 2015 Posters & Demonstrations Track co-located with the 14th International Semantic Web Conference (ISWC-2015), Bethlehem, PA, USA, October 11, 2015, 2015
- VizCurator: A Visual Tool for Curating Open DataIn Proceedings of the 24th International Conference on World Wide Web Companion, WWW 2015, Florence, Italy, May 18-22, 2015 - Companion Volume, 2015
2014
- Big Data CurationIn 20th International Conference on Management of Data, COMAD 2014, Hyderabad, India, December 17-19, 2014, 2014
- Continuous data cleaningIn IEEE 30th International Conference on Data Engineering, Chicago, ICDE 2014, IL, USA, March 31 - April 4, 2014, 2014
- VoidWiz: Resolving incompleteness using network effectsIn IEEE 30th International Conference on Data Engineering, Chicago, ICDE 2014, IL, USA, March 31 - April 4, 2014, 2014
2013
-
-
-
-
- Using SQL for Efficient Generation and Querying of Provenance InformationIn In Search of Elegance in the Theory and Practice of Computation - Essays Dedicated to Peter Buneman, 2013
- Value invention in data exchangeIn Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2013, New York, NY, USA, June 22-27, 2013, 2013
- Provenance for Data MiningIn 5th Workshop on the Theory and Practice of Provenance, TaPP’13, Lombard, IL, USA, April 2-3, 2013, 2013
2012
- The Vivification Problem in Real-Time Business Intelligence: A VisionIn Enabling Real-Time Business Intelligence - 6th International Workshop, BIRTE 2012, Held at the 38th International Conference on Very Large Databases, VLDB 2012, Istanbul, Turkey, August 27, 2012, Revised Selected Papers, 2012
- AutoDict: Automated Dictionary DiscoveryIn IEEE 28th International Conference on Data Engineering (ICDE 2012), Washington, DC, USA (Arlington, Virginia), 1-5 April, 2012, 2012
- Automated dictionary discovery for the online marketplaceIn iConference 2012, Toronto, Ontario, Canada, February 7-10, 2012, 2012
-
2011
- Debugging Data Exchange with VagabondProc. VLDB Endow., 2011
- NSERC business intelligence network: selected topicsIn Center for Advanced Studies on Collaborative Research, CASCON ’11, Toronto, ON, Canada, November 7-10, 2011, 2011
- A unified model for data and constraint repairIn Proceedings of the 27th International Conference on Data Engineering, ICDE 2011, April 11-16, 2011, Hannover, Germany, 2011
- Active repair of data quality rulesIn Proceedings of the 16th International Conference on Information Quality, ICIQ 2011, Adelaide, Australia, November 18-20, 2011, 2011
- Reexamining Some Holy Grails of Data ProvenanceIn 3rd Workshop on the Theory and Practice of Provenance, TaPP’11, Heraklion, Crete, Greece, June 20-21, 2011, 2011
- Linking Semistructured Data on the WebIn Proceedings of the 14th International Workshop on the Web and Databases 2011, WebDB 2011, Athens, Greece, June 12, 2011, 2011
- Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2011, Athens, Greece, June 12-16, 20112011
2010
- SECRET: A Model for Analysis of the Execution Semantics of Stream Processing SystemsProc. VLDB Endow., 2010
-
-
-
- Information Integration: a Vision for Integration Independence and Linking Open DataIn Proceedings of the 4th Alberto Mendelzon International Workshop on Foundations of Data Management, Buenos Aires, Argentina, May 17-20, 2010, 2010
- Online annotation of text streams with structured entitiesIn Proceedings of the 19th ACM Conference on Information and Knowledge Management, CIKM 2010, Toronto, Ontario, Canada, October 26-30, 2010, 2010
- Stream schema: providing and exploiting static metadata for data stream processingIn EDBT 2010, 13th International Conference on Extending Database Technology, Lausanne, Switzerland, March 22-26, 2010, Proceedings, 2010
- BibBase triplifiedIn Proceedings the 6th International Conference on Semantic Systems, I-SEMANTICS 2010, Graz, Austria, September 1-3, 2010, 2010
- A first step towards integration independenceIn Workshops Proceedings of the 26th International Conference on Data Engineering, ICDE 2010, March 1-6, 2010, Long Beach, California, USA, 2010
- Composing local-as-view mappings: closure and applicationsIn Database Theory - ICDT 2010, 13th International Conference, Lausanne, Switzerland, March 23-25, 2010, Proceedings, 2010
- Publishing Bibliographic Data on the Semantic Web using BibBaseIn Proceedings of the ISWC 2010 Posters & Demonstrations Track: Collected Abstracts, Shanghai, China, November 9, 2010, 2010
- Enabling Real-Time Business Intelligence - Third International Workshop, BIRTE 2009, Held at the 35th International Conference on Very Large Databases, VLDB 2009, Lyon, France, August 24, 2009, Revised Selected Papers2010
2009
-
-
-
- Clio: Schema Mapping Creation and Data ExchangeIn Conceptual Modeling: Foundations and Applications - Essays in Honor of John Mylopoulos, 2009
- A framework for semantic link discovery over relational dataIn Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM 2009, Hong Kong, China, November 2-6, 2009, 2009
- (Not) yet another matcherIn Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM 2009, Hong Kong, China, November 2-6, 2009, 2009
- YAM: a schema matcher factoryIn Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM 2009, Hong Kong, China, November 2-6, 2009, 2009
- Schema AND Data: A Holistic Approach to Mapping, Resolution and Fusion in Information IntegrationIn Conceptual Modeling - ER 2009, 28th International Conference on Conceptual Modeling, Gramado, Brazil, November 9-12, 2009. Proceedings, 2009
-
- LinkedCT: A Linked Data Space for Clinical TrialsCoRR, 2009
2008
-
-
- Muse: Mapping Understanding and deSign by ExampleIn Proceedings of the 24th International Conference on Data Engineering, ICDE 2008, April 7-12, 2008, Cancún, Mexico, 2008
- Muse: a system for understanding and designing mappingsIn Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2008, Vancouver, BC, Canada, June 10-12, 2008, 2008
2007
-
- Retrospective on Clio: Schema Mapping and Data Exchange in PracticeIn Proceedings of the 2007 International Workshop on Description Logics (DL2007), Brixen-Bressanone, near Bozen-Bolzano, Italy, 8-10 June, 2007, 2007
- A Semantic Approach to Discovering Schema Mapping ExpressionsIn Proceedings of the 23rd International Conference on Data Engineering, ICDE 2007, The Marmara Hotel, Istanbul, Turkey, April 15-20, 2007, 2007
- Creating Nested Mappings with ClioIn Proceedings of the 23rd International Conference on Data Engineering, ICDE 2007, The Marmara Hotel, Istanbul, Turkey, April 15-20, 2007, 2007
- Management of Inconsistent and Uncertain DataIn Proceedings of the Fifth International Workshop on Quality in Databases, QDB 2007, at the VLDB 2007 conference, Vienna, Austria, September 23, 2007, 2007
- Accuracy of Approximate String Joins Using GramsIn Proceedings of the Fifth International Workshop on Quality in Databases, QDB 2007, at the VLDB 2007 conference, Vienna, Austria, September 23, 2007, 2007
- Leveraging data and structure in ontology integrationIn Proceedings of the ACM SIGMOD International Conference on Management of Data, Beijing, China, June 12-14, 2007, 2007
- Geographically-Sensitive Link AnalysisIn 2007 IEEE / WIC / ACM International Conference on Web Intelligence, WI 2007, 2-5 November 2007, Silicon Valley, CA, USA, Main Conference Proceedings, 2007
2006
-
- Authorization-Transparent Access Control for XML Under the Non-Truman ModelIn Advances in Database Technology - EDBT 2006, 10th International Conference on Extending Database Technology, Munich, Germany, March 26-31, 2006, Proceedings, 2006
- Clean Answers over Dirty Databases: A Probabilistic ApproachIn Proceedings of the 22nd International Conference on Data Engineering, ICDE 2006, 3-8 April 2006, Atlanta, GA, USA, 2006
- Nested Mappings: Schema Mapping ReloadedIn Proceedings of the 32nd International Conference on Very Large Data Bases, Seoul, Korea, September 12-15, 2006, 2006
2005
-
-
-
- Representing and Querying Data TransformationsIn Proceedings of the 21st International Conference on Data Engineering, ICDE 2005, 5-8 April 2005, Tokyo, Japan, 2005
- First-Order Query Rewriting for Inconsistent DatabasesIn Database Theory - ICDT 2005, 10th International Conference, Edinburgh, UK, January 5-7, 2005, Proceedings, 2005
- Peer data exchangeIn Proceedings of the Twenty-fourth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, June 13-15, 2005, Baltimore, Maryland, USA, 2005
- ConQuer: Efficient Management of Inconsistent DatabasesIn Proceedings of the ACM SIGMOD International Conference on Management of Data, Baltimore, Maryland, USA, June 14-16, 2005, 2005
- Data Sharing in the Hyperion Peer Database SystemIn Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30 - September 2, 2005, 2005
- ConQuer: A System for Efficient Querying Over Inconsistent DatabasesIn Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30 - September 2, 2005, 2005
2004
-
-
- LIMBO: Scalable Clustering of Categorical DataIn Advances in Database Technology - EDBT 2004, 9th International Conference on Extending Database Technology, Heraklion, Crete, Greece, March 14-18, 2004, Proceedings, 2004
- ToMAS: A System for Adapting Mappings while Schemas EvolveIn Proceedings of the 20th International Conference on Data Engineering, ICDE 2004, 30 March - 2 April 2004, Boston, MA, USA, 2004
- Information-Theoretic Tools for Mining Database Structure from Large Data SetsIn Proceedings of the ACM SIGMOD International Conference on Management of Data, Paris, France, June 13-18, 2004, 2004
- (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31 - September 3 20042004
2003
- Letter from the Special Issue EditorIEEE Data Eng. Bull., 2003
- Schema DiscoveryIEEE Data Eng. Bull., 2003
-
-
- Managing Data Mappings in the Hyperion ProjectIn Proceedings of the 19th International Conference on Data Engineering, March 5-8, 2003, Bangalore, India, 2003
- Data Exchange: Semantics and Query AnsweringIn Database Theory - ICDT 2003, 9th International Conference, Siena, Italy, January 8-10, 2003, Proceedings, 2003
- Towards Inconsistency Management in Data Integration SystemsIn Proceedings of IJCAI-03 Workshop on Information Integration on the Web (IIWeb-03), August 9-10, 2003, Acapulco, Mexico, 2003
- Using Categorical Clustering in Schema DiscoveryIn Proceedings of IJCAI-03 Workshop on Information Integration on the Web (IIWeb-03), August 9-10, 2003, Acapulco, Mexico, 2003
- Mapping Data in Peer-to-Peer Systems: Semantics and Algorithmic IssuesIn Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, San Diego, California, USA, June 9-12, 2003, 2003
- Mapping Adaptation under Evolving SchemasIn Proceedings of 29th International Conference on Very Large Data Bases, VLDB 2003, Berlin, Germany, September 9-12, 2003, 2003
2002
- Letter from the Special Issue EditorIEEE Data Eng. Bull., 2002
- Schema ManagementIEEE Data Eng. Bull., 2002
-
- Similarity Search Over Time-Series Data Using WaveletsIn Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, February 26 - March 1, 2002, 2002
- Mapping XML and Relational Schemas with ClioIn Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, February 26 - March 1, 2002, 2002
- Translating Web DataIn Proceedings of 28th International Conference on Very Large Data Bases, VLDB 2002, Hong Kong, August 20-23, 2002, 2002
2001
-
- Mining for Empty Rectangles in Large Data SetsIn Database Theory - ICDT 2001, 8th International Conference, London, UK, January 4-6, 2001, Proceedings, 2001
- Reverse Engineering Meets Data AnalysisIn 9th International Workshop on Program Comprehension (IWPC 2001), 12-13 May 2001, Toronto, Canada, 2001
- Data-Driven Understanding and Refinement of Schema MappingsIn Proceedings of the 2001 ACM SIGMOD international conference on Management of data, Santa Barbara, CA, USA, May 21-24, 2001, 2001
- Clio: A Semi-Automatic Tool For Schema MappingIn Proceedings of the 2001 ACM SIGMOD international conference on Management of data, Santa Barbara, CA, USA, May 21-24, 2001, 2001
2000
- Approximate Query Answering in High-Dimensional Data CubesIn 2000 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, Dallas, Texas, USA, May 14, 2000, 2000
- Schema Mapping as Query DiscoveryIn VLDB 2000, Proceedings of 26th International Conference on Very Large Data Bases, September 10-14, 2000, Cairo, Egypt, 2000
1999
- Transforming Heterogeneous Data with Database Middleware: Beyond IntegrationIEEE Data Eng. Bull., 1999
-
1998
-
- Using Schematically Heterogeneous StructuresIn SIGMOD 1998, Proceedings ACM SIGMOD International Conference on Management of Data, June 2-4, 1998, Seattle, Washington, USA, 1998
1997
-
- Association Rules over Interval DataIn SIGMOD 1997, Proceedings ACM SIGMOD International Conference on Management of Data, May 13-15, 1997, Tucson, Arizona, USA, 1997
1996
- Using Metadata to Address Problems of Semantic Interoperability in Large Object SystemsIn Proceedings of the 1st IEEE Metadata Conference 1996, MD 1996, Silver Spring, MD, USA, April 16-18, 1996, 1996
1994
-
- Schema Equivalence in Heterogeneous Systems: Bridging Theory and Practice (Extended Abstract)In Advances in Database Technology - EDBT’94. 4th International Conference on Extending Database Technology, Cambridge, United Kingdom, March 28-31, 1994, Proceedings, 1994
1993
- Desktop Experiment ManagementIEEE Data Eng. Bull., 1993
- Understanding SchemasIn RIDE-IMS ’93, Thirst International Workshop on Research Issues in Data Engineering: Interoperability in Multidatabase Systems, Vienna, Austria, April 19-20, 1993, 1993
- The Use of Information Capacity in Schema Integration and TranslationIn 19th International Conference on Very Large Data Bases, August 24-27, 1993, Dublin, Ireland, Proceedings, 1993