2015


QOCO: A Query Oriented Data Cleaning System with Oracles
(with M. Bergman, T. Milo, S. Novgorodov)
Proceedings of the VLDB Endowment (PVLDB), 2015. (Demonstration Track) (To appear)

A Time Machine for Information: Looking Back to Look Forward
(with L. Dong)
Proceedings of the VLDB Endowment (PVLDB), 2015. (Tutorial Track) (To appear)

High-Level Explanations using Ontologies
(with B. ten Cate, C. Civili, E. Sherkhonov)
ACM Symposium on Principles of Database Systems (PODS), 2015.

QOCO: Query-Oriented Data Cleaning with Oracles
(with M. Bergman, T. Milo, S. Novgorodov)
ACM SIGMOD International Conference on Management of Data (SIGMOD), 2015.

Linking Temporal Records for Profiling Entities
(with F. Li, M. L. Lee, W. Hsu)
ACM SIGMOD International Conference on Management of Data (SIGMOD), 2015.

Approximation Algorithms for Schema-Mapping Discovery from Data Examples
(with B. ten Cate, Ph. G. Kolaitis, K. Qian)
Alberto Mendelzon Workshop (AMW), 2015.

A Declarative Framework for Linking Entities
(with D. Burdick, R. Fagin, Ph. G. Kolaitis, L. Popa)
International Conference on Database Theory (ICDT), 2015.
Best paper award

2014


Preference-aware Integration of Temporal Data
(with B. Alexe, M. Roth)
Proceedings of the VLDB Endowment (PVLDB), 2014.

A Hybrid Machine-Crowdsourcing System for Matching Web Tables
(with J. Fan, M. Lu, B.C. Ooi)
International Conference on Data Engineering (ICDE), 2014.

Federation in Cloud Data Management: Challenges and Opportunities
(with G. Chen, H.V. Jagadish, D. Jiang, D. Maier, B. C. Ooi, K.-L Tan)
IEEE Transactions on Knowledge and Data Engineering (TKDE), Vol. 26, No. 7, 2014.

Query Answering over Incomplete and Uncertain RDF
(with E. Pema, A. Kementsietsidis)
International Workshop on Web and Databases (WebDB), 2014.

2013


Efficient Querying of Inconsistent Databases with Binary Integer Programming
(with E. Pema and Ph. G. Kolaitis)
Proceedings of the VLDB Endowment (PVLDB), 2013.

Schema mappings and data examples
(with B. ten Cate, Ph. G. Kolaitis)
International Conference on Extending Database Technology (EDBT), 2013. (Tutorial Track)

Data Integration and Data Exchange: It's really about time
(with M. Roth)
Conference on Innovative Data Systems Research (CIDR), 2013.

2012


Asking the Right Questions in Crowd Data Sourcing
(with R. Boim, O. Greenshpan, T. Milo, S. Novgorodov, N. Polyzotis)
International Conference on Data Engineering (ICDE), 2008. (Demonstration Track)

Splash: A Platform for Analysis and Simulation of Health
(with P. J. Haas, R. L. Mak, C. A. Kieliszewski, P. G. Selinger, P. P. Magio, S. Glissmann, M. Cefkin)
ACM International Health Informatics Symposium, 2012.

MapMerge: Correlating Independent Schema Mappings
(with B. Alexe, M. A. Hernandez, L. Popa)
VLDB Journal, Vol. 21, No. 2, 2012.
Invited paper from the 2010 VLDB conference.

2011


Schema Mapping Evolution through Composition and Inversion
(with R. Fagin, Ph. G. Kolaitis, L. Popa)
Book chapter in Schema Mapping Evolution Through Composition and Inversion

Characterizing Schema Mappings via Data Examples
(with B. Alexe, B. ten Cate, Ph. G. Kolaitis)
ACM Transactions on Database Systems (TODS) Vol. 36, No. 4, 2011.
Invited paper from the 2010 PODS conference

Data is Dead... without What-If Models
(with P. J. Haas, P. P. Maglio, P. G. Selinger)
Proceedings of the VLDB Endowment (PVLDB) (Challenges and Vision Track), 2011.
Best paper award (3rd place)

EIRENE: Interactive Design and Refinement of Schema Mappings via Data Examples
(with B. Alexe, B. ten Cate, Ph. G. Kolaitis)
Proceedings of the VLDB Endowment (PVLDB) (Demonstration Track), 2011.

Designing and Refining Schema Mappings with Data Examples
(with B. Alexe, B. ten Cate, Ph. G. Kolaitis)
ACM SIGMOD International Conference on Management of Data (SIGMOD), 2011.

Reverse Data Exchange: Coping with Nulls
(with R. Fagin, Ph. G. Kolaitis, L. Popa)
ACM Transactions on Database Systems (TODS) Vol. 36, No. 2, 2011.
Invited paper from the 2009 PODS conference

2010


SPLASH: A Progress Report on Building a Platform for a 360 Degree View of Health
(with M. Cefkin, S. M. Glissman, P. J. Haas, L. Jalali, P. P. Maglio, P. G. Selinger)
Proceedings of the 5th INFORMS Workshop on Data Mining and Health Informatics (DM-HI), 2010.

MapMerge: Correlating Independent Schema Mappings
(with B. Alexe, M. A. Hernández, L. Popa)
Proceedings of the VLDB Endowment (PVLDB), 2010.

Database Constraints and Homomorphism Dualities
(with B. ten Cate, Ph. G. Kolaitis)
International Conference on Principles and Practice of Constraint Programming (CP), 2010.

Characterizing Schema Mappings via Data Examples
(with B. Alexe, Ph. G. Kolaitis)
ACM Symposium on Principles of Database Systems (PODS), 2010.

2009


Provenance in Databases: Why, How, and Where
(with J. Cheney, L. Chiticariu)
Foundations and Trends in Databases, 2009.

Laconic Schema Mappings: Computing the Core with SQL Queries
(with B. ten Cate, L. Chiticariu, Ph. G. Kolaitis)
Proceedings of the VLDB Endowment (PVLDB), 2009.

Artemis: A System for Analyzing Missing Answers
(with M. Herschel, M. A. Hernández)
Proceedings of the VLDB Endowment (PVLDB) (Demonstration Track), 2009.

Reverse Data Exchange:Coping with Nulls
(with R. Fagin, Ph. G. Kolaitis, L. Popa)
ACM Symposium on Principles of Database Systems (PODS), 2009.

Fusing Data Management Services with File Systems
(with S. A. Brandt, C. Maltzahn, N. Polyzotis)
Petascale Data Storage Workshop, 2009.

2008


Comparing and Evaluating Mapping Systems with STMark
(with B. Alexe, Y. Velegrakis)
Proceedings of the VLDB Endowment (PVLDB) (Demonstration Track), 2008.

STMark: Towards a Benchmark for Mapping Systems
(with B. Alexe, Y. Velegrakis)
Proceedings of the VLDB Endowment (PVLDB), 2008.

Data Exchange with Data-Metadata Translations
(with M. A. Hernández, P. Papotti)
Proceedings of the VLDB Endowment (PVLDB), 2008.

Quasi-inverses of Schema Mappings
(with R. Fagin, Ph. G. Kolaitis, L. Popa)
ACM Transactions on Database Systems (TODS) Vol. 33, No. 2, 2008.
Invited paper from the PODS 2007 conference

Curated Databases
(with P. Buneman, J. Cheney, S. Vansumerren)
ACM Symposium on Principles of Database Systems (PODS), 2008.

Muse: A System for Understanding and Designing Mappings
(with B. Alexe, L. Chiticariu, R. J. Miller, D. Pepper)
ACM SIGMOD International Conference on Management of Data (SIGMOD) (Demonstration Track), 2008

Muse: Mapping Understanding and deSign by Example
(with B. Alexe, L. Chiticariu, R. J. Miller)
International Conference on Data Engineering (ICDE), 2008.

2007


Provenance in Databases: Past, Current, and Future
W. Tan
IEEE Data Engineering Bulletin.

Provenance in Databases
(with P. Buneman)
ACM SIGMOD International Conference on Management of Data (SIGMOD) (Tutorial Track).
Tutorial slides in [ppt] (requires TexPoint for Math symbols to display properly) or [pdf].

Quasi-inverses of Schema Mappings
(with R. Fagin, Ph. G. Kolaitis, L. Popa)
ACM Symposium on Principles of Database Systems (PODS), 2007.

2006


Peer Data Exchange
(with A. Fuxman, Ph. G. Kolaitis, R. J. Miller)
ACM Transactions on Database Systems (TODS) Vol. 31, No. 4, pages 1454 - 1498, 2006.
Invited paper from the PODS 2005 conference

SPIDER: A Schema MapPIng DEbuggeR
(with B. Alexe, L. Chiticariu)
Very Large Data Bases (VLDB) (Demonstration Track), 2006.

Debugging Schema Mappings with Routes
(with L. Chiticariu)
Very Large Data Bases (VLDB), 2006.

The Complexity of Data Exchange
(with J. Panttaja, Ph. G. Kolaitis)
ACM Symposium on Principles of Database Systems (PODS), 2006

2005


An Annotation Management System for Relational Databases
(with D. Bhagwat, L. Chiticariu, G. Vijayvargiya)
VLDB Journal, Vol. 14, No. 4, 2005
Invited paper from the VLDB 2004 conference

Composing Schema Mappings: Second-Order Dependencies to the Rescue
(with R. Fagin, Ph. G. Kolaitis, L. Popa)
ACM Transactions on Database Systems (TODS), Vol. 30, No. 4, pages 994-1055, 2005
Invited paper from the PODS 2004 conference

DBNotes: A Post-It System for Relational Databases [fullversion]
(with L. Chiticariu, G. Vijayvargiya)
ACM SIGMOD International Conference on Management of Data (SIGMOD) (Demonstration Track), 2005

Peer Data Exchange
(with A. Fuxman, Ph. G. Kolaitis, R. J. Miller)
ACM Symposium on Principles of Database Systems (PODS), 2005

2004


Research Problems in Data Provenance
W. Tan
IEEE Data Engineering Bulletin, vol. 27, no. 4, pages 45-52, 2004

An Annotation Management System for Relational Databases
(with D. Bhagwat, L. Chiticariu, G. Vijayvargiya)
Very Large Data Bases (VLDB), 2004

Composing Schema Mappings: Second-Order Dependencies to the Rescue
(with R. Fagin, Ph. G. Kolaitis, L. Popa)
ACM Symposium on Principles of Database Systems (PODS) , 2004.
IBM 2004 Pat Goldberg Best Paper Award
 
Archiving Scientific Data
(with P. Buneman, S. Khanna, K. Tajima)
ACM Transactions on Database Systems (TODS), vol. 29, No. 1, pages 2-42, 2004
Invited paper from the SIGMOD 2002 conference

2003


Containment of Relational Queries with Annotation Propagation [ fullversion]
W. Tan
International Workshop on Data Base and Programming Languages (DBPL), 2003

Reasoning about Keys for XML
(with P. Buneman, S. Davidson, W. Fan, C. Hara)
Information Systems Journal, vol. 28, no. 8, pages 1037-1063, December, 2003

2002


SilkRoute: A Framework for Publishing Relational Data in XML
(with M. Fernández, Y. Kadiyska, A. Morishima, D. Suciu)
ACM Transactions on Database Systems (TODS), vol. 27, no. 4, pages 438-493, 2002

On Propagation of Deletions and Annotations through Views
(with P. Buneman, S. Khanna)
ACM Symposium on Principles of Database Systems (PODS), 2002.

Archiving Scientific Data
(with P. Buneman, S. Khanna, K. Tajima)
ACM SIGMOD International Conference on Management of Data (SIGMOD), 2002

Keys for XML
(with P. Buneman, S. Davidson, W. Fan, C. Hara)
Computer Networks, vol. 39, no. 5, pages 473-487, 2002
Best Paper Award

2001


Reasoning about Keys for XML (with P. Buneman, S. Davidson, W. Fan, C. Hara)
International Workshop on Database Programming Languages (DBPL) , 2001.

On Computing Functions with Uncertainty (with S. Khanna)
ACM Symposium on Principles of Database Systems (PODS), 2001.

Publishing Relational Data in XML: the SilkRoute Approach
(with M. Fernández, A. Morishima, D. Suciu)
IEEE Data Engineering Bulletin, Vol. 24, no. 2, pages 12-19, 2001.

Keys for XML
(with P. Buneman, S. Davidson, W. Fan, C. Hara)
International Conference on World Wide Web (WWW10), 2001.
Commended Paper

Why and Where: A Charaterization of Data Provenance
(with P. Buneman, S. Khanna)
International Conference on Database Theory (ICDT), 2001.

2000


Towards a query language for annotation graphs
(with P. Buneman and S. Bird)
International Conference on Language Resources and Evaluation, 2000.

Data Provenance: Some Basic Issues
(with P. Buneman, S. Khanna)
Foundations of Software Technology and Theoretical Computer Science (FSTTCS), 2000.

SilkRoute: Trading between Relations and XML
(with M. Fernández, D. Suciu)
International World Wide Web Conference (WWW9) , 2000.

1998 and earlier


A Deterministic Model for Semistructured Data
(with P. Buneman, A. Deutsch)
Workshop on Query Processing for Semistructured Data and Non-Standard Data Formats, 1998.

Beyond XML Query Languages
(with P. Buneman, A Deutsch, W Fan, H. Liefke, A. Sahuguet)
Query Language Workshop (QL98), 1998.

QUICK: Graphical User Interface to Multiple Databases.
(with K. Wang, L. Wong)
Database and Expert Systems Applications (DEXA) Workshop, 1996.