Publications

 

  • Papers by Year
     

2015

 

    • Xin Luna Dong, Evgeniy Gabrilovich, Kevin Murphy, Van Dang, Wilko Horn, Camillo Lugaresi, Shaohua Sun, and Wei Zhang. Knowledge-based trust: estimating the trustworthiness of web sources. In VLDB, 2015. [PDF][Presentation]
    • Wenfei Fan, Zhe Fan, Chao Tian, and Xin Luna Dong. Keys for Graphs. In VLDB 2015. [PDF][Presentation]
    • Xin Luna Dong and Wang-Chiew Tan: A Time Machine for Information: Looking Back to Look Forward. Tutorial in VLDB, 2015. [PDF][Slides]
    • Xiaolan Wang, Mary Feng, Yue Wang, Xin Luna Dong, Alexandra Meliou. Error diagnosis and data profiling with Data X-Ray. Demo in VLDB, 2015. [PDF]
    • Tim Althoff, Xin Luna Dong, Kevin Murphy, Safa Alai, Van Dang, and Wei Zhang. TimeMachine: timeline generation for knowledge-base entities. In SIGKDD 2015. [PDF][Presentation]
    • Xin Luna Dong and Divesh Srivastava. Knowledge curation and knowledge fusion: challenges, models, and applications. Tutorial in Sigmod'15. [PDF][Presentation]
    • Xiaolan Wang, Xin Luna Dong, Alexandra Meliou. Data X-Ray: A diagnostic tool for data errors. In Sigmod, 2015. [PDF][Presentation]
    • Pei Li, Xin Luna Dong, Songtao Guo, Andrea Maurino, and Divesh Srivastava. Robust group linkage. In WWW 2015. [PDF][Presentation][Report]
    • Xian Li, Xin Luna Dong, Kenneth Lyons, Weiyi Meng, and Divesh Srivastava. Scaling up Copy Detection. In ICDE, 2015. [PDF][Report]
    • Theodoros Rehatsinas, Xin Luna Dong, Lise Getoor, Divesh Srivastava. Finding Quality in Quantity: The Challenge of Discovering Valuable Sources for Integration In CIDR, 2015. [PDF][Presentation]

 

 

2014

 

    • Xin Luna Dong, Evgeniy Gabrilovich, Geremy Heitz, Wilko Horn, Ni Lao, Kevin Murphy, Thomas Strohmann, Shaohua Sun, and Wei Zhang. Knowledge Vault: A Web-scale approach to probabilistic knowledge fusion. In SIGKDD, 2014. [PDF]
    • Xin Luna Dong, Evgeniy Gabrilovich, Geremy Heitz, Wilko Horn, Kevin Murphy, Shaohua Sun, and Wei Zhang. From data fusion to knowledge fusion. In VLDB, 2014. [PDF][Presentation]
    • Anja Gruenheid, Xin Luna Dong, and Divesh Srivastava. Incremental record linkage. In VLDB 2014. [PDF][Report][Presentation]
    • Mariam Salloum, Xin Luna Dong, Divesh Srivastava, Vassilis J. Tsotras. Online ordering of overlapping data sources. In VLDB, 2014. [PDF][Presentation]
    • Ravali Pochampally, Anish Das Sarma, Xin Luna Dong, Alexandra Meliou, and Divesh Srivastava. Fusing data with correlations. In Sigmod, 2014. [PDF][Presentation][Poster]
    • Theodoros Rehatsinas, Xin Luna Dong, Divesh Srivastava. Characterizing and selecting fresh data sources. In Sigmod, 2014. [PDF][Presentation]

 

 

2013

 

    • Xin Luna Dong, Barna Saha, and Divesh Srivastava. Less is More: Selecting Sources Wisely for Integration. In VLDB, 2013. [PDF][Full paper] [Slides (short)] [Slides (long)]
    • Xian Li, Xin Luna Dong, Kenneth Lyons, Weiyi Meng, and Divesh Srivastava. Truth Finding on the Deep Web: Is the Problem Solved? In VLDB, 2013. [PDF][Full paper] [Presentation]
    • Xin Luna Dong and Divesh Srivastava. Big data integration. Tutorial in VLDB'13. [PDF] [Presentation]
    • Xin Luna Dong, Laure Berti-Equille, Divesh Srivastava. Data fusion: Resolving conflicts from mutiple sources. In WAIM, 2013. [PDF]
    • Xin Luna Dong and Divesh Srivastava. Compact explanation of data fusion decisions. In WWW, 2013. [PDF][Report][Presentation]
    • Xin Luna Dong and Divesh Srivastava. Big data integration. Tutorial in ICDE'13. [PDF] [Presentation]

 

 

2012

 

    • Pei Li, Xin Luna Dong, Andrea MaurinoDivesh Srivastava: Linking temporal records. Frontiers of Computer Science 6(3): 293-312, 2012. [PDF]
    • Pei Li, Haidong Wang, Christina Tziviskou, Xin Luna Dong, Xiaoguang Liu, Andrea Maurino and Divesh Srivastava: CHRONOS: Facilitating History Discovery by Linking Temporal Records. Demo in VLDB, 2012. [PDF][Poster][Demo]
    • Xin Luna Dong and Divesh Srivastava. Large-Scale Copying Detection. Tutorial in ICDE, 2011. [PDF][Presentation]
    • Xin Luna Dong, Divesh Srivastava: Detecting Clones, Copying and Reuse on the Web. Tutorial in DASFAA, 2012. [PDF][Presentation]

 

 

2011

 

    • Anish Das Sarma, Xin Luna Dong, Alon Halevy. Data integration with dependent sources. In EDBT, 2011. [PDF][Presentation]
    • Xin Luna Dong and Divesh Srivastava. Large-Scale Copying Detection. Tutorial in Sigmod, 2011. [PDF][Presentation]
    • Su Chen, Xin Luna Dong, Laks V.S. Lakshmanan and Divesh Srivastava: We Challenge You to Certify Your Update. In Sigmod 2011. [PDF][Presentation][Poster]
    • Pei Li, Xin Luna Dong, Andrea Maurino, and Divesh Srivastava: Linking Temporal Records. In VLDB 2011. [PDF][Presentation]
    • Xuan Liu, Xin Luna Dong, Beng Chin Ooi, and Divesh Srivastava: Online data fusion. In VLDB, 2011. [PDF][Presentation]
       

2010

 

    • Xin Luna Dong, Laure Berti-Equille, Yifan Hu, and Divesh Srivastava. Solomon: Seeking the truth via copying detection. Demo in VLDB, 2010. [PDF][Poster][Demo]
    • Xin Luna Dong, Laure Berti-Equille, Yifan Hu, and Divesh Srivastava. Global detection of complex copying relationships between sources. In VLDB, 2010. [PDF][Full paper][Presentation]
    • Songtao Guo, Xin Luna Dong, Divesh Srivastava, and Remi Zajac. Record Linkage with Uniqueness Constraints and Erroneous Values. In VLDB, 2010. [PDF][Full paper][Presentation]
       

2009

 

    • Xin Dong, Alon Y. Halevy and Cong Yu: Data Integration with Uncertainties. VLDBJ 18(2):469-500, 2009. [PDF] (The original publication is available at here.)
    • Laure Berti-Equille, Anish Das Sarma, Xin Luna Dong, Amelie Marian, and Divesh Srivastava. Sailing the information ocean with awareness of currents: discovery and application of source dependence. In CIDR, 2009. [PDF][Presentation]
    • Daisy Zhe Wang, Xin Luna Dong, Anish Das Sarma, Michael Franklin, Alon Halevy. Functional Dependency Generation and Applications in Pay-As-You-Go Data Integration Systems. In WebDB, 2009. [PDF]
    • Xin Luna Dong, Laure Berti-Equille, and Divesh Srivastava. Integrating conflicting data: the role of source dependence. In VLDB, 2009. [PDF][Presentation][Full paper]
    • Xin Luna Dong, Laure Berti-Equille, and Divesh Srivastava. Truth discovery and copying detection in a dynamic world. In VLDB, 2009. [PDF][Presentation][Full paper]
    • Xin Luna Dong and Felix Naumann. Data fusion--Resolving data conflicts for integration. In VLDB, 2009. [PDF][Presentation]

 

2008

 

    • Anish Das Sarma, Xin Dong, and Alon Y. Halevy: Bootstrapping Pay-as-you-go Data Integration Systems. In SIGMOD, 2008. [PDF]

 

2007

 

    • Mike Cammarano, Xin Luna Dong, Bryan Chan, Jeff Klingner, Justin Talbot, Alon Y. Halevy, Pat Hanrahan: Visualization of Heterogeneous Data. IEEE Trans. Vis. Comput. Graph. 13(6): 1200-1207, 2007. [PDF]
    • Xin Dong, Alon Y. Halevy and Cong Yu: Data Integration with Uncertainties. In VLDB, 2007. [PDF][Presentation][DBClip][Full paper]
    • Xin Dong and Alon Y. Halevy: Indexing Dataspaces. In Sigmod, 2007. [PDF][Presentation]
    • Jayant Madhavan, Shawn Jeffery, Shirley Cohen, Xin Dong, Alon Y. Halevy, David Ko and Cong Yu: Web-Scale Data Integration: You can afford to Pay as You Go. In CIDR, 2007. [PDF]

 

2006

 

    • Jayant Madhavan, Shirley Cohen, Xin Dong, Alon Y. Halevy, Shawn Jeffery, David Ko and Cong Yu: Structured data meets the Web: A few observations. IEEE Data Eng. Bull. 29(4): 19-26, 2006. [PDF]
    • Jing Liu, Xin Dong and Alon Y. Halevy: Answering Structured Queries on Unstructured Data. In WebDB 2006. [PDF][Presentation]

 

2005

 

    • Xin Dong: A Platform for Personal Information Management and Integration. In VLDB 2005 PhD Workshop. [PDF]
    • Xin Dong, Alon Y. Halevy and Jayant Madhavan: Reference Reconciliation in Complex Information Spaces. In SIGMOD 2005. [PDF][Presentation]
    • Yuhan Cai, Xin Dong, Alon Y. Halevy, Jing Liu and Jayant Madhavan: Personal Information Management with SEMEX. SIGMOD DEMO 2005. (BEST DEMO, one of three top demos) [PDF][Presentation]
    • Xin Dong and Alon Y. Halevy: Malleable Schemas: A Preliminary Report. In WebDB 2005. [PDF][Presentation]
    • Xin Dong and Alon Y. Halevy: A Platform for Personal Information Management and Integration. In CIDR 2005. [PDF][Presentation]

 

2004

 

    • Xin Dong, Alon Y. Halevy and Jayant Madhavan: Mining structures for semantics. SIGKDD Explorations, 6(2):53-60, 2004. [PDF]
    • Xin Dong, Alon Y. Halevy, Jayant Madhavan, Ema Nemes and Jun Zhang: Similarity Search for Web Services. In VLDB 2004. [PDF][Presentation]
    • Xin Dong, Alon Y. Halevy and Igor Tatarinov: Containment of Nested XML Queries. In VLDB 2004. [PDF][Presentation][Full paper]
    • Xin Dong, Alon Y. Halevy, Ema Nemes, Stephan B. Sigundsson and Pedro Domingos: SEMEX: Toward On-the-fly Personal Information Integration. In IIWEB 2004. [PDF][Presentation]

 

2003

 

    • Igor Tatarinov, Zachary G. Ives, Jayant Madhavan, Alon Y. Halevy, Dan Suciu, Nilesh N. Dalvi, Xin Dong, Yana Kadiyska, Gerome Miklau and Peter Mork: The Piazza peer data management project. In SIGMOD Record 32(3): 47-52 (2003) [PDF]
    • Philip Bohannon, Xin Dong, Sumit Ganguly, Henry F. Korth, Chengkai Li, P. P. S. Narayan and Pradeep Shenoy: ROLEX: Relational On-Line Exchange with XML. In SIGMOD Conference 2003: 673. [PDF]

 

  • Book Chapters

 

    • Anish Das Sarma, Xin Dong, and Alon Halevy: Uncertainty in Data Integration and Dataspace Support Platforms.
      Book chapter in \93Advances in Schema Matching and Mapping\94, Springer, 2011.
    • Anish Das Sarma, Xin Dong, and Alon Halevy: Data Modeling in Dataspace Support Platforms. Book chapter in \93Conceptual Modeling: Foundations and Applications.\94 Springer, 2009.
    • Xin Dong and Divesh Srivastava: XML Indexing. In \93Encyclopedia of Database Systems.\94 Springer, to appear.
    • Anish Das Sarma, Xin Dong, and Alon Halevy: Uncertainty in Data Integration. In \93Managing and Mining Uncertain Data.\94 Springer, 2008.
    • Tiziana Catarci, Xin Dong, Alon Halevy and Antonella Poggio: Structure everything. In "Personal Information Management." University of Washington Press, Oct 2007.
    • Xin Dong: Chpt 1. The origin of XML. Chap 2. XML grammar. In "Step by Step to XML." Tsinghua Publication House, March 2001. (In Chinese)

 

  • Ph.D. Dissertation

 

    • Xin Dong: Providing best-effort services in Dataspace systems. Ph.D. Dissertation. Univ. of Washington, 2007. [PDF]