XML Database Reading List


Introduction

  1. XML: Current Developments and Future Challenges for the Database Community, S. Ceri, P. Fraternali & S. Paraboschi, EDBT 2000

XML Standards

  1. Extensible Markup Language (XML) 1.0 (Second Edition) W3C Recommendation 6 October 2000, Tim Bray, Jean Paoli, C. M. Sperberg-McQueen, Eve Maler.

  2. XSL Transformations (XSLT), Version 1.0, W3C Recommendation 16 November 1999, James Clark

  3. Document Object Model (DOM) Level 3 Core Specification, Version 1.0, W3C Working Draft 22 October 2002

  4. Official SAX (Simple API for XML) Specification (at the official SAX website)

XML Schema

  1.  XML Schema Part 0: Primer, W3C Recommendation, 2 May 2001

  2. Comparative Analysis of Six XML Schema Languages, D. Lee, W. Chu, ACM SIGMOD Record, Vol. 29, No. 3, Sept. 2000

  3. XQuery 1.0 and XPath 2.0 Data Model, last release 15 November 2002

XML Query Languages

  1. XQuery 1.0: An XML Query Language , last release 15 November 2002

  2. XML Path Language (XPath) 2.0 , last release 15 November 2002

XML-Enabled Relational Databases 

  1. Relational Databases for Querying XML Documents: Limitations and Opportunities. Jayavel Shanmugasundaram, H. Gang, Kristin Tufte, Chun Zhang, David DeWitt, and Jeffrey F. Naughton. VLDB 1999.

  2. From XML schema to relations: a cost-based approach to XML storage , Bohannon, P.; Freire, J.; Roy, P.; Simeon, J. Page(s): 64 -75,  ICDE 2002

  3. Storing and Querying Ordered XML Using a Relational Database System,, Igor Tatarinov, Stratis Viglas, Kevin Beyer, Jayavel Shanmugasundaram, Eugene Shekita, Chun Zhang, SIGMOD 2002

  4. RRXS: Redundancy reducing XML storage in relations, Yi Chen, Susan Davidson (University of Pennsylvania, USA), Carmem Hara (Universidade Federal do Parana, Brazil), Yifeng Zheng (University of Pennsylvania, USA), VLDB 2003

  5. A Comprehensive XQuery to SQL Translation using Dynamic Interval Encoding, David DeHaan, David Toman, Mariano P. Consens, M. Tamer Özsu, SIGMOD 2003

Native XML Database

  1. Timber: A native XML database, H. V. Jagadish, Shurug Al-Khalifa, Laks Lakshmanan, Andrew Nierman, Stylianos Paparizos, Jignesh Patel, Divesh Srivastava, and Yuqing Wu. Technical report, University of Michigan, April 2002.

  2. TAX: A Tree Algebra for XML, H. V. Jagadish, Laks V. S. Lakshmanan, Divesh Srivastava, Keith Thompson, 8th International Workshop on Database Programming Languages, DBLP 2001, pp. 149-164

  3. A Succinct Physical Storage Scheme for Efficient Evaluation of Path Queries in XML, Ning Zhang, Varun Kacholia and M. Tamer Ozsu, ICDE 2004

Structural Joins and Node Indexes

  1. On Supporting Containment Queries in Relational Database Management Systems, Chun Zhang (Univ. of Wisconsin), Jeffrey Naughton (Univ. of Wisconsin), David DeWitt (Univ. of Wisconsin), Qiong Luo (Univ. of Wisconsin), and Guy Lohman (IBM Almaden), SIGMOD 2001

  2. Indexing and Querying XML Data for Regular Path Expressions, Quanzhong Li, Bongki Moon. 361-370,  VLDB 2001

  3. Structural joins: a primitive for efficient XML query pattern matching , Al-Khalifa, S.; Jagadish, H.V.; Koudas, N.; Patel, J.M.; Srivastava, D.; Yuqing Wu, Page(s): 141 -152, ICDE 2002

  4. Holistic Twig Joins: Optimal XML Pattern Matching, Nicolas Bruno, Divesh Srivastava, Nick Koudas, SIGMOD 2002

  5. Efficient Structural Joins on Indexed XML Documents, Shu-Yao Chien (NCR/Teradata Division, U.S.A.), Zografoula Vagena, Donghui Zhang, Vassilis J. Tsotras (University of California - Riverside, U.S.A.), Carlo Zaniolo (UCLA, U.S.A.), VLDB 2002

  6. XR-Tree: Indexing XML Data for Efficient Structural Joins,  Haifeng Jiang, Hongjun Lu, Wei Wang, Beng Chin Ooi, ICDE 2003

  7. Approximate XML Joins, Sudipto Guha, H.V. Jagadish, Nick Koudas, Divesh Srivastava, Ting Yu, SIGMOD 2002

  8. Structural Join Order Selection for XML Query Optimization, Y. Wu, J. Patel, H.V. Jagadish, ICDE 2003

  9. Holistic Twig Joins on Indexed XML Documents, Haifeng Jiang, Wei Wang, Hongjun Lu (The Hong Kong Univ. of Science and Technology, China), Jeffrey Xu Yu (The Chinese Univ. of Hong Kong, China), VLDB 2003

Indexes for Path Expressions

  1. DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases, Roy Goldman, Jennifer Widom:. pp. 436-445, VLDB 1997

  2. Index Structures for Path Expressions. Tova Milo. and Dan Suciu, ICDT, pp. 277-295, 1999

  3. A Fast Index for Semistructured Data, Brian Cooper, Neal Sample, Michael J. Franklin, Gisli R. Hjaltason, Moshe Shadmon: .341-350,  VLDB 2001

  4. Exploiting local similarity for indexing paths in graph-structured data, Kaushik, R.; Shenoy, P.; Bohannon, P.; Gudes, E., Page(s): 129 -140,  ICDE 2002

  5. Covering Indexes for Branching Path Queries, Raghav Kaushik, Philip Bohannon, Jeffrey Naughton, Henry Korth, SIGMOD 2002

  6. APEX: An Adaptive Path Index for XML data, Chin-Wan Chung, Jun-Ki Min, Kyuseok Shim, SIGMOD 2002

  7. Efficient Algorithms for Processing XPath Queries, Georg Gottlob, Christoph Koch, Reinhard Pichler (Technische Universität Wien, Austria), VLDB 2002

  8. Updates for Structure Indexes, Raghav Kaushik (University of Wisconsin - Madison, U.S.A.), Philip Bohannon (Lucent Technologies - Bell Labs, U.S.A.), Jeffrey F. Naughton (University of Wisconsin - Madison, U.S.A.), Pradeep Shenoy (University of Washington - Seattle, U.S.A.),  , VLDB 2002

  9.  Covering Indexes for XML Queries: Bisimulation - Simulation = Negation, Prakash Ramanan (Wichita State University, USA), VLDB 2003

  10. D(K)-Index: An Adaptive Structural Summary for Graph-Structured Data, Qun Chen, Andrew Lim, Kian Win Ong, SIGMOD 2003

XML Query Optimization

  1. Minimization of Tree Pattern Queries, Sihem Amer-Yahia, SungRan Cho, Laks V. S. Lakshmanan, Divesh Srivastava. SIGMOD Conference 2001

  2. Efficient Algorithms for Minimizing Tree Pattern Queries, Prakash Ramanan, SIGMOD 2002

  3. On the minimization of Xpath queries, Sergio Flesca, Filippo Furfaro  (Università della Calabria, Italy), Elio Masciari (ICAR - CNR, Italy), VLDB, 2003

General XML Query Evaluation

  1. From Tree Patterns to Generalized Tree Patterns: On Efficient Evaluation of XQuery, Zhimin Chen (Univ. of British Columbia, Canada), H.V. Jagadish (Univ. of Michigan, USA) , Laks V. S. Lakshmanan (Univ. of British Columbia, Canada), Stelios Paparizos (Univ. of Michigan, USA)

  2. Mixed Mode XML Query Processing, Alan Halverson, Josef Burger, Leonidas Galanis, Ameet Kini, Rajasekar Krishnamurthy, Ajith Nagaraja Rao, Feng Tian, Stratis D. Viglas, Yuan Wang, Jeffrey F. Naughton, David J. DeWitt (University of Wisconsin-Madison, USA), VLDB 2003

  3. ViST: A Dynamic Index Method for Querying XML Data by Tree Structures, Haixun Wang, Sanghyun Park, Wei Fan, Philip Yu, SIGMOD 2003

  4. Projecting XML Documents, Amélie Marian (Columbia University, USA), Jérôme Siméon (Bell Laboratories, USA), VLDB 2003

  5.  Efficient Processing of Expressive Node-Selecting Queries on XML Data in Secondary Storage: A Tree Automata-based Approach, Christoph Koch (University of Edinburgh, UK), VLDB 2003

XML Data Statistics

  1. Estimating the Selectivity of XML Path Expressions for Internet Scale Applications, Ashraf Aboulnaga, Alaa R. Alameldeen, Jeffrey F. Naughton, 591-600, VLDB 2001

  2. XPathLearner: An On-line Self-Tuning Markov Histogram for XML Path Selectivity Estimation , Lipyeow Lim (Duke University, U.S.A.), Min Wang, Sriram Padmanabhan (IBM T.J. Watson Research Center, U.S.A.), Jeffrey Scott Vitter, Ronald Parr (Duke University, U.S.A.), VLDB 2002

  3. Structure and Value Synopses for XML Data Graphs, Neoklis Polyzotis (University of Wisconsin - Madison, U.S.A.), Minos Garofalakis (Lucent Technologies - Bell Labs, U.S.A.),  VLDB 2002

  4. Statistical Synopses for Graph-Structured XML Databases, Neoklis Polyzotis, Minos Garofalakis, SIGMOD 2002

  5. Estimating Answer Sizes for XML Queries, Yuqing Wu, Jignesh M. Patel, H. V. Jagadish. EDBT 2002: 590-608

Continuous Query and XML Stream Processing

  1. NiagaraCQ: A Scalable Continuous Query System for Internet Databases, Jianjun Chen, David J. DeWitt, Feng Tian, Yuan Wang,  SIGMOD 2000.

  2. On Efficient Matching of Streaming XML Documents and Queries, Laks V. S. Lakshmanan, Sailaja Parthasarathy. EDBT 2002, 142-160

  3. Navigation- vs. Index-Based XML Multi-Query Processing N. Bruno, L. Gravano, N. Koudas, and D. Srivastava. in Proc. of the 19th IEEE International Conference on Data Engineering (ICDE 2003), 2003.

  4. Query Processing for High-Volume XML Message Brokering, Yanlei Diao, Michael Franklin (University of California, Berkeley, USA), VLDB 2003

  5. Stream Processing of XPath Queries with Predicates, Ashish Kumar Gupta, Dan Suciu, SIGMOD 2003

  6. XPath Queries on Streaming Data, Feng Peng, Sudarshan S. Chawathe, SIGMOD 2003

  7. A Transducer-Based XML Query Processor, Bertram Ludaescher, Pratik Mukhopadhyay, Yannis Papakonstantinou (University of California - San Diego, U.S.A.) VLDB 2002.

  8. RE-Tree: An Efficient Index Structure for Regular Expressions, Chee-Yong Chan,Minos Garofalakis, Rajeev Rastogi (Lucent Technologies - Bell Labs, U.S.A.) , VLDB 2002

  9. Efficient Filtering of XML Documents for Selective Dissemination of Information, Altinel, M. and Franklin, M.J.. VLDB, 2000

XML Benchmarking

  1. XMark: A Benchmark for XML Data Management, A. R. Schmidt, F. Waas, M. L. Kersten, M. J. Carey, I. Manolescu, R. Busse. t., VLDB 2002.

XML Data Update

  1. Updating XML, Igor Tatarinov , Zachary G. Ives , Alon Y. Haley , Daniel S. Weld , SIGMOD 2001 , 2001.

  2. Change-Centric Management of Versions in an XML Warehouse, Amélie Marian, Serge Abiteboul, Gregory Cobena, Laurent Mignet. 581-590, VLDB 2001

  3. Detecting changes in XML documents, Cobena, G.; Abiteboul, S.; Marian, A. , Page(s): 41 -52, ICDE 2002

  4. X-Diff: An Efficient Change Detection Algorithm for XML Documents, Y. Wang, D. DeWitt, J-Y Cai, ICDE 2003

XML Document Retreival

  1. Phrase Matching in XML, Sihem Amer-Yahia, Mary Fernández, Divesh Srivastava (AT&T Labs-Research, USA), Yu Xu (Univ. of California, San Diego, USA), VLDB 2003

  2. Querying Structured Text in an XML Database, Shurug Al-Khalifa, Cong Yu, H. V. Jagadish, SIGMOD 2003

  3. XRANK: Ranked Keyword Search over XML Documents, Lin Guo, Feng Shao, Chavdar Botev, Jayavel Shanmugasundaram, SIGMOD 2003

Mining XML Data

  1. Efficient Mining of XML Query Patterns for Caching , Liang Huai Yang, Mong Li Lee, Wynne Hsu, VLDB 2003

XML Publishing

  1. Querying XML Views of Relational Data, Jayavel Shanmugasundaram, Jerry Kiernan, Eugene J. Shekita, Catalina Fan, John Funderburk. 261-270,  VLDB 2001

  2. Views in a Large Scale XML Repository, Sophie Cluet, Pierangelo Veltri, Dan Vodislav. 271-280, VLDB 2001

  3. On Relational Support for XML Publishing: Beyond Sorting and Tagging, Surajit Chaudhuri, Raghav Kaushik, Jeffrey F. Naughton, SIGMOD 2003

  4. Composing XSL Transformations with XML Publishing Views, Chengkai Li, Philip Bohannon, Henry F. Korth, P.P.S. Narayan, SIGMOD 2003

  5. MARS: A System for Publishing XML from Mixed and Redundant Storage, Alin Deutsch (University of California, San Diego, USA), Val Tannen (University of Pennsylvania, USA), VLDB 2003

XML and Semantic Web

  1. The Semantic Web: The Roles of XML and RDF, Stefan Decker, Sergey Melnik, Frank Van Harmelen, Dieter Fensel, Michel Klein, Jeen Broekstra, Michael Erdmann, Ian Horrocks: , IEEE Internet Computing, Vol 4, No5, pages  63-74, 2000.

  2. The Semantic Web, Tim Berners-Lee, James Hendler and Ora Lassila, Scientific American, May 2001.

  3. Dynamic XML Documents with Distribution and Replication, Serge Abiteboul, Angela Bonifati, Grégory Cobéna, Ioana Manolescu, Tova Milo, SIGMOD 2003