REALIZATION OF A SYSTEM OF EFFICIENT QUERYING OF HIERARCHICAL DATA TRANSFORMED INTO A QUASI-RELATIONAL MODEL
DOI:
https://doi.org/10.7494/csci.2016.17.3.353Keywords:
XML, SQL, hierarchical data, relational modelAbstract
Extensible Markup Language was mainly designed to easily represent documents; however, it has evolved and is now widely used for the representation of arbitrary data structures. There are many Application Programming Interfaces (APIs) to aid software developers with processing XML data. There are also many languages for querying and transforming XML, such as XPath or XQuery, which are widely used in this field. However, because of the great flexibility of XML documents, there are no unified data storing and processing standards, tools, or systems.On the other hand, a relational model is still the most-commonly and widely used standard for storing and querying data. Many Database Management Systems consist of components for loading and transforming hierarchical data. DB2 pureXML or Oracle SQLX are some of the most-recognized examples. Unfortunately, all of them require knowledge of additional tools, standards, and languages dedicated to accessing hierarchical data (for example, XPath or XQuery). Transforming XML documents into a (quasi)relational model and then querying (transformed) documents with SQL or SQL–like queries would significantly simplify the development of data-oriented systems and applications.
In this paper, an implementation of the SQLxD query system is proposed. The XML documents are converted into a quasi-relational model (preserving their hierarchical structure), and the SQL–like language based on SQL-92 allows for efficient data querying.
Downloads
References
Amer-Yahia S., Du F., Freire J.: A Comprehensive Solution to the XML-torelational Mapping Problem. In: Proceedings of the 6th Annual ACM International Workshop on Web Information and Data Management, WIDM ’04, pp. 31–38. ACM, New York, NY, USA, 2004. ISBN 1-58113-978-0. URL http://dx.doi.org/10.1145/1031453.1031461.
Barbosa D., Freire J., Mendelzon A.O.: Designing Information-preserving Mapping Schemes for XML. In: Proceedings of the 31st International Conference on Very Large Data Bases, VLDB ’05, pp. 109–120. VLDB Endowment, 2005. ISBN 1-59593-154-6. URL http://dl.acm.org/citation.cfm?id=1083592.1083608.
IronPython. http://ironpython.net. Accessed: 2014-05-23.
Khan L., Rao Y.: A Performance Evaluation of Storing XML Data in Relational Database Management Systems. In: Proceedings of the 3rd International Workshop on Web Information and Data Management, WIDM ’01, pp. 31–38. ACM, New York, NY, USA, 2001. ISBN 1-58113-444-4. URL http://dx.doi.org/10.1145/502932.502939.
Marcjan R., Siwik L.: The Concept of Transformation of XML Documents into Quasi-Relational Model. In: S. Kozielski, D. Mrozek, P. Kasprowski, B. Malysiak-Mrozek, D. Kostrzewa, eds., Beyond Databases, Architectures, and Structures, Communications in Computer and Information Science, vol. 424, pp. 569–580. Springer International Publishing, 2014. ISBN 978-3-319-06931-9. URL http://dx.doi.org/10.1007/978-3-319-06932-6_55.
Marcjan R., Wyrostek J.: Processing XML documents on the basis of quasirelational model and SQLxD language. In: Studia Informatica, ISSN 0208-7286, vol. Vol. 32(2A), pp. 111—120, 2011.
LIKE (Transact-SQL). http://msdn.microsoft.com/en-US/en-EN/library/ms179859(v=sql.105).aspx. Accessed: 2014-05-23.
PLY (Python Lex-Yacc). http://www.dabeaz.com/ply/. Accessed: 2014-05-23.
DB2 pureXML. http://www-01.ibm.com/software/data/db2/linux-unix-windows/xml/index.html/. Accessed: 2014-05-23.
SQL-92. http://www.contrib.andrew.cmu.edu/~shadow/sql/sql1992.txt. Accessed: 2014-05-23.
BNF Grammar for ISO/IEC 9075:1992 - Database Language SQL (SQL-92). http://savage.net.au/SQL/sql-92.bnf.html. Accessed: 2014-05-23.
SQLX. http://docs.oracle.com/cd/B10501_01/appdev.920/a96616/arxml34.htm. Accessed: 2014-05-23.
XPath Reference. http://www.w3schools.com/XPath. Accessed: 2014-05-23.
XQuery Reference. http://www.w3schools.com/xQuery/default.asp/. Accessed: 2014-05-23.