Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 30184
A Materialized Approach to the Integration of XML Documents: the OSIX System

Authors: H. Ahmad, S. Kermanshahani, A. Simonet, M. Simonet


The data exchanged on the Web are of different nature from those treated by the classical database management systems; these data are called semi-structured data since they do not have a regular and static structure like data found in a relational database; their schema is dynamic and may contain missing data or types. Therefore, the needs for developing further techniques and algorithms to exploit and integrate such data, and extract relevant information for the user have been raised. In this paper we present the system OSIX (Osiris based System for Integration of XML Sources). This system has a Data Warehouse model designed for the integration of semi-structured data and more precisely for the integration of XML documents. The architecture of OSIX relies on the Osiris system, a DL-based model designed for the representation and management of databases and knowledge bases. Osiris is a viewbased data model whose indexing system supports semantic query optimization. We show that the problem of query processing on a XML source is optimized by the indexing approach proposed by Osiris.

Keywords: Data integration, semi-structured data, views, XML.

Digital Object Identifier (DOI):

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1246


[1] S. Abiteboul, S. Cluet , G. Ferran and M-C. Rousset: "The Xyleme Project". Gemo Repot 248, INRIA, 2001.
[2] H.Ahmad, S. Kermanshahani, A. Simonet and M. Simonet: "A View- Based Approach to the Integration of Structured and Semi-structured Data", IEEE International Baltic Conference on Databases and Information Systems-Communication of Baltic DBIS , 2006.
[3] H.Ahmad, S. Kermanshahani, A. Simonet and M. Simonet: "Data Warehouse based Approach to the Integration of Semi-structured Data", WCMT The 1st International Workshop on Web-based Contents Management Technologies, Suzhou, China 2009
[4] X. Baril: "Un modèle de vues pour l-intégration de sources de données XML: VIMIX ". PHD thesis, Languedoc University of Science and Techniques, 2003.
[5] C. Bornhovd: "MIX - A Representation Model for the Integration of Web- Based Data". Technical report, Dep.CS, Darmstadt University of Technology, Germany, 1998.
[6] M. Cannataro, S. Cluet, G. Tradigo, P. Veltri and D. Vodislav:" Using views to query XML. In Encyclopedia of Database Technologies and Applications", pp.729-735 , 2005.
[7] H. Garcia-Molina: "The TSIMMIS approach to mediation: Data Models and Languages". Journal of Intelligent Information Systems. 8(2) pp 117-132, 1997.
[8] A. Halevy: "Answering queries using views: A survey". The VLBD Journal, 10(4), 270-294. 2001.
[9] S. Kermanshahani: "Semi-Materialized Framework: a Hybrid Approach to Data Integration", CSTST Student Workshop, Paris, October 2008.
[10] I. Manolescu, D. Florescu and D. Kossman: "Answering XML Queries Over Heterogeneous Data Sources". In proceedings of the 27 th International Conference on VLDB, 2001.
[11] M. Roger, A. Simonet, M. Simonet, "Bringing Together Description Logics and Databases in an Object-Oriented Model", DEXA 2002, Database and Expert System Applications, Toulouse, Sept. 2002.
[12] M. H. Scholl, C. Laasch, M. Tresch, "Updatable Views in Object- Oriented Databases", Proc. 2nd DOOD conf., pp 187-198, Dec. 1991.
[13] I. Sebi : "Interrogation de Documents XML à Travers des Vues". PhD thesis, EDITE, CEDRIS Laboratory, 2007.
[14] A. Simonet, M. Simonet, "Classement d-instance et Evaluation des Requ├¬tes en Osiris", in BDA-96 : Bases de Données Avancées, Cassis, France, pp 273-288, Aug. 1996.
[15] D. Stanat, D. McAllister: "Discrete Mathematics in Computer Science", Prentice Hall, 1977.
[16] G. Wiederhold. "Mediators in the architecture of future information systems". IEEE Computer Magazine, 25(3), 38-49, 1992.
[17] M.-C. Wu, A. P. Buchmann. "Research issues in data warehousing". In Datebanksysteme in Buro, Technik and Wissenschaft, pp. 61-82, 1997.