{"title":"Dimensional Modeling of HIV Data Using Open Source","authors":"Charles D. Otine, Samuel B. Kucel, Lena Trojer","volume":39,"journal":"International Journal of Computer and Systems Engineering","pagesStart":95,"pagesEnd":101,"ISSN":"1307-6892","URL":"https:\/\/publications.waset.org\/pdf\/8994","abstract":"Selecting the data modeling technique for an\r\ninformation system is determined by the objective of the resultant\r\ndata model. Dimensional modeling is the preferred modeling\r\ntechnique for data destined for data warehouses and data mining,\r\npresenting data models that ease analysis and queries which are in\r\ncontrast with entity relationship modeling. The establishment of data\r\nwarehouses as components of information system landscapes in\r\nmany organizations has subsequently led to the development of\r\ndimensional modeling. This has been significantly more developed\r\nand reported for the commercial database management systems as\r\ncompared to the open sources thereby making it less affordable for\r\nthose in resource constrained settings. This paper presents\r\ndimensional modeling of HIV patient information using open source\r\nmodeling tools. It aims to take advantage of the fact that the most\r\naffected regions by the HIV virus are also heavily resource\r\nconstrained (sub-Saharan Africa) whereas having large quantities of\r\nHIV data. Two HIV data source systems were studied to identify\r\nappropriate dimensions and facts these were then modeled using two\r\nopen source dimensional modeling tools. Use of open source would\r\nreduce the software costs for dimensional modeling and in turn make\r\ndata warehousing and data mining more feasible even for those in\r\nresource constrained settings but with data available.","references":"[1] Chen, P. (1976). The Entity Relationship model-Towards a unified view\r\nof data, ACM Transactions on Database Systems, 1, 1, 9-36.\r\n[2] Chilton, M.A. (2006). Data Modeling Education: The changing\r\ntechnology, Journal of Information Systems Educaion, 17,1, 17-20.\r\n[3] Coar, K. (2006). The Open source Definition , Retrieved on 18th Nov\r\n2008 from opensource.org: http:\/\/www.opensource.org\/docs\/osd\r\n[4] Dash, A.K and Agarwal, R. (2001). Dimensional modeling for Data\r\nwarehouse, ACM SIGSOFT software engineering notes, 26, 1, 83-84.\r\n[5] Golfarelli, M., Maio, D. and Rizzi, S. (1998). Conceptual Design of Data\r\nwarehouses from E-R schemes, Proceedings of the Hawaii International\r\nConference On System Sciences, January 6-9, Hawaii\r\n[6] Gui, Y., Tang, S., Tong, Y. and Yang,D. (2006). Tripple Driven Data\r\nModeling Methodology in Data warehousing: A case study, ACM\r\nworkshop on Data warehousing and OLAP, 59-66\r\n[7] Ilczuk, G. and Wakulicz-Deja, A. (2007). Selection of Important\r\nattributes for Medical Diagnosis Systems. Transactions on Rough Sets ,\r\n7,1, 70-84.\r\n[8] Jones, M. E. and Song, I.Y. (2008). Dimensional modeling:\r\nIdentification, classification and evaluation of patterns. Decision\r\nSupport Systems , 59-76.\r\n[9] Kleijen, J. P. (1995). Verification and validation of simulation models.\r\nEuropean Journal of Operations Research , 82,1, 145-162.\r\n[10] Kortinik, M. A. and Moody, D. L. (2003). From ER Models to\r\nDimensional Models: Bridging the Gap between OLTP and OLAP\r\nDesign. Business Intelligence Journal , 8,3, 1-17.\r\n[11] Laender H. F., Freitas, G.M., and Campos, M.L. (2002). MD2- Getting\r\nUsers Involved in the Development of Data Warehouse Applications.\r\n4th International Conference Workshop Design and Management of\r\nData warehouses. May 27, Toronto, University of British Columbia, 3-\r\n12.\r\n[12] Lambert, B. (1995). Break Old Habits To Define Data Warehousing\r\nRequirements. Data Management Review .\r\n[13] Malinowski, E. and Zimanyi, E. (2007). A conceptual model for\r\ntemporal data warehouses and its transformation to the the ER and\r\nobject-relational model. Data and Knowledge Engineering ,64, 101-133.\r\n[14] Martyn, T. (2004). Reconsidering Multi-Dimensional Schemas. ACMs\r\nSpecial Interest Group On Management of Data , 33,1, 83-88.\r\n[15] Nguyen, T. M., Tjoa, A. M., and Trujillo, J. (2005). Data Warehousing\r\nand Knowledge Discovery: A Chronological View of Research\r\nChallenges. Springer , 530-535.\r\n[16] Pearson, W. (2008, 1 24). Dimensional Model components: Dimensions\r\npart 1. Retrieved 11 19, 2008, from Database Journal:\r\nhttp:\/\/www.databasejournal.com\/features\/mssql\/article.php\/3723311\/Di\r\nmensional-Model-Components--Dimensions-Part-I.htm\r\n[17] Phipps, C. and Davis, K.C. (2003). Automating Data warehouse\r\nconceptual Schema Design and Evaluation. Proceedings of the 4th\r\ninternational conference on Design and Management of Data\r\nwarehouses. May 27, Toronto Canada, 23-32\r\n[18] Pokorny, J. (2003). Modeling stars using XML.\r\n[19] Riadh, B. M., Omar, B., & Sabine, R. (2004). A new OLAP Aggregation\r\nBased on the AHC Technique. DOLAP (pp. 65-71). Washington,DC:\r\nACM.\r\n[20] UNAIDS. (2008). 2008 Report on the Global AIDS epidemic. Geneva:\r\nWHO Library Cataloguing-in-Publication Data.","publisher":"World Academy of Science, Engineering and Technology","index":"Open Science Index 39, 2010"}