Selection of Relevant Servers in Distributed Information Retrieval System
Authors: Benhamouda Sara, Guezouli Larbi
Abstract:
Nowadays, the dissemination of information touches the distributed world, where selecting the relevant servers to a user request is an important problem in distributed information retrieval. During the last decade, several research studies on this issue have been launched to find optimal solutions and many approaches of collection selection have been proposed. In this paper, we propose a new collection selection approach that takes into consideration the number of documents in a collection that contains terms of the query and the weights of those terms in these documents. We tested our method and our studies show that this technique can compete with other state-of-the-art algorithms that we choose to test the performance of our approach.
Keywords: Distributed information retrieval, relevance, server selection, collection selection.
Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1123879
Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1376References:
[1] Allison L. Powell, and James C. French, "Comparing the Performance of Collection Selection Algorithms", In ACM Transactions on Information Systems (TOIS), Vol.21, No.4, 2003, pp. 412–456.
[2] Daryl D’Souza, Justin Zobel, and James A, "Is CORI Effective for Collection Selection an Exploration of Parameters, Queries, and Data", In Proceedings of the Australian Document Computing Symposium, Melbourne, Australia, December 2004, pp.41-46.
[3] Faïza Abbaci,"Méthodes de sélection de collections dans un environnement de recherche d'informations distribuée", Thesis, Neuchâtel University, 2003.
[4] Luis Gravano, Héctor Garcia-Molina and Anthony Tomasic, "GlOSS: Text-Source Discovery over the Internet", In Journal ACM Transactions on Database Systems (TODS), vol. 24, no. 2, Jun 1999, pp. 229-264.
[5] Nicholas Eric Craswell, "Methods for Distributed Information Retrieval", Thesis, Australian National University, 2000.
[6] Paul Thomas, and David Hawking, "Server selection methods in personal metasearch: a comparative empirical study", In Information Retrieval Jounal, Vol. 12, Issue 5, 2009, pp. 581-604.
[7] Sander Bockting, “Collection Selection for Distributed Web Search Using Highly Discriminative Keys, Query-driven Indexing and ColRank”, Thesis, University of Twente Enshed-The Netherlands, 2009.
[8] Umberto Straccia, and Raphaiel Troncy, "Towards Distributed Information Retrieval in the Semantic Web: Query Reformulation Using the oMAP Framework", The 3rd European conference on the Semantic Web: Research and Applications Lecture Notes in Computer Science, 2006, pp. 378-392.
[9] Fabio Crestani, Ilya Markov, "Distributed Information Retrieval and Applications", In the 35th European Conference on IR Research (ECIR 2013), Moscow, Russia, 2013, pp. 865-868.