Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 1

categorisation Related Publications

1 Interactive, Topic-Oriented Search Support by a Centroid-Based Text Categorisation

Authors: Herwig Unger, Mario Kubek

Abstract:

Centroid terms are single words that semantically and topically characterise text documents and so may serve as their very compact representation in automatic text processing. In the present paper, centroids are used to measure the relevance of text documents with respect to a given search query. Thus, a new graphbased paradigm for searching texts in large corpora is proposed and evaluated against keyword-based methods. The first, promising experimental results demonstrate the usefulness of the centroid-based search procedure. It is shown that especially the routing of search queries in interactive and decentralised search systems can be greatly improved by applying this approach. A detailed discussion on further fields of its application completes this contribution.

Keywords: query, centroid, keyword, search algorithm, categorisation, cooccurrence

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 217