Categorizing Search Result Records Using Word Sense Disambiguation
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 32797
Categorizing Search Result Records Using Word Sense Disambiguation

Authors: R. Babisaraswathi, N. Shanthi, S. S. Kiruthika

Abstract:

Web search engines are designed to retrieve and extract the information in the web databases and to return dynamic web pages. The Semantic Web is an extension of the current web in which it includes semantic content in web pages. The main goal of semantic web is to promote the quality of the current web by changing its contents into machine understandable form. Therefore, the milestone of semantic web is to have semantic level information in the web. Nowadays, people use different keyword- based search engines to find the relevant information they need from the web. But many of the words are polysemous. When these words are used to query a search engine, it displays the Search Result Records (SRRs) with different meanings. The SRRs with similar meanings are grouped together based on Word Sense Disambiguation (WSD). In addition to that semantic annotation is also performed to improve the efficiency of search result records. Semantic Annotation is the process of adding the semantic metadata to web resources. Thus the grouped SRRs are annotated and generate a summary which describes the information in SRRs. But the automatic semantic annotation is a significant challenge in the semantic web. Here ontology and knowledge based representation are used to annotate the web pages.

Keywords: Ontology, Semantic Web, WordNet, Word Sense Disambiguation.

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1096393

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1712

References:


[1] Benjamin Donz., Dietmar Bruckner (2012), "External Semantic Annotation of Web-Databases”, IEEE Digital Library, pp 841-845.
[2] Fernando Gomez (2006), "Automatic semantic annotation of texts”, University of Central Florida, Orlando, FL 32816.
[3] Nadzeya Kiyavitskaya., Nicola Zeni., James R.Cordy., Luisa Mich and John Mylopoulos (2006), "Semi-Automatic Semantic Annotation for Web Documents”.
[4] Raquel Trillo., Laura Po.,Sergio Ilarri., Sonia Bergamaschi and Eduardo Mena (2011), "Using semantic techniques to access web data ”, Elsevier Information Systems, Vol.No.36, pp 117-133.
[5] Yiyao Lu., Hai He., Hongkun Zhao., Weiyi Meng and Clement Yu (2013), "Annotating search results from web databases ”, IEEE Transaction on Knowledge and Data Engineering, Vol.No.25, pp 514- 527.
[6] Raquel Trillo., Jorge Gracia., Mauricio Espinoza and Eduardo Mena (2007), ―Discovering the semantics of user keywords”, Journal on Universal Computer Science, Vol.13, No.12, pp 1908–1935.
[7] https://answers.yahoo.com/question/index?qid=20091014064541AAd1C Dt
[8] http://www.w3.org/RDF/Metalog/docs/sw-easy
[9] http://www.netcluesoft.com/downloading-webpages-and-html.html
[10] http://nlp.stanford.edu/IR-book/html/htmledition/results-snippets-1.html
[11] http://javarevisited.blogspot.in/2014/09/how-to-parse-html-file-in-javajsoup- example.html