Determining Senses for Word Sense Disambiguation in Turkish
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 32797
Determining Senses for Word Sense Disambiguation in Turkish

Authors: Zeynep Orhan, Zeynep Altan

Abstract:

Word sense disambiguation is an important intermediate stage for many natural language processing applications. The senses of an ambiguous word are the classification of usages for that specific word. This paper deals with the methodologies of determining the senses for a given word if they can not be obtained from an already available resource like WordNet. We offer a method that helps us to determine the sense boundaries gradually. In this method, first we decide on some features that are thought to be effective on the senses and divide the instances first into two, then according to the results of evaluations we continue dividing instances gradually. In a second method we use the pseudo words. We devise artificial words depending on some criteria and evaluate classification algorithms on these previously classified words.

Keywords: Word sense disambiguation, sense determination, pseudo words, sense granularity.

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1082297

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1362

References:


[1] Barwise, J., and PERRY., J, 1983. Situations and Attitudes. Cambridge, MA: MIT Press.
[2] Bloomfield, L., 1933. Language. New York: Henry Holt.
[3] Gaustad., T., 2001. Statistical Corpus-Based Word Sense Disambiguation: Pseudo words vs. Real Ambiguous Words., Proc. 39th Annual Meeting of ACL (ACL/EACL 2001) - Student Research Workshop.
[4] IDE, N., and Veronis, J., 1998, Introduction to the special issue on word sense disambiguation: The State of the Art, Computational Linguistics, 24(1), 1-40.
[5] Manning, C., Sch├╝tze, H., 1999. Foundations of Statistical Natural Language Processing, MIT Press. Cambridge, MA.
[6] METU Turkish Treebank, http://www.ii.metu.edu.tr/~corpus/indextr.html
[7] MILLER, G., March 22, 2001, Ambiguous Words, iMP Magazine: http://www.cisp.org/imp/march_2001/miller/03_01miller.htm
[8] Nakov, P., and Hearst, M., 2003, Category-based Pseudo words, in the Companion Volume of the Proceedings of HLT-NAACL'03, pp. 67-69, Edmonton, Canada, May 2003.
[9] Oflazer, K. Say, B., Hakkani-Tur, D.Z., Tur, G., 2003 Building a Turkish Treebank, Invited Chapter in Building and Exploiting Syntactically-Annotated Corpora, Anne Abeille Editor, Kluwer Academic Publishers.
[10] Orhan, Z., Altan, Z., 2003, "Disambiguation of Turkish Word Senses By Supervised Statistical Methods", International XII. Turkish Symposium on Artificial Intelligence and Neural Networks (TAINN 2003).
[11] Senseval, Evaluation exercises for Word Sense Disambiguation Organized by ACL-SIGLEX, http://www.senseval.org
[12] TDK Turkish Dictionary, http://tdk.org.tr/tdksozluk
[13] Tu─ƒlaci, P., 1995, Okyanus Ansiklopedik T├╝rk├ºe Sözl├╝k, ABC Kitabevi Yay─▒n ve da─ƒ─▒t─▒m A┼×, ─░stanbul.
[14] Weaver, W., 1949. Translation. Mimeographed, 12 pp., Reprinted in Locke, William N. and Booth, A. Donald (1955) (Eds.), Machine translation of languages. John Wiley & Sons, New York, 15-23.
[15] WEKA: Data Mining Software in Java, http://www.cs.waikato.ac.nz/~ml/weka/index.html
[16] Wittgenstein L 1953, Philosophical Investigations translated by G E M Anscombe, Oxford: Basil Blackwell.