{"title":"Effective Features for Disambiguation of Turkish Verbs","authors":"Zeynep Orhan, Zeynep Altan","volume":7,"journal":"International Journal of Computer and Information Engineering","pagesStart":2264,"pagesEnd":2269,"ISSN":"1307-6892","URL":"https:\/\/publications.waset.org\/pdf\/13001","abstract":"
This paper summarizes the results of some experiments for finding the effective features for disambiguation of Turkish verbs. Word sense disambiguation is a current area of investigation in which verbs have the dominant role. Generally verbs have more senses than the other types of words in the average and detecting these features for verbs may lead to some improvements for other word types. In this paper we have considered only the syntactical features that can be obtained from the corpus and tested by using some famous machine learning algorithms.<\/p>\r\n","references":"[1] Saussure, Ferdinand de. 1974 (1916). Course in General Linguistics. Tr.\r\nWade Baskin. Glasgow: Fontana & Collins. (Orig.: Cours de\r\nlinguistique g\u00e9n\u00e9rale.Lousanne et Paris: Payot.)\r\n[2] Canfield J.V. (Editor), 1997, Philosophy of Meaning, Knowledge and\r\nValue in the 20th Century: Routledge History of Philosophy Volume 10.\r\nBritish Library Cataloguing in Publication data.\r\n[3] Ng, H.T., and Lee, H.B., 1996. Integrating multiple knowledge sources\r\nto disambiguate word sense: An exemplar-based approach. In\r\nProceedings of the 34th Annual Meeting of the Association for\r\nComputational Linguistics (ACL-96), Santa Cruz.\r\n[4] SENSEVAL: An Exercise in Evaluating Word Sense Disambiguation\r\nPrograms A. Kilgarriff. In Proc. LREC, Granada, May 1998. Pp 581--\r\n588.\r\n[5] Schutze, H., and Pedersen, J. 1995. Information Retrieval Based on\r\nWord Senses. In Proceedings of the Fourth Annual Symposium on\r\nDocument Analysis and Information Retrieval, 161-175. Las Vegas,\r\nNev.: University of Nevada at Las Vegas.\r\n[6] Ido Dagan , Alon Itai, Word sense disambiguation using a second\r\nlanguage monolingual corpus, Computational Linguistics, v.20 n.4,\r\np.563-596, December 1994.\r\n[7] R. Bruce and J. Wiebe. 1999. Decomposable modeling in natural\r\nlanguage processing. Computational Linguistics, 25(2):195-207.\r\n[8] Pedersen, T., 2001. A decision tree of bigrams is an accurate predictor of\r\nword sense. In Proceedings of the North American Chapter of the\r\nAssociation for Computational Linguistics, NAACL 2001, pages 79-86,\r\nPittsburg.\r\n[9] Fellbaum, C., Palmer, M., Dang, H.T., Delfs, L., and Wolf., S., 2001.\r\nManual and automatic semantic annotation with WordNet. In WordNet\r\nand Other lexical resources: NAACL 2001 workshop, pages 3-10,\r\nPittsburgh.\r\n[10] Ng, H. T., Zelle, J., Winter, 1997, Corpus-based approaches to semantic\r\ninterpretation in natural language processing - Natural Language\r\nProcessing, AI Magazine.\r\n[11] Kelly, E. and Stone, P. (1975) Computer Recognition of English Word\r\nSenses, North Holland, Amsterdam.\r\n[12] Yarowsky, D. 1993. One Sense per Collocation. In Proceedings of the\r\nARPA Human-Language Technology Workshop, 266-271. Washington,\r\nD.C.: Advanced Research Projects Agency.\r\n[13] Yarowsky, D. 1994. Decision Lists for Lexical Ambiguity Resolution:\r\nApplication to Accent Restoration in Spanish and French. In\r\nProceedings of the Thirty-Second Annual Meeting of the Association for\r\nComputational Linguistics, 88-95. Somerset, N.J.: Association for\r\nComputational Linguistics.\r\n[14] Bruce, R. and J. Wiebe. 1994. Word-sense disambiguation using\r\ndecomposable models. In Proceedings of the 32nd Annual Meeting of\r\nthe Association for Computational Linguistics, pages 139-- 146.\r\n[15] Rada Mihalcea, August 2002, Instance Based Learning with Automatic\r\nFeature Selection Applied to Word Sense Disambiguation, in\r\nProceedings of the 19th International Conference on Computational\r\nLinguistics (COLING 2002), Taiwan.\r\n[16] Pedersen, Ted and Rebecca Bruce. 1997. A new supervised learning\r\nalgorithm for word sense disambiguation. In Proceedings of the 14th\r\nNational Conference on Artificial Intelligence (AAAI-97), Providence,\r\nRI.\r\n[17] R. Mooney. 1996. Comparative experiments on disambiguating word\r\nsenses: An illustration of the role of bias in machine learning. In\r\nProceedings of the 1996 Conference on Empirical Methods in Natural\r\nLanguage Processing (EMNLP-1996), pages 82-91, Philadelphia.\r\n[18] Leacock, C., Towell, G. and Voorhees, E. M., 1993 \"Corpus-based\r\nstatistical sense resolution.\" In Proceedings of the ARPA Human\r\nLanguages Technology Workshop.\r\n[19] Gale, W., K. Church, and D. Yarowsky. ``Work on Statistical Methods\r\nfor Word Sense Disambiguation.'' In Proceedings, AAAI Fall\r\nSymposium on Probabilistic Approaches to Natural Language.\r\nCambridge, MA, pp. 54-60, 1992.\r\n[20] Yarowsky, D. `` Word-Sense Disambiguation Using Statistical Models\r\nof Roget's Categories Trained on Large Corpora.'' In Proceedings,\r\nCOLING-92. Nantes, pp. 454-460, 1992.\r\n[21] R. Mooney. 1996. Comparative experiments on disambiguating word\r\nsenses: An illustration of the role of bias in machine learning. In\r\nProceedings of the 1996 Conference on Empirical Methods in Natural\r\nLanguage Processing (EMNLP-1996), pages 82-91, Philadelphia.\r\n[22] Pedersen, T., 2001. A decision tree of bigrams is an accurate predictor of\r\nword sense. In Proceedings of the North American Chapter of the\r\nAssociation for Computational Linguistics, NAACL 2001, pages 79-86,\r\nPittsburg.\r\n[23] Yarowsky, D. `` Hierarchical Decision Lists for Word Sense\r\nDisambiguation.'' Computers and the Humanities, 34(2):179-186, 2000.\r\n[24] H. T. Ng. 1997. Exemplar-Base Word Sense Disambiguation: Some\r\nRecent Improvements. In Procs. of the 2nd Conference on Empirical\r\nMethods in Natural Language Processing, EMNLP.\r\n[25] C. Cardie. 1993. A case-based approach to knowledge acquisition for\r\ndomain-specific sentence analysis. In Proceedings of the Eleventh\r\nNational Conference on Artificial Intelligence, pages 798-803,\r\nWashington, DC.\r\n[26] Veenstra, A. van den Bosch, J., S. Buchholz, W. Daelemans, and J.\r\nZavrel. 2000 Memory-based word sense disambiguation Computers and\r\nthe Humanities, 34:171-177.\r\n[27] Daeleman, W.,Machine Learning of Language: A Model and a Problem,\r\nESSLLI'2002 Workshop on Machine Learning Approaches in\r\nComputational Linguistics, August 5 - 9, 2002, Trento, Italy.\r\n[28] G. Escudero, L. Mrquez, and G. Rigau. 2000, Naive Bayes and\r\nExemplar-Based Approaches to Word Sense Disambiguation Revisited.\r\nIn Proceedings of the 14th European Conference on Artificial\r\nIntelligence, ECAL.\r\n[29] Lee, Yoong Keok, & Ng, Hwee Tou. An Empirical Evaluation of\r\nKnowledge Sources and Learning Algorithms for Word Sense\r\nDisambiguation. Proceedings of the 2002 Conference on Empirical\r\nMethods in Natural Language Processing (EMNLP-2002). pp. 41-48,\r\n2002.\r\n[30] D.W. Aha and R.L. Bankert. 1994. Feature selection for case-based\r\nclassification of cloud types: An empirical comparison. In Proceedings\r\nof the AAAI-94 Workshop on Case-Based Reasoning, pages 106-112,\r\nSeattle, WA.\r\n[31] A.W. Moore and M.S. Lee. 1994. Efficient algorithms for minimizing\r\ncross validation error. In International Conference on Machine Learning,\r\npages 190-198, New Brunswick.\r\n[32] C. Cardie. 1996. Automating feature set selection for case-based\r\nlearning of linguistic knowledge. In Proceedings of the Conference on\r\nEmpirical Methods in Natural Language Processing EMNLP, pages 113-\r\n126, Somerset, New Jersey.\r\n[33] P. Domingos. 1997. Context-sensitive feature selection for lazy learners.\r\nArtificial Intelligence Review, (11):227-253.\r\n[34] Mihalcea, R., Instance Based Learning with Automatic Feature Selection\r\nApplied to Word Sense Disambiguation, in Proceedings of the 19th\r\nInternational Conference on Computational Linguistics (COLING\r\n2002), Taiwan, August 2002.\r\n[35] Y\u2500\u2592lmaz, O., September, 1994, Design and implementation of a verb\r\nlexicon and sense disambiguator for Turkish, MS. Thesis, Bilkent\r\nUniversity, Ankara, Turkey.\r\n[36] Orhan Z., Altan Z., 2003, \"Disambiguation of Turkish Word Senses\r\nBy Supervised Statistical Methods\", International XII. Turkish\r\nSymposium on Artificial Intelligence and Neural Networks (TAINN\r\n2003).\r\n[37] Nart B. Atalay, Kemal Oflazer, Bilge Say, The Annotation Process in\r\nthe Turkish treebank, in Proceedings of the EACL Workshop on\r\nLinguistically Interpreted Corpora-LINC, April 13-14, 2003, Budapest.\r\n[38] Fellbaum C., 1998, WordNet: An Electronic Lexical Database. The MIT\r\npress.\r\n[39] Ciaramita M., Johnson M. , 2004, \"Multi-Component Word Sense\r\nDisambiguation\" Proceedings of Senseval-3: The Third International\r\nWorkshop on the Evaluation of Systems for the Semantic Analysis of\r\nText, pp. 97-100.\r\n[40] Stamou, S., Oflazer, K., Pala, K., Christodoulakis, D., Cristea, D., Tufis,\r\nD., Koeva, S., Totkov, G., Dutoit, D., Grigoriadou, M., BalkaNet: A\r\nmultilingual Semantic Network for Balkan Languages, in Proceedings of\r\nthe First International WordNet Conference, Mysore India, January\r\n2002.\r\n[41] O. Bilgin, \u251c\u00e7et\u2500\u2592no\u2500\u0192lu, \u251c\u00fb., Oflazer, K., Building a Wordnet for Turkish,\r\nRomanian Journal of Information Science and Technology, Volume 7,\r\nNumbers 1-2, 2004, 163-172.\r\n[42] WEKA system, http:\/\/www.cs.waikato.ac.nz\/ml\/weka.","publisher":"World Academy of Science, Engineering and Technology","index":"Open Science Index 7, 2007"}