Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 32299
AGHAZ : An Expert System Based approach for the Translation of English to Urdu

Authors: Uzair Muhammad, Kashif Bilal, Atif Khan, M. Nasir Khan


Machine Translation (MT 3) of English text to its Urdu equivalent is a difficult challenge. Lot of attempts has been made, but a few limited solutions are provided till now. We present a direct approach, using an expert system to translate English text into its equivalent Urdu, using The Unicode Standard, Version 4.0 (ISBN 0-321-18578-1) Range: 0600–06FF. The expert system works with a knowledge base that contains grammatical patterns of English and Urdu, as well as a tense and gender-aware dictionary of Urdu words (with their English equivalents).

Keywords: Machine Translation, Multiword Expressions, Urdulanguage processing, POS12 Tagging for Urdu, Expert Systems.

Digital Object Identifier (DOI):

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 2222


[1] I.A. Sag, T. Baldwin, F. Bond, A. Copestake, D. Flickinger 2001, "Multi-word Expressions: A Pain in the Neck for NLP", LinGO Working Paper No. 2001-03. Stanford University, CA.
[2] Scott S. L. Piao, Paul Rayson, Dawn Archer, Andrew Wilson, Tony McEnery "Extracting Multiword Expression Using a Semantic Tagger", Lancaster University.
[3] Ann Copestake, Fabre Lambeau, Aline Villavicecio, Francis Bond, Timothy Baldwin, Ivan A.Sag, Dan Flickinger, "Multiword Expressions: linguistic precision and reusability", University of Cambridge Computer Laboratory, William Gates Building, JJ Thomson Avenue, Cambridge, CB3 0FD, UK. NTT Communication Science Labortries, Hikari Dai, Seiko-cho,Soraku-gun, Kioto 619-0237, JAPAN.
[4] Z. Pervez, S. Khan, F. Mustafa, M. Mahmood, U. Hasan, "Pharasal Consolidation Algorithm for Part Of Speech Tags In Machine Translation From English To Urdu" National University of Science and Technology, Rawalpindi Pakistan.
[5] Sarmad Hussein. "Letter-to-Sound Conversion for Urdu Text-to-Speech System". Center for Research in Urdu Language Processing, National University of Computer and Emerging Sciences, Lahore, Pakistan.
[6] T. Rahman (2002). "Language Ideology and Power: Language Learning Among the Muslims of Pakistan and North India", Oxford University Press, Karachi, Pakistan.
[7] Ethnologue, 13th Edition.
[8] T. Mitamura, E. Nyberg, E. Torrejon, D. Svoboda, A.Brunner and K. Baker, "Pronominal Anaphora Resolution in the Kantoo Multilingual Machine Translation System", Proceedings of the 9th International Conference on Theoretical and Methodological Issues in Machine Translation. Keihanna, Japan, Mar 2002.
[9] AltaVista Babelfish. URL:
[10] Google Language Tool. URL:
[11] Z. Pervez, S. Khan, F. Mustafa, M. Mahmood, U. Hasan, "Pharasal Consolidation Algorithm for Part Of Speech Tags In Machine Translation from English to Urdu", NUST Institute of Information Technology, National University of Sciences and Technology.