Search results for: Arabic natural language processing
12315 Language Literacy Attrition: An Empirical Investigation
Authors: Ahmad Al-Issa
Abstract:
Our world is now operating under the auspices of globalization with its attendant language of ‘global English.' In many parts of the world, the need for English is often accepted without much thought given to native languages. Indeed, this is the current situation in the United Arab Emirates (UAE), with English encroaching into all areas of society, and especially forcefully into the education sector, where English as a medium of instruction (EMI) is on the rise. At the same time, Arabic literacy (i.e., the ability to read and write in Arabic) is declining among the UAE youth. Using a mixed-methods design, a study was conducted to gain insights into the use of Arabic by Emirati University students. The study examines how often Emiratis, males and females, use their native language (Arabic) in their daily lives, how they view their reading and writing skills in Arabic vis-à-vis their English literacy skills, and the extent to which they can demonstrate their literacy skills in Arabic. Clear evidence emerged showing that while Arabic as a dialect continues to be spoken on a daily basis, Arabic literacy is unquestionably losing ground. This was found to be motivated by educational, political, societal, and personal forces. These findings and their implications to language policy and existing bilingualism programs will be discussed. Suggestions for further research will also be made.Keywords: Arabic, globalization, global English, literacy attrition, United Arab Emirates
Procedia PDF Downloads 29212314 University Arabic/Foreign Language Teacher's Competences, Professionalism and the Challenges and Opportunities
Authors: Abeer Heider
Abstract:
The article considers the definitions of teacher’s competences and professionalism from different perspectives of Arab and foreign scientists. A special attention is paid to the definition, classification of the stages and components of University Arabic /foreign language teacher’s professionalism. The results of the survey are offered and recommendations are given. In this paper, only some of the problems of defining professional competence and professionalism of the university Arabic/ foreign language teacher have been mentioned. It needs much more analysis and discussion, because the quality of training today’s competitive and mobile students with a good knowledge of foreign languages depends directly on the teachers’ professional level.Keywords: teacher’s professional competences, Arabic/ foreign language teacher’s professionalism, teacher evaluation, teacher quality
Procedia PDF Downloads 45612313 Comparison Constructions in the Language of the Qur'an
Authors: Safiah Ahmed Yahya Madkhali
Abstract:
The aim of the present paper is to provide a characterization of the expression of comparison in the language of the Qur’an, the language of the Divine Book of the Muslim nation. It focuses on quantitative as well as qualitative comparisons. While works on comparison constructions in Arabic focus on a type(s) of the comparison construction and exclude another and investigate its behaviour in Standard Arabic, the paper aims to be inclusive of the varied instances that are scalar comparison constructions and describe its aspects in the language of the Qur’an. To the best of my knowledge, comparative constructions in the language of the Qur’an has not been tackled before and hence the characterization provided in the paper would be the contribution of the present work. The paper highlights the several rhetorical features of the construction as present in the different verses in the Qur’an which set them distinct from the ordinary use of the construction in the different verities of the Arabic language.Keywords: comparison constructions, inequality, comparative, superlative, equality
Procedia PDF Downloads 7612312 Promotion of the Arabic language in India: MES Mampad College - A Torchbearer
Authors: Junaid C, Sabique MK
Abstract:
Introduction: MES Mamapd College is an autonomous college established in 1964 affiliated with the University of Calicut run by the Muslim Educational Society Kerala. The department of Arabic of the college is having a pivotal role in promoting Arabic language learning, teaching, research, and other allied academic activities. State of Problem: Department of Arabic of the college introduced before the academic committee the culture of international seminars. The department connected the academic community with foreign scholars and introduced industry-academia collaboration programs which are beneficial to the job seekers. These practices and innovations should be documented. Objectives: Create awareness of innovative practices implemented for the promotion of the Arabic language. Infuse confidence in learners in learning of Arabic language. Showcase the distinctive academic programs initiated by the department Methodology: Data will be collected from archives, souvenirs, and reports. Survey methods and interviews with authorities and beneficiaries will be collected for the data analysis. Major results: MES Mampad College introduced before its stakeholders different unique academic practices related to the Arabic language and literature. When the unprecedented pandemic situation pulled back all of the academic community, the department come forward with numerous academic initiatives utilizing the virtual space. Both arenas will be documented. Conclusion: This study will help to make awareness on the promotion of the Arabic language studies and related practices initiated by the department of Arabic MES Mampad College. These practices and innovations can be modeled and replicated.Keywords: teaching Arabic language, MES mampad college, Arabic webinars, pandemic impacts in literature
Procedia PDF Downloads 8612311 Resume Ranking Using Custom Word2vec and Rule-Based Natural Language Processing Techniques
Authors: Subodh Chandra Shakya, Rajendra Sapkota, Aakash Tamang, Shushant Pudasaini, Sujan Adhikari, Sajjan Adhikari
Abstract:
Lots of efforts have been made in order to measure the semantic similarity between the text corpora in the documents. Techniques have been evolved to measure the similarity of two documents. One such state-of-art technique in the field of Natural Language Processing (NLP) is word to vector models, which converts the words into their word-embedding and measures the similarity between the vectors. We found this to be quite useful for the task of resume ranking. So, this research paper is the implementation of the word2vec model along with other Natural Language Processing techniques in order to rank the resumes for the particular job description so as to automate the process of hiring. The research paper proposes the system and the findings that were made during the process of building the system.Keywords: chunking, document similarity, information extraction, natural language processing, word2vec, word embedding
Procedia PDF Downloads 15812310 Prospects in Teaching Arabic Grammatical Structures to Non-Arab Learners
Authors: Yahya Toyin Muritala, Nonglaksana Kama, Ahmad Yani
Abstract:
The aim of the paper is to investigate various linguistic techniques in enhancing and facilitating the acquisition of the practical knowledge of Arabic grammatical structuring among non-Arab learners of the standard classical Arabic language in non-Arabic speaking academic settings in the course of the current growth of the internationalism and cultural integration in some higher institutions. As the nature of the project requires standard investigations into the unique principal features of Arabic structurings and implications, the findings of the research work suggest some principles to follow in solving the problems faced by learners while acquiring grammatical aspects of Arabic language. The work also concentrates on the the structural features of the language in terms of inflection/parsing, structural arrangement order, functional particles, morphological formation and conformity etc. Therefore, grammatical aspect of Arabic which has gone through major stages in its early evolution of the classical stages up to the era of stagnation, development and modern stage of revitalization is a main subject matter of the paper as it is globally connected with communication and religion of Islam practiced by millions of Arabs and non-Arabs nowadays. The conclusion of the work shows new findings, through the descriptive and analytical methods, in terms of teaching language for the purpose of effective global communication with focus on methods of second language acquisitions by application.Keywords: language structure, Arabic grammar, classical Arabic, intercultural communication, non-Arabic speaking environment and prospects
Procedia PDF Downloads 40012309 The Effect of the Vernacular on Code-Switching Hebrew into Palestinian Arabic
Authors: Ward Makhoul
Abstract:
Code-switching (CS) is known as a ubiquitous phenomenon in multilingual societies and countries. Vernacular Palestinian Arabic (PA) variety spoken in Israel is among these languages, informally used for day-to-day conversations only. Such conversations appear to contain code-switched instances from Hebrew, the formal and dominant language of the country, even in settings where the need for CS seems to be unnecessary. This study examines the CS practices in PA and investigates the reason behind these CS instances in controlled settings and the correlation between bilingual dominance and CS. In the production-task interviews and Bilingual Language Profile test (BLP), there was a correlation between language dominance and CS; 13 participants were interviewed to elicit and analyze natural speech-containing CS instances, along with undergoing a BLP test. The acceptability judgment task observed the limits and boundaries of different code-switched linguistic structures.Keywords: code-switching, Hebrew, Palestinian-Arabic, vernacular
Procedia PDF Downloads 11812308 Saudi Twitter Corpus for Sentiment Analysis
Authors: Adel Assiri, Ahmed Emam, Hmood Al-Dossari
Abstract:
Sentiment analysis (SA) has received growing attention in Arabic language research. However, few studies have yet to directly apply SA to Arabic due to lack of a publicly available dataset for this language. This paper partially bridges this gap due to its focus on one of the Arabic dialects which is the Saudi dialect. This paper presents annotated data set of 4700 for Saudi dialect sentiment analysis with (K= 0.807). Our next work is to extend this corpus and creation a large-scale lexicon for Saudi dialect from the corpus.Keywords: Arabic, sentiment analysis, Twitter, annotation
Procedia PDF Downloads 63012307 Oral Grammatical Errors of Arabic as Second Language (ASL) Learners: An Applied Linguistic Approach
Authors: Sadeq Al Yaari, Fayza Al Hammadi, Ayman Al Yaari, Adham Al Yaari, Montaha Al Yaari, Aayah Al Yaari, Sajedah Al Yaari, Salah Al Yami
Abstract:
Background: When we further take Arabic grammatical issues into account in accordance with applied linguistic investigations on Arabic as Second Language (ASL) learners, a fundamental issue arises at this point as to the production of speech in Arabic: Oral grammatical errors committed by ASL learners. Aims: Using manual rating as well as computational analytic methodology to test a corpus of recorded speech by Second Language (ASL) learners of Arabic, this study aims to find the areas of difficulties in learning Arabic grammar. More specifically, it examines how and why ASL learners make grammatical errors in their oral speech. Methods: Tape recordings of four (4) Arabic as Second Language (ASL) learners who ranged in age from 23 to 30 were naturally collected. All participants have completed an intensive Arabic program (two years) and 20 minute-speech was recorded for each participant. Having the collected corpus, the next procedure was to rate them against Arabic standard grammar. The rating includes four processes: Description, analysis and assessment. Conclusions: Outcomes made from the issues addressed in this paper can be summarized in the fact that ASL learners face many grammatical difficulties when studying Arabic word order, tenses and aspects, function words, subject-verb agreement, verb form, active-passive voice, global and local errors, processes-based errors including addition, omission, substitution or a combination of any of them.Keywords: grammar, error, oral, Arabic, second language, learner, applied linguistics.
Procedia PDF Downloads 4712306 Neural Correlates of Arabic Digits Naming
Authors: Fernando Ojedo, Alejandro Alvarez, Pedro Macizo
Abstract:
In the present study, we explored electrophysiological correlates of Arabic digits naming to determine semantic processing of numbers. Participants named Arabic digits grouped by category or intermixed with exemplars of other semantic categories while the N400 event-related potential was examined. Around 350-450 ms after the presentation of Arabic digits, brain waves were more positive in anterior regions and more negative in posterior regions when stimuli were grouped by category relative to the mixed condition. Contrary to what was found in other studies, electrophysiological results suggested that the production of numerals involved semantic mediation.Keywords: Arabic digit naming, event-related potentials, semantic processing, number production
Procedia PDF Downloads 58212305 The Effects of the Inference Process in Reading Texts in Arabic
Authors: May George
Abstract:
Inference plays an important role in the learning process and it can lead to a rapid acquisition of a second language. When learning a non-native language, i.e., a critical language like Arabic, the students depend on the teacher’s support most of the time to learn new concepts. The students focus on memorizing the new vocabulary and stress on learning all the grammatical rules. Hence, the students became mechanical and cannot produce the language easily. As a result, they are unable to predict the meaning of words in the context by relying heavily on the teacher, in that they cannot link their prior knowledge or even identify the meaning of the words without the support of the teacher. This study explores how the teacher guides students learning during the inference process and what are the processes of learning that can direct student’s inference.Keywords: inference, reading, Arabic, language acquisition
Procedia PDF Downloads 53112304 Written Grammatical Errors of Arabic as Second Language (ASL) Learners: An Evaluative Study
Authors: Sadeq Al Yaari, Fayza Al Hammadi, Ayman Al Yaari, Adham Al Yaari, Montaha Al Yaari, Aayah Al Yaari, Sajedah Al Yaari, Salah Al Yami
Abstract:
Background: In recent years, the number of non-native speakers of the Arabic language has exponentially increased. Aims: This analytical study aims to investigate written grammatical errors committed by Arabic as a Second Language (ASL) learners. More specifically, it explores the reasons behind committing these errors and their effects on the daily communication of ASL learners. Methods: Ten (10) ASL senior learners of the Arabic Language Institute (ALI), College of Arts, King Saud University (KSU), Riyadh, Kingdom of Saudi Arabia (KSA) were randomly selected in this study. The participants were asked to write paragraphs about themselves and then their written work was linguistically analyzed and evaluated by the researchers and some Arabic Language experts before it was statistically analyzed. Conclusions: Results outline that written grammatical errors of ASL learners are characterized by the misuse of many grammatical items. Mainly, these items are proper nouns (PN), common nouns (CN), main verbs (MV), adjectives (adj.), time adverbs (T. Adv.), manner adverbs (M. Adv.), objective pronouns (OP), and central determiners (C Det.) including demonstratives (Dem.) and articles (Artic.), pronouns (Pron.) and prepositions (Prep.).Keywords: written, grammatical errors, Arabic, second language, non-native learners, analysis.
Procedia PDF Downloads 4412303 Arabic Light Stemmer for Better Search Accuracy
Authors: Sahar Khedr, Dina Sayed, Ayman Hanafy
Abstract:
Arabic is one of the most ancient and critical languages in the world. It has over than 250 million Arabic native speakers and more than twenty countries having Arabic as one of its official languages. In the past decade, we have witnessed a rapid evolution in smart devices, social network and technology sector which led to the need to provide tools and libraries that properly tackle the Arabic language in different domains. Stemming is one of the most crucial linguistic fundamentals. It is used in many applications especially in information extraction and text mining fields. The motivation behind this work is to enhance the Arabic light stemmer to serve the data mining industry and leverage it in an open source community. The presented implementation works on enhancing the Arabic light stemmer by utilizing and enhancing an algorithm that provides an extension for a new set of rules and patterns accompanied by adjusted procedure. This study has proven a significant enhancement for better search accuracy with an average 10% improvement in comparison with previous works.Keywords: Arabic data mining, Arabic Information extraction, Arabic Light stemmer, Arabic stemmer
Procedia PDF Downloads 30912302 Gender Bias in Natural Language Processing: Machines Reflect Misogyny in Society
Authors: Irene Yi
Abstract:
Machine learning, natural language processing, and neural network models of language are becoming more and more prevalent in the fields of technology and linguistics today. Training data for machines are at best, large corpora of human literature and at worst, a reflection of the ugliness in society. Machines have been trained on millions of human books, only to find that in the course of human history, derogatory and sexist adjectives are used significantly more frequently when describing females in history and literature than when describing males. This is extremely problematic, both as training data, and as the outcome of natural language processing. As machines start to handle more responsibilities, it is crucial to ensure that they do not take with them historical sexist and misogynistic notions. This paper gathers data and algorithms from neural network models of language having to deal with syntax, semantics, sociolinguistics, and text classification. Results are significant in showing the existing intentional and unintentional misogynistic notions used to train machines, as well as in developing better technologies that take into account the semantics and syntax of text to be more mindful and reflect gender equality. Further, this paper deals with the idea of non-binary gender pronouns and how machines can process these pronouns correctly, given its semantic and syntactic context. This paper also delves into the implications of gendered grammar and its effect, cross-linguistically, on natural language processing. Languages such as French or Spanish not only have rigid gendered grammar rules, but also historically patriarchal societies. The progression of society comes hand in hand with not only its language, but how machines process those natural languages. These ideas are all extremely vital to the development of natural language models in technology, and they must be taken into account immediately.Keywords: gendered grammar, misogynistic language, natural language processing, neural networks
Procedia PDF Downloads 12012301 Current Trends in the Arabic Linguistics Development: Between National Tradition and Global Tendencies
Authors: Olga Bernikova, Oleg Redkin
Abstract:
Globalization is a process of worldwide economic, political and cultural integration. Obviously, this phenomenon has both positive and negative issues. This article analyzes the impact of the modern process of globalization on the national traditions of language teaching and research. In this context, the problem of the ratio of local to global can be viewed from several sides. Firstly, since English is the language of over 80 percent of scientific and technical research worldwide, what should be the language of science in certain region? Secondly, language 'globality' is not always associated with English, because intercultural communications may have their regional peculiarities. For example, in the Arab world, Modern Standard Arabic can also be regarded as 'global' phenomenon, since the mother-tongue languages of the population are local Arabic dialects. In addition, the correlation 'local' versus 'global' is manifested not only in the linguistic sphere but also in the methodology used in language acquisition and research. Thus, the major principles of the Arabic philological tradition, which goes back to the 7th century, are still spread in the modern Arab world. At the same time, the terminology and methods of language research that are peculiar to this tradition are quite far from the issues of general linguistics that underlies the description of all the languages of the world. The present research relies on a comparative analysis of sources in Arabic linguistics, including original works in Arabic dating back to the 12th-13th centuries. As a case study, interaction of local and global is also considered on the example of the Arabic teaching and research in Russia. Speaking about the correlation between local and global it is possible to forecast development of two parallel tendencies: the spread of the phenomena of globalization on one hand, and local implementation of a language policy aimed at preserving native languages, including Arabic, on the other.Keywords: Arabic, global, language, local, tradition
Procedia PDF Downloads 26012300 Anti-Language in Jordanian Spoken Arabic: A Sociolinguistic Perspective
Authors: Ahmad Mohammad Al-Harahsheh
Abstract:
Anti-language reflects anti-society; it is a restricted spoken code used among a group of interlocutors because of anti-society. This study aims to shed light on the sociolinguistic characteristics of anti-language used by prisoners in Jordan. The participants included were 15 male-Jordanian prisoners who have recently been released. The data were written, transliterated, and analyzed on the basis of sociolinguistics and discourse analysis. This study draws on sociolinguistic theory of language codes as the theoretical framework. The study concludes that anti-language is a male language and is used for secrecy, as the prisoners' tendency to protect themselves from the police; it is a verbal competition, contest and display. In addition, it is employed to express obnoxious ideas and acts by using more pleasant or blurred words and expressions. Also, the anti-language used by prisoners has six linguistic characteristics in JSA (Jordanian Spoken Arabic), such as relexicalization, neologism, rhyme formation, semantic change, derivation, and metaphorical expressions.Keywords: anti-language, Jordanian Spoken Arabic, sociolinguistics, prisoners
Procedia PDF Downloads 36612299 Language Use in Computer-Mediated Communication and Users’ Social Identity
Authors: Miramar Damanhouri
Abstract:
This study examines the relationship between language use in computer-mediated communication and the social identity of the user. The data were collected by surveying 298 Saudi bilingual speakers who are familiar with Arabizi, a blend of Latin characters and Arabic numerals to transliterate Arabic sounds, and then analyzed quantitatively by running tests for statistical confidence in order to determine differences in perceptions between young adults (ages 15-25 years) and middle-aged adults (ages 26-50 years). According to the findings of this study, English is the dominant language among most of the young adults surveyed, and when they do use Arabic, they use Arabizi because of its flexibility, compatibility with modern technology, and its acceptance among people of their age and sociocultural backgrounds. On the other hand, most middle-aged adults surveyed here tend to use Arabic, as they believe that they should show their loyalty to their origin. The results of the study demonstrate a mutual relationship between language use in computer-mediated communication and the user’s social identity, as language is used both to reflect and construct that identity.Keywords: Arabizi, computer mediated communication, digital communication, language use
Procedia PDF Downloads 13412298 Towards a Large Scale Deep Semantically Analyzed Corpus for Arabic: Annotation and Evaluation
Authors: S. Alansary, M. Nagi
Abstract:
This paper presents an approach of conducting semantic annotation of Arabic corpus using the Universal Networking Language (UNL) framework. UNL is intended to be a promising strategy for providing a large collection of semantically annotated texts with formal, deep semantics rather than shallow. The result would constitute a semantic resource (semantic graphs) that is editable and that integrates various phenomena, including predicate-argument structure, scope, tense, thematic roles and rhetorical relations, into a single semantic formalism for knowledge representation. The paper will also present the Interactive Analysis tool for automatic semantic annotation (IAN). In addition, the cornerstone of the proposed methodology which are the disambiguation and transformation rules, will be presented. Semantic annotation using UNL has been applied to a corpus of 20,000 Arabic sentences representing the most frequent structures in the Arabic Wikipedia. The representation, at different linguistic levels was illustrated starting from the morphological level passing through the syntactic level till the semantic representation is reached. The output has been evaluated using the F-measure. It is 90% accurate. This demonstrates how powerful the formal environment is, as it enables intelligent text processing and search.Keywords: semantic analysis, semantic annotation, Arabic, universal networking language
Procedia PDF Downloads 58212297 An Enhanced Support Vector Machine Based Approach for Sentiment Classification of Arabic Tweets of Different Dialects
Authors: Gehad S. Kaseb, Mona F. Ahmed
Abstract:
Arabic Sentiment Analysis (SA) is one of the most common research fields with many open areas. Few studies apply SA to Arabic dialects. This paper proposes different pre-processing steps and a modified methodology to improve the accuracy using normal Support Vector Machine (SVM) classification. The paper works on two datasets, Arabic Sentiment Tweets Dataset (ASTD) and Extended Arabic Tweets Sentiment Dataset (Extended-AATSD), which are publicly available for academic use. The results show that the classification accuracy approaches 86%.Keywords: Arabic, classification, sentiment analysis, tweets
Procedia PDF Downloads 14912296 An Analytic Comparison between Arabic and English Prosodies: Poetical Feet and Meters
Authors: Jamil Jafari, Sharafat Karimi
Abstract:
The Arabic Language has a complicated system of prosody invented by the great grammarian Khalil Ibn Ahmad Farahidi. He could extract 15 meters out of his innovative five circles, which were used in Arabic poetry of the 7th and 8th centuries. Then after a while, his student Akhfash added or compensated another meter to his tutor's meters, so overall, we now have 16 different meters in Arabic poetry. These meters have been formed by various combinations of 8 different feet and each foot is combined of rudimentary units called Sabab and Wated which are combinations of movement (/) and silent (ʘ) letters. On the other hand in English, we are dealing with another system of metrical prosody. In this language, feet are consisted of stressed and unstressed syllables and are of six types: iamb, trochee, dactyl, anapest, spondee, and pyrrhic. Using the descriptive-analytic method, in this research we aim at making a comparison between Arabic and English systems of metrical prosody to investigate their similarities and differences. The results show that both of them are quantitative and both of them rely on syllables in afoot. But unlike Arabic, English is utilizing another rhyme system and the number of feet in a line differs from Arabic; also, its feet are combined of stressed and unstressed syllables, while those of Arabic is a combination of movement and silent letters.Keywords: Arabic prosody, English prosody, foot, meter, poetry
Procedia PDF Downloads 14612295 English Grammatical Errors of Arabic Sentence Translations Done by Machine Translations
Authors: Muhammad Fathurridho
Abstract:
Grammar as a rule used by every language to be understood by everyone is always related to syntax and morphology. Arabic grammar is different with another languages’ grammars. It has more rules and difficulties. This paper aims to investigate and describe the English grammatical errors of machine translation systems in translating Arabic sentences, including declarative, exclamation, imperative, and interrogative sentences, specifically in year 2018 which can be supported with artificial intelligence’s role. The Arabic sample sentences which are divided into two; verbal and nominal sentence of several Arabic published texts will be examined as the source language samples. The translated sentences done by several popular online machine translation systems, including Google Translate, Microsoft Bing, Babylon, Facebook, Hellotalk, Worldlingo, Yandex Translate, and Tradukka Translate are the material objects of this research. Descriptive method that will be taken to finish this research will show the grammatical errors of English target language, and classify them. The conclusion of this paper has showed that the grammatical errors of machine translation results are varied and generally classified into morphological, syntactical, and semantic errors in all type of Arabic words (Noun, Verb, and Particle), and it will be one of the evaluations for machine translation’s providers to correct them in order to improve their understandable results.Keywords: Arabic, Arabic-English translation, machine translation, grammatical errors
Procedia PDF Downloads 15512294 Application of Natural Language Processing in Education
Authors: Khaled M. Alhawiti
Abstract:
Reading capability is a major segment of language competency. On the other hand, discovering topical writings at a fitting level for outside and second language learners is a test for educators. We address this issue utilizing natural language preparing innovation to survey reading level and streamline content. In the connection of outside and second-language learning, existing measures of reading level are not appropriate to this errand. Related work has demonstrated the profit of utilizing measurable language preparing procedures; we expand these thoughts and incorporate other potential peculiarities to measure intelligibility. In the first piece of this examination, we join characteristics from measurable language models, customary reading level measures and other language preparing apparatuses to deliver a finer technique for recognizing reading level. We examine the execution of human annotators and assess results for our finders concerning human appraisals. A key commitment is that our identifiers are trainable; with preparing and test information from the same space, our finders beat more general reading level instruments (Flesch-Kincaid and Lexile). Trainability will permit execution to be tuned to address the needs of specific gatherings or understudies.Keywords: natural language processing, trainability, syntactic simplification tools, education
Procedia PDF Downloads 49012293 How Western Donors Allocate Official Development Assistance: New Evidence From a Natural Language Processing Approach
Authors: Daniel Benson, Yundan Gong, Hannah Kirk
Abstract:
Advancement in national language processing techniques has led to increased data processing speeds, and reduced the need for cumbersome, manual data processing that is often required when processing data from multilateral organizations for specific purposes. As such, using named entity recognition (NER) modeling and the Organisation of Economically Developed Countries (OECD) Creditor Reporting System database, we present the first geotagged dataset of OECD donor Official Development Assistance (ODA) projects on a global, subnational basis. Our resulting data contains 52,086 ODA projects geocoded to subnational locations across 115 countries, worth a combined $87.9bn. This represents the first global, OECD donor ODA project database with geocoded projects. We use this new data to revisit old questions of how ‘well’ donors allocate ODA to the developing world. This understanding is imperative for policymakers seeking to improve ODA effectiveness.Keywords: international aid, geocoding, subnational data, natural language processing, machine learning
Procedia PDF Downloads 7912292 Context Detection in Spreadsheets Based on Automatically Inferred Table Schema
Authors: Alexander Wachtel, Michael T. Franzen, Walter F. Tichy
Abstract:
Programming requires years of training. With natural language and end user development methods, programming could become available to everyone. It enables end users to program their own devices and extend the functionality of the existing system without any knowledge of programming languages. In this paper, we describe an Interactive Spreadsheet Processing Module (ISPM), a natural language interface to spreadsheets that allows users to address ranges within the spreadsheet based on inferred table schema. Using the ISPM, end users are able to search for values in the schema of the table and to address the data in spreadsheets implicitly. Furthermore, it enables them to select and sort the spreadsheet data by using natural language. ISPM uses a machine learning technique to automatically infer areas within a spreadsheet, including different kinds of headers and data ranges. Since ranges can be identified from natural language queries, the end users can query the data using natural language. During the evaluation 12 undergraduate students were asked to perform operations (sum, sort, group and select) using the system and also Excel without ISPM interface, and the time taken for task completion was compared across the two systems. Only for the selection task did users take less time in Excel (since they directly selected the cells using the mouse) than in ISPM, by using natural language for end user software engineering, to overcome the present bottleneck of professional developers.Keywords: natural language processing, natural language interfaces, human computer interaction, end user development, dialog systems, data recognition, spreadsheet
Procedia PDF Downloads 31112291 The Attitude of Egyptian Nubian University Students towards Arabic and Nubian Languages
Authors: Sanaa Abouras
Abstract:
This research investigates the attitude of Egyptian Nubian University students towards the Arabic and the two Nubian languages, Nobiin, and Kenuzi-Dongola. The Nubian languages are called by Egyptian Nubians, Fadijja/Fadicca and Kenzi, respectively. Nubians are people who live in the Nubia area which lies between Egypt’s southern borders with the northern part of Sudan. Nubia is divided into two parts - one under the Egyptian regime, and the other under the Sudanese regime. The number of participants used in the study was forty - half male and half female. Twenty of these participants live in the Nubian region and are enrolled at the South Valley University in Aswan, Egypt. This number was compared with an additional twenty Egyptian-Nubian university students who live outside the Nubian region and attend various Egyptian universities located in Alexandria and Cairo. The hypothesis of this study is that Egyptian Nubian University students tend to have positive attitudes toward Arabic and also the Nubian languages. This research is a qualitative and partially quantitative one. Observations, questionnaires, and interviews were used to collect data in order to explore the following: (1) the language students prefer to speak at home and in public and if language preferences are gender-related, (2) the factors that influence the Egyptian Nubian university students' attitudes towards Arabic and Nubian languages, and (3) a look at the future of these ethnic Nubian languages. Results that answered the main question on the attitude of Egyptian Nubian University students toward Arabic and Nubian languages revealed that students who live inside and outside the Nubian region tend to have positive attitudes towards both the Arabic and the Nubian languages.Keywords: language attitude, minority, Arabic language, Nubian Language
Procedia PDF Downloads 27212290 The Syllable Structure and Syllable Processes in Suhwa Arabic: An Autosegmental Analysis
Authors: Muhammad Yaqub Olatunde
Abstract:
Arabic linguistic science is redirecting its focus towards the analysis and description of social, regional, and temporal varieties of social, regional, and temporal varieties in order to show how they vary in pronunciation, vocabulary, and grammar. This is not to say that the traditional Arabic linguists did not mention scores of dialectical variations but such works focused on the geographical boundaries of the Arabic speaking countries. There is need for a comprehensive survey of various Arabic dialects within the boundary of Arabic speaking countries and outside showing both the similarities and differences of linguistic and extra linguistic elements. This study therefore examines the syllable structure and process in noun and verb in the shuwa Arabic dialect speaking in North East Nigeria [mainly in Borno state]. The work seeks to establish the facts about this phenomenon, using auto- segmental analysis. These facts are compared, where necessary; using possible alternative analysis, with what operate in other related dialects within and outside Arabic speaking countries. The interaction between epenthesis and germination in the language also generate an interesting issue. The paper then conclude that syllable structure and process in the language need to recognize the existence of complex onset and a complex rhyme producing a consonant cluster in the former and a closed syllable in the letter. This emerges as result of resyllabification, which is motivated by these processes.Keywords: Arabic, dialect, linguistics, processes, resyllabification
Procedia PDF Downloads 42312289 Enhanced Arabic Semantic Information Retrieval System Based on Arabic Text Classification
Authors: A. Elsehemy, M. Abdeen , T. Nazmy
Abstract:
Since the appearance of the Semantic web, many semantic search techniques and models were proposed to exploit the information in ontology to enhance the traditional keyword-based search. Many advances were made in languages such as English, German, French and Spanish. However, other languages such as Arabic are not fully supported yet. In this paper we present a framework for ontology based information retrieval for Arabic language. Our system consists of four main modules, namely query parser, indexer, search and a ranking module. Our approach includes building a semantic index by linking ontology concepts to documents, including an annotation weight for each link, to be used in ranking the results. We also augmented the framework with an automatic document categorizer, which enhances the overall document ranking. We have built three Arabic domain ontologies: Sports, Economic and Politics as example for the Arabic language. We built a knowledge base that consists of 79 classes and more than 1456 instances. The system is evaluated using the precision and recall metrics. We have done many retrieval operations on a sample of 40,316 documents with a size 320 MB of pure text. The results show that the semantic search enhanced with text classification gives better performance results than the system without classification.Keywords: Arabic text classification, ontology based retrieval, Arabic semantic web, information retrieval, Arabic ontology
Procedia PDF Downloads 52612288 The Contribution of Translation to Arabic and Islamic Civilization during the Golden Age (661–1258)
Authors: Smail Hadj Mahammed
Abstract:
Translation is not merely a process of conveying the meaning from one particular language into another to overcome language barriers and ensure a good understanding; it is also a work of civilization and progress. Without the translation of Greek, Indian and Persian works, Arabic and Islamic Civilization would not have taken off, and without the translations of Arabic works into Latin, and then into European languages, the scientific and technological revolution of the modern world would not have taken place. In this context, the present paper seeks to investigate how the translation movement contributed to the Arabic and Islamic Civilizations during the Golden Age. The research paper consists of three major parts: the first part provides a brief historical overview of the translation movement during the golden age, which witnessed two important eras: the Umayyad and Abbasid eras. The second part shows the main reasons why translation was a prominent cultural activity during the Golden Age and why it gained great interest from the Arabs. The last part highlights the constructive contribution of translation to the Arabic and Islamic Civilization during the period (661–1258). The results demonstrate that Arabic translation movement was unprecedented in the transmission of knowledge in the whole history of humankind and that translation during the Golden Age had significantly assisted in enriching the Arabic and Islamic civilizations, which had absorbed major and important scientific works of old Greek, Indian and Persian civilizations.Keywords: Arabic and Islamic civilization, contribution, golden age, translation
Procedia PDF Downloads 11212287 Integrating Technology into Foreign Language Teaching: A Closer Look at Arabic Language Instruction at the Australian National University
Authors: Kinda Alsamara
Abstract:
Foreign language education is a complex endeavor that often presents educators with a range of challenges and difficulties. This study shed light on the specific challenges encountered in the context of teaching Arabic as a foreign language at the Australian National University (ANU). Drawing from real-world experiences and insights, we explore the multifaceted nature of these challenges and discuss strategies that educators have employed to address them. The challenges in teaching the Arabic language encompass various dimensions, including linguistic intricacies, cultural nuances, and diverse learner backgrounds. The complex Arabic script, grammatical structures, and pronunciation patterns pose unique obstacles for learners. Moreover, the cultural context embedded within the language demands a nuanced understanding of cultural norms and practices. The diverse backgrounds of learners further contribute to the challenge of tailoring instruction to meet individual needs and proficiency levels. This study also underscores the importance of technology in tackling these challenges. Technological tools and platforms offer innovative solutions to enhance language acquisition and engagement. Online resources, interactive applications, and multimedia content can provide learners with immersive experiences, aiding in overcoming barriers posed by traditional teaching methods. Furthermore, this study addresses the role of instructors in mitigating challenges. Educators often find themselves adapting teaching approaches to accommodate different learning styles, abilities, and motivations. Establishing a supportive learning environment and fostering a sense of community can contribute significantly to overcoming challenges related to learner diversity. In conclusion, this study provides a comprehensive overview of the challenges faced in teaching Arabic as a foreign language at ANU. By recognizing these challenges and embracing technological and pedagogical advancements, educators can create more effective and engaging learning experiences for students pursuing Arabic language proficiency.Keywords: Arabic, Arabic online, blended learning, teaching and learning, Arabic language, educational aids, technology
Procedia PDF Downloads 6312286 The Case of Plagiarism and Its Presence in Classical Arabic Poetry
Authors: Yusuf Seller
Abstract:
Classical Arabic poetry was narrated by the followers of poets, who were memorizing and repeating all the couplets of their master constantly. Although the students established their own styles, it was very natural for them to reflect the style and expression of their masters. This reflection was discussed in classical Arabic literary criticism and rhetoric (al-‘ilm al-balagha), as “al-Sariqah al-shiriyyah”, plagiarism in poetry. This study tests the claim that the reflection of the master's style and expressions in the student's poetry cannot be considered plagiarism. In addition, one of the goals of this essay is also to investigate the methodological emergence of plagiarism phenomena in classical Arabic poetry. The investigation of the methodological origins of plagiarism helps us see the relationship of plagiarism with literary property and the extent of the property`s authenticity. Therefore, the focus is directed towards uncovering the underlying ethical principles governing literary works and academic research in classical Arabic poetry.Keywords: Arabic literary criticism, classical Arabic poetry, plagiarism, al-Sariqah al-shiriyyah
Procedia PDF Downloads 45