Search results for: Arabic on line character recognition
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 5184

Search results for: Arabic on line character recognition

5124 A Supervised Approach for Word Sense Disambiguation Based on Arabic Diacritics

Authors: Alaa Alrakaf, Sk. Md. Mizanur Rahman

Abstract:

Since the last two decades’ Arabic natural language processing (ANLP) has become increasingly much more important. One of the key issues related to ANLP is ambiguity. In Arabic language different pronunciation of one word may have a different meaning. Furthermore, ambiguity also has an impact on the effectiveness and efficiency of Machine Translation (MT). The issue of ambiguity has limited the usefulness and accuracy of the translation from Arabic to English. The lack of Arabic resources makes ambiguity problem more complicated. Additionally, the orthographic level of representation cannot specify the exact meaning of the word. This paper looked at the diacritics of Arabic language and used them to disambiguate a word. The proposed approach of word sense disambiguation used Diacritizer application to Diacritize Arabic text then found the most accurate sense of an ambiguous word using Naïve Bayes Classifier. Our Experimental study proves that using Arabic Diacritics with Naïve Bayes Classifier enhances the accuracy of choosing the appropriate sense by 23% and also decreases the ambiguity in machine translation.

Keywords: Arabic natural language processing, machine learning, machine translation, Naive bayes classifier, word sense disambiguation

Procedia PDF Downloads 327
5123 Data Gathering and Analysis for Arabic Historical Documents

Authors: Ali Dulla

Abstract:

This paper introduces a new dataset (and the methodology used to generate it) based on a wide range of historical Arabic documents containing clean data simple and homogeneous-page layouts. The experiments are implemented on printed and handwritten documents obtained respectively from some important libraries such as Qatar Digital Library, the British Library and the Library of Congress. We have gathered and commented on 150 archival document images from different locations and time periods. It is based on different documents from the 17th-19th century. The dataset comprises differing page layouts and degradations that challenge text line segmentation methods. Ground truth is produced using the Aletheia tool by PRImA and stored in an XML representation, in the PAGE (Page Analysis and Ground truth Elements) format. The dataset presented will be easily available to researchers world-wide for research into the obstacles facing various historical Arabic documents such as geometric correction of historical Arabic documents.

Keywords: dataset production, ground truth production, historical documents, arbitrary warping, geometric correction

Procedia PDF Downloads 143
5122 Recognition of Tifinagh Characters with Missing Parts Using Neural Network

Authors: El Mahdi Barrah, Said Safi, Abdessamad Malaoui

Abstract:

In this paper, we present an algorithm for reconstruction from incomplete 2D scans for tifinagh characters. This algorithm is based on using correlation between the lost block and its neighbors. This system proposed contains three main parts: pre-processing, features extraction and recognition. In the first step, we construct a database of tifinagh characters. In the second step, we will apply “shape analysis algorithm”. In classification part, we will use Neural Network. The simulation results demonstrate that the proposed method give good results.

Keywords: Tifinagh character recognition, neural networks, local cost computation, ANN

Procedia PDF Downloads 307
5121 Gesture in the Arabic and Malay Languages a Comparative Study

Authors: Siti Sara binti Hj Ahmad, Adil Elshiekh Abdalla

Abstract:

The Arabic and Malay languages belong to different language’s families; while the Arabic language descends from the Semitic language, Malay belongs to the Austronesian (Malayo-Polynesian) family. Hence, the grammatical systems of the two languages differ from each other. Arabic, being a language found in the heart of the dessert, and Malay is the language found in the heart of thick equatorial forests, is another source of vital cultural differences. Consequently, it is expected that this situation will create differences in the ways of how speakers of the two languages perceive the world around them, convey and understand their messages. On the other hand, as the majority of the speakers of Malay language are Muslims, Arabic language found its way in this region; currently, Arabic is widely taught in school, some terms of it found their way in the Malay language. Accordingly, the Arabic language and culture have widely penetrated into the Malay language. This study is proposed with the aim to find out the differences and similarities between the two languages, in the term of the nonverbal communication. The result of this study will be of high significance, as it will help in enhancing the mutual understanding between the speakers of these languages. The comparative analysis approach will be utilized in this study.

Keywords: gesture, Arabic language, Malay language, comparative analysis

Procedia PDF Downloads 534
5120 Finetuned Transformers for Translating Multi Dialect Texts to MSA

Authors: Tahar Alimi, Rahma Boujelbane, Wiem Derouich, Lamia Hadrich Belguith

Abstract:

Machine translation task of low-resourced languages such as Arabic is a challenging task. Despite the appearance of sophisticated models based on the latest deep learning techniques, namely the transfer learning and transformers, all models prove incapable of carrying out an acceptable translation which includes Arabic dialects because they not official status. In this paper, a machine translation model designed to translate Arabic multidialectal content into Modern Standard Arabic (MSA), leveraging both new and existing parallel resources. The latter achieved the best results for both Levantine and Maghrebi dialects with BLEU score of 64.99.

Keywords: Arabic translation, dialect translation, fine-tune, msa translation, transformer, translation

Procedia PDF Downloads 9
5119 A Method for Compression of Short Unicode Strings

Authors: Masoud Abedi, Abbas Malekpour, Peter Luksch, Mohammad Reza Mojtabaei

Abstract:

The use of short texts in communication has been greatly increasing in recent years. Applying different languages in short texts has led to compulsory use of Unicode strings. These strings need twice the space of common strings, hence, applying algorithms of compression for the purpose of accelerating transmission and reducing cost is worthwhile. Nevertheless, other compression methods like gzip, bzip2 or PAQ due to high overhead data size are not appropriate. The Huffman algorithm is one of the rare algorithms effective in reducing the size of short Unicode strings. In this paper, an algorithm is proposed for compression of very short Unicode strings. At first, every new character to be sent to a destination is inserted in the proposed mapping table. At the beginning, every character is new. In case the character is repeated for the same destination, it is not considered as a new character. Next, the new characters together with the mapping value of repeated characters are arranged through a specific technique and specially formatted to be transmitted. The results obtained from an assessment made on a set of short Persian and Arabic strings indicate that this proposed algorithm outperforms the Huffman algorithm in size reduction.

Keywords: Algorithms, Data Compression, Decoding, Encoding, Huffman Codes, Text Communication

Procedia PDF Downloads 320
5118 Contemporary Arabic Novel Probing the Self and the Other: A Contrapuntal Study of Identity, Sexuality, and Fundamentalism

Authors: Jihan Mahmoud

Abstract:

This paper examines the role played by Arabic novelists in revolutionary change in the Arab world, discussing themes of identity, sexuality and fundamentalism as portrayed in a selection of modern and contemporary Arabic novels that are either written in English or translated from Arabic into English. It particularly focuses on the post-Naguib Mahfouz era. Taking my cue from the current political changes in the Arab world, starting with 9/11/ terrorist attacks in the USA and the UK, the ‘Arab Spring’ revolutions, the rise of political Islam and the emergence of Isis, the Islamic state in Iraq and the Levant, the study analyses the differences in the ways contemporary Arab novelists from different Arabic countries represent the interaction between identity, sexual politics and fundamentalist ideas in the Arab world, with a specific focus on the overlap between literature, religion and international politics in the region. It argues that the post-Mahfouz era marked a new phase in the development of the political Arabic novel not only as a force of resistance against political-religious oppression, but as a call for revolution as well. Thus, the Arabic novel reshapes values and prompts future action.

Keywords: Arabic novel, Islam, politics, sexuality

Procedia PDF Downloads 493
5117 Language Literacy Attrition: An Empirical Investigation

Authors: Ahmad Al-Issa

Abstract:

Our world is now operating under the auspices of globalization with its attendant language of ‘global English.' In many parts of the world, the need for English is often accepted without much thought given to native languages. Indeed, this is the current situation in the United Arab Emirates (UAE), with English encroaching into all areas of society, and especially forcefully into the education sector, where English as a medium of instruction (EMI) is on the rise. At the same time, Arabic literacy (i.e., the ability to read and write in Arabic) is declining among the UAE youth. Using a mixed-methods design, a study was conducted to gain insights into the use of Arabic by Emirati University students. The study examines how often Emiratis, males and females, use their native language (Arabic) in their daily lives, how they view their reading and writing skills in Arabic vis-à-vis their English literacy skills, and the extent to which they can demonstrate their literacy skills in Arabic. Clear evidence emerged showing that while Arabic as a dialect continues to be spoken on a daily basis, Arabic literacy is unquestionably losing ground. This was found to be motivated by educational, political, societal, and personal forces. These findings and their implications to language policy and existing bilingualism programs will be discussed. Suggestions for further research will also be made.

Keywords: Arabic, globalization, global English, literacy attrition, United Arab Emirates

Procedia PDF Downloads 263
5116 Using a Character’s Inner Monologue for Song Analysis

Authors: Robert Roznowski

Abstract:

The thought process of the character is never more evident than when singing alone onstage. The composer scores the emotional state and the lyricist voices the inner conflict as the character shares with an audience her or his deepest feelings. It is at these moments that a character may be thought of as voicing her or his inner monologue. Using examples from several musical theatre songs, this presentation will look at a codified approach to analyze a song from a more psychological perspective. Using the clues from the score, traditional character analysis and a psychological-based scoring method an actor may explore more fully inhabit and express the sung and unsung thoughts of the character. The approach yields a richer and more complex approach to acting the song.

Keywords: acting, analysis, musical theatre, psychology

Procedia PDF Downloads 452
5115 Grammatical Parallelism in the Qurʼān

Authors: Yehudit Dror

Abstract:

Parallelism¬, or as it is called in Arabic, al-muqābala, occupies a central position in the rhetorical discipline of ʻilm al-bayān. Parallelism is used as a figure of textual ornamentation or embellishment and can be divided into several types that are based on the semantics of parallelism and its formative structure. Parallelism in Arabic has received a considerable amount of attention from the Arab rhetorician, which enables understanding the essence of parallelism in Arabic – its types, structure and meaning. However, there are some lacunae in their descriptions concerning the function and thematic restrictions of parallelism in the Qur’ān. In my presentation, which focuses on grammatical parallelism where the two stichos of the parallelism are the same with respect to syntax and morphology, I will show that parallelism has some important roles in the textual arrangement; it may, for example, conclude a thematic section, indicate a turning point in the text or to clarify what has been said previously. In addition, it will be shown that parallelism is not used randomly in the Qurʼān but rather is restricted to repeated themes which carry the most important messages of the Qurʼān, such as God's Might or behavioral patterns of the believers and the non-believers; or it can be used as a stylistic device.

Keywords: grammatical parallelism, half-line, symmetry, Koran

Procedia PDF Downloads 303
5114 Poem and Novel Translations from Arabic to Turkish Done between the Years of 1980-2015

Authors: Gürkan Dağbaşı

Abstract:

Translation is a vitally important activity like as the expression the thought and emotions of humanbeing, providing reciprocal cultural transfer, shaping future by establishing a connection with the past, and like as being exist in an other language. Translation is also an important instrument providing cross-cultural coalescence between nations. Although the first translations from Arabic to Turkish was restricted to only religious texts, over time, the importance of translation was found out via translations of works about literature. Later on, some literature genres like novel and poems were also translated from Arabic to Turkish. Works of many men of Arabic literature were translated to Turkish, including Nejib Mahfuz, owner of Nobel Prize, Tawfiq al-Hakim, Adonis, Gibran Khalil Gibran and etc. In this study, novels and poems translated from Arabic to Turkish between 1980-2015 years are examined.

Keywords: poem, novel, Arabic, translation

Procedia PDF Downloads 339
5113 The Relationship between Brand Recall and Brand Attitude in Advergame

Authors: Azaze-Azizi Abdul Adis, Hyung Jun Kim, Mohamad Rizwan Abdul Majid, Zaiton Osman, Izyanti Awang Razli

Abstract:

The increase of online advertising, specifically advergame has become a popular method of strengthening consumer brand recognition by inserting attractive characters and enhancing entertainment value. There have been several remarkable studies on spokes-characters in advertising effectiveness. However, few studies have examined the link between character presence and consumers' brand recall and attitude in advergame. Moreover, how the entertainment value of an advergame influences brand recall and brand attitude and the mediating role of brand recall in influencing character presence and entertainment on brand attitude are still lacking in the advergaming literature. An online survey was conducted with 366 Malaysian gamers. Using structural equation modeling, the results showed that character presence had no influence but entertainment value had a positive influence on brand recall and brand attitude. This study confirmed the role of brand recall as a mediator of the effect of between entertainment and brand attitude in advergame.

Keywords: character presence, entertainment, brand recall, brand attitude, advergame

Procedia PDF Downloads 510
5112 Arabic Language in Modern Era: Some Challenges

Authors: Tajudeen Yusuf

Abstract:

Arabic language and its instruction occupy a prominent status in the contemporary world, especially in academic and research institutions. Arabic, like other international languages, consolidates understanding among people of different nations and societies. It is a promising medium of sharing thoughts and feelings. As a means of communication and interaction, the language has gained its outstanding status since ancient times, especially because of the relationship it maintains with Islam and its heritage. Adding to its importance is the rapid growth and advancement of Science and Technology in the contemporary Era which has eventually made communication between human societies all over the world inevitable. Despite, the Arabic language still experiences many challenges especially in some area such as irrelevant textbooks and other teaching materials, old versions of teaching methods and inadequate teachers who professionally trained. Eventually, these have resulted in difficulties in the teaching and learning of the language. Therefore, urgent and necessary measures to enhance the teaching and learning of Arabic language within and outside Arab countries are therefore needed to be taken.

Keywords: Arabic, language, challenges, modern era

Procedia PDF Downloads 567
5111 A Recognition Method of Ancient Yi Script Based on Deep Learning

Authors: Shanxiong Chen, Xu Han, Xiaolong Wang, Hui Ma

Abstract:

Yi is an ethnic group mainly living in mainland China, with its own spoken and written language systems, after development of thousands of years. Ancient Yi is one of the six ancient languages in the world, which keeps a record of the history of the Yi people and offers documents valuable for research into human civilization. Recognition of the characters in ancient Yi helps to transform the documents into an electronic form, making their storage and spreading convenient. Due to historical and regional limitations, research on recognition of ancient characters is still inadequate. Thus, deep learning technology was applied to the recognition of such characters. Five models were developed on the basis of the four-layer convolutional neural network (CNN). Alpha-Beta divergence was taken as a penalty term to re-encode output neurons of the five models. Two fully connected layers fulfilled the compression of the features. Finally, at the softmax layer, the orthographic features of ancient Yi characters were re-evaluated, their probability distributions were obtained, and characters with features of the highest probability were recognized. Tests conducted show that the method has achieved higher precision compared with the traditional CNN model for handwriting recognition of the ancient Yi.

Keywords: recognition, CNN, Yi character, divergence

Procedia PDF Downloads 136
5110 Study of Hydrocarbons Metering Issues in Algerian Fields under the New Law Context

Authors: A. Hadjadj, S. Maamir

Abstract:

Since the advent of the law 86/14 concerning the
exploitation of the national territory by foreign companies in
partnership with the Algerian oil and gas company, the problem of
hydrocarbons metering in the sharing production come out.
More generally, good management counting hydrocarbons can
provide data on the production wells, the field and the reservoir for
medium and long term planning, particularly in the context of the
management and field development.
In this work, we are interested in the transactional metering which
is a very delicate and crucial period in the current context of the new
hydrocarbon’s law characterized by assets system between the
various activities of Sonatrach and its foreign partners.
After a state of the art on hydrocarbons metering devices in
Algeria and elsewhere, we will decline the advantages and
disadvantages of each system, and then we describe the problem to
try to reach an optimal solution.

Keywords: transactional metering, flowmeter orifice, heat flow, Sonatrach

Procedia PDF Downloads 334
5109 Importance of Positive Education: A Focus on the Importance of Character Strength Building

Authors: Hajra Hussain

Abstract:

Positive education, the inclusion of social, emotional and intellectual skills across a curriculum, is fundamental to the optimal functioning of young people in any society because it combines the best teaching practices with the principles of positive psychology. While learning institutions foster academic skills, little attention is being paid to the identification and development of character strengths and their integration into teaching. There is an increasing recognition of the important role education plays in equipping today’s youth with 21st century social skills. For youth to succeed in this highly competitive environment, there is a need for positive education that is focused on character strengths such as the growth of social, emotional and intellectual skills that promote the flourishing of well-rounded individuals. Character strength programs and awareness are a necessity if the human capital within a region is to be competitive, productive and happy. The Counselling & Wellbeing Centre at Amity University Dubai has consistently implemented Character Strength awareness workshops and has found that such workshops have increased student life satisfaction due to individual awareness of signature strengths. A positive education/positive psychology framework with its key focus on the development of character strengths can be fundamental to individual's confidence and self-awareness; thus allowing both optimum flourishing and functioning.

Keywords: positive psychology, positive education, strengths, youth, happiness

Procedia PDF Downloads 249
5108 Neural Correlates of Arabic Digits Naming

Authors: Fernando Ojedo, Alejandro Alvarez, Pedro Macizo

Abstract:

In the present study, we explored electrophysiological correlates of Arabic digits naming to determine semantic processing of numbers. Participants named Arabic digits grouped by category or intermixed with exemplars of other semantic categories while the N400 event-related potential was examined. Around 350-450 ms after the presentation of Arabic digits, brain waves were more positive in anterior regions and more negative in posterior regions when stimuli were grouped by category relative to the mixed condition. Contrary to what was found in other studies, electrophysiological results suggested that the production of numerals involved semantic mediation.

Keywords: Arabic digit naming, event-related potentials, semantic processing, number production

Procedia PDF Downloads 548
5107 Saudi Twitter Corpus for Sentiment Analysis

Authors: Adel Assiri, Ahmed Emam, Hmood Al-Dossari

Abstract:

Sentiment analysis (SA) has received growing attention in Arabic language research. However, few studies have yet to directly apply SA to Arabic due to lack of a publicly available dataset for this language. This paper partially bridges this gap due to its focus on one of the Arabic dialects which is the Saudi dialect. This paper presents annotated data set of 4700 for Saudi dialect sentiment analysis with (K= 0.807). Our next work is to extend this corpus and creation a large-scale lexicon for Saudi dialect from the corpus.

Keywords: Arabic, sentiment analysis, Twitter, annotation

Procedia PDF Downloads 592
5106 Towards Logical Inference for the Arabic Question-Answering

Authors: Wided Bakari, Patrice Bellot, Omar Trigui, Mahmoud Neji

Abstract:

This article constitutes an opening to think of the modeling and analysis of Arabic texts in the context of a question-answer system. It is a question of exceeding the traditional approaches focused on morphosyntactic approaches. Furthermore, we present a new approach that analyze a text in order to extract correct answers then transform it to logical predicates. In addition, we would like to represent different levels of information within a text to answer a question and choose an answer among several proposed. To do so, we transform both the question and the text into logical forms. Then, we try to recognize all entailment between them. The results of recognizing the entailment are a set of text sentences that can implicate the user’s question. Our work is now concentrated on an implementation step in order to develop a system of question-answering in Arabic using techniques to recognize textual implications. In this context, the extraction of text features (keywords, named entities, and relationships that link them) is actually considered the first step in our process of text modeling. The second one is the use of techniques of textual implication that relies on the notion of inference and logic representation to extract candidate answers. The last step is the extraction and selection of the desired answer.

Keywords: NLP, Arabic language, question-answering, recognition text entailment, logic forms

Procedia PDF Downloads 310
5105 Using Mining Methods of WEKA to Predict Quran Verb Tense and Aspect in Translations from Arabic to English: Experimental Results and Analysis

Authors: Jawharah Alasmari

Abstract:

In verb inflection, tense marks past/present/future action, and aspect marks progressive/continues perfect/completed actions. This usage and meaning of tense and aspect differ in Arabic and English. In this research, we applied data mining methods to test the predictive function of candidate features by using our dataset of Arabic verbs in-context, and their 7 translations. Weka machine learning classifiers is used in this experiment in order to examine the key features that can be used to provide guidance to enable a translator’s appropriate English translation of the Arabic verb tense and aspect.

Keywords: Arabic verb, English translations, mining methods, Weka software

Procedia PDF Downloads 246
5104 Arabic as a Foreign Language in the Curriculum of Higher Education in Nigeria: Problems, Solutions, and Prospects

Authors: Kazeem Oluwatoyin Ajape

Abstract:

The study is concerned with the problem of how to improve the teaching of Arabic as a foreign language in Nigerian Higher Education System. The paper traces the historical background of Arabic education in Nigeria and also outlines the problems facing the language in Nigerian Institutions. It lays down some of the essential foundation work necessary for bringing about systematic and constructive improvements in the Teaching of Arabic as a Foreign Language (TAFL) by giving answers to the following research questions: what is the appropriate medium of instruction in teaching a foreign or second language? What is the position of English language in the teaching and learning of Arabic/Islamic education? What is the relevance of the present curriculum of Arabic /Islamic education in Nigerian institutions to the contemporary society? A survey of the literature indicates that a revolution is currently taking place in FL teaching and that a new approach known as the Communicative Approach (CA), has begun to emerge and influence the teaching of FLs in general, over the last decade or so. Since the CA is currently being adapted to the teaching of most major FLs and since this revolution has not yet had much impact on TAPL, the study explores the possibility of the application of the CA to the teaching of Arabic as a living language and also makes recommendations towards the development of the language in Nigerian Institutions of Higher Learning.

Keywords: Arabic Language, foreign language, Nigerian institutions, curriculum, communicative approach

Procedia PDF Downloads 579
5103 Visual Speech Perception of Arabic Emphatics

Authors: Maha Saliba Foster

Abstract:

Speech perception has been recognized as a bi-sensory process involving the auditory and visual channels. Compared to the auditory modality, the contribution of the visual signal to speech perception is not very well understood. Studying how the visual modality affects speech recognition can have pedagogical implications in second language learning, as well as clinical application in speech therapy. The current investigation explores the potential effect of speech visual cues on the perception of Arabic emphatics (AEs). The corpus consists of 36 minimal pairs each containing two contrasting consonants, an AE versus a non-emphatic (NE). Movies of four Lebanese speakers were edited to allow perceivers to have partial view of facial regions: lips only, lips-cheeks, lips-chin, lips-cheeks-chin, lips-cheeks-chin-neck. In the absence of any auditory information and relying solely on visual speech, perceivers were above chance at correctly identifying AEs or NEs across vowel contexts; moreover, the models were able to predict the probability of perceivers’ accuracy in identifying some of the COIs produced by certain speakers; additionally, results showed an overlap between the measurements selected by the computer and those selected by human perceivers. The lack of significant face effect on the perception of AEs seems to point to the lips, present in all of the videos, as the most important and often sufficient facial feature for emphasis recognition. Future investigations will aim at refining the analyses of visual cues used by perceivers by using Principal Component Analysis and including time evolution of facial feature measurements.

Keywords: Arabic emphatics, machine learning, speech perception, visual speech perception

Procedia PDF Downloads 272
5102 The Syllable Structure and Syllable Processes in Suhwa Arabic: An Autosegmental Analysis

Authors: Muhammad Yaqub Olatunde

Abstract:

Arabic linguistic science is redirecting its focus towards the analysis and description of social, regional, and temporal varieties of social, regional, and temporal varieties in order to show how they vary in pronunciation, vocabulary, and grammar. This is not to say that the traditional Arabic linguists did not mention scores of dialectical variations but such works focused on the geographical boundaries of the Arabic speaking countries. There is need for a comprehensive survey of various Arabic dialects within the boundary of Arabic speaking countries and outside showing both the similarities and differences of linguistic and extra linguistic elements. This study therefore examines the syllable structure and process in noun and verb in the shuwa Arabic dialect speaking in North East Nigeria [mainly in Borno state]. The work seeks to establish the facts about this phenomenon, using auto- segmental analysis. These facts are compared, where necessary; using possible alternative analysis, with what operate in other related dialects within and outside Arabic speaking countries. The interaction between epenthesis and germination in the language also generate an interesting issue. The paper then conclude that syllable structure and process in the language need to recognize the existence of complex onset and a complex rhyme producing a consonant cluster in the former and a closed syllable in the letter. This emerges as result of resyllabification, which is motivated by these processes.

Keywords: Arabic, dialect, linguistics, processes, resyllabification

Procedia PDF Downloads 395
5101 The Artificial Intelligence Technologies Used in PhotoMath Application

Authors: Tala Toonsi, Marah Alagha, Lina Alnowaiser, Hala Rajab

Abstract:

This report is about the Photomath app, which is an AI application that uses image recognition technology, specifically optical character recognition (OCR) algorithms. The (OCR) algorithm translates the images into a mathematical equation, and the app automatically provides a step-by-step solution. The application supports decimals, basic arithmetic, fractions, linear equations, and multiple functions such as logarithms. Testing was conducted to examine the usage of this app, and results were collected by surveying ten participants. Later, the results were analyzed. This paper seeks to answer the question: To what level the artificial intelligence features are accurate and the speed of process in this app. It is hoped this study will inform about the efficiency of AI in Photomath to the users.

Keywords: photomath, image recognition, app, OCR, artificial intelligence, mathematical equations.

Procedia PDF Downloads 136
5100 Enhanced Arabic Semantic Information Retrieval System Based on Arabic Text Classification

Authors: A. Elsehemy, M. Abdeen , T. Nazmy

Abstract:

Since the appearance of the Semantic web, many semantic search techniques and models were proposed to exploit the information in ontology to enhance the traditional keyword-based search. Many advances were made in languages such as English, German, French and Spanish. However, other languages such as Arabic are not fully supported yet. In this paper we present a framework for ontology based information retrieval for Arabic language. Our system consists of four main modules, namely query parser, indexer, search and a ranking module. Our approach includes building a semantic index by linking ontology concepts to documents, including an annotation weight for each link, to be used in ranking the results. We also augmented the framework with an automatic document categorizer, which enhances the overall document ranking. We have built three Arabic domain ontologies: Sports, Economic and Politics as example for the Arabic language. We built a knowledge base that consists of 79 classes and more than 1456 instances. The system is evaluated using the precision and recall metrics. We have done many retrieval operations on a sample of 40,316 documents with a size 320 MB of pure text. The results show that the semantic search enhanced with text classification gives better performance results than the system without classification.

Keywords: Arabic text classification, ontology based retrieval, Arabic semantic web, information retrieval, Arabic ontology

Procedia PDF Downloads 498
5099 Off-Line Text-Independent Arabic Writer Identification Using Optimum Codebooks

Authors: Ahmed Abdullah Ahmed

Abstract:

The task of recognizing the writer of a handwritten text has been an attractive research problem in the document analysis and recognition community with applications in handwriting forensics, paleography, document examination and handwriting recognition. This research presents an automatic method for writer recognition from digitized images of unconstrained writings. Although a great effort has been made by previous studies to come out with various methods, their performances, especially in terms of accuracy, are fallen short, and room for improvements is still wide open. The proposed technique employs optimal codebook based writer characterization where each writing sample is represented by a set of features computed from two codebooks, beginning and ending. Unlike most of the classical codebook based approaches which segment the writing into graphemes, this study is based on fragmenting a particular area of writing which are beginning and ending strokes. The proposed method starting with contour detection to extract significant information from the handwriting and the curve fragmentation is then employed to categorize the handwriting into Beginning and Ending zones into small fragments. The similar fragments of beginning strokes are grouped together to create Beginning cluster, and similarly, the ending strokes are grouped to create the ending cluster. These two clusters lead to the development of two codebooks (beginning and ending) by choosing the center of every similar fragments group. Writings under study are then represented by computing the probability of occurrence of codebook patterns. The probability distribution is used to characterize each writer. Two writings are then compared by computing distances between their respective probability distribution. The evaluations carried out on ICFHR standard dataset of 206 writers using Beginning and Ending codebooks separately. Finally, the Ending codebook achieved the highest identification rate of 98.23%, which is the best result so far on ICFHR dataset.

Keywords: off-line text-independent writer identification, feature extraction, codebook, fragments

Procedia PDF Downloads 484
5098 Performance Evaluation of an Ontology-Based Arabic Sentiment Analysis

Authors: Salima Behdenna, Fatiha Barigou, Ghalem Belalem

Abstract:

Due to the quick increase in the volume of Arabic opinions posted on various social media, Arabic sentiment analysis has become one of the most important areas of research. Compared to English, there is very little works on Arabic sentiment analysis, in particular aspect-based sentiment analysis (ABSA). In ABSA, aspect extraction is the most important task. In this paper, we propose a semantic aspect-based sentiment analysis approach for standard Arabic reviews to extract explicit aspect terms and identify the polarity of the extracted aspects. The proposed approach was evaluated using HAAD datasets. Experiments showed that the proposed approach achieved a good level of performance compared with baseline results. The F-measure was improved by 19% for the aspect term extraction tasks and 55% aspect term polarity task.

Keywords: sentiment analysis, opinion mining, Arabic, aspect level, opinion, polarity

Procedia PDF Downloads 135
5097 Fight against Money Laundering with Optical Character Recognition

Authors: Saikiran Subbagari, Avinash Malladhi

Abstract:

Anti Money Laundering (AML) regulations are designed to prevent money laundering and terrorist financing activities worldwide. Financial institutions around the world are legally obligated to identify, assess and mitigate the risks associated with money laundering and report any suspicious transactions to governing authorities. With increasing volumes of data to analyze, financial institutions seek to automate their AML processes. In the rise of financial crimes, optical character recognition (OCR), in combination with machine learning (ML) algorithms, serves as a crucial tool for automating AML processes by extracting the data from documents and identifying suspicious transactions. In this paper, we examine the utilization of OCR for AML and delve into various OCR techniques employed in AML processes. These techniques encompass template-based, feature-based, neural network-based, natural language processing (NLP), hidden markov models (HMMs), conditional random fields (CRFs), binarizations, pattern matching and stroke width transform (SWT). We evaluate each technique, discussing their strengths and constraints. Also, we emphasize on how OCR can improve the accuracy of customer identity verification by comparing the extracted text with the office of foreign assets control (OFAC) watchlist. We will also discuss how OCR helps to overcome language barriers in AML compliance. We also address the implementation challenges that OCR-based AML systems may face and offer recommendations for financial institutions based on the data from previous research studies, which illustrate the effectiveness of OCR-based AML.

Keywords: anti-money laundering, compliance, financial crimes, fraud detection, machine learning, optical character recognition

Procedia PDF Downloads 108
5096 Students' Perceptions of Social Media as a Means to Improve Their Language Skills

Authors: Bahia Braktia, Ana Marcela Montenegro Sanchez

Abstract:

Social media, such as Facebook, Twitter, and YouTube, has been used for teaching and learning for quite some time. These platforms have been proven to be a good tool to improve various language skills, students’ performance of the English language, motivation as well as trigger the authentic language interaction. However, little is known about the potential effects of social media usage on the learning performance of Arabic language learners. The present study explores the potential role that the social media technologies play in learning Arabic as a foreign language at a university in Southeast of United States. In order to investigate this issue, an online survey was administered to examine the perceptions and attitudes of American students learning Arabic. The research questions were: How does social media, specifically Facebook and Twitter, impact the students' Arabic language skills, and what is their attitude toward it? The preliminary findings of the study showed that students had a positive attitude toward the use of social media to enhance their Arabic language skills, and that they used a range of social media features to expose themselves to the Arabic language and communicate in Arabic with native Arabic speaking friends. More detailed findings will be shared in the light data analysis with the audience during the presentation.

Keywords: foreign language learning, social media, students’ perceptions, survey

Procedia PDF Downloads 171
5095 Promotion of the Arabic language in India: MES Mampad College - A Torchbearer

Authors: Junaid C, Sabique MK

Abstract:

Introduction: MES Mamapd College is an autonomous college established in 1964 affiliated with the University of Calicut run by the Muslim Educational Society Kerala. The department of Arabic of the college is having a pivotal role in promoting Arabic language learning, teaching, research, and other allied academic activities. State of Problem: Department of Arabic of the college introduced before the academic committee the culture of international seminars. The department connected the academic community with foreign scholars and introduced industry-academia collaboration programs which are beneficial to the job seekers. These practices and innovations should be documented. Objectives: Create awareness of innovative practices implemented for the promotion of the Arabic language. Infuse confidence in learners in learning of Arabic language. Showcase the distinctive academic programs initiated by the department Methodology: Data will be collected from archives, souvenirs, and reports. Survey methods and interviews with authorities and beneficiaries will be collected for the data analysis. Major results: MES Mampad College introduced before its stakeholders different unique academic practices related to the Arabic language and literature. When the unprecedented pandemic situation pulled back all of the academic community, the department come forward with numerous academic initiatives utilizing the virtual space. Both arenas will be documented. Conclusion: This study will help to make awareness on the promotion of the Arabic language studies and related practices initiated by the department of Arabic MES Mampad College. These practices and innovations can be modeled and replicated.

Keywords: teaching Arabic language, MES mampad college, Arabic webinars, pandemic impacts in literature

Procedia PDF Downloads 62