Search results for: multilingual corpora
253 Anxiety Caused by the Single Mode of Instruction in Multilingual Classrooms: The Case of African Language Learners
Authors: Stanle Madonsela
Abstract:
For learning to take place effectively, learners have to use language. Language becomes a critical tool by which to communicate, to express feelings, desires and thoughts, and most of all to learn. However, each individual’s capacity to use language is unique. In multilingual countries, classrooms usually comprise learners from different language backgrounds, and therefore the language used for teaching and learning requires rethinking. Interaction in the classroom, if done in a language that is understood by the learners, could maximise the outcomes of learning. This paper explores the extent to which the use of a single code becomes a source of anxiety to learners in multilingual classrooms in South African schools. It contends that a multilingual approach in the learning process should be explored in order to promote learner autonomy in the learning process.Keywords: anxiety, classroom, foreign language teaching, multilingual
Procedia PDF Downloads 538252 Adjectives in Academic Discourse: A Comparative Study of Research Articles
Authors: Beata Grymska
Abstract:
The research studies on academic discourse focus in general on lexical bundles, epistemic modality markers, or interactions between writers and readers. Following the research into the written forms of the academic community, this study concentrates on adjectives in research articles. The study investigates the distribution of adjectives in research articles in two academic disciplines: linguistics and medicine. It is corpus-based in design and consists of 100 linguistic and 100 medical research articles all written in English. The aim of the study is to compare the distribution of adjectives between the two corpora and four main parts of articles: IMRD (Introduction, Methods, Results, and Discussion). The second aim is to see if the two corpora share common core adjectives, e.g., different, important, specific, and if there are discipline-specific adjectives. The further part of the paper elaborates on adjectives use in the corpora together with examples. The results indicate that the two corpora do not differ in the distribution of adjectives to a great extent. The occurrences of the most frequently used adjectives depend on the academic discipline of the research articles. The concluding part reflects upon the role of adjectives in academic discourse and also presents how corpora can be helpful in composing academic texts.Keywords: academic discourse, academic texts, adjectives, corpus analysis, research articles
Procedia PDF Downloads 191251 Applied Linguistics: Language, Corpora, and Technology
Authors: M. Imran
Abstract:
This research explores the intersections of applied linguistics, corpus linguistics, translation, and technology, aiming to present innovative cross-disciplinary tools and frameworks. It highlights significant contributions to language, corpora, and technology within applied linguistics, which deepen our understanding of these domains and provide practical resources for scholars, educators, and translators. By showcasing these advancements, the study seeks to enhance collaboration and application in language-related fields. The significance of applied linguistics is emphasized by some of the research that has been emphasized, which presents pedagogical perspectives that could enhance instruction and the learning results of student’s at all academic levels as well as translation trainees. Researchers provided useful data from language studies with classroom applications from an instructional standpoint.Keywords: linguistics, language, corpora, technology
Procedia PDF Downloads 20250 The Challenge of Teaching French as a Foreign Language in a Multilingual Community
Authors: Carol C. Opara, Olukemi E. Adetuyi-Olu-Francis
Abstract:
The teaching of French language, like every other language, has its numerous challenges. A multilingual community, however, is a linguistic environment housing diverse languages, each with its peculiarity, both pros, and cones. A foreign language will have to strive hard for survival in an environment where various indigenous languages, as well as an established official language, exist. This study examined the challenges and prospects of the teaching of French as a foreign language in a multilingual community. A 22-item questionnaire was used to elicit information from 40 Nigerian Secondary school teachers of French. One of the findings of this study showed that the teachers of the French language are not motivated. Also, the linguistic environment is not favourable for the teaching and learning of French language in Nigeria. One of the recommendations was that training and re-training of teachers of French should be of utmost importance to the Nigerian Federal Ministry of Education.Keywords: challenges, french as foreign language, multilingual community, teaching
Procedia PDF Downloads 223249 Beyond Replicating Linguistic Elements: Novel Concept Combinations in Multilingual Children
Authors: Xiao-lei Wang
Abstract:
The Novel Concept Combination (NCC) refers to the unique ability of multilingual children to creatively merge and integrate different linguistic and cultural elements to form innovative and original concepts. Children raised with more than one language often exhibit this skill in their daily communication, such as creating innovative metaphors that enrich their communication, showcasing their creativity in conveying the essence of their messages. This paper explores NCC abilities in multilingual children by focusing on two male trilingual siblings exposed to Chinese, French, and English from birth. The siblings were observed for 19 years in their daily context. Seventy-six hours of video-recorded data were used for this study (38 hours for each participant). A coding scheme developed by Wang et al. was employed to code the recorded data. The results suggest that these multilingual siblings proportionally increased their NCC skills over the years, emerging at age 3 and peaking at age 15. The characteristic of their NCC lies in their capacity to not merely replicate linguistic elements of different languages but to recreate, reshape, and reconstruct novel ideas in communication, enriching their interactions. The paper also addresses the educational implications for educators and parents, emphasizing the importance of valuing these novel ideas in everyday environments to encourage NCC development. This, in turn, contributes to cognitive and social development.Keywords: multilingual children, novel concept combination, multilingual creativity, linguistic richness
Procedia PDF Downloads 68248 Multiple Identity Construction among Multilingual Minorities: A Quantitative Sociolinguistic Case Study
Authors: Stefanie Siebenhütter
Abstract:
This paper aims to reveal criterions involved in the process of identity-forming among multilingual minority language speakers in Northeastern Thailand and in the capital Bangkok. Using sociolinguistic interviews and questionnaires, it is asked which factors are important for speakers and how they define their identity by their interactions socially as well as linguistically. One key question to answer is how sociolinguistic factors may force or diminish the process of forming social identity of multilingual minority speakers. However, the motivation for specific language use is rarely overt to the speaker’s themselves as well as to others. Therefore, identifying the intentions included in the process of identity construction is to approach by scrutinizing speaker’s behavior and attitudes. Combining methods used in sociolinguistics and social psychology allows uncovering the tools for identity construction that ethnic Kui uses to range themselves within a multilingual setting. By giving an overview of minority speaker’s language use in context of the specific border near multilingual situation and asking how speakers construe identity within this spatial context, the results exhibit some of the subtle and mostly unconscious criterions involved in the ongoing process of identity construction.Keywords: social identity, identity construction, minority language, multilingualism, social networks, social boundaries
Procedia PDF Downloads 268247 Using Corpora in Semantic Studies of English Adjectives
Authors: Oxana Lukoshus
Abstract:
The methods of corpus linguistics, a well-established field of research, are being increasingly applied in cognitive linguistics. Corpora data are especially useful for different quantitative studies of grammatical and other aspects of language. The main objective of this paper is to demonstrate how present-day corpora can be applied in semantic studies in general and in semantic studies of adjectives in particular. Polysemantic adjectives have been the subject of numerous studies. But most of them have been carried out on dictionaries. Undoubtedly, dictionaries are viewed as one of the basic data sources, but only at the initial steps of a research. The author usually starts with the analysis of the lexicographic data after which s/he comes up with a hypothesis. In the research conducted three polysemantic synonyms true, loyal, faithful have been analyzed in terms of differences and similarities in their semantic structure. A corpus-based approach in the study of the above-mentioned adjectives involves the following. After the analysis of the dictionary data there was the reference to the following corpora to study the distributional patterns of the words under study – the British National Corpus (BNC) and the Corpus of Contemporary American English (COCA). These corpora are continually updated and contain thousands of examples of the words under research which make them a useful and convenient data source. For the purpose of this study there were no special needs regarding genre, mode or time of the texts included in the corpora. Out of the range of possibilities offered by corpus-analysis software (e.g. word lists, statistics of word frequencies, etc.), the most useful tool for the semantic analysis was the extracting a list of co-occurrence for the given search words. Searching by lemmas, e.g. true, true to, and grouping the results by lemmas have proved to be the most efficient corpora feature for the adjectives under the study. Following the search process, the corpora provided a list of co-occurrences, which were then to be analyzed and classified. Not every co-occurrence was relevant for the analysis. For example, the phrases like An enormous sense of responsibility to protect the minds and hearts of the faithful from incursions by the state was perceived to be the basic duty of the church leaders or ‘True,’ said Phoebe, ‘but I'd probably get to be a Union Official immediately were left out as in the first example the faithful is a substantivized adjective and in the second example true is used alone with no other parts of speech. The subsequent analysis of the corpora data gave the grounds for the distribution groups of the adjectives under the study which were then investigated with the help of a semantic experiment. To sum it up, the corpora-based approach has proved to be a powerful, reliable and convenient tool to get the data for the further semantic study.Keywords: corpora, corpus-based approach, polysemantic adjectives, semantic studies
Procedia PDF Downloads 315246 Undergraduate Students’ Learning Experience and Practices in Multilingual Higher Education Institutions: The Case of the University of Luxembourg
Authors: Argyro Maria Skourmalla
Abstract:
The present paper draws on the example of the University of Luxembourg as a multilingual and international setting. The University of Luxembourg, which is located between France, Germany, and Belgium, has adopted a new multilingualism policy in 2020, establishing English, French, German, and Luxembourgish as the official languages of the Institution. With around 7.000 students, more than half of which are international students, the University is a meeting point for languages and cultures. This paper includes data from an online survey that with undergraduate students from different disciplines at the University of Luxembourg. Students shared their personal experience and opinions regarding language use in this higher education context, as well as practices they use in learning in this multilingual context. Findings show the role of technology in assisting students in different aspects of learning this multilingual context. At the same time, more needs to be done to avoid an exclusively monolingual paradigm in higher education. Findings also show that some languages remain ‘unseen’ in this context. Overall, even though linguistic diversity in this University is seen as an asset, a lot needs to be done towards the recognition of staff and students’ linguistic repertoires for inclusion and education equity.Keywords: higher education, learning, linguistic diversity, multilingual practices
Procedia PDF Downloads 66245 The Construction of Multilingual Online Gaming Community
Authors: Dina Alnefaie
Abstract:
This poster presents a study of a Discord private server with thirteen multilingual gamers, aiming to explore the elements that construct a multilingual online gaming community. The study focuses on the communication practices of four Saudi female and male gamers, using various data collection methods, including online observations through recorded videos and screenshots, interviews, and informal conversations for one year. The primary findings show that translanguaging was a prominent feature of their verbal and textual communication practices. Besides, these practices that mostly accompany cultural ones were used to facilitate communication and express their identities in an intercultural context.Keywords: online community construction, perceptions, multilingualism, digital identity
Procedia PDF Downloads 85244 Leveraging Large Language Models to Build a Cutting-Edge French Word Sense Disambiguation Corpus
Authors: Mouheb Mehdoui, Amel Fraisse, Mounir Zrigui
Abstract:
With the increasing amount of data circulating over the Web, there is a growing need to develop and deploy tools aimed at unraveling semantic nuances within text or sentences. The challenges in extracting precise meanings arise from the complexity of natural language, while words usually have multiple interpretations depending on the context. The challenge of precisely interpreting words within a given context is what the task of Word Sense Disambiguation meets. It is a very old domain within the area of Natural Language Processing aimed at determining a word’s meaning that it is going to carry in a particular context, hence increasing the correctness of applications processing the language. Numerous linguistic resources are accessible online, including WordNet, thesauri, and dictionaries, enabling exploration of diverse contextual meanings. However, several limitations persist. These include the scarcity of resources for certain languages, a limited number of examples within corpora, and the challenge of accurately detecting the topic or context covered by text, which significantly impacts word sense disambiguation. This paper will discuss the different approaches to WSD and review corpora available for this task. We will contrast these approaches, highlighting the limitations, which will allow us to build a corpus in French, targeted for WSD.Keywords: semantic enrichment, disambiguation, context fusion, natural language processing, multilingual applications
Procedia PDF Downloads 15243 Teachers Handbook: A Key to Imparting Teaching in Multilingual Classrooms at Kalinga Institute of Social Sciences (KISS)
Authors: Sushree Sangita Mohanty
Abstract:
The pedagogic system, which is used to work with indigenous groups, who have equally different socio-economic, socio-cultural & multi-lingual conditions with differing cognitive capabilities, makes the education situation complex. As a result, educating the indigenous people became just the dissemination of facts and information, but advancement in knowledge and possibilities somewhere hides. This gap arises complexities due to the language barrier and the teachers from a conventional background of teaching practices are unable to understand or connect with the students in the schools. This paper presents the research work of the Mother Tongue Based Multilingual Education (MTB-MLE) project that has developed a creative pedagogic endeavor for the students of Kalinga Institute of Social Sciences (KISS) for facilitating Multilingual Education (MLE) teaching. KISS is a home for 25,000 indigenous children. The students enrolled here are from 62 different indigenous communities who speak around 24 different languages with geographical articulation. The book contents include concept, understanding languages, similitudes among languages, the need of mother tongue in teaching and learning, skill development (Listening-Speaking-Reading-Writing), teachers activities for teaching in multilingual schools, the process of teaching, training format of multilingual teaching and procedures for basic data collection regarding multilingual schools and classroom handle.Keywords: indigenous, multi-lingual, pedagogic, teachers, teaching practices
Procedia PDF Downloads 291242 Sentiment Analysis: Comparative Analysis of Multilingual Sentiment and Opinion Classification Techniques
Authors: Sannikumar Patel, Brian Nolan, Markus Hofmann, Philip Owende, Kunjan Patel
Abstract:
Sentiment analysis and opinion mining have become emerging topics of research in recent years but most of the work is focused on data in the English language. A comprehensive research and analysis are essential which considers multiple languages, machine translation techniques, and different classifiers. This paper presents, a comparative analysis of different approaches for multilingual sentiment analysis. These approaches are divided into two parts: one using classification of text without language translation and second using the translation of testing data to a target language, such as English, before classification. The presented research and results are useful for understanding whether machine translation should be used for multilingual sentiment analysis or building language specific sentiment classification systems is a better approach. The effects of language translation techniques, features, and accuracy of various classifiers for multilingual sentiment analysis is also discussed in this study.Keywords: cross-language analysis, machine learning, machine translation, sentiment analysis
Procedia PDF Downloads 715241 Particular Features of the First Romanian Multilingual Dictionaries
Authors: Mihaela Mocanu
Abstract:
The Romanian multilingual dictionaries – also named polyglot, plurilingual or polylingual dictionaries, have known a slow yet constant development starting with the end of the 17th century, when the first such work is attested, to the present time, when we witness a considerable increase of the number of polyglot dictionaries, especially the terminological ones. This paper aims at analyzing the context in which the first Romanian multilingual dictionaries were issued, as well as and the organization and structure particularities of the first lexicographic works of this type. The irretrievable loss of some of these works as well as the partial conservation of others renders the attempt to retrace the beginnings of Romanian lexicography extremely difficult. The research methodology is part of a descriptive and analytical approach based on two types of sources, subject to contrastive analysis: the notes made by the initiators of lexicographic projects and the testimonies of their contemporaries, respectively, along with the specialized studies regarding the history of the old Romanian lexicography. The analysis of the contents has indicated that these dictionaries lacked a scientific apparatus in the true sense of the phrase, failed to obey unitary organizational criteria, being limited, most of the times, to mere inventories of words, where the Romanian term was assigned its correspondent in other languages. Motivated by practical reasons, the first multilingual dictionaries were aimed at the clerics their purpose being to ensure the translators’ fidelity towards the original religious texts, regarded as sacred.Keywords: Romanian lexicography, multilingual dictionary, terminology, language
Procedia PDF Downloads 292240 Fostering Inclusive Learning: The Role of Intercultural Communication in Multilingual Primary Education
Authors: Ozge Yalciner
Abstract:
Intercultural communication is crucial in the education of multilingual learners in primary grades, significantly influencing their academic and social development. This study explores how intercultural communication intersects with multilingual education, highlighting the importance of culturally responsive teaching practices. It addresses the challenges and opportunities presented by diverse linguistic backgrounds and proposes strategies for creating inclusive and supportive learning environments. The research emphasizes the need for teacher training programs that equip educators with the skills to recognize and address cultural differences, thereby enhancing student engagement and participation. This study was completed in an elementary school in a city in the Midwest, USA. The data was collected through observations and interviews with students and teachers. It discusses the integration of multicultural perspectives in curricula and the promotion of language diversity as an asset. Peer interactions and collaborative learning are highlighted as crucial for developing intercultural competence among young learners. The findings suggest that meaningful intercultural communication fosters a sense of belonging and mutual respect, leading to improved educational outcomes for multilingual students. Prioritizing intercultural communication in primary education is essential for supporting the linguistic and cultural identities of multilingual learners. By adopting inclusive pedagogical approaches and fostering an environment of cultural appreciation, educators can better support their students' academic success and personal growth.Keywords: diversity, intercultural communication, multilingual learners, primary grades
Procedia PDF Downloads 40239 Communicating Meaning through Translanguaging: The Case of Multilingual Interactions of Algerians on Facebook
Authors: F. Abdelhamid
Abstract:
Algeria is a multilingual speech community where individuals constantly mix between codes in spoken discourse. Code is used as a cover term to refer to the existing languages and language varieties which include, among others, the mother tongue of the majority Algerian Arabic, the official language Modern Standard Arabic and the foreign languages French and English. The present study explores whether Algerians mix between these codes in online communication as well. Facebook is the selected platform from which data is collected because it is the preferred social media site for most Algerians and it is the most used one. Adopting the notion of translanguaging, this study attempts explaining how users of Facebook use multilingual messages to communicate meaning. Accordingly, multilingual interactions are not approached from a pejorative perspective but rather as a creative linguistic behavior that multilingual utilize to achieve intended meanings. The study is intended as a contribution to the research on multilingualism online because although an extensive literature has investigated multilingualism in spoken discourse, limited research investigated it in the online one. Its aim is two-fold. First, it aims at ensuring that the selected platform for analysis, namely Facebook, could be a source for multilingual data to enable the qualitative analysis. This is done by measuring frequency rates of multilingual instances. Second, when enough multilingual instances are encountered, it aims at describing and interpreting some selected ones. 120 posts and 16335 comments were collected from two Facebook pages. Analysis revealed that third of the collected data are multilingual messages. Users of Facebook mixed between the four mentioned codes in writing their messages. The most frequent cases are mixing between Algerian Arabic and French and between Algerian Arabic and Modern Standard Arabic. A focused qualitative analysis followed where some examples are interpreted and explained. It seems that Algerians mix between codes when communicating online despite the fact that it is a conscious type of communication. This suggests that such behavior is not a random and corrupted way of communicating but rather an intentional and natural one.Keywords: Algerian speech community, computer mediated communication, languages in contact, multilingualism, translanguaging
Procedia PDF Downloads 132238 Discourse Markers in Chinese University Students and Native English Speakers: A Corpus-Based Study
Authors: Dan Xie
Abstract:
The use of discourse markers (DMs) can play a crucial role in representing discourse interaction and pragmatic competence. Learners’ use of DMs and differences between native speakers (NSs) and non-native speakers (NNSs) in the use of various DMs have been the focus of considerable research attention. However, some commonly used DMs, such as you know, have not received as much attention in comparative studies, especially in the Chinese context. This study analyses data in two corpora (COLSEC and Spoken BNC 2014 (14-25)) to investigate how Chinese learners differ from NNSs in their use of the DM you know and its functions in speech. The results show that there is a significant difference between the two corpora in terms of the frequency of use of you know. In terms of the functions of you know, the study shows that six functions can all be present in both corpora, although there are significant differences between the five functional dimensions, especially in introducing a claim linked to the prior discourse and highlighting particular points in the discourse. It is hoped to show empirically how Chinese learners and NSs use DMs differently.Keywords: you know, discourse marker, native speaker, Chinese learner
Procedia PDF Downloads 81237 Reed: An Approach Towards Quickly Bootstrapping Multilingual Acoustic Models
Authors: Bipasha Sen, Aditya Agarwal
Abstract:
Multilingual automatic speech recognition (ASR) system is a single entity capable of transcribing multiple languages sharing a common phone space. Performance of such a system is highly dependent on the compatibility of the languages. State of the art speech recognition systems are built using sequential architectures based on recurrent neural networks (RNN) limiting the computational parallelization in training. This poses a significant challenge in terms of time taken to bootstrap and validate the compatibility of multiple languages for building a robust multilingual system. Complex architectural choices based on self-attention networks are made to improve the parallelization thereby reducing the training time. In this work, we propose Reed, a simple system based on 1D convolutions which uses very short context to improve the training time. To improve the performance of our system, we use raw time-domain speech signals directly as input. This enables the convolutional layers to learn feature representations rather than relying on handcrafted features such as MFCC. We report improvement on training and inference times by atleast a factor of 4x and 7.4x respectively with comparable WERs against standard RNN based baseline systems on SpeechOcean's multilingual low resource dataset.Keywords: convolutional neural networks, language compatibility, low resource languages, multilingual automatic speech recognition
Procedia PDF Downloads 124236 Corpus-Based Neural Machine Translation: Empirical Study Multilingual Corpus for Machine Translation of Opaque Idioms - Cloud AutoML Platform
Authors: Khadija Refouh
Abstract:
Culture bound-expressions have been a bottleneck for Natural Language Processing (NLP) and comprehension, especially in the case of machine translation (MT). In the last decade, the field of machine translation has greatly advanced. Neural machine translation NMT has recently achieved considerable development in the quality of translation that outperformed previous traditional translation systems in many language pairs. Neural machine translation NMT is an Artificial Intelligence AI and deep neural networks applied to language processing. Despite this development, there remain some serious challenges that face neural machine translation NMT when translating culture bounded-expressions, especially for low resources language pairs such as Arabic-English and Arabic-French, which is not the case with well-established language pairs such as English-French. Machine translation of opaque idioms from English into French are likely to be more accurate than translating them from English into Arabic. For example, Google Translate Application translated the sentence “What a bad weather! It runs cats and dogs.” to “يا له من طقس سيء! تمطر القطط والكلاب” into the target language Arabic which is an inaccurate literal translation. The translation of the same sentence into the target language French was “Quel mauvais temps! Il pleut des cordes.” where Google Translate Application used the accurate French corresponding idioms. This paper aims to perform NMT experiments towards better translation of opaque idioms using high quality clean multilingual corpus. This Corpus will be collected analytically from human generated idiom translation. AutoML translation, a Google Neural Machine Translation Platform, is used as a custom translation model to improve the translation of opaque idioms. The automatic evaluation of the custom model will be compared to the Google NMT using Bilingual Evaluation Understudy Score BLEU. BLEU is an algorithm for evaluating the quality of text which has been machine-translated from one natural language to another. Human evaluation is integrated to test the reliability of the Blue Score. The researcher will examine syntactical, lexical, and semantic features using Halliday's functional theory.Keywords: multilingual corpora, natural language processing (NLP), neural machine translation (NMT), opaque idioms
Procedia PDF Downloads 151235 Managing Linguistic Diversity in Teaching and in Learning in Higher Education Institutions: The Case of the University of Luxembourg
Authors: Argyro-Maria Skourmalla
Abstract:
Today’s reality is characterized by diversity in different levels and aspects of everyday life. Focusing on the aspect of language and communication in Higher Education (HE), the present paper draws on the example of the University of Luxembourg as a multilingual and international setting. The University of Luxembourg, which is located between France, Germany, and Belgium, adopted its new multilingualism policy in 2020, establishing English, French, German, and Luxembourgish as the official languages of the Institution. In addition, with around 10.000 students and staff coming from various countries around the world, linguistic diversity in this university is seen as both a resource and a challenge that calls for an inclusive and multilingual approach. The present paper includes data derived from semi-structured interviews with lecturing staff from different disciplines and an online survey with undergraduate students at the University of Luxembourg. Participants shared their experiences and point of view regarding linguistic diversity in this context. Findings show that linguistic diversity in this university is seen as an asset but comes with challenges, and even though there is progress in the use of multilingual practices, a lot needs to be done towards the recognition of staff and students’ linguistic repertoires for inclusion and education equity.Keywords: linguistic diversity, higher education, Luxembourg, multilingual practices, teaching, learning
Procedia PDF Downloads 77234 Query in Grammatical Forms and Corpus Error Analysis
Authors: Katerina Florou
Abstract:
Two decades after coined the term "learner corpora" as collections of texts created by foreign or second language learners across various language contexts, and some years following suggestion to incorporate "focusing on form" within a Task-Based Learning framework, this study aims to explore how learner corpora, whether annotated with errors or not, can facilitate a focus on form in an educational setting. Argues that analyzing linguistic form serves the purpose of enabling students to delve into language and gain an understanding of different facets of the foreign language. This same objective is applicable when analyzing learner corpora marked with errors or in their raw state, but in this scenario, the emphasis lies on identifying incorrect forms. Teachers should aim to address errors or gaps in the students' second language knowledge while they engage in a task. Building on this recommendation, we compared the written output of two student groups: the first group (G1) employed the focusing on form phase by studying a specific aspect of the Italian language, namely the past participle, through examples from native speakers and grammar rules; the second group (G2) focused on form by scrutinizing their own errors and comparing them with analogous examples from a native speaker corpus. In order to test our hypothesis, we created four learner corpora. The initial two were generated during the task phase, with one representing each group of students, while the remaining two were produced as a follow-up activity at the end of the lesson. The results of the first comparison indicated that students' exposure to their own errors can enhance their grasp of a grammatical element. The study is in its second stage and more results are to be announced.Keywords: Corpus interlanguage analysis, task based learning, Italian language as F1, learner corpora
Procedia PDF Downloads 54233 Investigating the Effectiveness of Multilingual NLP Models for Sentiment Analysis
Authors: Othmane Touri, Sanaa El Filali, El Habib Benlahmar
Abstract:
Natural Language Processing (NLP) has gained significant attention lately. It has proved its ability to analyze and extract insights from unstructured text data in various languages. It is found that one of the most popular NLP applications is sentiment analysis which aims to identify the sentiment expressed in a piece of text, such as positive, negative, or neutral, in multiple languages. While there are several multilingual NLP models available for sentiment analysis, there is a need to investigate their effectiveness in different contexts and applications. In this study, we aim to investigate the effectiveness of different multilingual NLP models for sentiment analysis on a dataset of online product reviews in multiple languages. The performance of several NLP models, including Google Cloud Natural Language API, Microsoft Azure Cognitive Services, Amazon Comprehend, Stanford CoreNLP, spaCy, and Hugging Face Transformers are being compared. The models based on several metrics, including accuracy, precision, recall, and F1 score, are being evaluated and compared to their performance across different categories of product reviews. In order to run the study, preprocessing of the dataset has been performed by cleaning and tokenizing the text data in multiple languages. Then training and testing each model has been applied using a cross-validation approach where randomly dividing the dataset into training and testing sets and repeating the process multiple times has been used. A grid search approach to optimize the hyperparameters of each model and select the best-performing model for each category of product reviews and language has been applied. The findings of this study provide insights into the effectiveness of different multilingual NLP models for Multilingual Sentiment Analysis and their suitability for different languages and applications. The strengths and limitations of each model were identified, and recommendations for selecting the most performant model based on the specific requirements of a project were provided. This study contributes to the advancement of research methods in multilingual NLP and provides a practical guide for researchers and practitioners in the field.Keywords: NLP, multilingual, sentiment analysis, texts
Procedia PDF Downloads 107232 Exploring Mtb-Mle Practices in Selected Schools in Benguet, Philippines
Authors: Jocelyn L. Alimondo, Juna O. Sabelo
Abstract:
This study explored the MTB-MLE implementation practices of teachers in one monolingual elementary school and one multilingual elementary school in Benguet, Philippines. It used phenomenological approach employing participant-observation, focus group discussion and individual interview. Data were gathered using a video camera, an audio recorder, and an FGD guide and were treated through triangulation and coding. From the data collected, varied ways in implementing the MTB-MLE program were noted. These are: Teaching using a hybrid first language, teaching using a foreign LOI, using translation and multilingual instruction, and using L2/L3 to unlock L1. However, these practices come with challenges such as the a conflict between the mandated LOI and what pupils need, lack of proficiency of teachers in the mandated LOI, facing unreceptive parents, stagnation of knowledge resulting from over-familiarity of input, and zero learning resulting from an incomprehensible language input. From the practices and challenges experienced by the teachers, a model of MTB-MLE approach, the 3L-in-one approach, to teaching was created to illustrate the practice which teachers claimed to be the best way to address the challenges besetting them while at the same time satisfying the academic needs of their pupils. From the findings, this paper concludes that despite the challenges besetting the teachers, they still displayed creativity in coming up with relevant teaching practices, the unreceptiveness of some teachers and parents sprung from the fact that they do not understand the real concept of MTB-MLE, greater challenges are being faced by teachers in multilingual school due to the diverse linguistic background of their clients, and the most effective approach in implementing MTB-MLE is the multilingual approach, allowing the use of the pupils’ mother tongue, L2 (Filipino), L3 (English), and other languages familiar to the students.Keywords: MTB-MLE Philippines, MTB-MLE model, first language, multilingual instruction
Procedia PDF Downloads 428231 Understanding the Multilingualism of the Mauritian Multilingual Primary School Learner and Translanguaging: A Linguistic Ethnographic Study
Authors: Yesha Devi Mahadeo-Doorgakant
Abstract:
The Mauritian landscape is well-known for its multilingualism with the daily interaction of the number of languages that are used in the island; namely Kreol Morisien, the European languages (English and French) and the Oriental/Asian languages (Hindi, Arabic/Urdu, Tamil, Telegu, Marathi, Mandarin, etc.). However, within Mauritius’ multilingual educational system, English is the official medium of instruction while French is taught as compulsory subject till upper secondary and oriental languages are offered as optional languages at primary level. Usually, Mauritians choose one oriental language based on their ethnic/religious identity, when they start their primary schooling as an additional language to learn. In January 2012, Kreol Morisien, which is the considered the language of daily interaction of the majority of Mauritians, was introduced as an optional subject at primary level, taught at the same time as the oriental languages. The introduction of Kreol Morisien has spurred linguistic debates about the issue of multilingualism within the curriculum. Taking this into account, researchers have started pondering on the multilingual educational system of the country and questioning whether the current language curriculum caters for the complex everyday linguistic reality of the multilingual Mauritian learner, given most learners are embedded within an environment where the different languages interact with each other daily. This paper, therefore, proposes translanguaging as being a more befitting theoretical lens through which the multilingualism and the linguistic repertoire of Mauritian learners’ can best be understood.Keywords: multilingualism, translanguaging, multilingual learner, linguistic ethnography
Procedia PDF Downloads 171230 Multilingual Practices in the UK: Kabyles’ Situational Language Choice in a Linguistically Diverse Setting.
Authors: Souhila Belabbas
Abstract:
This paper focuses on the Kabyles’ multilingual practices in the UK, within the Kabyle/Amazigh Cultural Organisation in London, on online platforms and at home. The Kabyles have roots in northern Algeria and associate their language, Kabyle, with a pre-Arabized history of northern Africa. Drawing on ethnographic research with this community, this study brings together their post-migration language preservation activisms as well as their dynamic multilingual practices and situational language choice into a dialogue. This shows the enduring significance of the heritage language for social, cultural and historical identity. It also demonstrates that the current survival of the “mother tongue” hinges on multilingual and multi-sited language activisms, which bear the hallmarks of both new creativities and diminishing fluencies in multilingual spaces. These multilingual repertoires also included a range of ideological stances, expressed as cultural, moral, and political attitudes to the “mother tongue” and to other, potentially more dominant, languages in their lives, involving both inclusive and exclusive instances. The Kabyles in the UK practice everyday forms of multilingualism in the dynamic terms whilst making strong identity claims to an endangered heritage language. Crucially, their language contact experiences were not a post-migration novelty but part of their pre-migration lifeworlds. The participants involved in this study shared a commitment to Kabyle identity activism. They expressed this differently, varyingly foregrounding cultural, social or political issues. These differences were related to their North-African cultural background, live, gender, religious and/or political affiliation, as well as to their different migratory trajectories. Among these ethno-conscious individuals, the use of Kabyle was often particularly vibrant in informal domains of casual conversations and mixed in with French, English and often Arabic. During community events and festivals, though, many made special efforts to converse in Kabyle as if to make a point about their commitment to a shared identity.Keywords: ethnography, language ideology, language choice, heritage language, migration trajectories, multilingual repertoires
Procedia PDF Downloads 81229 Multilingual Practices in the UK: Kabyles’ Situational Language Choice in a Linguistically Diverse Setting
Authors: Souhila Belabbas
Abstract:
This paper focuses on the Kabyles’ multilingual practices in the UK, within the Kabyle/Amazigh Cultural Organisation in London, on online platforms and at home. The Kabyles have roots in northern Algeria and associate their language, Kabyle, with a pre-Arabized history of northern Africa. Drawing on ethnographic research with this community, this study brings together their post-migration language preservation activisms as well as their dynamic multilingual practices and situational language choice into a dialogue. This shows the enduring significance of the heritage language for social, cultural and historical identity. It also demonstrates that the current survival of the “mother tongue” hinges on multilingual and multi-sited language activisms, which bear the hallmarks of both new creativities and diminishing fluencies in multilingual spaces. These multilingual repertoires also included a range of ideological stances, expressed as cultural, moral, and political attitudes to the “mother tongue” and to other, potentially more dominant, languages in their lives, involving both inclusive and exclusive instances. The Kabyles in the UK practice everyday forms of multilingualism in the dynamic terms whilst making strong identity claims to an endangered heritage language. Crucially, their language contact experiences were not a post-migration novelty but part of their pre-migration lifeworlds. The participants involved in this study shared a commitment to Kabyle identity activism. They expressed this differently, varyingly foregrounding cultural, social or political issues. These differences were related to their North-African cultural background, live, gender, religious and/or political affiliation, as well as to their different migratory trajectories. Among these ethno-conscious individuals, the use of Kabyle was often particularly vibrant in informal domains of casual conversations and mixed in with French, English and often Arabic. During community events and festivals, though, many made special efforts to converse in Kabyle as if to make a point about their commitment to a shared identity.Keywords: ethnography, language ideology, language choice, heritage language, migration trajectories, multilingual repertoires
Procedia PDF Downloads 79228 Frequency of the English Phrasal Verbs Used by Iranian Learners as a Reference to the Style of Writing Adopted by the Learners
Authors: Hamzeh Mazaherylaghab, Mehrangiz Vahabian, Seyyedeh Zahra Asghari
Abstract:
The present study initially focused on the frequency of phrasal verbs used by Iranian learners of English. The results then needed to be compared to the findings from native speaker corpora. After the extraction of phrasal verbs from learner and native-speaker corpora the findings were analysed. The results showed that Iranian learners avoided using phrasal verbs in many cases. Some of the findings proved to be significant. It was also found that the learners used the single-word counterparts of the avoided phrasal verbs to compensate for their lack of knowledge in many cases. Semantic complexity and Lack of L1 counterpart may have been the main reasons for avoidance, but despite the avoidance phenomenon, the learners displayed a tendency to use many other phrasal verbs which may have been due to the increase in the number of multi-word verbs in Persian. The overall scores confirmed the fact that the language produced by the learners illustrates signs of more formal style in comparison with the native speakers of English by using less phrasal verbs and more formal single word verbs instead.Keywords: corpus, corpora, LOCNESS, phrasal verbs, single-word verb
Procedia PDF Downloads 203227 A Collaborative Platform for Multilingual Ontology Development
Authors: Ahmed Tawfik, Fausto Giunchiglia, Vincenzo Maltese
Abstract:
Ontologies provide a common understanding of a specific domain of interest that can be communicated between people and used as background knowledge for automated reasoning in a wide range of applications. In this paper we address the design of multilingual ontologies following well-defined knowledge engineering methodologies with the support of novel collaborative development approaches. In particular, we present a collaborative platform which allows ontologies to be developed incrementally in multiple languages. This is made possible via an appropriate mapping between language independent concepts and one lexicalization per language (or a lexical gap in case such lexicalization does not exist). The collaborative platform has been designed to support the development of the Universal Knowledge Core, a multilingual ontology currently in English, Italian, Chinese, Mongolian, Hindi, and Bangladeshi. Its design follows a workflow-based development methodology that models resources as a set of collaborative objects and assigns customizable workflows to build and maintain each collaborative object in a community driven manner, with extensive support of modern web 2.0 social and collaborative features.Keywords: knowledge diversity, knowledge representation, ontology, development
Procedia PDF Downloads 393226 Multilingualism and the Question of National Language in Nigeria
Authors: Salome Labeh
Abstract:
Diverse Languages that exist in Nigeria, gave rise to the need to choose among these languages, which one or ones to be used as the National Language(s) in Nigeria. The Multilingual Nature of Nigeria has been examined, in relation to the provisional result of 1991 census conducted in Nigeria and the status of language policy in the country, which eventually led to the discovery of the fact that Hausa, Igbo, Yoruba languages have the highest speaker in terms of population, and are already made co-official languages in Nigeria, alongside with English language. Then, these languages should be considered as the National Languages, if eventually a language policy emerges in Nigeria.Keywords: multilingual, languages, culture, Nigeria
Procedia PDF Downloads 372225 Speech and LanguageTherapists’ Advices for Multilingual Children with Developmental Language Disorders
Authors: Rudinë Fetahaj, Flaka Isufi, Kristina Hansson
Abstract:
While evidence shows that in most European countries’ multilingualism is rising, unfortunately, the focus of Speech and Language Therapy (SLT) is still monolingualism. Furthermore, there is sparse information on how the needs of multilingual children with language disorders such as Developmental Language Disorder (DLD) are being met and which factors affect the intervention approach of SLTs when treating DLD. This study aims to examine the relationship and correlation between the number of languages SLTs speak, years of experience, and length of education with the advice they give to parents of multilingual children with DLD regarding which language to be spoken. This is a cross-sectional study where a survey was completed online by 2608 SLTs across Europe and data has been used from a 2017 COST-action project. IBM-SPSS-28 was used where descriptive analysis, correlation and Kruskal-Wallis test were performed.SLTs mainly advise the parents of multilingual children with DLD to speak their native language at home. Besides years of experience, language status and the level of education showed to have no association with the type of advice SLTs give. Results showed a non-significant moderate positive correlation between SLTs years of experience and their advice regarding the native language, whereas language status and length of education showed no correlation with the advice SLTs give to parents.Keywords: quantitative study, developmental language disorders, multilingualism, speech and language therapy, children, European context
Procedia PDF Downloads 81224 The Contribution of Corpora to the Investigation of Cross-Linguistic Equivalence in Phraseology: A Contrastive Analysis of Russian and Italian Idioms
Authors: Federica Floridi
Abstract:
The long tradition of contrastive idiom research has essentially been focusing on three domains: the comparison of structural types of idioms (e.g. verbal idioms, idioms with noun-phrase structure, etc.), the description of idioms belonging to the same thematic groups (Sachgruppen), the identification of different types of cross-linguistic equivalents (i.e. full equivalents, partial equivalents, phraseological parallels, non-equivalents). The diastratic, diachronic and diatopic aspects of the compared idioms, as well as their syntactic, pragmatic and semantic properties, have been rather ignored. Corpora (both monolingual and parallel) give the opportunity to investigate the actual use of correlating idioms in authentic texts of L1 and L2. Adopting the corpus-based approach, it is possible to draw attention to the frequency of occurrence of idioms, their syntactic embedding, their potential syntactic transformations (e.g., nominalization, passivization, relativization, etc.), their combinatorial possibilities, the variations of their lexical structure, their connotations in terms of stylistic markedness or register. This paper aims to present the results of a contrastive analysis of Russian and Italian idioms referring to the concepts of ‘beginning’ and ‘end’, that has been carried out by using the Russian National Corpus and the ‘La Repubblica’ corpus. Beyond the digital corpora, bilingual dictionaries, like Skvorcova - Majzel’, Dobrovol’skaja, Kovalev, Čerdanceva, as well as monolingual resources, have been consulted. The study has shown that many of the idioms that have been traditionally indicated as cross-linguistic equivalents on bilingual dictionaries cannot be considered correspondents. The findings demonstrate that even those idioms, that are formally identical in Russian and Italian and are presumably derived from the same source (e.g., conceptual metaphor, Bible, classical mythology, World literature), exhibit differences regarding usage. The ultimate purpose of this article is to highlight that it is necessary to review and improve the existing bilingual dictionaries considering the empirical data collected in corpora. The materials gathered in this research can contribute to this sense.Keywords: corpora, cross-linguistic equivalence, idioms, Italian, Russian
Procedia PDF Downloads 149