Search results for: pre-trained language models
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 10180

Search results for: pre-trained language models

10060 Student's Reluctance in Oral Participation

Authors: Soumia Hebbri

Abstract:

English language has become a major medium for communication across borders. Nowadays, it is seen as a communicative medium not only for business but also for academic purposes. Some scientists describe English language as a way to enjoy an admired position in many countries. It is neither a national nor an official language in North Africa; it is considered as the most widely taught foreign language at the educational system. In order to achieve mastery of a foreign language, learners must develop the four principal language skills: Reading, writing, listening and speaking. However, being able to interact orally with others, using effectively the target language, is nowadays very important. People who cannot speak a foreign language cannot be considered effective language users, even if they can read and understand it. The teachers’ role in promoting foreign language acquisition is very important, as they are responsible for providing students appropriate contexts to foster communicative situations that allow students to express themselves and interact in the target language. So, we should understand the student’s reasons of their reluctance in oral participation when dealing with oral communicative tasks, in order to get insights about the possible motivating factors that may improve their involvement and participation in the classroom.

Keywords: EL, EFL, ET, TEFL, communication

Procedia PDF Downloads 503
10059 De-Learning Language at Preschool: A Case of Nepal

Authors: Meenakshi Dahal

Abstract:

Generally, children start verbal communication by the age of eighteen months. Though they have difficulties in constructing complete sentences, they try to make their thought s understandable to the audience. By the age of 36 months, when they enroll in preschool, their Language and communication skills are enhanced. Children need plenty of classroom experiences that will help them to develop their oral language skills. Oral language is the primary means through which each individual child is enabled to structure, evaluate, describe and to express his/her experiences. In the context of multi lingual and multi-cultural country like Nepal, the languages used in preschool and the communities vary. In such a case, the language of instruction in the preschool is different from the language used by the children to communicate at home. Using qualitative research method the socio-cultural aspect of the language learning has been analyzed. This has been done by analyzing and exploring preschool activities as well as the language of instruction and communication in the preschools in rural Nepal. It is found that the language of instruction is different from the language of communications primarily used by the children. Teachers seldom use local language resulting in difficulties for the children to understand. Instead of recognizing their linguistic, social and cultural capitals teachers conform to using the Nepali language which the children are not familiar with. Children have to adapt to new language structures and patterns of usage resulting them to be slow in oral language and communication in the preschool. The paper concludes that teachers have to recognize the linguistic capitals of the children and schools need to be responsible to facilitate this process for all children, whatever their language background.

Keywords: children, language, preschool, socio-culture

Procedia PDF Downloads 392
10058 The Advancements of Transformer Models in Part-of-Speech Tagging System for Low-Resource Tigrinya Language

Authors: Shamm Kidane, Ibrahim Abdella, Fitsum Gaim, Simon Mulugeta, Sirak Asmerom, Natnael Ambasager, Yoel Ghebrihiwot

Abstract:

The call for natural language processing (NLP) systems for low-resource languages has become more apparent than ever in the past few years, with the arduous challenges still present in preparing such systems. This paper presents an improved dataset version of the Nagaoka Tigrinya Corpus for Parts-of-Speech (POS) classification system in the Tigrinya language. The size of the initial Nagaoka dataset was incremented, totaling the new tagged corpus to 118K tokens, which comprised the 12 basic POS annotations used previously. The additional content was also annotated manually in a stringent manner, followed similar rules to the former dataset and was formatted in CONLL format. The system made use of the novel approach in NLP tasks and use of the monolingually pre-trained TiELECTRA, TiBERT and TiRoBERTa transformer models. The highest achieved score is an impressive weighted F1-score of 94.2%, which surpassed the previous systems by a significant measure. The system will prove useful in the progress of NLP-related tasks for Tigrinya and similarly related low-resource languages with room for cross-referencing higher-resource languages.

Keywords: Tigrinya POS corpus, TiBERT, TiRoBERTa, conditional random fields

Procedia PDF Downloads 103
10057 Some Considerations on UML Class Diagram Formalisation Approaches

Authors: Abdullah A. H. Alzahrani, Majd Zohri Yafi, Fawaz K. Alarfaj

Abstract:

Unified Modelling Language (UML) is a software modelling language that is widely used and accepted. One significant drawback, of which, is that the language lacks formality. This makes carrying out any type of rigorous analysis difficult process. Many researchers attempt to introduce their approaches to formalize UML diagrams. However, it is always hard to decide what language and/or approach to use. Therefore, in this paper, we highlight some of the advantages and disadvantages of number of those approaches. We also try to compare different counterpart approaches. In addition, we draw some guidelines to help in choosing the suitable approach. Special concern is given to the formalization of the static aspects of UML shown is class diagrams.

Keywords: UML formalization, object constraints language, description logic, z language

Procedia PDF Downloads 434
10056 Recurrent Patterns of Netspeak among Selected Nigerians on WhatsApp Platform: A Quest for Standardisation

Authors: Lily Chimuanya, Esther Ajiboye, Emmanuel Uba

Abstract:

One of the consequences of online communication is the birth of new orthography genres characterised by novel conventions of abbreviation and acronyms usually referred to as Netspeak. Netspeak, also known as internet slang, is a style of writing mainly used in online communication to limit the length of text characters and to save time. The aim of this study is to evaluate how second language users of the English language have internalised this new convention of writing; identify the recurrent patterns of Netspeak; and assess the consistency of the use of the identified patterns in relation to their meanings. The study is corpus-based, and data drawn from WhatsApp chart pages of selected groups of Nigerian English speakers show a large occurrence of inconsistencies in the patterns of Netspeak and their meanings. The study argues that rather than emphasise the negative impact of Netspeak on the communicative competence of second language users, studies should focus on suggesting models as yardsticks for standardising the usage of Netspeak and indeed all other emerging language conventions resulting from online communication. This stance stems from the inevitable global language transformation that is eminent with the coming of age of information technology.

Keywords: abbreviation, acronyms, Netspeak, online communication, standardisation

Procedia PDF Downloads 391
10055 Reconfigurable Device for 3D Visualization of Three Dimensional Surfaces

Authors: Robson da C. Santos, Carlos Henrique de A. S. P. Coutinho, Lucas Moreira Dias, Gerson Gomes Cunha

Abstract:

The article refers to the development of an augmented reality 3D display, through the control of servo motors and projection of image with aid of video projector on the model. Augmented Reality is a branch that explores multiple approaches to increase real-world view by viewing additional information along with the real scene. The article presents the broad use of electrical, electronic, mechanical and industrial automation for geospatial visualizations, applications in mathematical models with the visualization of functions and 3D surface graphics and volumetric rendering that are currently seen in 2D layers. Application as a 3D display for representation and visualization of Digital Terrain Model (DTM) and Digital Surface Models (DSM), where it can be applied in the identification of canyons in the marine area of the Campos Basin, Rio de Janeiro, Brazil. The same can execute visualization of regions subject to landslides, as in Serra do Mar - Agra dos Reis and Serranas cities both in the State of Rio de Janeiro. From the foregoing, loss of human life and leakage of oil from pipelines buried in these regions may be anticipated in advance. The physical design consists of a table consisting of a 9 x 16 matrix of servo motors, totalizing 144 servos, a mesh is used on the servo motors for visualization of the models projected by a retro projector. Each model for by an image pre-processing, is sent to a server to be converted and viewed from a software developed in C # Programming Language.

Keywords: visualization, 3D models, servo motors, C# programming language

Procedia PDF Downloads 342
10054 A Comparative Study of Language Learning Strategy Use of Iranian Kurdish Bilingual and Persian Monolingual in EFL Context

Authors: Reza Khani, Ziba Hosseini

Abstract:

This study was an attempt to investigate the difference between learners of Iranian Kurdish–Persian bilingual language and Persian monolinguals, regarding language strategy use (LLS). The participants of the study were 120 monolingual Persian and 120 bilingual Kurdish studying English as a foreign language (EFL). Data were collected using strategy inventory for language learning SILL. The results show bilingual reported higher use of language learning strategies in all categories of SILL except memory strategies.

Keywords: language learning, memory, monolingual, comparative study

Procedia PDF Downloads 403
10053 Teaching English Language through Religious English Literature

Authors: Smriti Mary Gupta

Abstract:

This article intends to show how literature may be used in language classes to develop student’s knowledge of English. First, we examine the evolution of literature in the language classroom, then we give account of some reasons that justify its use in language classes, of the role of reading in language development, and of the way poetry is treated in the ESL classroom. This paper aims to emphasize the use of literature as a popular tool to teach language skills (i.e. reading, writing, listening and speaking), language areas (i.e. vocabulary, grammar and pronunciation) as well as moral teachings, which is the necessity in present time. Reason for using religious literary texts in foreign language classroom and main criteria for selecting suitable religious literary texts in foreign language classes are stressed so as to make the reader familiar with the underlying reasons and criteria for language teachers, using and selecting religious literary texts. Moreover, religious literature and teaching of language skills, benefits the different genres of religious literature (i.e. poetry, fiction and drama), and also gaining knowledge of a particular religion through language teaching but some problems had been observed by language teachers within the area of English through religious literature (i.e. lack of preparation in the area of literature teaching in TESL/TEFL programs, absence of clarity in objectives defining the role of literature in ESL/EFL), language teachers not having the background, training and appropriate knowledge in religious literature, lack of pedagogically-designed teaching material that can be used by language teachers in a classroom.

Keywords: religious literature, teaching literature, teaching of language skills, foreign language teaching, literary competence

Procedia PDF Downloads 525
10052 Dialect as a Means of Identification among Hausa Speakers

Authors: Hassan Sabo

Abstract:

Language is a system of conventionally spoken, manual and written symbols by human beings that members of a certain social group and participants in its culture express themselves. Communication, expression of identity and imaginative expression are among the functions of language. Dialect is a form of language, or a regional variety of language that is spoken in a particular geographical setting by a particular group of people. Hausa is one of the major languages in Africa, in terms of large number of people for whom it is the first language. Hausa is one of the western Chadic groups of languages. It constitutes one of the five or six branches of Afro-Asiatic family. The predominant Hausa speakers are in Nigeria and they live in different geographical locations which resulted to variety of dialects within the Hausa language apart of the standard Hausa language, the Hausa language has a variety of dialect that distinguish from one another by such features as phonology, grammar and vocabulary. This study intends to examine such features that serve as means of identification among Hausa speakers who are set off from others, geographically or socially.

Keywords: dialect, features, geographical location, Hausa language

Procedia PDF Downloads 194
10051 [Keynote Talk]: Computer-Assisted Language Learning (CALL) for Teaching English to Speakers of Other Languages (TESOL/ESOL) as a Foreign Language (TEFL/EFL), Second Language (TESL/ESL), or Additional Language (TEAL/EAL)

Authors: Andrew Laghos

Abstract:

Computer-assisted language learning (CALL) is defined as the use of computers to help learn languages. In this study we look at several different types of CALL tools and applications and how they can assist Adults and Young Learners in learning the English language as a foreign, second or additional language. It is important to identify the roles of the teacher and the learners, and what the learners’ motivations are for learning the language. Audio, video, interactive multimedia games, online translation services, conferencing, chat rooms, discussion forums, social networks, social media, email communication, songs and music video clips are just some of the many ways computers are currently being used to enhance language learning. CALL may be used for classroom teaching as well as for online and mobile learning. Advantages and disadvantages of CALL are discussed and the study ends with future predictions of CALL.

Keywords: computer-assisted language learning (CALL), teaching English as a foreign language (TEFL/EFL), adult learners, young learners

Procedia PDF Downloads 434
10050 Efficient Chiller Plant Control Using Modern Reinforcement Learning

Authors: Jingwei Du

Abstract:

The need of optimizing air conditioning systems for existing buildings calls for control methods designed with energy-efficiency as a primary goal. The majority of current control methods boil down to two categories: empirical and model-based. To be effective, the former heavily relies on engineering expertise and the latter requires extensive historical data. Reinforcement Learning (RL), on the other hand, is a model-free approach that explores the environment to obtain an optimal control strategy often referred to as “policy”. This research adopts Proximal Policy Optimization (PPO) to improve chiller plant control, and enable the RL agent to collaborate with experienced engineers. It exploits the fact that while the industry lacks historical data, abundant operational data is available and allows the agent to learn and evolve safely under human supervision. Thanks to the development of language models, renewed interest in RL has led to modern, online, policy-based RL algorithms such as the PPO. This research took inspiration from “alignment”, a process that utilizes human feedback to finetune the pretrained model in case of unsafe content. The methodology can be summarized into three steps. First, an initial policy model is generated based on minimal prior knowledge. Next, the prepared PPO agent is deployed so feedback from both critic model and human experts can be collected for future finetuning. Finally, the agent learns and adapts itself to the specific chiller plant, updates the policy model and is ready for the next iteration. Besides the proposed approach, this study also used traditional RL methods to optimize the same simulated chiller plants for comparison, and it turns out that the proposed method is safe and effective at the same time and needs less to no historical data to start up.

Keywords: chiller plant, control methods, energy efficiency, proximal policy optimization, reinforcement learning

Procedia PDF Downloads 28
10049 English Language Teaching and Learning Analysis in Iran

Authors: F. Zarrabi, J. R. Brown

Abstract:

Although English is not a second language in Iran, it has become an inseparable part of many Iranian people’s lives and is becoming more and more widespread. This high demand has caused a significant increase in the number of private English language institutes in Iran. Although English is a compulsory course in schools and universities, the majority of Iranian people are unable to communicate easily in English. This paper reviews the current state of teaching and learning English as an international language in Iran. Attitudes and motivations about learning English are reviewed. Five different aspects of using English within the country are analysed, including: English in public domain, English in Media, English in organizations/businesses, English in education, and English in private language institutes. Despite the time and money spent on English language courses in private language institutes, the majority of learners seem to forget what has been learned within months of completing their course. That is, when they are students with the support of the teacher and formal classes, they appear to make progress and use English more or less fluently. When this support is removed, their language skills either stagnant or regress. The findings of this study suggest that a dependant approach to learning is potentially one of the main reasons for English language learning problems and this is encouraged by English course books and approaches to teaching.

Keywords: English in Iran, English language learning, English language teaching, evaluation

Procedia PDF Downloads 418
10048 Decolonial Theorization of Epistemic Agency in Language Policy Management: Case of Plurinational Ecuador

Authors: Magdalena Madany-Saá

Abstract:

This paper compares the language management of two language policies in plurinational Ecuador: (1) mandatory English language teaching that uses Western standards of quality, and (2) indigenous educación intercultural bilingüe, which promotes ancestral knowledge and the indigenous languages of Ecuador. The data are from a comparative institutional ethnography conducted between 2018 and 2022 in English and Kichwa teacher preparation programs in an Ecuadorian teachers’ college. Specifically, the paper explores frameworks of knowledge promoted by different educational actors in both teacher education programs and the ways in which the Ecuadorian transformation towards a knowledge-based economy is intertwined with the country’s linguistic policies. Focusing on the specific role of language advocates and their discursive role in knowledge production, the paper elaborates on the notion of agency in Language Policy and Planning (LPP), referred to as epistemic agency. Specifically, the epistemic agency is conceptualized through the analysis of English language epistemic advocates who participate in empowering English language policies and endorse knowledge production in that language. By proposing an epistemic agency, this paper argues that in the context of knowledge-based societies, advocates are key in transferring the policies from the political to the epistemic realm – where decisions about what counts as legitimate knowledge are made. The study uses the decolonial option as its analytical framework for critiquing the hegemonic perpetuation of modernity and its knowledge-based models in Latin America derived from the colonial matrix of power. Through this theoretical approach, it is argued that if indigenous stakeholders are only viewed as political actors and not as knowledge producers, the hegemony of Global English will reinforce a knowledge-based society constructed upon Global North modernity. In the absence of strong epistemic advocates for indigenous language policies, powerful Global English advocates occupy such vacancies at the language management level, thus dominating the ecology of knowledge in a plurinational and plurilingual Ecuador.

Keywords: educación intercultural bilingüe, English language teaching, epistemic agency, language advocates, plurinationality

Procedia PDF Downloads 36
10047 Implementation of Computer-Based Technologies into Foreign Language Teaching Process

Authors: Golovchun Aleftina, Dabyltayeva Raikhan

Abstract:

Nowadays, in the world of widely developing cross-cultural interactions and rapidly changing demands of the global labor market, foreign language teaching and learning has taken a special role not only in school education but also in everyday life. Cognitive Lingua-Cultural Methodology of Foreign Language Teaching originated in Kazakhstan brings a communicative approach to the forefront in foreign language teaching that gives raise a variety of techniques to make the language learning a real communication. One of these techniques is Computer Assisted Language Learning. In our article, we aim to: demonstrate what learning benefits students are likely to get by teachers having implemented computer-based technologies into foreign language teaching process; prove that technology-based classroom serves as the best tool for interactive and efficient language learning; give examples of classroom sufficient organization with computer-based activities.

Keywords: computer assisted language learning, learning benefits, foreign language teaching process, implementation, communicative approach

Procedia PDF Downloads 473
10046 Armenian in the Jordanian Linguistic Landscape: Marginalisation and Revitalisation

Authors: Omar Alomoush

Abstract:

This paper examines the Armenian language in the linguistic landscape of Jordanian cities. The results indicate that Armenian is chiefly marginalised in the LL. By quantitative and qualitative methods, the current study attempts to identify the main reasons behind this marginalisation. In the light of the fact that Armenian is completely absent from the commercial streets of major Jordanian cities, all monolingual and multilingual signs in Armenian Neighbourhood in Amman city are photographed to identify them according to function and language. To provide plausible explanations for the marginalisation of the Armenian language in the LL, the current study builds upon issues of language maintenance and underlying language policy. According to the UNESCO Endangerment Framework, it can be assumed that Armenian is a vulnerable language, even though the Armenian Church exerted great efforts to revitalise Armenian in all social settings, including the LL. It was found that language policies enacted by the state of Jordan, language shift, language hostility, voluntary migration and economic pressures are among the reasons behind this marginalisation.

Keywords: linguistic landscape, multilingualism, Armenian, marginalisation and revitalisation

Procedia PDF Downloads 261
10045 Reciprocal Interferences in Bilingual English-Igbo Speaking Society: The Implications in Language Pedagogy

Authors: Ugwu Elias Ikechukwu

Abstract:

Discussions on bilingualism have always dwelt on how the mother tongue interferes with the target language. This interference is considered a serious problem in second language learning. Usually, the interference has been phonological. But the objective of this research is to explore how the target language interferes with the mother tongue. In the case of the Igbo language, it interferes with English mostly at the phonological level while English interferes with Igbo at the realm of vocabulary. The result is a new language \"Engligbo\" which is a hybrid of English and Igbo. The Igbo language spoken by about 25 million people is one of the three most prominent languages in Nigeria. This paper discusses the phenomenal Engligbo, and other implications for Igbo learners of English. The method of analysis is descriptive. A number of recommendations were made that would help teachers handle problems arising from such mutual interferences.

Keywords: reciprocal interferences, bilingualism, implications, language pedagogy

Procedia PDF Downloads 398
10044 Spoken Subcorpus of the Kazakh Language: History, Content, Methodology

Authors: Kuralay Bimoldaevna Kuderinova, Beisenkhan Samal

Abstract:

The history of creating a linguistic corpus in Kazakh linguistics begins only in 2016. Though within this short period of time, the linguistic corpus has become a national corpus and its several subcorpora, namely historical, cultural, spoken, dialectological, writers’ subcorpus, proverbs subcorpus and poetic texts subcorpus, have appeared and are working effectively. Among them, the spoken corpus has its own characteristics. The Kazakh language is one of the languages belonging to the Kypchak-Nogai group of Turkic peoples. The Kazakh language is a language that, as a part of the former Soviet Union, was directly influenced by the Russian language and underwent major changes in its spoken and written forms. After the Republic of Kazakhstan gained independence, the Kazakh language received the status of the state language in 1991. However, today, the prestige of the Russian language is still higher than that of the Kazakh language. Therefore, the direct influence of the Russian language on the structure, style, and vocabulary of the Kazakh language continues. In particular, it can be said that the national practice of the spoken language is disappearing, as the spoken form of Kazakh is not used in official gatherings and events of state importance. In this regard, it is very important to collect and preserve examples of spoken language. Recording exemplary spoken texts, converting them into written form, and providing their audio along with orphoepic explanations will serve as a valuable tool for teaching and learning the Kazakh language. Therefore, the report will cover interesting aspects and scientific foundations related to the creation, content, and methodology of the oral subcorpus of the Kazakh language.

Keywords: spoken corpus, Kazakh language, orthoepic norm, LLM

Procedia PDF Downloads 8
10043 The Application of Computer and Technology in Language Teaching and Learning

Authors: Pouya Vakili

Abstract:

Since computers were first introduced into educational facilities, foreign language educators have been faced with the problem of integrating high-tech multimedia techniques into a traditional text-based curriculum. As studies of language teaching have pointed out, ‘Language teaching tends in practice to be eclectic…. There are not only exceptionally many paths and educational means for arriving at a given educational goal, but there are also many types of educational materials which can be used to achieve that goal’. For language educators who are trying to incorporate technology into their curricula, the choices seem endless. Yet the quantity, as well as the limitations, of available computer programs does not guarantee that these programs can be successfully integrated into a curriculum.

Keywords: curriculum, language teaching, learning, multimedia, technology

Procedia PDF Downloads 574
10042 Culture Sensitization: Understanding German Culture by Learning German

Authors: Lakshmi Shenoy

Abstract:

In today’s era of Globalization, arises the need that students and professionals relocate temporarily or permanently to another country in order to pursue their respective academic and career goals. This involves not only learning the local language of the country but also integrating oneself into the native culture. This paper explains the method of understanding a nation’s culture through the study of its language. The method uses language not as a series of rules that connect words together but as a social practice in which one can actively participate. It emphasizes on how culture provides an environment in which languages can flourish and how culture dictates the interpretation of the language especially in case of German. This paper introduces language and culture as inseparable entities, as two sides of the same coin.

Keywords: language and culture, sociolinguistics, Ronald Wardhaugh, German

Procedia PDF Downloads 305
10041 English and the Question of National Language in Nigeria

Authors: Foyewa R. A.

Abstract:

This paper examined the role of English language and the quest for a national language in Nigeria. Various hindrances to the choice of a national language in Nigeria were observed. These include: The dominant role of English language, political instability and multilingual nature of the country. The writer suggested that ’’the three big’’ that is, Hausa, Igbo and Yoruba should be selected as the national languages. It was also suggested that a credit pass in a student’s mother tongue and one of “the three big” (Hausa, Igbo and Yoruba) should constitute the prerequisite for admission into Nigerian higher institutions.

Keywords: English, roles of English, national language, Nigerian languages, Hausa, Igbo, Yoruba

Procedia PDF Downloads 789
10040 The Pen Is Mightier than the Sword: Kurdish Language Policy in Turkey

Authors: Irene Yi

Abstract:

This paper analyzes the development of Kurdish language endangerment in Turkey and Kurdish language education over time. It examines the historical context of the Turkish state, as well as reasons for the Turkish language hegemony. From a linguistic standpoint, the Kurdish language is in danger of extinction despite a large number of speakers, lest Kurdish language education is more widely promoted. The paper argues that Kurdish is no longer in a stable diglossic state; if the current trends continue, the language will lose its vitality. This paper recognizes the importance of education in preserving the language while discussing the changing political and institutional regard for Kurdish education. Lastly, the paper outlines solutions to the issue by looking at a variety of proposals, from creating a Kurdistan to merely changing the linguistic landscape in Turkey. After analysis of possible solutions in terms of realistic ability and effectiveness, the paper concludes that changing linguistic landscape and increasing Kurdish language education are the most ideal first steps in a long fight for Kurdish linguistic equality.

Keywords: endangered, Kurdish, oppression, policy

Procedia PDF Downloads 151
10039 Time and Cost Prediction Models for Language Classification Over a Large Corpus on Spark

Authors: Jairson Barbosa Rodrigues, Paulo Romero Martins Maciel, Germano Crispim Vasconcelos

Abstract:

This paper presents an investigation of the performance impacts regarding the variation of five factors (input data size, node number, cores, memory, and disks) when applying a distributed implementation of Naïve Bayes for text classification of a large Corpus on the Spark big data processing framework. Problem: The algorithm's performance depends on multiple factors, and knowing before-hand the effects of each factor becomes especially critical as hardware is priced by time slice in cloud environments. Objectives: To explain the functional relationship between factors and performance and to develop linear predictor models for time and cost. Methods: the solid statistical principles of Design of Experiments (DoE), particularly the randomized two-level fractional factorial design with replications. This research involved 48 real clusters with different hardware arrangements. The metrics were analyzed using linear models for screening, ranking, and measurement of each factor's impact. Results: Our findings include prediction models and show some non-intuitive results about the small influence of cores and the neutrality of memory and disks on total execution time, and the non-significant impact of data input scale on costs, although notably impacts the execution time.

Keywords: big data, design of experiments, distributed machine learning, natural language processing, spark

Procedia PDF Downloads 120
10038 Developing Kazakh Language Fluency Test in Nazarbayev University

Authors: Saule Mussabekova, Samal Abzhanova

Abstract:

The Kazakh Language Fluency Test, based on the IELTS exam, was implemented in 2012 at Nazarbayev University in Astana, Kazakhstan. We would like to share our experience in developing this exam and some exam results with other language instructors. In this paper, we will cover all these peculiarities and their related issues. The Kazakh Language Fluency Test is a young exam. During its development, we faced many difficulties. One of the goals of the university and the country is to encourage fluency in the Kazakh language for all citizens of the Republic. Nazarbayev University has introduced a Kazakh language program to assist in achieving this goal. This policy is one-step in ensuring that NU students have a thorough understanding of the Kazakh language through a fluency test based on the International English Language Testing System (IELTS). The Kazakh Language Fluency Test exam aims to determine student’s knowledge of Kazakh language. The fact is that there are three types of students at Nazarbayev University: Kazakh-speaking heritage learners, Russian-speaking and English-speaking students. Unfortunately, we have Kazakh students who do not speak Kazakh. All students who finished school with Russian language instruction are given Kazakh Language Fluency Test in order to determine their Kazakh level. After the test exam, all students can choose appropriate Kazakh course: Basic Kazakh, Intermediate Kazakh and Upper-Intermediate Kazakh. The Kazakh Language Fluency Test consists of four parts: Listening, Reading, Writing and Speaking. They are taken on the same day in the abovementioned order.

Keywords: diagnostic test, kazakh language, placement test, test result

Procedia PDF Downloads 406
10037 Foreign Language Curriculum of Mongolian Higher Educational Institutions, Problems and Solutions: In the Example of the Curriculum at National University of Mongolia

Authors: Sainbilegt Dashdorj, Delgerekhtsetseg Tsedev, Odontuya Mishigdorj, Bat-Uchral Ganzorigt

Abstract:

To develop a content-based recommendation of foreign language teaching for foreign language majoring and non-majoring classes at domestic universities by comparing the current situation, the environmental conditions, the curriculum, the plan, the content and so on of Mongolian foreign language teaching with the ones at the universities in the education development leading countries was set as the main goal and thus, it is considered to become an important step not only for solving an urgent foreign language teaching issue at Mongolian higher educational institutions but also for enhancing the foreign language knowledge of the national human resource in the globalizing world.

Keywords: CEFR, content standart, language curriculum, multilingualism

Procedia PDF Downloads 577
10036 Text Similarity in Vector Space Models: A Comparative Study

Authors: Omid Shahmirzadi, Adam Lugowski, Kenneth Younge

Abstract:

Automatic measurement of semantic text similarity is an important task in natural language processing. In this paper, we evaluate the performance of different vector space models to perform this task. We address the real-world problem of modeling patent-to-patent similarity and compare TFIDF (and related extensions), topic models (e.g., latent semantic indexing), and neural models (e.g., paragraph vectors). Contrary to expectations, the added computational cost of text embedding methods is justified only when: 1) the target text is condensed; and 2) the similarity comparison is trivial. Otherwise, TFIDF performs surprisingly well in other cases: in particular for longer and more technical texts or for making finer-grained distinctions between nearest neighbors. Unexpectedly, extensions to the TFIDF method, such as adding noun phrases or calculating term weights incrementally, were not helpful in our context.

Keywords: big data, patent, text embedding, text similarity, vector space model

Procedia PDF Downloads 175
10035 The Use of AI to Measure Gross National Happiness

Authors: Riona Dighe

Abstract:

This research attempts to identify an alternative approach to the measurement of Gross National Happiness (GNH). It uses artificial intelligence (AI), incorporating natural language processing (NLP) and sentiment analysis to measure GNH. We use ‘off the shelf’ NLP models responsible for the sentiment analysis of a sentence as a building block for this research. We constructed an algorithm using NLP models to derive a sentiment analysis score against sentences. This was then tested against a sample of 20 respondents to derive a sentiment analysis score. The scores generated resembled human responses. By utilising the MLP classifier, decision tree, linear model, and K-nearest neighbors, we were able to obtain a test accuracy of 89.97%, 54.63%, 52.13%, and 47.9%, respectively. This gave us the confidence to use the NLP models against sentences in websites to measure the GNH of a country.

Keywords: artificial intelligence, NLP, sentiment analysis, gross national happiness

Procedia PDF Downloads 118
10034 English as a Foreign Language for Deaf Students in the K-12 Schools in Turkey: A Policy Analysis

Authors: Cigdem Fidan

Abstract:

Deaf students in Turkey generally do not have access to foreign language classes. However, the knowledge of foreign languages, especially English, is important for them to access knowledge and other opportunities in the globalizing world. In addition, learning any language including foreign languages is a basic linguistic human right. This study applies critical discourse analysis to examine language ideologies, perceptions of deafness and current language and education policies used for deaf education in Turkey. The findings show that representation of deafness as a disability in policy documents, ignorance the role of sign languages in education and lack of policies that support foreign language education for the deaf may result in inaccessibility of foreign language education for deaf students in Turkey. The paper concludes with recommendations for policymakers, practitioners, and advocates for the deaf.

Keywords: deaf learners, English as a foreign language, language policy, linguistic human rights

Procedia PDF Downloads 381
10033 Sociolinguistics and Language Change

Authors: Banazzouz Halima

Abstract:

Throughout the ages, language has been viewed not only as a simple code of communicating information but rather as the most powerful and versatile medium of maintaining relationships with other people. While,by the end of the 18th century, such matters of scientific investigation concerning the study of human language began to occur under the scope of “Linguistics” generally defined as the scientific study of language. Linguistics, thus, provides a growing body of scientific knowledge about language which can guide the activity of the language teacher and student as well. Moreover,as times passed, the linguistic development engaged language in a broadly practiced academic discipline having relationship with other sciences such as: psychology, sociology, anthropology etc. Therefore, “Sociolinguistics” was given birth during the 1960’s. In fact, the given abstract is mainly linguistic, inserted under the scope of “Sociolinguistics” and by far it highlights on the process of linguistic variation and language change to show that all languages change through time and linguistic systems may vary from one speech community to another providing there is a sense of vitality where people of different parts of the globe may mutually and intelligibly communicate and comprehend each other.

Keywords: language change-sociolinguistics, social context-speech community, vitality of language, linguistic variation, urban dialectology, urban dialectology

Procedia PDF Downloads 628
10032 Multilingualism and the Question of National Language in Nigeria

Authors: Salome Labeh

Abstract:

Diverse Languages that exist in Nigeria, gave rise to the need to choose among these languages, which one or ones to be used as the National Language(s) in Nigeria. The Multilingual Nature of Nigeria has been examined, in relation to the provisional result of 1991 census conducted in Nigeria and the status of language policy in the country, which eventually led to the discovery of the fact that Hausa, Igbo, Yoruba languages have the highest speaker in terms of population, and are already made co-official languages in Nigeria, alongside with English language. Then, these languages should be considered as the National Languages, if eventually a language policy emerges in Nigeria.

Keywords: multilingual, languages, culture, Nigeria

Procedia PDF Downloads 371
10031 A Review on Water Models of Surface Water Environment

Authors: Shahbaz G. Hassan

Abstract:

Water quality models are very important to predict the changes in surface water quality for environmental management. The aim of this paper is to give an overview of the water qualities, and to provide directions for selecting models in specific situation. Water quality models include one kind of model based on a mechanistic approach, while other models simulate water quality without considering a mechanism. Mechanistic models can be widely applied and have capabilities for long-time simulation, with highly complexity. Therefore, more spaces are provided to explain the principle and application experience of mechanistic models. Mechanism models have certain assumptions on rivers, lakes and estuaries, which limits the application range of the model, this paper introduces the principles and applications of water quality model based on the above three scenarios. On the other hand, mechanistic models are more easily to compute, and with no limit to the geographical conditions, but they cannot be used with confidence to simulate long term changes. This paper divides the empirical models into two broad categories according to the difference of mathematical algorithm, models based on artificial intelligence and models based on statistical methods.

Keywords: empirical models, mathematical, statistical, water quality

Procedia PDF Downloads 264