Commenced in January 2007

Frequency: Monthly

Edition: International

Paper Count: 289

Search results for: Keywords: Uzbek

289 Benchmarking Bert-Based Low-Resource Language: Case Uzbek NLP Models

Authors: Jamshid Qodirov, Sirojiddin Komolov, Ravilov Mirahmad, Olimjon Mirzayev

Abstract:

Nowadays, natural language processing tools play a crucial role in our daily lives, including various techniques with text processing. There are very advanced models in modern languages, such as English, Russian etc. But, in some languages, such as Uzbek, the NLP models have been developed recently. Thus, there are only a few NLP models in Uzbek language. Moreover, there is no such work that could show which Uzbek NLP model behaves in different situations and when to use them. This work tries to close this gap and compares the Uzbek NLP models existing as of the time this article was written. The authors try to compare the NLP models in two different scenarios: sentiment analysis and sentence similarity, which are the implementations of the two most common problems in the industry: classification and similarity. Another outcome from this work is two datasets for classification and sentence similarity in Uzbek language that we generated ourselves and can be useful in both industry and academia as well.

Keywords: NLP, benchmak, bert, vectorization

Procedia PDF Downloads 35

288 A Study of The Contrasts and Cultural Commonalities of the Hazara and Uzbek Peoples of Afghanistan

Authors: Sadullah Rahmani

Abstract:

Legends, stories, beliefs and traditions in every nation represent the collective dreams, secrets and aspirations of a nation and on the other hand, the foundation of their collective memory; What generally forms the foundation of the culture of any nation has undergone changes and transformations due to the passage of time and changes in political, religious and social conditions. Afghanistan is one of the richest countries in terms of cultural diversity. This country is home to people of different languages, ethnicities and religions. The purpose of this article is to analyze the contrasts and cultural commonalities between two ethnic groups in Afghanistan, namely the Hazara and Uzbek peoples. This research was done with qualitative method and structured interview tool. The method of data analysis is content analysis. In order to explain the intercultural sensitivities of the two groups, Milton Bennett's intercultural sensitivities measures have been used. Based on the theory of intercultural sensitivities, the development of communication is an important factor in reducing intercultural sensitivities. In this research, 8 people from the Hazara and Uzbek tribes were interviewed. Various factors such as customs and manners, music, language, art, lifestyle, etc. have been examined in the article. These factors can contribute to cultural differences and commonalities between the Hazara and Uzbek peoples. The results of this research show that according to Bennett's theory, there are less cultural sensitivities between the Hazara and Uzbek peoples of Afghanistan, especially in matters of marriage, language, economic poverty, being discriminated against, and work relationships; But cultural sensitivities are more in many other cases such as education, religion and the formation of cultural communities.

Keywords: Keywords: Uzbek, language, culture, religion, Hazara.

Procedia PDF Downloads 20

287 Teaching Food Discourse in Cross-Cultural Communication Lectures at University

Authors: Sanjar Davronov

Abstract:

Linguistic research of food discourse helps to analyze gastronomic picture of the world which plays important role in cross-cultural communications. 20 hours lecture can’t provide broad knowledge about national picture of the world of native speakers whose language being studied by future translator students. This abstract analyses how to research food discourse in “Cross-cultural (or lingvo-cultural) communication” lectures for ESL students. During compare Uzbek and American national meals, we found some specific features of food names in both countries. For example: If names of food includes advertising character in USA restaurant menus like: New York strip Sirloin crowned with Fresh – squeezed orange and lemon with a hint of garlic; Uzbek meals names are too simple, short and force general afford in underlining action – preparation process like: “Dimlama” (dimla(verb-to stew)+ma(suffix of past perfect like- stew- stewed). “Qovurdoq” (qovur (verb- to fry)+ doq (suffix of adverb like “fried one”) but these are the most delicious and difficult in preparing national meals however it is heritage of national cuisine. There are also similarity between US and Uzbek food names which has geographical color - South African Lobster tail; Qashqadaryo tandiri (lamb prepared in “tandir” typical national oven with pine leafs in Qashkadarya region). Food for European people contains physical context more than spiritual but in Asian literature especially Uzbek food has some pragmatic stuff: salt and bread (associates with hospitality and humanity), don’t be faithlessness 40 for owners of house where you where a guest. We share some teaching techniques for food discourse analyzing lectures.

Keywords: cross-cultural communications, food discourse, ESL lectures, linguistic research

Procedia PDF Downloads 601

286 A Cross-Gender Statistical Analysis of Tuvinian Intonation Features in Comparison With Uzbek and Azerbaijani

Authors: Daria Beziakina, Elena Bulgakova

Abstract:

The paper deals with cross-gender and cross-linguistic comparison of pitch characteristics for Tuvinian with two other Turkic languages - Uzbek and Azerbaijani, based on the results of statistical analysis of pitch parameter values and intonation patterns used by male and female speakers. The main goal of our work is to obtain the ranges of pitch parameter values typical for Tuvinian speakers for the purpose of automatic language identification. We also propose a cross-gender analysis of declarative intonation in the poorly studied Tuvinian language. The ranges of pitch parameter values were obtained by means of specially developed software that deals with the distribution of pitch values and allows us to obtain statistical language-specific pitch intervals.

Keywords: speech analysis, statistical analysis, speaker recognition, identification of person

Procedia PDF Downloads 331

285 The Importance of Country-of-Origin Information and Perceived Product Quality in Uzbekistan

Authors: Begzod Nishanov, Farhod Karimov

Abstract:

Globalization and the internet have completely changed the way in which businesses operate as well as has equipped customers with endless potential. Today, consumers’ product choice is not only affected by branding, price and quality of the product, but also by the country-of-origin information. Precisely, ‘Made In’ label is considered as one of the driving factors which directly impact on consumers’ preferences. Generally, it is obvious that products manufactured in less developed countries are considered to be of lower quality and riskier compared to the products made in developed countries. In this regard, it is worth to note that this phenomenon is mainly applicable to western developed countries. However, there is a lack of empirical research on underlying the influence of country-of-origin phenomenon in emerging economies such as Uzbekistan. Today, Uzbek market is being dominated by growing number of foreign made products. Uzbek manufacturers are facing intense competition not only from local producers but also from the availability of foreign goods suppliers. Consequently, consumers are given wider choice of products than ever before. In this regard, it is important to define the importance of country-of-origin information in order to understand Uzbek consumers’ preference. The methodology of the research is formulated based on the methodology of previous papers. A total 527 online questionnaires were completed. Data analysis was conducted using factor analysis and analysis of variance test (ANOVA). Findings of the research support the view that Uzbek consumers attach great importance to the country-of-origin information of products. Precisely, it can be stated that Uzbek people perceive product quality by its ‘Made in...’ label, especially when buying high involvement goods such as car or refrigerator. Another findings of the paper show that products manufactured in developed countries including Germany, Japan and USA are found to be of high quality, while products manufactured in less developed countries are considered to be of lower quality. Marketers can use this information for segmentation purposes. For example, products manufactured in less developed countries can be targeted for low-to-middle income families while goods manufactured in developed countries can be targeted for higher income families. In conclusion, it can be stated that perceived product quality of products that are made in Uzbekistan has slightly increased since 18 years. It implies that nowadays products under ‘Made in Uzbekistan’ label is continually becoming available to many consumers in foreign markets, especially among Commonwealth of Independent States (CIS) countries. Therefore, conducting further research to explore the phenomenon of country-of-origin information and perceived product quality in emerging markets is of paramount importance.

Keywords: country-of-origin, consumer behavior, product evaluation, perceived quality

Procedia PDF Downloads 233

284 Bibliometrics of 'Community Garden' and Associated Keywords

Authors: Guilherme Reis Ranieri, Guilherme Leite Gaudereto, Michele Toledo, Luis Fernando Amato-Lourenco, Thais Mauad

Abstract:

Given the importance to urban sustainability and the growing relevance of the term ‘community garden’, this paper aims to conduct a bibliometric analysis of the term. Using SCOPUS as database, we analyzed 105 articles that contained the keywords ‘community garden’, and conducted a cluster analysis with the associated keywords. As results, we found 205 articles and 404 different keywords. Among the keywords, 334 are not repeated anytime, 44 are repeated 2 times and 9 appear 3 times. The most frequent keywords are: community food systems (74), urban activism (14), Communities of practice (6), food production (6) and public rethoric (5). Within the areas, which contains more articles are: social sciences (74), environmental science (29) and agricultural and biological sciences (24).The three main countries that concentrated the papers are United States (54), Canada (15) and Australia (12). The main journal with these keywords is Local Environment (10). The first publication was in 1999, and by 2010 concentrated 30,5% of the publications. The other 69,5% occurred 2010 to 2015, indicating an increase in frequency. We can conclude that the papers, based on the distribution of the keywords, are still scattered in various research topics and presents high variability between subjects.

Keywords: bibliometrics, community garden, metrics, urban agriculture

Procedia PDF Downloads 341

283 Issue Reorganization Using the Measure of Relevance

Authors: William Wong Xiu Shun, Yoonjin Hyun, Mingyu Kim, Seongi Choi, Namgyu Kim

Abstract:

Recently, the demand of extracting the R&D keywords from the issues and using them in retrieving R&D information is increasing rapidly. But it is hard to identify the related issues or to distinguish them. Although the similarity between the issues cannot be identified, but with the R&D lexicon, the issues that always shared the same R&D keywords can be determined. In details, the R&D keywords that associated with particular issue is implied the key technology elements that needed to solve the problem of the particular issue. Furthermore, the related issues that sharing the same R&D keywords can be showed in a more systematic way through the issue clustering constructed from the perspective of R&D. Thus, sharing of the R&D result and reusable of the R&D technology can be facilitated. Indirectly, the redundancy of investment on the same R&D can be reduce as the R&D information can be shared between those corresponding issues and reusability of the related R&D can be improved. Therefore, a methodology of constructing an issue clustering from the perspective of common R&D keywords is proposed to satisfy the demands mentioned.

Keywords: clustering, social network analysis, text mining, topic analysis

Procedia PDF Downloads 561

282 Exploring Research Trends and Topics in Intervention on Metabolic Syndrome Using Network Analysis

Authors: Lee Soo-Kyoung, Kim Young-Su

Abstract:

This study established a network related to metabolic syndrome intervention by conducting a social network analysis of titles, keywords, and abstracts, and it identified emerging topics of research. It visualized an interconnection between critical keywords and investigated their frequency of appearance to construe the trends in metabolic syndrome intervention measures used in studies conducted over 38 years (1979–2017). It examined a collection of keywords from 8,285 studies using text rank analyzer, NetMiner 4.0. The analysis revealed 5 groups of newly emerging keywords in the research. By examining the relationship between keywords with reference to their betweenness centrality, the following clusters were identified. Thus if new researchers refer to existing trends to establish the subject of their study and the direction of the development of future research on metabolic syndrome intervention can be predicted.

Keywords: intervention, metabolic syndrome, network analysis, research, the trend

Procedia PDF Downloads 188

281 Evidence of a Negativity Bias in the Keywords of Scientific Papers

Authors: Kseniia Zviagintseva, Brett Buttliere

Abstract:

Science is fundamentally a problem-solving enterprise, and scientists pay more attention to the negative things, that cause them dissonance and negative affective state of uncertainty or contradiction. While this is agreed upon by philosophers of science, there are few empirical demonstrations. Here we examine the keywords from those papers published by PLoS in 2014 and show with several sentiment analyzers that negative keywords are studied more than positive keywords. Our dataset is the 927,406 keywords of 32,870 scientific articles in all fields published in 2014 by the journal PLOS ONE (collected from Altmetric.com). Counting how often the 47,415 unique keywords are used, we can examine whether those negative topics are studied more than positive. In order to find the sentiment of the keywords, we utilized two sentiment analysis tools, Hu and Liu (2004) and SentiStrength (2014). The results below are for Hu and Liu as these are the less convincing results. The average keyword was utilized 19.56 times, with half of the keywords being utilized only 1 time and the maximum number of uses being 18,589 times. The keywords identified as negative were utilized 37.39 times, on average, with the positive keywords being utilized 14.72 times and the neutral keywords - 19.29, on average. This difference is only marginally significant, with an F value of 2.82, with a p of .05, but one must keep in mind that more than half of the keywords are utilized only 1 time, artificially increasing the variance and driving the effect size down. To examine more closely, we looked at those top 25 most utilized keywords that have a sentiment. Among the top 25, there are only two positive words, ‘care’ and ‘dynamics’, in position numbers 5 and 13 respectively, with all the rest being identified as negative. ‘Diseases’ is the most studied keyword with 8,790 uses, with ‘cancer’ and ‘infectious’ being the second and fourth most utilized sentiment-laden keywords. The sentiment analysis is not perfect though, as the words ‘diseases’ and ‘disease’ are split by taking 1st and 3rd positions. Combining them, they remain as the most common sentiment-laden keyword, being utilized 13,236 times. More than just splitting the words, the sentiment analyzer logs ‘regression’ and ‘rat’ as negative, and these should probably be considered false positives. Despite these potential problems, the effect is apparent, as even the positive keywords like ‘care’ could or should be considered negative, since this word is most commonly utilized as a part of ‘health care’, ‘critical care’ or ‘quality of care’ and generally associated with how to improve it. All in all, the results suggest that negative concepts are studied more, also providing support for the notion that science is most generally a problem-solving enterprise. The results also provide evidence that negativity and contradiction are related to greater productivity and positive outcomes.

Keywords: bibliometrics, keywords analysis, negativity bias, positive and negative words, scientific papers, scientometrics

Procedia PDF Downloads 166

280 Behavioral Finance in Hundred Keywords

Authors: Ramon Hernán, Maria Teresa Corzo

Abstract:

This study examines the impact and contribution of the main journals in the discipline of behavioral finance to determine the state of the art of the discipline and the growth lines and concepts studied to date. This is a unique and novel study given that a review of the discipline has not been carried out through the keywords of the articles that allows visualizing through this component of the research, which are the main topics of discussion and the relationships that arise between the concepts discussed. To carry out this study, 3,876 articles have been taken as a reference, which includes 15,859 keywords from the main journals responsible for the growth of the discipline.; Journal of Behavioral Finance, Review of Behavioral Finance, Journal of Behavioral and Experimental Economics, Journal of Behavioral and Experimental Economics and Review of Behavioral Finance. The results indicate which are the topics most covered in the discipline throughout the period from 2000 to 2020, how these concepts have been dealt with on a recurring basis along with others throughout the aforementioned period and how the different concepts have been grouped based on the keywords established by the authors for the classification of their articles with a network diagram to complete the analysis.

Keywords: behavioral finance, keywords, co-words, top journals, data visualization

Procedia PDF Downloads 176

279 Evaluate Existing Mental Health Intervention Programs Tailored for International Students in China

Authors: Nargiza Nuralieva

Abstract:

This meta-analysis investigates the effectiveness of mental health interventions tailored for international students in China, with a specific focus on Uzbek students and Silk Road scholarship recipients. The comprehensive literature review synthesizes existing studies, papers, and reports, evaluating the outcomes, limitations, and cultural considerations of these programs. Data selection targets mental health programs for international students, honing in on a subset analysis related to Uzbek students and Silk Road scholarship recipients. The analysis encompasses diverse outcome measures, such as reported stress levels, utilization rates of mental health services, academic performance, and more. Results reveal a consistent and statistically significant reduction in reported stress levels, emphasizing the positive impact of these interventions. Utilization rates of mental health services witness a significant increase, highlighting the accessibility and effectiveness of support. Retention rates show marked improvement, though academic performance yields mixed findings, prompting nuanced exploration. Psychological well-being, quality of life, and overall well-being exhibit substantial enhancements, aligning with the overarching goal of holistic student development. Positive outcomes are observed in increased help-seeking behavior, positive correlations with social support, and significant reductions in anxiety levels. Cultural adaptation and satisfaction with interventions both indicate positive outcomes, underscoring the effectiveness of culturally sensitive mental health support. The findings emphasize the importance of tailored mental health interventions for international students, providing novel insights into the specific needs of Uzbek students and Silk Road scholarship recipients. This research contributes to a nuanced understanding of the multifaceted impact of mental health programs on diverse student populations, offering valuable implications for the design and refinement of future interventions. As educational institutions continue to globalize, addressing the mental health needs of international students remains pivotal for fostering inclusive and supportive learning environments.

Keywords: international students, mental health interventions, cross-cultural support, silk road scholarship, meta-analysis

Procedia PDF Downloads 35

278 Lecture Video Indexing and Retrieval Using Topic Keywords

Authors: B. J. Sandesh, Saurabha Jirgi, S. Vidya, Prakash Eljer, Gowri Srinivasa

Abstract:

In this paper, we propose a framework to help users to search and retrieve the portions in the lecture video of their interest. This is achieved by temporally segmenting and indexing the lecture video using the topic keywords. We use transcribed text from the video and documents relevant to the video topic extracted from the web for this purpose. The keywords for indexing are found by applying the non-negative matrix factorization (NMF) topic modeling techniques on the web documents. Our proposed technique first creates indices on the transcribed documents using the topic keywords, and these are mapped to the video to find the start and end time of the portions of the video for a particular topic. This time information is stored in the index table along with the topic keyword which is used to retrieve the specific portions of the video for the query provided by the users.

Keywords: video indexing and retrieval, lecture videos, content based video search, multimodal indexing

Procedia PDF Downloads 234

277 Finding Related Scientific Documents Using Formal Concept Analysis

Authors: Nadeem Akhtar, Hira Javed

Abstract:

An important aspect of research is literature survey. Availability of a large amount of literature across different domains triggers the need for optimized systems which provide relevant literature to researchers. We propose a search system based on keywords for text documents. This experimental approach provides a hierarchical structure to the document corpus. The documents are labelled with keywords using KEA (Keyword Extraction Algorithm) and are automatically organized in a lattice structure using Formal Concept Analysis (FCA). This groups the semantically related documents together. The hierarchical structure, based on keywords gives out only those documents which precisely contain them. This approach open doors for multi-domain research. The documents across multiple domains which are indexed by similar keywords are grouped together. A hierarchical relationship between keywords is obtained. To signify the effectiveness of the approach, we have carried out the experiment and evaluation on Semeval-2010 Dataset. Results depict that the presented method is considerably successful in indexing of scientific papers.

Keywords: formal concept analysis, keyword extraction algorithm, scientific documents, lattice

Procedia PDF Downloads 310

276 Identification of Spam Keywords Using Hierarchical Category in C2C E-Commerce

Authors: Shao Bo Cheng, Yong-Jin Han, Se Young Park, Seong-Bae Park

Abstract:

Consumer-to-Consumer (C2C) E-commerce has been growing at a very high speed in recent years. Since identical or nearly-same kinds of products compete one another by relying on keyword search in C2C E-commerce, some sellers describe their products with spam keywords that are popular but are not related to their products. Though such products get more chances to be retrieved and selected by consumers than those without spam keywords, the spam keywords mislead the consumers and waste their time. This problem has been reported in many commercial services like e-bay and taobao, but there have been little research to solve this problem. As a solution to this problem, this paper proposes a method to classify whether keywords of a product are spam or not. The proposed method assumes that a keyword for a given product is more reliable if the keyword is observed commonly in specifications of products which are the same or the same kind as the given product. This is because that a hierarchical category of a product in general determined precisely by a seller of the product and so is the specification of the product. Since higher layers of the hierarchical category represent more general kinds of products, a reliable degree is differently determined according to the layers. Hence, reliable degrees from different layers of a hierarchical category become features for keywords and they are used together with features only from specifications for classification of the keywords. Support Vector Machines are adopted as a basic classifier using the features, since it is powerful, and widely used in many classification tasks. In the experiments, the proposed method is evaluated with a golden standard dataset from Yi-han-wang, a Chinese C2C e-commerce, and is compared with a baseline method that does not consider the hierarchical category. The experimental results show that the proposed method outperforms the baseline in F1-measure, which proves that spam keywords are effectively identified by a hierarchical category in C2C e-commerce.

Keywords: spam keyword, e-commerce, keyword features, spam ﬁltering

Procedia PDF Downloads 276

275 Analyzing Keyword Networks for the Identification of Correlated Research Topics

Authors: Thiago M. R. Dias, Patrícia M. Dias, Gray F. Moita

Abstract:

The production and publication of scientific works have increased significantly in the last years, being the Internet the main factor of access and distribution of these works. Faced with this, there is a growing interest in understanding how scientific research has evolved, in order to explore this knowledge to encourage research groups to become more productive. Therefore, the objective of this work is to explore repositories containing data from scientific publications and to characterize keyword networks of these publications, in order to identify the most relevant keywords, and to highlight those that have the greatest impact on the network. To do this, each article in the study repository has its keywords extracted and in this way the network is characterized, after which several metrics for social network analysis are applied for the identification of the highlighted keywords.

Keywords: bibliometrics, data analysis, extraction and data integration, scientometrics

Procedia PDF Downloads 227

274 Keyword Network Analysis on the Research Trends of Life-Long Education for People with Disabilities in Korea

Authors: Jakyoung Kim, Sungwook Jang

Abstract:

The purpose of this study is to examine the research trends of life-long education for people with disabilities using a keyword network analysis. For this purpose, 151 papers were selected from 594 papers retrieved using keywords such as 'people with disabilities' and 'life-long education' in the Korean Education and Research Information Service. The Keyword network analysis was constructed by extracting and coding the keyword used in the title of the selected papers. The frequency of the extracted keywords, the centrality of degree, and betweenness was analyzed by the keyword network. The results of the keyword network analysis are as follows. First, the main keywords that appeared frequently in the study of life-long education for people with disabilities were 'people with disabilities', 'life-long education', 'developmental disabilities', 'current situations', 'development'. The research trends of life-long education for people with disabilities are focused on the current status of the life-long education and the program development. Second, the keyword network analysis and visualization showed that the keywords with high frequency of occurrences also generally have high degree centrality and betweenness centrality. In terms of the keyword network diagram, it was confirmed that research trends of life-long education for people with disabilities are centered on six prominent keywords. Based on these results, it was discussed that life-long education for people with disabilities in the future needs to expand the subjects and the supporting areas of the life-long education, and the research needs to be further expanded into more detailed and specific areas.

Keywords: life-long education, people with disabilities, research trends, keyword network analysis

Procedia PDF Downloads 323

273 A Framework of Product Information Service System Using Mobile Image Retrieval and Text Mining Techniques

Authors: Mei-Yi Wu, Shang-Ming Huang

Abstract:

The online shoppers nowadays often search the product information on the Internet using some keywords of products. To use this kind of information searching model, shoppers should have a preliminary understanding about their interesting products and choose the correct keywords. However, if the products are first contact (for example, the worn clothes or backpack of passengers which you do not have any idea about the brands), these products cannot be retrieved due to insufficient information. In this paper, we discuss and study the applications in E-commerce using image retrieval and text mining techniques. We design a reasonable E-commerce application system containing three layers in the architecture to provide users product information. The system can automatically search and retrieval similar images and corresponding web pages on Internet according to the target pictures which taken by users. Then text mining techniques are applied to extract important keywords from these retrieval web pages and search the prices on different online shopping stores with these keywords using a web crawler. Finally, the users can obtain the product information including photos and prices of their favorite products. The experiments shows the efficiency of proposed system.

Keywords: mobile image retrieval, text mining, product information service system, online marketing

Procedia PDF Downloads 339

272 Artistic and Technological Features of Bukhara Copper Embossing in the 20th Century

Authors: Zebiniso Mukhsinova

Abstract:

This article discusses the dynamics of the historical development of the Bukhara school of copper-stamped products. Copper embossing is one of the leading crafts of Uzbek decorative and applied art. A critical and analytical assessment of innovative ideas, artistic and technological features, which arose as a result of the inter-regional synthesis of a local school, is presented. The article includes a detailed analysis of exhibits in museum collections, a research of the scientific papers of leading art critics and differs from previous studies in this area.

Keywords: applied art, copper embossing, metalwork, ewer, tray, Bukhara school

Procedia PDF Downloads 129

271 Combining Corpus Linguistics and Critical Discourse Analysis to Study Power Relations in Hindi Newspapers

Authors: Vandana Mishra, Niladri Sekhar Dash, Jayshree Charkraborty

Abstract:

This present paper focuses on the application of corpus linguistics techniques for critical discourse analysis (CDA) of Hindi newspapers. While Corpus linguistics is the study of language as expressed in corpora (samples) of 'real world' text, CDA is an interdisciplinary approach to the study of discourse that views language as a form of social practice. CDA has mainly been studied from a qualitative perspective. However, we can say that recent studies have begun combining corpus linguistics with CDA in analyzing large volumes of text for the study of existing power relations in society. The corpus under our study is also of a sizable amount (1 million words of Hindi newspaper texts) and its analysis requires an alternative analytical procedure. So, we have combined both the quantitative approach i.e. the use of corpus techniques with CDA’s traditional qualitative analysis. In this context, we have focused on the Keyword Analysis Sorting Concordance Lines of the selected Keywords and calculating collocates of the keywords. We have made use of the Wordsmith Tool for all these analysis. The analysis starts with identifying the keywords in the political news corpus when compared with the main news corpus. The keywords are extracted from the corpus based on their keyness calculated through statistical tests like chi-squared test and log-likelihood test on the frequent words of the corpus. Some of the top occurring keywords are मोदी (Modi), भाजपा (BJP), कांग्रेस (Congress), सरकार (Government) and पार्टी (Political party). This is followed by the concordance analysis of these keywords which generates thousands of lines but we have to select few lines and examine them based on our objective. We have also calculated the collocates of the keywords based on their Mutual Information (MI) score. Both concordance and collocation help to identify lexical patterns in the political texts. Finally, all these quantitative results derived from the corpus techniques will be subjectively interpreted in accordance to the CDA’s theory to examine the ways in which political news discourse produces social and political inequality, power abuse or domination.

Keywords: critical discourse analysis, corpus linguistics, Hindi newspapers, power relations

Procedia PDF Downloads 201

270 Design an Algorithm for Software Development in CBSE Envrionment Using Feed Forward Neural Network

Authors: Amit Verma, Pardeep Kaur

Abstract:

In software development organizations, Component based Software engineering (CBSE) is emerging paradigm for software development and gained wide acceptance as it often results in increase quality of software product within development time and budget. In component reusability, main challenges are the right component identification from large repositories at right time. The major objective of this work is to provide efficient algorithm for storage and effective retrieval of components using neural network and parameters based on user choice through clustering. This research paper aims to propose an algorithm that provides error free and automatic process (for retrieval of the components) while reuse of the component. In this algorithm, keywords (or components) are extracted from software document, after by applying k mean clustering algorithm. Then weights assigned to those keywords based on their frequency and after assigning weights, ANN predicts whether correct weight is assigned to keywords (or components) or not, otherwise it back propagates in to initial step (re-assign the weights). In last, store those all keywords into repositories for effective retrieval. Proposed algorithm is very effective in the error correction and detection with user base choice while choice of component for reusability for efficient retrieval is there.

Keywords: component based development, clustering, back propagation algorithm, keyword based retrieval

Procedia PDF Downloads 362

269 Chatbots in Education: Case of Development Using a Chatbot Development Platform

Authors: Dulani Jayasuriya

Abstract:

This study outlines the developmental steps of a chatbot for administrative purposes of a large undergraduate course. The chatbot is able to handle student queries about administrative details, including assessment deadlines, course documentation, how to navigate the course, group formation, etc. The development window screenshots are that of a free account on the Snatchbot platform such that this can be adopted by the wider public. While only one connection to an answer based on possible keywords is shown here, one needs to develop multiple connections leading to different answers based on different keywords for the actual chatbot to function. The overall flow of the chatbot showing connections between different interactions is depicted at the end.

Keywords: chatbots, education, technology, snatch bot, artificial intelligence

Procedia PDF Downloads 86

268 Information Extraction Based on Search Engine Results

Authors: Mohammed R. Elkobaisi, Abdelsalam Maatuk

Abstract:

The search engines are the large scale information retrieval tools from the Web that are currently freely available to all. This paper explains how to convert the raw resulted number of search engines into useful information. This represents a new method for data gathering comparing with traditional methods. When a query is submitted for a multiple numbers of keywords, this take a long time and effort, hence we develop a user interface program to automatic search by taking multi-keywords at the same time and leave this program to collect wanted data automatically. The collected raw data is processed using mathematical and statistical theories to eliminate unwanted data and converting it to usable data.

Keywords: search engines, information extraction, agent system

Procedia PDF Downloads 411

267 Searchable Encryption in Cloud Storage

Authors: Ren Junn Hwang, Chung-Chien Lu, Jain-Shing Wu

Abstract:

Cloud outsource storage is one of important services in cloud computing. Cloud users upload data to cloud servers to reduce the cost of managing data and maintaining hardware and software. To ensure data confidentiality, users can encrypt their files before uploading them to a cloud system. However, retrieving the target file from the encrypted files exactly is difficult for cloud server. This study proposes a protocol for performing multikeyword searches for encrypted cloud data by applying k-nearest neighbor technology. The protocol ranks the relevance scores of encrypted files and keywords, and prevents cloud servers from learning search keywords submitted by a cloud user. To reduce the costs of file transfer communication, the cloud server returns encrypted files in order of relevance. Moreover, when a cloud user inputs an incorrect keyword and the number of wrong alphabet does not exceed a given threshold; the user still can retrieve the target files from cloud server. In addition, the proposed scheme satisfies security requirements for outsourced data storage.

Keywords: fault-tolerance search, multi-keywords search, outsource storage, ranked search, searchable encryption

Procedia PDF Downloads 362

266 The Influence of Cognitive Load in the Acquisition of Words through Sentence or Essay Writing

Authors: Breno Barrreto Silva, Agnieszka Otwinowska, Katarzyna Kutylowska

Abstract:

Research comparing lexical learning following the writing of sentences and longer texts with keywords is limited and contradictory. One possibility is that the recursivity of writing may enhance processing and increase lexical learning; another possibility is that the higher cognitive load of complex-text writing (e.g., essays), at least when timed, may hinder the learning of words. In our study, we selected 2 sets of 10 academic keywords matched for part of speech, length (number of characters), frequency (SUBTLEXus), and concreteness, and we asked 90 L1-Polish advanced-level English majors to use the keywords when writing sentences, timed (60 minutes) or untimed essays. First, all participants wrote a timed Control essay (60 minutes) without keywords. Then different groups produced Timed essays (60 minutes; n=33), Untimed essays (n=24), or Sentences (n=33) using the two sets of glossed keywords (counterbalanced). The comparability of the participants in the three groups was ensured by matching them for proficiency in English (LexTALE), and for few measures derived from the control essay: VocD (assessing productive lexical diversity), normed errors (assessing productive accuracy), words per minute (assessing productive written fluency), and holistic scores (assessing overall quality of production). We measured lexical learning (depth and breadth) via an adapted Vocabulary Knowledge Scale (VKS) and a free association test. Cognitive load was measured in the three essays (Control, Timed, Untimed) using normed number of errors and holistic scores (TOEFL criteria). The number of errors and essay scores were obtained from two raters (interrater reliability Pearson’s r=.78-91). Generalized linear mixed models showed no difference in the breadth and depth of keyword knowledge after writing Sentences, Timed essays, and Untimed essays. The task-based measurements found that Control and Timed essays had similar holistic scores, but that Untimed essay had better quality than Timed essay. Also, Untimed essay was the most accurate, and Timed essay the most error prone. Concluding, using keywords in Timed, but not Untimed, essays increased cognitive load, leading to more errors and lower quality. Still, writing sentences and essays yielded similar lexical learning, and differences in the cognitive load between Timed and Untimed essays did not affect lexical acquisition.

Keywords: learning academic words, writing essays, cognitive load, english as an L2

Procedia PDF Downloads 55

265 A Cross-Dialect Statistical Analysis of Final Declarative Intonation in Tuvinian

Authors: D. Beziakina, E. Bulgakova

Abstract:

This study continues the research on Tuvinian intonation and presents a general cross-dialect analysis of intonation of Tuvinian declarative utterances, specifically the character of the tone movement in order to test the hypothesis about the prevalence of level tone in some Tuvinian dialects. The results of the analysis of basic pitch characteristics of Tuvinian speech (in general and in comparison with two other Turkic languages - Uzbek and Azerbaijani) are also given in this paper. The goal of our work was to obtain the ranges of pitch parameter values typical for Tuvinian speech. Such language-specific values can be used in speaker identification systems in order to get more accurate results of ethnic speech analysis. We also present the results of a cross-dialect analysis of declarative intonation in the poorly studied Tuvinian language.

Keywords: speech analysis, statistical analysis, speaker recognition, identification of person

Procedia PDF Downloads 451

264 A.T.O.M.- Artificial Intelligent Omnipresent Machine

Authors: R. Kanthavel, R. Yogesh Kumar, T. Narendrakumar, B. Santhosh, S. Surya Prakash

Abstract:

This paper primarily focuses on developing an affordable personal assistant and the implementation of it in the field of Artificial Intelligence (AI) to create a virtual assistant/friend. The problem in existing home automation techniques is that it requires the usage of exact command words present in the database to execute the corresponding task. Our proposed work is ATOM a.k.a ‘Artificial intelligence Talking Omnipresent Machine’. Our inspiration came from an unlikely source- the movie ‘Iron Man’ in which a character called J.A.R.V.I.S has omnipresence, and device controlling capability. This device can control household devices in real time and send the live information to the user. This device does not require the user to utter the exact commands specified in the database as it can capture the keywords from the uttered commands, correlates the obtained keywords and perform the specified task. This ability to compare and correlate the keywords gives the user the liberty to give commands which are not necessarily the exact words provided in the database. The proposed work has a higher flexibility (due to its keyword extracting ability from the user input) comparing to the existing work Intelligent Home automation System (IHAS), is more accurate, and is much more affordable as it makes use of WI-FI module and raspberry pi 2 instead of ZigBee and a computer respectively.

Keywords: home automation, speech recognition, voice control, personal assistant, artificial intelligence

Procedia PDF Downloads 321

263 Identity and Ethnic Conflicts in Afghanistan: Diversity as a Cultural Treasure

Authors: Morteza Azimi

Abstract:

In Afghanistan, as a multi-ethnic country, there have been ethnic conflicts, especially after 2001. These conflicts are more visible among the four main ethnicities Pashtun, Tajik, Hazara, and Uzbek. In this paper, such ethnic conflicts and their roles in the political sphere will be discussed. The distribution of personal electronic ID cards, for example, has been one of the most controversial and unsuccessful projects in Afghanistan. As a result, the lack of clear population statistics has led to several corrupted and unsuccessful presidential elections since 2001. The nation-building process in post-Taliban Afghanistan, as well as the Afghan government’s failure to build a nation, are discussed. By referring to the hybridity theory of Homi Bhabha, it is argued that the process of assimilation for nation-building has not only failed but has deepened ethnic divisions. In the end, some suggestions and solutions for making the most out of ethnic diversity rather than suffering from it will be provided. It will be argued that diversity or difference improves the freedom of choices for groups and individuals; it boosts agency in comparison with life in an assimilated, coherent, and homogeneous society.

Keywords: Afghan identity, ethnicity, nation-building, political system, self and other

Procedia PDF Downloads 214

262 A Corpus-Based Discourse Analysis of the Disappearance of MH370 in Malaysia and United Kingdom Newspapers: A Pilot Study

Authors: Theng Theng Ong

Abstract:

This pilot study adopts a corpus-based discourse analysis to explore the construction of Malaysia airline tragedy MH370 in the selected Malaysian and United Kingdom (UK) newspapers. Fairclough’s three-dimensional model is adopted in the study to support the corpus-based analysis. The analysis aims to determine the ways in which Malaysian Airline tragedy MH370 is linguistically defined and constructed in terms of keywords and collocation. The study also seeks to identify the types of discourse that are presented in the news articles. In addition, the differences or similarities in terms of keywords, topics or issues covered by the selected Malaysian and UK news media are examined.

Keywords: corpus, CDA, newspapers, airline tragedies

Procedia PDF Downloads 279

261 Research Related to the Academic Learning Stress, Reflected into PubMed Website Publications

Authors: Ramona-Niculina Jurcau, Ioana-Marieta Jurcau, Dong Hun Kwak, Nicolae-Alexandru Colceriu

Abstract:

Background: Academic environment led, in time, to the birth of some research subjects concluded with many publications. One of these issues is related to the learning stress. Thus far, the PubMed website displays an impressive number of papers related to the academic stress. Aims: Through this study, we aimed to evaluate the research concerning academic learning stress (ALS), by a retrospective analysis of PubMed publications. Methods: We evaluated the ALS, considering: a) different keywords as - ‘academic stress’ (AS), ‘academic stressors’ (ASs), ‘academic learning stress’ (ALS), ‘academic student stress’ (ASS), ‘academic stress college’ (ASC), ‘medical academic stress’ (MAS), ‘non-medical academic stress’ (NMAS), ‘student stress’ (SS), ‘nursing student stress’ (NS), ‘college student stress’ (CSS), ‘university student stress’ (USS), ‘medical student stress’ (MSS), ‘dental student stress’ (DSS), ‘non-medical student stress’ (NMSS), ‘learning students stress’ (LSS), ‘medical learning student stress’ (MLSS), ‘non-medical learning student stress’ (NMLSS); b) the year average for decades; c) some selection filters provided by PubMed website: Article types - Journal Article (JA), Clinical Trial (CT), Review (R); Species - Humans (H); Sex - Male (M) and Female (F); Ages - 13-18, 19-24, 19-44. Statistical evaluation was made on the basis of the Student test. Results: There were differences between keywords, referring to all filters. Nevertheless, for all keywords were noted the following: the majority of studies have indicated that subjects were humans; there were no important differences between the number of subjects M and F; the age of participants was mentioned only in some studies, predominating those with teenagers and subjects between 19-24 years. Conclusions: 1) PubMed publications document that concern for the research field of academic stress, lasts for 56 years and was materialized in more than 5.010 papers. 2) Number of publications in the field of academic stress varies depending on the selected keywords: those with a general framing (AS, ASs, ALS, ASS, SS, USS, LSS) are more numerous than those with a specific framing (ASC, MAS, NMAS, NS, CSS, MSS, DSS, NMSS, MLSS, NMLSS); those concerning the academic medical environment (MAS, NS, MSS, DSS, MLSS) prevailed compared to the non-medical environment (NMAS, NMSS, NMLSS). 3) Most of the publications are included at JA, of which a small percentage are CT and R. 4) Most of the academic stress studies were conducted with subjects both M and F, most aged under 19 years and between 19-24 years.

Keywords: academic stress, student stress, academic learning stress, medical student stress

Procedia PDF Downloads 541

260 Algorithm for Information Retrieval Optimization

Authors: Kehinde K. Agbele, Kehinde Daniel Aruleba, Eniafe F. Ayetiran

Abstract:

When using Information Retrieval Systems (IRS), users often present search queries made of ad-hoc keywords. It is then up to the IRS to obtain a precise representation of the user’s information need and the context of the information. This paper investigates optimization of IRS to individual information needs in order of relevance. The study addressed development of algorithms that optimize the ranking of documents retrieved from IRS. This study discusses and describes a Document Ranking Optimization (DROPT) algorithm for information retrieval (IR) in an Internet-based or designated databases environment. Conversely, as the volume of information available online and in designated databases is growing continuously, ranking algorithms can play a major role in the context of search results. In this paper, a DROPT technique for documents retrieved from a corpus is developed with respect to document index keywords and the query vectors. This is based on calculating the weight (

Keywords: information retrieval, document relevance, performance measures, personalization

Procedia PDF Downloads 222