Search results for: text mining
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 2210

Search results for: text mining

770 The Presence of Anglicisms in Italian Fashion Magazines and Fashion Blogs

Authors: Vivian Orsi

Abstract:

The present research investigates the lexicon of a fashion magazine, whose universe is very receptive to lexical loans, especially those from English, called Anglicisms. Specifically, we intend to discuss the presence of English items and expressions in the Vogue Italia fashion magazine. Besides, we aim to study the anglicisms used in an Italian fashion blog called The Blonde Salad. Within the discussion of fashion blogs and their contributions to scientific studies, we adopt the theories of Lexicology / Lexicography to define Anglicism (BIDERMAN, 2001), and the observation of its prestige in the Italian Language (ROGATO, 2008; BISETTO, 2003). According to the theoretical basis mentioned, we intend to make a brief analysis of the Anglicisms collected from posts of the first year of existence of such fashion blog, emphasizing also the keywords that have the role to encapsulate the content of the text, allowing the reader to retrieve information from the post of the blog. About the use of English in Italian magazines and blogs, we can affirm that it seems to represent sophistication, assuming the value of prerequisite to participate in the fashion centers of the world. Besides, we believe, as Barthes says (1990, p. 215), that “Fashion does not evolve, it changes: its lexicon is new each year, like that of a language which always keeps the same system but suddenly and regularly ‘changes’ the currency of its words”. Fashion is a mode of communication: it is present in man's interaction with the world, which means that such lexical universe is represented according to the particularities of each culture.

Keywords: anglicism, lexicology, magazines, blogs, fashion

Procedia PDF Downloads 308
769 The Potential of Sentiment Analysis to Categorize Social Media Comments Using German Libraries

Authors: Felix Boehnisch, Alexander Lutz

Abstract:

Based on the number of users and the amount of content posted daily, Facebook is considered the largest social network in the world. This content includes images or text posts from companies but also private persons, which are also commented on by other users. However, it can sometimes be difficult for companies to keep track of all the posts and the reactions to them, especially when there are several posts a day that contain hundreds to thousands of comments. To facilitate this, the following paper deals with the possible applications of sentiment analysis to social media comments in order to be able to support the work in social media marketing. In a first step, post comments were divided into positive and negative by a subjective rating, then the same comments were checked for their polarity value by the two german python libraries TextBlobDE and SentiWS and also grouped into positive, negative, or even neutral. As a control, the subjective classifications were compared with the machine-generated ones by a confusion matrix, and relevant quality criteria were determined. The accuracy of both libraries was not really meaningful, with 60% to 66%. However, many words or sentences were not evaluated at all, so there seems to be room for optimization to possibly get more accurate results. In future studies, the use of these specific German libraries can be optimized to gain better insights by either applying them to stricter cleaned data or by adding a sentiment value to emojis, which have been removed from the comments in advance, as they are not contained in the libraries.

Keywords: Facebook, German libraries, polarity, sentiment analysis, social media comments

Procedia PDF Downloads 157
768 A Translation Criticism of the Persian Translation of “A**Hole No More” Written by Xavier Crement

Authors: Mehrnoosh Pirhayati

Abstract:

Translation can be affected by different meta-textual factors of target context such as ideology, politics, and culture. So, the rule of fidelity, or being faithful to the source text, can be ignored by the translator. On the other hand, critical discourse analysis, derived from applied linguistics, is entered into the field of translation studies and used by scholars for revealing hidden deviations and possible roots of manipulations. This study focused on the famous Persian translation of the bestseller book, “A**hole No More,” written by XavierCrement 1990, performed by Mahmud Farjami to comparatively and critically analyze it with its corresponding English original book. The researcher applied Pirhayati’s model and framework of translation criticism at the textual and semiotic levels for this qualitative study. It should be noted that Kress and Van Leeuwen’s semiotic model, along with Machin’s model of typographical analysis, was also used at the semiotic level. The results of the comparisons and analyses indicate thatthis Persian translation of the book is affected by the factors of ideology and economics and reveal that the Islamic attitude causes the translator to employ some strategies such as substitution and deletion. Those who may benefit from this research are translation trainers, students of translation studies, critics, and scholars.

Keywords: farjami (2013), Ideology, manipulation, pirhayati's (2013) model of translation criticism, Xavier crement (1990)

Procedia PDF Downloads 195
767 Linguistic Features for Sentence Difficulty Prediction in Aspect-Based Sentiment Analysis

Authors: Adrian-Gabriel Chifu, Sebastien Fournier

Abstract:

One of the challenges of natural language understanding is to deal with the subjectivity of sentences, which may express opinions and emotions that add layers of complexity and nuance. Sentiment analysis is a field that aims to extract and analyze these subjective elements from text, and it can be applied at different levels of granularity, such as document, paragraph, sentence, or aspect. Aspect-based sentiment analysis is a well-studied topic with many available data sets and models. However, there is no clear definition of what makes a sentence difficult for aspect-based sentiment analysis. In this paper, we explore this question by conducting an experiment with three data sets: ”Laptops”, ”Restaurants”, and ”MTSC” (Multi-Target-dependent Sentiment Classification), and a merged version of these three datasets. We study the impact of domain diversity and syntactic diversity on difficulty. We use a combination of classifiers to identify the most difficult sentences and analyze their characteristics. We employ two ways of defining sentence difficulty. The first one is binary and labels a sentence as difficult if the classifiers fail to correctly predict the sentiment polarity. The second one is a six-level scale based on how many of the top five best-performing classifiers can correctly predict the sentiment polarity. We also define 9 linguistic features that, combined, aim at estimating the difficulty at sentence level.

Keywords: sentiment analysis, difficulty, classification, machine learning

Procedia PDF Downloads 55
766 Sentiment Analysis of Ensemble-Based Classifiers for E-Mail Data

Authors: Muthukumarasamy Govindarajan

Abstract:

Detection of unwanted, unsolicited mails called spam from email is an interesting area of research. It is necessary to evaluate the performance of any new spam classifier using standard data sets. Recently, ensemble-based classifiers have gained popularity in this domain. In this research work, an efficient email filtering approach based on ensemble methods is addressed for developing an accurate and sensitive spam classifier. The proposed approach employs Naive Bayes (NB), Support Vector Machine (SVM) and Genetic Algorithm (GA) as base classifiers along with different ensemble methods. The experimental results show that the ensemble classifier was performing with accuracy greater than individual classifiers, and also hybrid model results are found to be better than the combined models for the e-mail dataset. The proposed ensemble-based classifiers turn out to be good in terms of classification accuracy, which is considered to be an important criterion for building a robust spam classifier.

Keywords: accuracy, arcing, bagging, genetic algorithm, Naive Bayes, sentiment mining, support vector machine

Procedia PDF Downloads 116
765 [Keynote Talk]: Three Key Ideas to Undergraduate Thesis Project Tutoring

Authors: M. T. Becerra-Traver, M. Montanero, R. Alejo, A. Antúnez, P. Cañamero, M. J. Fernández, M. Gómez, A. L. Medialdea, J. D. Martínez, A. M. Piquer-Píriz, M. J. Rabazo

Abstract:

The introduction of new subjects at university level, brought about with the implementation of the European Higher Education Area (EHEA), has meant changes for students and lecturers that, in the case of the latter, have also revealed a need for further training. In our context, one of the main changes has been the introduction of Undergraduate Thesis Projects (UTPs) in the degrees taught in our faculty: Pre-Primary and Primary Education. The aim of this paper is to analyze UTPs and to provide some suggestions that can help both students and lecturers in the process. UTPs complete the university training cycle of the Degree Studies and entail the elaboration of a written piece of work, supervised by a professor and presented to a panel in order to ensure that students acquire the required competences of these Degrees to develop an autonomous, responsible and comprehensive activity. In addition, UTPs develop students’ abilities for oral presentations and to defend and argue their own ideas. One of the first difficulties in the supervision of UTPs is that most of the students do not know how to write an academic text. To solve this problem, we propose a three-phase model based on planning, textualization and review. The implementation of this model has enabled us to see a successful evolution in the correct development of the academic dissertations that students submit at the end their degrees.

Keywords: academic task, student, tutoring, university

Procedia PDF Downloads 250
764 A Multimodal Dialogue Management System for Achieving Natural Interaction with Embodied Conversational Agents

Authors: Ozge Nilay Yalcin

Abstract:

Dialogue has been proposed to be the natural basis for the human-computer interaction, which is behaviorally rich and includes different modalities such as gestures, posture changes, gaze, para-linguistic parameters and linguistic context. However, equipping the system with these capabilities might have consequences on the usability of the system. One issue is to be able to find a good balance between rich behavior and fluent behavior, as planning and generating these behaviors is computationally expensive. In this work, we propose a multi-modal dialogue management system that automates the conversational flow from text-based dialogue examples and uses synchronized verbal and non-verbal conversational cues to achieve a fluent interaction. Our system is integrated with Smartbody behavior realizer to provide real-time interaction with embodied agent. The nonverbal behaviors are used according to turn-taking behavior, emotions, and personality of the user and linguistic analysis of the dialogue. The verbal behaviors are responsive to the emotional value of the utterance and the feedback from the user. Our system is aimed for online planning of these affective multi-modal components, in order to achieve enhanced user experience with richer and more natural interaction.

Keywords: affect, embodied conversational agents, human-agent interaction, multimodal interaction, natural interfaces

Procedia PDF Downloads 155
763 Utilization of Multi-Criteria Evaluation in Forensic Engineering and the Expertise outside Wall Subsystem

Authors: Tomas Barnak, Libor Matejka

Abstract:

The aim of this study is to create a standard application using multi-criteria evaluation in the field of forensic engineering. This situation can occur in the professional assessment in several cases such as when it is necessary to consider more criteria variant of the structural subsystems, more variants according to several criteria based on a court claim, which requires expert advice. A problematic situation arises when it is necessary to clearly determine the ranking of the options according to established criteria, and reduce subjective evaluation. For the procurement in the field of construction which is based on the prepared text of the law not only economic criteria but also technical, technological and environmental criteria will be determined. This fact substantially changes the style of evaluation of individual bids. For the above-mentioned needs of procurement, the unification of expert’s decisions and the use of multi-criteria assessment seem to be a reasonable option. In the case of experimental verification when using multi-criteria evaluation of alternatives construction subsystem the economic, technical, technological and environmental criteria will be compared. The core of the solution is to compare a selected number of set criteria, application methods and evaluation weighting based on the weighted values assigned to each of the criteria to use multi-criteria evaluation methods. The sequence of individual variations is determined by the evaluation of the importance of the values of corresponding criteria concerning expertise in the problematic of outside wall constructional subsystems.

Keywords: criteria, expertise, multi-criteria evaluation, outside wall subsystems

Procedia PDF Downloads 302
762 Comparing and Contrasting Western and Eastern Ways of War: Building a Universal Strategic Theory

Authors: Adam Kok Wey Leong

Abstract:

The comparison between the Western ways of war and Eastern ways of war has raised contemporary debates on the validity of these arguments. The Western way of war is popularly propounded by Victor Davis Hanson as originating from the Greek hoplite tactics, direct military maneuvers, democratic principles and social freedom and cohesion that has continued to yield military success for the Western powers for centuries. On the other hand, the Eastern way of war has been deemed as relying on indirect tactics, deception, and ruses. This often accepted notion of the divide between Western and Eastern style does not sustain in view of the available classical strategic texts from both sides from the same period that has proposed similar principles of warfare. This paper analyses the similarities between classical strategic texts on war from the Eastern perspective namely Sun Tzu’s Art of War with a similar temporal strategic text from the West which is Sextus Iuluis Frontinus’s Stratagematon, and deduces answers to this core research question - Does the hypothesis of the existence of distinctive Western and Eastern ways of warfare stands? The main thesis advanced by this research is that ways of warfare share universal principles, and it transcends cultural and spatial boundaries. Warfare is a human endeavour, and the same moral actions guide humans from different geo-cultural spheres in warfare’s objectives, which are winning over an enemy in the most economical way and serve as a mean to an end.

Keywords: ways of warfare, strategic culture, strategy, Sun Tzu, frontinus

Procedia PDF Downloads 413
761 A Linguistic Relativity Appraisal of an African Drama: The Lion and The Jewel

Authors: T. O. Adekunle, R. L. Makhubu, C. N. Ngwane

Abstract:

This research was designed to assess the validity of the Sapir Whorf hypothesis in relation to the linguistic and cultural notions of the Yoruba and Zulu language speakers’ via the evaluation of the culture enriched dramatic text The Lion and The Jewel by Wole Soyinka. The study queried both the hypothesis’ strong version, (language governs thought: linguistic classifications restrain and influence mental classifications); and its weak version, (linguistic classifications and their use influence thought as well as some other classes of non-linguistic activities) and their possible reliability. Participants were purposively selected and their ages ranged from 16-46 years old. The participants amounted to 38 (18 Yoruba and 20 Zulu) students of DUT who all speak both English and Zulu (Zulu participants) and English and Yoruba (Yoruba participants) and the mixed methods approach was used. Thus with the use of questionnaire and interviews the research questions were answered and the findings provided support for validity of the linguistic relativity hypothesis, languages indeed influence thought. The findings also revealed that linguistic influence on cognition is not limited to different language users alone, but also same language speakers per level of exposure to other languages and concepts.

Keywords: culture, cognition, DUT, language, linguistic relativity hypothesis, Sapir-Whorf hypothesis, The Lion and The Jewel, thought, Wole Soyinka, Yoruba, Zulu

Procedia PDF Downloads 432
760 Product Features Extraction from Opinions According to Time

Authors: Kamal Amarouche, Houda Benbrahim, Ismail Kassou

Abstract:

Nowadays, e-commerce shopping websites have experienced noticeable growth. These websites have gained consumers’ trust. After purchasing a product, many consumers share comments where opinions are usually embedded about the given product. Research on the automatic management of opinions that gives suggestions to potential consumers and portrays an image of the product to manufactures has been growing recently. After launching the product in the market, the reviews generated around it do not usually contain helpful information or generic opinions about this product (e.g. telephone: great phone...); in the sense that the product is still in the launching phase in the market. Within time, the product becomes old. Therefore, consumers perceive the advantages/ disadvantages about each specific product feature. Therefore, they will generate comments that contain their sentiments about these features. In this paper, we present an unsupervised method to extract different product features hidden in the opinions which influence its purchase, and that combines Time Weighting (TW) which depends on the time opinions were expressed with Term Frequency-Inverse Document Frequency (TF-IDF). We conduct several experiments using two different datasets about cell phones and hotels. The results show the effectiveness of our automatic feature extraction, as well as its domain independent characteristic.

Keywords: opinion mining, product feature extraction, sentiment analysis, SentiWordNet

Procedia PDF Downloads 377
759 Artificial Reproduction System and Imbalanced Dataset: A Mendelian Classification

Authors: Anita Kushwaha

Abstract:

We propose a new evolutionary computational model called Artificial Reproduction System which is based on the complex process of meiotic reproduction occurring between male and female cells of the living organisms. Artificial Reproduction System is an attempt towards a new computational intelligence approach inspired by the theoretical reproduction mechanism, observed reproduction functions, principles and mechanisms. A reproductive organism is programmed by genes and can be viewed as an automaton, mapping and reducing so as to create copies of those genes in its off springs. In Artificial Reproduction System, the binding mechanism between male and female cells is studied, parameters are chosen and a network is constructed also a feedback system for self regularization is established. The model then applies Mendel’s law of inheritance, allele-allele associations and can be used to perform data analysis of imbalanced data, multivariate, multiclass and big data. In the experimental study Artificial Reproduction System is compared with other state of the art classifiers like SVM, Radial Basis Function, neural networks, K-Nearest Neighbor for some benchmark datasets and comparison results indicates a good performance.

Keywords: bio-inspired computation, nature- inspired computation, natural computing, data mining

Procedia PDF Downloads 249
758 Critical Discourse Analysis of Political TV Talk Show of Pakistani Media

Authors: Sumaira Saleem, Sajjad Hussain, Asma Kashif Shahzad, Hina Shaheen

Abstract:

This study aims at exploring the relationship between language and ideology and how such relationships are represented in the analysis of spoken texts, following Van Dijk’s Socio Cognitive Model (2002). In this study, it is tried to show that political Talk shows broadcast by Private TV channels are working apparatuses of ideology and store meanings which are not always obvious for readers. This analysis was about the situation created by Arslan Iftkhar, the son of ex-Chief Justice of Pakistan, Iftikhar Muhammad Chaudry and PTI Chief Imran Khan. Arslan Iftikhar submitted an application against Imran Khan that he is not able to become a member of parliament of Pakistan. In the application, he demanded the documents, which are submitted by Imran Khan at the time of Election to the Election Commission of Pakistan. Murad Ali from PTI also submitted an application against PM Nawaz Sharif to the Election Commission of Pakistan for providing the copies. It also suggests that these talk shows mystify the agency of processes by using various strategies. In other words, critical text analyses reveal how these choices enable speakers to manipulate the realizations of agency and power in the representation of action to produce particular meanings which are not always explicit for all readers.

Keywords: ECP, CDA, socio cognitive model, ideology, TV channels, power

Procedia PDF Downloads 710
757 Feature Weighting Comparison Based on Clustering Centers in the Detection of Diabetic Retinopathy

Authors: Kemal Polat

Abstract:

In this paper, three feature weighting methods have been used to improve the classification performance of diabetic retinopathy (DR). To classify the diabetic retinopathy, features extracted from the output of several retinal image processing algorithms, such as image-level, lesion-specific and anatomical components, have been used and fed them into the classifier algorithms. The dataset used in this study has been taken from University of California, Irvine (UCI) machine learning repository. Feature weighting methods including the fuzzy c-means clustering based feature weighting, subtractive clustering based feature weighting, and Gaussian mixture clustering based feature weighting, have been used and compered with each other in the classification of DR. After feature weighting, five different classifier algorithms comprising multi-layer perceptron (MLP), k- nearest neighbor (k-NN), decision tree, support vector machine (SVM), and Naïve Bayes have been used. The hybrid method based on combination of subtractive clustering based feature weighting and decision tree classifier has been obtained the classification accuracy of 100% in the screening of DR. These results have demonstrated that the proposed hybrid scheme is very promising in the medical data set classification.

Keywords: machine learning, data weighting, classification, data mining

Procedia PDF Downloads 309
756 The Buddha in Sophocles’ Tragedy, King Oedipus: An Intertextual Analysis

Authors: Newton Rathnasiri Ranaweera Kalu Arachchige

Abstract:

Academics argue that Greek myths and legends have had an influence on Buddhist stories: Jataka tales, Theri Gata (Psalms of older Buddhist nuns), and even Mahavansa (a Sri Lankan historical chronicle). However, this article asserts that there is evidence in Sophocles King Oedipus to argue that the Buddha’s life story and key Buddhist concepts have influenced pre-Christian Greek philosophy and literature, especially Sophocles’ King Oedipus. When reading the text with the notion that there could be intertextual relationships or new texts are built on the existing texts and discourses, the reader may see that Sophocles’ play contains incidents that remind them of the special occasions of the Buddha’s life, his utterances and the key Buddhist concepts such as the truth of suffering, cessation of suffering, the three poisons (greed, hatred, and delusion), and finding the truth within one’s own self. The present intertextual study explored only the special occasions of the Buddha’s life to make it more focused and found that Sophocles alludes to the Buddha’s life story in his attempt to raise a moral culprit to a moral hero with higher moral values. This article, however, acknowledges that one needs to cross-check the other historical and philosophical references when claiming that Sophocles has had influence from the Buddha’s life story in King Oedipus.

Keywords: Buddhism, the Buddha’s life story, King Oedipus, Greece, tragedy, Sri Lanka

Procedia PDF Downloads 85
755 Contemporary Malayalam Independent Cinema: Limited Location Storytelling and It’s Prominence in the Pandemic Era.

Authors: Krishnanunni S.

Abstract:

The COVID-19 Pandemic has had an impact on every part of our lives, and the film industry is no exception. The restrictions the pandemic has brought made filmmakers confine their films to limited spaces. In India, Malayalam cinema was the first to incorporate the pandemic into its stories and started producing films within existing constraints. The purpose of this study was to study how the limited location storytelling concept influenced Malayalam independent and lockdown films. To answer this question, the three of the most popular films that we shot during the pandemic: The Great Indian Kitchen, Joji and Joyful Mystery, were dissected through text analysis and in-depth interviews were conducted with the makers of The Great Indian Kitchen and Joyful Mystery. The study revealed that the pandemic had had an influence on the way filmmakers visualize their stories and shoot them, especially while working within the restrictions of the pandemic. It was also observed that working with limited locations was the only way for filmmakers to make films during the times of pandemic. But rather than a hindrance to their work, filmmakers saw it as a new possibility to create in times of confinement. The findings of this study expanded the work of previous researchers about films shot in limited locations and the significant changes the pandemic has brought to the film industry.

Keywords: limited location storytelling, pandemic, pandemic restrictions, lockdown cinema, pandemic films, Malayalam cinema, OTT revolution, cinema, films

Procedia PDF Downloads 63
754 An Evaluation of Edible Plants for Remediation of Contaminated Soil- Can Edible Plants Be Used to Remove Heavy Metals on Soil?

Authors: Celia Marilia Martins, Sonia I. V. Guilundo, Iris M. Victorino, Antonio O. Quilambo

Abstract:

In Mozambique rapid industrialization (mining, aluminium and cement activities) and urbanization processes has led to the incorporation of heavy metals on soil, thus degrading not only the quality of the environment, but also affecting plants, animals and human healthy. Several methods have been used to remediate contaminated soils, but most of them are costly and difficult to get optimum results. Currently, phytoremediation is an effective and affordable technological solution used to extract or remove inactive metals from contaminated soil. Phytoremediation is the use of plants to clean up a contamination from soils, sediments, and water. This technology is environmental friendly and potentially cost effective. The present investigation summarised the potential of edible vegetable to grow under the high level of heavy metals such as lead and zinc. The plants used in these studies include Tomatoes, lettuce and Soya beans. The studies have shown that edible plants can be grown under the high level of heavy metals on the soil. Further investigations are identifying mechanisms used by plants to ensure a safe and sustainable use for remediation of contaminated soils by heavy metals.

Keywords: contaminated soil, edible plants, heavy metals, phytoremediation

Procedia PDF Downloads 347
753 The International Monetary Fund’s Treatment Towards Argentina and Brazil During Financial Negotiations for Their First Adjustment Programs, 1958-64

Authors: Fernanda Conforto de Oliveira

Abstract:

The International Monetary Fund (IMF) has a central role in global financial governance as the world’s leading crisis lender. Its practice of conditional lending – conditioning loans on the implementation of economic policy adjustments – is the primary lever by which the institution interacts with and influences the policy choices of member countries and has been a key topic of interest to scholars and public opinion. However, empirical evidence about the economic and (geo)political determinants of IMF lending behavior remains inconclusive, and no model that explains IMF policies has been identified. This research moves beyond panel analysis to focus on financial negotiations for the first IMF programs in Argentina and Brazil in the early post-war period. It seeks to understand why negotiations achieved distinct objectives: Argentinean officials cooperated and complied with IMF policies, whereas their Brazilian counterparts hesitated. Using qualitative and automated text analysis, this paper analyses the hypothesis about whether a differential IMF treatment could help to explain these distinct outcomes. This paper contributes to historical studies on IMF-Latin America relations and the broader literature in international policy economy about IMF policies.

Keywords: international monetary fund, international history, financial history, Latin American economic history, natural language processing, sentiment analysis

Procedia PDF Downloads 36
752 Statistical Comparison of Machine and Manual Translation: A Corpus-Based Study of Gone with the Wind

Authors: Yanmeng Liu

Abstract:

This article analyzes and compares the linguistic differences between machine translation and manual translation, through a case study of the book Gone with the Wind. As an important carrier of human feeling and thinking, the literature translation poses a huge difficulty for machine translation, and it is supposed to expose distinct translation features apart from manual translation. In order to display linguistic features objectively, tentative uses of computerized and statistical evidence to the systematic investigation of large scale translation corpora by using quantitative methods have been deployed. This study compiles bilingual corpus with four versions of Chinese translations of the book Gone with the Wind, namely, Piao by Chunhai Fan, Piao by Huairen Huang, translations by Google Translation and Baidu Translation. After processing the corpus with the software of Stanford Segmenter, Stanford Postagger, and AntConc, etc., the study analyzes linguistic data and answers the following questions: 1. How does the machine translation differ from manual translation linguistically? 2. Why do these deviances happen? This paper combines translation study with the knowledge of corpus linguistics, and concretes divergent linguistic dimensions in translated text analysis, in order to present linguistic deviances in manual and machine translation. Consequently, this study provides a more accurate and more fine-grained understanding of machine translation products, and it also proposes several suggestions for machine translation development in the future.

Keywords: corpus-based analysis, linguistic deviances, machine translation, statistical evidence

Procedia PDF Downloads 121
751 Domain Adaptive Dense Retrieval with Query Generation

Authors: Rui Yin, Haojie Wang, Xun Li

Abstract:

Recently, mainstream dense retrieval methods have obtained state-of-the-art results on some datasets and tasks. However, they require large amounts of training data, which is not available in most domains. The severe performance degradation of dense retrievers on new data domains has limited the use of dense retrieval methods to only a few domains with large training datasets. In this paper, we propose an unsupervised domain-adaptive approach based on query generation. First, a generative model is used to generate relevant queries for each passage in the target corpus, and then, the generated queries are used for mining negative passages. Finally, the query-passage pairs are labeled with a cross-encoder and used to train a domain-adapted dense retriever. We also explore contrastive learning as a method for training domain-adapted dense retrievers and show that it leads to strong performance in various retrieval settings. Experiments show that our approach is more robust than previous methods in target domains that require less unlabeled data.

Keywords: dense retrieval, query generation, contrastive learning, unsupervised training

Procedia PDF Downloads 75
750 Aquatic and Marshy Flora from Fresh Water Wetlands on Quartz Sands in Pinar Del Río, Cuba

Authors: Vidal Pérez Hernández, Enrique González Pendás

Abstract:

The most of the aquatic and marshy flora in Cuba, is located on quartzitic sands ecosystems and they are represented by a wide variety of freshwater wetlands, which are spread in the whole south and south-western plain of Pinar del Río. The survey carried out in these ecosystems offers an updated inventory of these species, showing up their biological type, habit, distribution, and the threat grade to which are subjected, taking into account categories granted by UICN. A remarkable decrease is evidenced, in the total of these species respect to this area; due to deposit processes and deforestation, which are taken place by the human activity and the climatic change. It is linked to others threats like, limitless use of their water reserves for irrigating groves, the cattle raising and intensive fishing. Added to it, its sand with 99% pure crystal quartz, are used for the mining. The combination of all factors has a negative influence on a flora that stores more than 250 species, most of them herbaceous and hydrophytes. In these particular ecosystems were found a 40% endemism from total flora, and more than 80%, are evaluated inside the most sensitive threat categories, and already some of them have been declared as extinct.

Keywords: aquatic flora, marshy flora, quartzitic sands, wetlands

Procedia PDF Downloads 201
749 Examining the Function of Containers and Determining Lexical Indices for the Shapes of Pottery and the Poems Written on Them from the End of the 3rd Century to the End of the 8th Century

Authors: Mohadese Sookhtesaraii, Abed Taghavi, Kosar Sookhtesaraii

Abstract:

Pottery is always attended by human beings for its application functions. By passing time and human development and writing progressing, writing was started to do on pottery dishes. Some of important issues in making thise dishes, in addition to their application, are their names and obviosely their relationship between their function and their names. These names are different based on their appearances and the kind of their using. So by meaning these words in dictionary, naming these dishes are classified. In poetry works there are so many names of these dishes which are showing their importance and their using. More using of some of these dishes name in poem and writing works is caused the select these dishes. For better and precise analysing the form of pottery it emphasis on the meaning which are in dictionary and the names that are existed in poems and writters works. On the other hand, on the dishes there are written poet more than text, that it can study their beautiful aspect. Seperate from their meanings. Dishes name like Chamaneh, Satgini, was clearly named for drinking in dictionary. while using Khonb was applied for storing. So dishes applying can be the basis of classifying. The size and capacity of these dishes is also caused the differences in naming the dishes. Such as Khom, Khonb which are same in farm but. They are different in capacity and size. Meaning are written on these dishe was studied. In addition to preying phrase, they had loving meaning or inviting to drink and enjoying and shorting the human life.

Keywords: pialeh, sajegni, khomre, pottery

Procedia PDF Downloads 47
748 Synchronous Versus Asynchronous Telecollaboration in Intercultural Communication

Authors: Vita Kalnberzina, Lauren Miller Anderson

Abstract:

The aim of the paper is to report on the results of the telecollaboration project results carried out between the students of the University of Latvia, National Louis University in the US, and Austral University in Chili during the Intercultural Communication course. The objectives of the study are 1) to compare different forms of student telecollaboration and virtual exchange, 2) to collect and analyse the student feedback on the telecollaboration project, 3) to evaluate the products (films) produced during the telecollaboration project. The methods of research used are as follows: Survey of the student feedback after the project, video text analysis of the films produced by the students, and interview of the students participating in the project. We would like to compare the results of a three-year collaboration project, where we tried out synchronous telecollaboration and asynchronous collaboration. The different variables that were observed were the impact of the different time zones, different language proficiency levels of students, and different curricula developed for collaboration. The main findings suggest that the effort spent by students to organize meetings in different time zones and to get to know each other diminishes the quality of the product developed and thus reduces the students' feeling of accomplishment. Therefore, we would like to propose that asynchronous collaboration where the national teams work on a film project specifically developed by the students of one university for the students of another university ends up with a better quality film, which in its turn appeals more to the students of the other university and creates a deeper intercultural bond between the collaborating students.

Keywords: telecollaboration, intercultural communication, synchronous collaboration, asynchronous collaboration

Procedia PDF Downloads 77
747 Anthropocentric and Ecocentric Representation of Human-Environment Relationship in Paulo Coelho's the Alchemist

Authors: Tooba Sabir, Namra Sabir, Mohammad Amjad Sabir

Abstract:

The human-environment relationship has been projected since the beginning of literary tradition i.e. pastoral tradition, however, the interest of critics, writers and poets, in this view, has been developed, since the last couple of decades because of the increasing scope of environmental studies and growing environmental issues. One such novel, that projects human-environment relationship, is ‘The Alchemist.’ It is Paulo Coelho’s one of the most read novels. It holds a central theme that the universe conspires to help a person achieve his destiny, projecting anthropocentrism and human domination by centralizing human and devaluing the intrinsic worth of ecosystem. However, ecocritical analysis of the text reveals that the novel contains, at several instances, ecocentrism as well e.g. ‘everything on earth is being continuously transformed because earth is alive.’ This portrays ecosphere as living and dynamic entity rather than a mere instrument for human to achieve his destiny. The idea that the universe shares the same language projects unity of nature showing the relationship between human and non-human aspects of the environment as one being and not separate or superior to one another. It depicts human as a part of the environment and not the lord of the world. Therefore, it can be concluded that the novel oscillates between both the ecocentric and the anthropocentric phenomena. It is not suggested, however, that one phenomenon should be valued over the other but that the complexities of both the phenomena should be recognized, acknowledged and valued in order to encourage the interactions between literature and environment.

Keywords: anthropocentrism, ecocentrism, ecocritical analysis, human-environment relationship

Procedia PDF Downloads 288
746 Clinical Course and Prognosis of Cutaneous Manifestations of COVID-19: A Systematic Review of Reported Cases

Authors: Hilary Modir, Kyle Dutton, Michelle Swab, Shabnam Asghari

Abstract:

Since its emergence, the cutaneous manifestations of COVID-19 have been documented in the literature. However, the majority are case reports with significant limitations in appraisal quality, thus leaving the role of dermatological manifestations of COVID-19 erroneously underexplored. The primary aim of this review was to systematically examine clinical patterns of dermatological manifestations as reported in the literature. This study was designed as a systematic review of case reports. The inclusion criteria consisted of all published reports and articles regarding COVID-19 in English, from September 1st, 2019, until June 22nd, 2020. The population consisted of confirmed cases of COVID-19 with associated cutaneous signs and symptoms. Exclusion criteria included research in planning stages, protocols, book reviews, news articles, review studies, and policy analyses. With the collaboration of a librarian, a search strategy was created consisting of a mixture of keyword terms and controlled vocabulary. Electronic databases searched were MEDLINE via PubMed, EMBASE, CINAHL, Web of Science, LILACS, PsycINFO, WHO Global Literature on Coronavirus Disease, Cochrane Library, Campbell Collaboration, Prospero, WHO International Clinical Trials Registry Platform, Australian and New Zealand Clinical Trials Registry, U.S. Institutes of Health Ongoing Trials Register, AAD Registry, OSF preprints, SSRN, MedRxiV and BioRxiV. The study selection featured an initial pre-screening of titles and abstracts by one independent reviewer. Results were verified by re-examining a random sample of 1% of excluded articles. Eligible studies progressed for full-text review by two calibrated independent reviewers. Covidence was used to store and extract data, such as citation information and findings pertaining to COVID-19 and cutaneous signs and symptoms. Data analysis and summarization methodology reflect the framework proposed by PRISMA and recommendations set out by Cochrane and Joanna Brigg’s Institute for conducting systematic reviews. The Oxford Centre for Evidence-Based Medicine’s level of evidence was used to appraise the quality of individual studies. The literature search revealed a total of 1221 articles. After the abstract and full-text screening, only 95 studies met the eligibility criteria, proceeding to data extraction. Studies were divided into 58% case reports and 42% series. A total of 833 manifestations were reported in 723 confirmed COVID-19 cases. The most frequent lesions were 23% maculopapular, 15% urticarial and 13% pseudo-chilblains, with 46% of lesions reporting pruritus, 16% erythema, 14% pain, 12% burning sensation, and 4% edema. The most common lesion locations were 20% trunk, 19.5% lower limbs, and 17.7% upper limbs. The time to resolution of lesions was between one and twenty-one days. In conclusion, over half of the reported cutaneous presentations in COVID-19 positive patients were maculopapular, urticarial and pseudo-chilblains, with the majority of lesions distributed to the extremities and trunk. As this review’s sample size only contained COVID-19 confirmed cases with skin presentations, it becomes difficult to deduce the direct relationship between skin findings and COVID-19. However, it can be correlated that acute onset of skin lesions, such as chilblains-like, may be associated with or may warrant consideration of COVID-19 as part of the differential diagnosis.

Keywords: COVID-19, cutaneous manifestations, cutaneous signs, general dermatology, medical dermatology, Sars-Cov-2, skin and infectious disease, skin findings, skin manifestations

Procedia PDF Downloads 165
745 Quantifying User-Related, System-Related, and Context-Related Patterns of Smartphone Use

Authors: Andrew T. Hendrickson, Liven De Marez, Marijn Martens, Gytha Muller, Tudor Paisa, Koen Ponnet, Catherine Schweizer, Megan Van Meer, Mariek Vanden Abeele

Abstract:

Quantifying and understanding the myriad ways people use their phones and how that impacts their relationships, cognitive abilities, mental health, and well-being is increasingly important in our phone-centric society. However, most studies on the patterns of phone use have focused on theory-driven tests of specific usage hypotheses using self-report questionnaires or analyses of smaller datasets. In this work we present a series of analyses from a large corpus of over 3000 users that combine data-driven and theory-driven analyses to identify reliable smartphone usage patterns and clusters of similar users. Furthermore, we compare the stability of user clusters across user- and system-initiated sessions, as well as during the hypothesized ritualized behavior times directly before and after sleeping. Our results indicate support for some hypothesized usage patterns but present a more complete and nuanced view of how people use smartphones.

Keywords: data mining, experience sampling, smartphone usage, health and well being

Procedia PDF Downloads 141
744 Types of Taboo Expressions in Igbo Society

Authors: Christian Nwaoha

Abstract:

This study investigates taboo expressions and classifications in Igbo discourse, their socio-cultural factors affecting their usage. The study classifies Linguistic taboo expressions by their discourse into five categories: morality-related taboo, veneration-related, decorum-related, religion-related and fear-related taboo expressions. This study argues that while religion-related and decorum-related taboos are unmentioned and have no euphemistic synonyms is because they are closely tied to various Igbo deities and objects, while morality, veneration, and fear-related have permissible alternatives. A descriptive research design was adopted and the data collection was by questionnaire and oral interview. The result of the research proves that aside of the categories of taboos in Igbo, socially, the styles of discourse have some levels of gender, age and class-connected taboos, which for instance, in gender-connected taboos, women in Igbo are forbidden to use style of discourse that are connected with genital organs in social gathering comprising men and women. The same has to do with class-connected where much younger men can use some certain expressions that are taboo, but in much older men gathering such expressions would be tagged forbidden in the context. The study further reveals that there are occasions in which these taboos can be used with reasons. The research concludes that using these taboos in literary text can enhance clear understanding of Igbo taboos to the users and learners of Igbo language.

Keywords: taboo expressions, classifications, Igbo, socio-cultural factors, discourse

Procedia PDF Downloads 213
743 Internet of Things, Edge and Cloud Computing in Rock Mechanical Investigation for Underground Surveys

Authors: Esmael Makarian, Ayub Elyasi, Fatemeh Saberi, Olusegun Stanley Tomomewo

Abstract:

Rock mechanical investigation is one of the most crucial activities in underground operations, especially in surveys related to hydrocarbon exploration and production, geothermal reservoirs, energy storage, mining, and geotechnics. There is a wide range of traditional methods for driving, collecting, and analyzing rock mechanics data. However, these approaches may not be suitable or work perfectly in some situations, such as fractured zones. Cutting-edge technologies have been provided to solve and optimize the mentioned issues. Internet of Things (IoT), Edge, and Cloud Computing technologies (ECt & CCt, respectively) are among the most widely used and new artificial intelligence methods employed for geomechanical studies. IoT devices act as sensors and cameras for real-time monitoring and mechanical-geological data collection of rocks, such as temperature, movement, pressure, or stress levels. Structural integrity, especially for cap rocks within hydrocarbon systems, and rock mass behavior assessment, to further activities such as enhanced oil recovery (EOR) and underground gas storage (UGS), or to improve safety risk management (SRM) and potential hazards identification (P.H.I), are other benefits from IoT technologies. EC techniques can process, aggregate, and analyze data immediately collected by IoT on a real-time scale, providing detailed insights into the behavior of rocks in various situations (e.g., stress, temperature, and pressure), establishing patterns quickly, and detecting trends. Therefore, this state-of-the-art and useful technology can adopt autonomous systems in rock mechanical surveys, such as drilling and production (in hydrocarbon wells) or excavation (in mining and geotechnics industries). Besides, ECt allows all rock-related operations to be controlled remotely and enables operators to apply changes or make adjustments. It must be mentioned that this feature is very important in environmental goals. More often than not, rock mechanical studies consist of different data, such as laboratory tests, field operations, and indirect information like seismic or well-logging data. CCt provides a useful platform for storing and managing a great deal of volume and different information, which can be very useful in fractured zones. Additionally, CCt supplies powerful tools for predicting, modeling, and simulating rock mechanical information, especially in fractured zones within vast areas. Also, it is a suitable source for sharing extensive information on rock mechanics, such as the direction and size of fractures in a large oil field or mine. The comprehensive review findings demonstrate that digital transformation through integrated IoT, Edge, and Cloud solutions is revolutionizing traditional rock mechanical investigation. These advanced technologies have empowered real-time monitoring, predictive analysis, and data-driven decision-making, culminating in noteworthy enhancements in safety, efficiency, and sustainability. Therefore, by employing IoT, CCt, and ECt, underground operations have experienced a significant boost, allowing for timely and informed actions using real-time data insights. The successful implementation of IoT, CCt, and ECt has led to optimized and safer operations, optimized processes, and environmentally conscious approaches in underground geological endeavors.

Keywords: rock mechanical studies, internet of things, edge computing, cloud computing, underground surveys, geological operations

Procedia PDF Downloads 38
742 Cost Sensitive Feature Selection in Decision-Theoretic Rough Set Models for Customer Churn Prediction: The Case of Telecommunication Sector Customers

Authors: Emel Kızılkaya Aydogan, Mihrimah Ozmen, Yılmaz Delice

Abstract:

In recent days, there is a change and the ongoing development of the telecommunications sector in the global market. In this sector, churn analysis techniques are commonly used for analysing why some customers terminate their service subscriptions prematurely. In addition, customer churn is utmost significant in this sector since it causes to important business loss. Many companies make various researches in order to prevent losses while increasing customer loyalty. Although a large quantity of accumulated data is available in this sector, their usefulness is limited by data quality and relevance. In this paper, a cost-sensitive feature selection framework is developed aiming to obtain the feature reducts to predict customer churn. The framework is a cost based optional pre-processing stage to remove redundant features for churn management. In addition, this cost-based feature selection algorithm is applied in a telecommunication company in Turkey and the results obtained with this algorithm.

Keywords: churn prediction, data mining, decision-theoretic rough set, feature selection

Procedia PDF Downloads 425
741 A Method for the Extraction of the Character's Tendency from Korean Novels

Authors: Min-Ha Hong, Kee-Won Kim, Seung-Hoon Kim

Abstract:

The character in the story-based content, such as novels and movies, is one of the core elements to understand the story. In particular, the character’s tendency is an important factor to analyze the story-based content, because it has a significant influence on the storyline. If readers have the knowledge of the tendency of characters before reading a novel, it will be helpful to understand the structure of conflict, episode and relationship between characters in the novel. It may therefore help readers to select novel that the reader wants to read. In this paper, we propose a method of extracting the tendency of the characters from a novel written in Korean. In advance, we build the dictionary with pairs of the emotional words in Korean and English since the emotion words in the novel’s sentences express character’s feelings. We rate the degree of polarity (positive or negative) of words in our emotional words dictionary based on SenticNet. Then we extract characters and emotion words from sentences in a novel. Since the polarity of a word grows strong or weak due to sentence features such as quotations and modifiers, our proposed method consider them to calculate the polarity of characters. The information of the extracted character’s polarity can be used in the book search service or book recommendation service.

Keywords: character tendency, data mining, emotion word, Korean novel

Procedia PDF Downloads 319