Search results for: document classifier
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 1086

Search results for: document classifier

606 Prediction of Music Track Popularity: A Machine Learning Approach

Authors: Syed Atif Hassan, Luv Mehta, Syed Asif Hassan

Abstract:

Hit song science is a field of investigation wherein machine learning techniques are applied to music tracks in order to extract such features from audio signals which can capture information that could explain the popularity of respective tracks. Record companies invest huge amounts of money into recruiting fresh talents and churning out new music each year. Gaining insight into the basis of why a song becomes popular will result in tremendous benefits for the music industry. This paper aims to extract basic musical and more advanced, acoustic features from songs while also taking into account external factors that play a role in making a particular song popular. We use a dataset derived from popular Spotify playlists divided by genre. We use ten genres (blues, classical, country, disco, hip-hop, jazz, metal, pop, reggae, rock), chosen on the basis of clear to ambiguous delineation in the typical sound of their genres. We feed these features into three different classifiers, namely, SVM with RBF kernel, a deep neural network, and a recurring neural network, to build separate predictive models and choosing the best performing model at the end. Predicting song popularity is particularly important for the music industry as it would allow record companies to produce better content for the masses resulting in a more competitive market.

Keywords: classifier, machine learning, music tracks, popularity, prediction

Procedia PDF Downloads 631
605 Teaching Italian Sign Language in Higher Education

Authors: Maria Tagarelli De Monte

Abstract:

Since its formal recognition in 2021, Italian Sign Language (LIS) and interpreters’ education has become a topic for higher education in Italian universities. In April 2022, Italian universities have been invited to present their proposals to create sign language courses for interpreters’ training for both LIS and tactile LIS. As a result, a few universities have presented a three-year course leading candidate students from the introductory level to interpreters. In such a context, there is an open debate not only on the fact that three years may not be enough to prepare skillful interpreters but also on the need to refer to international standards in the definition of the training path to follow. Among these, are the Common European Framework of Reference (CEFR) for languages and Dublin’s descriptors. This contribution will discuss the potentials and the challenges given by LIS training in academic settings, by comparing traditional studies to the requests coming from universities. Particular attention will be given to the use of CEFR as a reference document for the Italian Sign Language Curriculum. Its use has given me the chance to reflect on how LIS can be taught in higher education, and the adaptations that need to be addressed to respect the visual-gestural nature of sign language and the formal requirements of academic settings.

Keywords: Italian sign language, higher education, sign language curriculum, interpreters education, CEFR

Procedia PDF Downloads 27
604 From Electroencephalogram to Epileptic Seizures Detection by Using Artificial Neural Networks

Authors: Gaetano Zazzaro, Angelo Martone, Roberto V. Montaquila, Luigi Pavone

Abstract:

Seizure is the main factor that affects the quality of life of epileptic patients. The diagnosis of epilepsy, and hence the identification of epileptogenic zone, is commonly made by using continuous Electroencephalogram (EEG) signal monitoring. Seizure identification on EEG signals is made manually by epileptologists and this process is usually very long and error prone. The aim of this paper is to describe an automated method able to detect seizures in EEG signals, using knowledge discovery in database process and data mining methods and algorithms, which can support physicians during the seizure detection process. Our detection method is based on Artificial Neural Network classifier, trained by applying the multilayer perceptron algorithm, and by using a software application, called Training Builder that has been developed for the massive extraction of features from EEG signals. This tool is able to cover all the data preparation steps ranging from signal processing to data analysis techniques, including the sliding window paradigm, the dimensionality reduction algorithms, information theory, and feature selection measures. The final model shows excellent performances, reaching an accuracy of over 99% during tests on data of a single patient retrieved from a publicly available EEG dataset.

Keywords: artificial neural network, data mining, electroencephalogram, epilepsy, feature extraction, seizure detection, signal processing

Procedia PDF Downloads 169
603 Nyiragongo: An Active Volcano at Risk of Eruption without Precursor Signs

Authors: Emmanuel Havugimana

Abstract:

If there is a natural phenomenon that could endanger the lives of countless people in Central Africa, it is the possible eruption of the Nyiragongo Volcano. This one is 3,470 m above sea level and has a summit formed by a crater 1.2 km in diameter. Its composite is made up of many layers of lava and tephras from the Great Rift Valley located in the Democratic Republic of Congo. It is also located in the region of the volcanic mountains near the city of Goma in Congo and near the city of Gisenyi in Rwanda. Nyiragongo represents an imminent danger considering that its magma has a very low silica content and is thus quite fluid. Its slopes are also high and slippery, and the lava takes advantage of this to flow up to 100 km. Lately, its eruptions took place in May 2002, resumed in May 2021, and they were faster than before. The volcano remains active even today. All these factors make it among the most dangerous volcanoes in the world. On top of that, no one knows when the next eruption will take place, especially since it can also occur without any warning signs. Unfortunately, volcanological monitoring services in Congo are non-existent, and that is why this document concludes that Nyiragongo could if nothing is done in this regard, ravage the two neighboring towns: Goma in Congo and Gisenyi in Rwanda. It also proposes solutions that may contribute to preventing the expected dangers in this context.

Keywords: Nyiragongo, volcanic eruption, precursor signs, active volcano

Procedia PDF Downloads 76
602 Status of Herpetofauna of Trans-Himalayan Region of Ladakh, India

Authors: Dimpi A. Patel, Pankaj Raina, Ramesh Chinnasamy, Sunetro Ghosal

Abstract:

The herpetological fauna of Ladakh has been surveyed few times till 1999. In 2019, a rapid survey to document current herpetofaunal composition was undertaken in which a total of 6 species belonging to 2 orders and five families along with their altitudinal ranges were recorded. We present a revised checklist of reptiles found in Ladakh trans Himalayas based on historical records and recent field surveys. Records for erroneously reported species in literature are discussed and recommended for removal from the list from this region. For several species, new elevation range records have been recorded. This paper contributes to the present status of the richness of reptiles and amphibians in the region by documenting the composition and ecological distribution of the herpetofauna of unstudied sites. Species-specific temperature and humidity regimes were also recorded during the survey periods. Our study creates baseline information for future ecological and behavioral studies on the herpetofauna of the region by providing habitat preferences and distribution in detail.

Keywords: amphibians, distribution, diversity, reptiles, trans-Himalaya

Procedia PDF Downloads 142
601 Audit Committee Financial Expertise and Financial Reporting Timeliness in Emerging Market: The Role of Audit Committee Chair

Authors: Saeed Rabea Baatwah, Zalailah Salleh, Norsiah Ahmad

Abstract:

This study examines whether audit committee chair with financial expertise enhances the audit committee role in financial reporting quality in emerging market. We investigate this influence by employing the direct effect and moderating effect of audit committee chair with financial expertise on financial reporting timeliness. By using Omani data and the panel data method for two proxies for financial reporting timeliness, we find that audit committee chair with financial expertise enhances the timeliness of financial reporting through making the disclosure of annual reports timely. Further, we report evidence showing that both accounting and non-accounting financial expertise on the audit committee have a positive and significant influence on the timeliness of financial reporting. We also document that the association between financial expertise and the timeliness of financial reporting is more pronounced when the chair of the audit committee has financial expertise. This study is among the first to comprehensively prove that audit committee chair with financial expertise contributes to the quality of financial reporting in emerging market.

Keywords: audit committee, chair with financial expertise, timeliness of financial reporting, Oman

Procedia PDF Downloads 252
600 Performance Assessment of Multi-Level Ensemble for Multi-Class Problems

Authors: Rodolfo Lorbieski, Silvia Modesto Nassar

Abstract:

Many supervised machine learning tasks require decision making across numerous different classes. Multi-class classification has several applications, such as face recognition, text recognition and medical diagnostics. The objective of this article is to analyze an adapted method of Stacking in multi-class problems, which combines ensembles within the ensemble itself. For this purpose, a training similar to Stacking was used, but with three levels, where the final decision-maker (level 2) performs its training by combining outputs from the tree-based pair of meta-classifiers (level 1) from Bayesian families. These are in turn trained by pairs of base classifiers (level 0) of the same family. This strategy seeks to promote diversity among the ensembles forming the meta-classifier level 2. Three performance measures were used: (1) accuracy, (2) area under the ROC curve, and (3) time for three factors: (a) datasets, (b) experiments and (c) levels. To compare the factors, ANOVA three-way test was executed for each performance measure, considering 5 datasets by 25 experiments by 3 levels. A triple interaction between factors was observed only in time. The accuracy and area under the ROC curve presented similar results, showing a double interaction between level and experiment, as well as for the dataset factor. It was concluded that level 2 had an average performance above the other levels and that the proposed method is especially efficient for multi-class problems when compared to binary problems.

Keywords: stacking, multi-layers, ensemble, multi-class

Procedia PDF Downloads 253
599 Science Communication: A Possible Dialogue between Researchers and Agribusiness Farmers

Authors: Cristiane Hengler Corrêa Bernardo

Abstract:

The communication is an essential part of the process that characterizes scientific research. It should be present in every stage of research in a systemic way. However, this process is not always efficient and effective. Reports of researchers focused on agribusiness point to difficulties in communicating with farmers that negatively impact on research results and may cause distortions and even quite significant inconsistencies. This research aims at identifying the main noise and barriers in communication between agribusiness researchers and farmers. It discusses the possibility of creating a specific strategy to correct or minimize such failures. The main research question: what features of the communication process will be decisive for the communication between agribusiness researcher and farmer occur with greater efficiency? It is expected that the research will result in processes that may correct or minimize such problems, promoting dialogues more efficient knowledge. The research will adopt a qualitative approach, using action research as a form of investigative action for social and educational nature, aiming at promoting understanding and interaction between researchers and members of the investigated situations. To collect and analyze data to document analysis will be used; questionnaires and interviews and content analysis.

Keywords: agribusiness farmers, researchers, science communication, analysis

Procedia PDF Downloads 262
598 Pre-Operative Tool for Facial-Post-Surgical Estimation and Detection

Authors: Ayat E. Ali, Christeen R. Aziz, Merna A. Helmy, Mohammed M. Malek, Sherif H. El-Gohary

Abstract:

Goal: Purpose of the project was to make a plastic surgery prediction by using pre-operative images for the plastic surgeries’ patients and to show this prediction on a screen to compare between the current case and the appearance after the surgery. Methods: To this aim, we implemented a software which used data from the internet for facial skin diseases, skin burns, pre-and post-images for plastic surgeries then the post- surgical prediction is done by using K-nearest neighbor (KNN). So we designed and fabricated a smart mirror divided into two parts a screen and a reflective mirror so patient's pre- and post-appearance will be showed at the same time. Results: We worked on some skin diseases like vitiligo, skin burns and wrinkles. We classified the three degrees of burns using KNN classifier with accuracy 60%. We also succeeded in segmenting the area of vitiligo. Our future work will include working on more skin diseases, classify them and give a prediction for the look after the surgery. Also we will go deeper into facial deformities and plastic surgeries like nose reshaping and face slim down. Conclusion: Our project will give a prediction relates strongly to the real look after surgery and decrease different diagnoses among doctors. Significance: The mirror may have broad societal appeal as it will make the distance between patient's satisfaction and the medical standards smaller.

Keywords: k-nearest neighbor (knn), face detection, vitiligo, bone deformity

Procedia PDF Downloads 143
597 Logistic Regression Based Model for Predicting Students’ Academic Performance in Higher Institutions

Authors: Emmanuel Osaze Oshoiribhor, Adetokunbo MacGregor John-Otumu

Abstract:

In recent years, there has been a desire to forecast student academic achievement prior to graduation. This is to help them improve their grades, particularly for individuals with poor performance. The goal of this study is to employ supervised learning techniques to construct a predictive model for student academic achievement. Many academics have already constructed models that predict student academic achievement based on factors such as smoking, demography, culture, social media, parent educational background, parent finances, and family background, to name a few. This feature and the model employed may not have correctly classified the students in terms of their academic performance. This model is built using a logistic regression classifier with basic features such as the previous semester's course score, attendance to class, class participation, and the total number of course materials or resources the student is able to cover per semester as a prerequisite to predict if the student will perform well in future on related courses. The model outperformed other classifiers such as Naive bayes, Support vector machine (SVM), Decision Tree, Random forest, and Adaboost, returning a 96.7% accuracy. This model is available as a desktop application, allowing both instructors and students to benefit from user-friendly interfaces for predicting student academic achievement. As a result, it is recommended that both students and professors use this tool to better forecast outcomes.

Keywords: artificial intelligence, ML, logistic regression, performance, prediction

Procedia PDF Downloads 80
596 Fail Analysis of the Filter in a Land Dam

Authors: Guillermo Cardoso-Landa, Ana Julita Cuenca-Castro

Abstract:

The present paper focuses to research the possible causes of curtain failure of dam "El Batan" in Querétaro, Mexico, including the design of the fineness of the employee filter during the construction of the curtain was verified since this depends greatly on the proper functioning of this filter. To carry out the required analysis, it was necessary to document elements provided understanding about the composition and behavior of the land curtain, and the main types of failure in these curtains. The general characteristics of the curtain dam "El Batan", the composition of the filter, as well as possible causes resulted in the failure were also analyzed. Once obtained data starting, the actual analysis was carried out by reviewing the following possible causes of failure: fails due to a poor constructive process of the curtain, failure due to hydraulic suppression, fails due to a structural design wrong, fails due to a geotechnical design wrong, fails due to a hydraulic design wrong, fails due to an inadequate design of the curtain filter. It is concluded that the type of the filter employed in the land dam curtain of "El Batan", located in the municipality of Querétaro, México, do not have adequate characteristics, outside of the ranges of design, using the curves: Terzaghi criteria, Sherard and Dunnigan criteria, UCSCS criteria, and Foster and Fell criteria.

Keywords: failure, dam, filter, curtain

Procedia PDF Downloads 478
595 A Dynamic Solution Approach for Heart Disease Prediction

Authors: Walid Moudani

Abstract:

The healthcare environment is generally perceived as being information rich yet knowledge poor. However, there is a lack of effective analysis tools to discover hidden relationships and trends in data. In fact, valuable knowledge can be discovered from application of data mining techniques in healthcare system. In this study, a proficient methodology for the extraction of significant patterns from the coronary heart disease warehouses for heart attack prediction, which unfortunately continues to be a leading cause of mortality in the whole world, has been presented. For this purpose, we propose to enumerate dynamically the optimal subsets of the reduced features of high interest by using rough sets technique associated to dynamic programming. Therefore, we propose to validate the classification using Random Forest (RF) decision tree to identify the risky heart disease cases. This work is based on a large amount of data collected from several clinical institutions based on the medical profile of patient. Moreover, the experts’ knowledge in this field has been taken into consideration in order to define the disease, its risk factors, and to establish significant knowledge relationships among the medical factors. A computer-aided system is developed for this purpose based on a population of 525 adults. The performance of the proposed model is analyzed and evaluated based on set of benchmark techniques applied in this classification problem.

Keywords: multi-classifier decisions tree, features reduction, dynamic programming, rough sets

Procedia PDF Downloads 394
594 Connecting Students and Faculty Research Efforts through the Research and Projects Portal

Authors: Havish Nalapareddy, Mark V. Albert, Ranak Bansal, Avi Udash, Lin Lin

Abstract:

Students engage in many course projects during their degree programs. However, impactful projects often need a time frame longer than a single semester. Ideally, projects are documented and structured to be readily accessible to future students who may choose to continue the project, with features that emphasize the local community, university, or course structure. The Research and Project Portal (RAPP) is a place where students can post both their completed and ongoing projects with all the resources and tools used. This portal allows students to see what other students have done in the past, in the same university environment, related to their domain of interest. Computer science instructors or students selecting projects can use this portal to assign or choose an incomplete project. Additionally, this portal allows non-computer science faculty and industry collaborators to document their project ideas for students in courses to prototype directly, rather than directly soliciting the help of instructors in engaging students. RAPP serves as a platform linking students across classes and faculty both in and out of computer science courses on joint projects to encourage long-term project efforts across semesters or years.

Keywords: education, technology, research, academic portal

Procedia PDF Downloads 121
593 Experiences and Perceptions of Parents Raising Children with Autism

Authors: Tamene Keneni, Tibebu Yohannes

Abstract:

The prevalence of autism spectrum disorder (ASD) in general and autism in particular is on the rise globally, and the need for evidence-based intervention and care for children with autism has grown, too. However, evidence on autism is scanty in developing countries, including Ethiopia. With the aim to help fill the gap and paucity in research into the issue, the main purpose of this study is to explore, better understand, and document the experiences and perceptions of parents of children with autism. To this end, we used a qualitative survey to collect data from a convenient sample of parents raising a child with autism. The data collected were subjected to qualitative analysis that yielded several themes and subthemes, including late diagnosis, parents’ reactions to diagnosis, sources of information during and after diagnosis, differing reactions to having a child with autism from siblings, extended family members, and the larger community, attribution of autism to several causes by the community, lack of recognition and open discussion of autism and lack of appropriated public educational and health care services for children with autism and their parents. The themes and subthemes identified were discussed in light of existing literature, and implications for practice were drawn.

Keywords: ASD, autism, children with autism, raising children with autism

Procedia PDF Downloads 136
592 A Palmprint Identification System Based Multi-Layer Perceptron

Authors: David P. Tantua, Abdulkader Helwan

Abstract:

Biometrics has been recently used for the human identification systems using the biological traits such as the fingerprints and iris scanning. Identification systems based biometrics show great efficiency and accuracy in such human identification applications. However, these types of systems are so far based on some image processing techniques only, which may decrease the efficiency of such applications. Thus, this paper aims to develop a human palmprint identification system using multi-layer perceptron neural network which has the capability to learn using a backpropagation learning algorithms. The developed system uses images obtained from a public database available on the internet (CASIA). The processing system is as follows: image filtering using median filter, image adjustment, image skeletonizing, edge detection using canny operator to extract features, clear unwanted components of the image. The second phase is to feed those processed images into a neural network classifier which will adaptively learn and create a class for each different image. 100 different images are used for training the system. Since this is an identification system, it should be tested with the same images. Therefore, the same 100 images are used for testing it, and any image out of the training set should be unrecognized. The experimental results shows that this developed system has a great accuracy 100% and it can be implemented in real life applications.

Keywords: biometrics, biological traits, multi-layer perceptron neural network, image skeletonizing, edge detection using canny operator

Procedia PDF Downloads 354
591 A Cross-Cultural Strategy for Managing an Organisation Located in a Diverse-Populated Community

Authors: Tsuu Faith Machingura, Daniel Madzanire, Doreen Nkala

Abstract:

High employment opportunities in various towns in Zimbabwe attracted linguistically-diverse ethnic groups to settle therein. This movement, which largely was economically-induced, concocted diverse-populated communities in towns and in surrounding areas. Service provisions in such domains as education and business need to be diverse-sensitive. Prompted by the prevalence of diversity in present day business organisations, the study sought to suggest a cross-cultural strategy for managing an organisation located in a diverse-populated community. A case study research design was used. A sample of 10 participants consisting of five diverse business owners and five diverse clients was purposively drawn. Document analysis and key informant interviews were used to gather data. The study revealed that organisations that are located in diverse populated communities were shaped by the prevailing ethos. A diverse-sensitive managerial strategy was suggested as a pertinent cross-cultural managerial tool.

Keywords: cross-cultural strategy, linguistic diversity, diverse-populated community, ethnic groups

Procedia PDF Downloads 46
590 Land Use/Land Cover Mapping Using Landsat 8 and Sentinel-2 in a Mediterranean Landscape

Authors: Moschos Vogiatzis, K. Perakis

Abstract:

Spatial-explicit and up-to-date land use/land cover information is fundamental for spatial planning, land management, sustainable development, and sound decision-making. In the last decade, many satellite-derived land cover products at different spatial, spectral, and temporal resolutions have been developed, such as the European Copernicus Land Cover product. However, more efficient and detailed information for land use/land cover is required at the regional or local scale. A typical Mediterranean basin with a complex landscape comprised of various forest types, crops, artificial surfaces, and wetlands was selected to test and develop our approach. In this study, we investigate the improvement of Copernicus Land Cover product (CLC2018) using Landsat 8 and Sentinel-2 pixel-based classification based on all available existing geospatial data (Forest Maps, LPIS, Natura2000 habitats, cadastral parcels, etc.). We examined and compared the performance of the Random Forest classifier for land use/land cover mapping. In total, 10 land use/land cover categories were recognized in Landsat 8 and 11 in Sentinel-2A. A comparison of the overall classification accuracies for 2018 shows that Landsat 8 classification accuracy was slightly higher than Sentinel-2A (82,99% vs. 80,30%). We concluded that the main land use/land cover types of CLC2018, even within a heterogeneous area, can be successfully mapped and updated according to CLC nomenclature. Future research should be oriented toward integrating spatiotemporal information from seasonal bands and spectral indexes in the classification process.

Keywords: classification, land use/land cover, mapping, random forest

Procedia PDF Downloads 109
589 The Digital Video and Online Media Development for Integrated Marketing Communication and Tourism Promote in Taling Chan District, Bangkok

Authors: Somsak Klaysung

Abstract:

This study purpose to develop video to promote cultural tourism in Taling Chan District. For qualitative research, the sample size was 40 people from 5 group of the tourism entrepreneur in Taling Chan district, conducted the key informants’ content analysis by using focus group and structures in-depth interview from all stakeholders. Quota sampling was used for this kind of research. The findings indicated that get media video marketing and tourism contribute a set length 11.35 9 minutes there is plenty of social capital in Taling Chan District including detail like local wisdom, knowledge, and way of thinking related to nature, history, historic document, occupation, administration and attribute of local people. Additional research found the new path of travel through the water route according to Khlong Bang Ramat called Route 9 temples that travelers can travel by boat are available in the market in four areas Taling Chan also as well.

Keywords: digital video, integrated marketing communication, online media development, Taling Chan district

Procedia PDF Downloads 331
588 Inadequate Requirements Engineering Process: A Key Factor for Poor Software Development in Developing Nations: A Case Study

Authors: K. Adu Michael, K. Alese Boniface

Abstract:

Developing a reliable and sustainable software products is today a big challenge among up–coming software developers in Nigeria. The inability to develop a comprehensive problem statement needed to execute proper requirements engineering process is missing. The need to describe the ‘what’ of a system in one document, written in a natural language is a major step in the overall process of Software Engineering. Requirements Engineering is a process use to discover, analyze and validate system requirements. This process is needed in reducing software errors at the early stage of the development of software. The importance of each of the steps in Requirements Engineering is clearly explained in the context of using detailed problem statement from client/customer to get an overview of an existing system along with expectations from the new system. This paper elicits inadequate Requirements Engineering principle as the major cause of poor software development in developing nations using a case study of final year computer science students of a tertiary-education institution in Nigeria.

Keywords: client/customer, problem statement, requirements engineering, software developers

Procedia PDF Downloads 387
587 A Proposed Framework for Software Redocumentation Using Distributed Data Processing Techniques and Ontology

Authors: Laila Khaled Almawaldi, Hiew Khai Hang, Sugumaran A. l. Nallusamy

Abstract:

Legacy systems are crucial for organizations, but their intricacy and lack of documentation pose challenges for maintenance and enhancement. Redocumentation of legacy systems is vital for automatically or semi-automatically creating documentation for software lacking sufficient records. It aims to enhance system understandability, maintainability, and knowledge transfer. However, existing redocumentation methods need improvement in data processing performance and document generation efficiency. This stems from the necessity to efficiently handle the extensive and complex code of legacy systems. This paper proposes a method for semi-automatic legacy system re-documentation using semantic parallel processing and ontology. Leveraging parallel processing and ontology addresses current challenges by distributing the workload and creating documentation with logically interconnected data. The paper outlines challenges in legacy system redocumentation and suggests a method of redocumentation using parallel processing and ontology for improved efficiency and effectiveness.

Keywords: legacy systems, redocumentation, big data analysis, parallel processing

Procedia PDF Downloads 22
586 Product Features Extraction from Opinions According to Time

Authors: Kamal Amarouche, Houda Benbrahim, Ismail Kassou

Abstract:

Nowadays, e-commerce shopping websites have experienced noticeable growth. These websites have gained consumers’ trust. After purchasing a product, many consumers share comments where opinions are usually embedded about the given product. Research on the automatic management of opinions that gives suggestions to potential consumers and portrays an image of the product to manufactures has been growing recently. After launching the product in the market, the reviews generated around it do not usually contain helpful information or generic opinions about this product (e.g. telephone: great phone...); in the sense that the product is still in the launching phase in the market. Within time, the product becomes old. Therefore, consumers perceive the advantages/ disadvantages about each specific product feature. Therefore, they will generate comments that contain their sentiments about these features. In this paper, we present an unsupervised method to extract different product features hidden in the opinions which influence its purchase, and that combines Time Weighting (TW) which depends on the time opinions were expressed with Term Frequency-Inverse Document Frequency (TF-IDF). We conduct several experiments using two different datasets about cell phones and hotels. The results show the effectiveness of our automatic feature extraction, as well as its domain independent characteristic.

Keywords: opinion mining, product feature extraction, sentiment analysis, SentiWordNet

Procedia PDF Downloads 380
585 A Supervised Approach for Detection of Singleton Spam Reviews

Authors: Atefeh Heydari, Mohammadali Tavakoli, Naomie Salim

Abstract:

In recent years, we have witnessed that online reviews are the most important source of customers’ opinion. They are progressively more used by individuals and organisations to make purchase and business decisions. Unfortunately, for the reason of profit or fame, frauds produce deceptive reviews to hoodwink potential customers. Their activities mislead not only potential customers to make appropriate purchasing decisions and organisations to reshape their business, but also opinion mining techniques by preventing them from reaching accurate results. Spam reviews could be divided into two main groups, i.e. multiple and singleton spam reviews. Detecting a singleton spam review that is the only review written by a user ID is extremely challenging due to lack of clue for detection purposes. Singleton spam reviews are very harmful and various features and proofs used in multiple spam reviews detection are not applicable in this case. Current research aims to propose a novel supervised technique to detect singleton spam reviews. To achieve this, various features are proposed in this study and are to be combined with the most appropriate features extracted from literature and employed in a classifier. In order to compare the performance of different classifiers, SVM and naive Bayes classification algorithms were used for model building. The results revealed that SVM was more accurate than naive Bayes and our proposed technique is capable to detect singleton spam reviews effectively.

Keywords: classification algorithms, Naïve Bayes, opinion review spam detection, singleton review spam detection, support vector machine

Procedia PDF Downloads 286
584 Ethnomedicinal Uses of Plants in Bridim Village Development Committee in Langtang National Park, Nepal

Authors: Ila Shrestha

Abstract:

Bridim Village Development Committee (VDC) is one of the medicinal plants hot spots of Nepal. It is located on a ridge above the lower Langtang Khola, steep and narrow spot in between 1944 m to 4833 m altitude. The study area is homogeneously inhabited by Tamang communities. An investigation on folk herbal medicine on the basis of traditional uses of medicinal plants was done in 2014. The local traditional healers, elder men and women, traders and teachers, were consulted as key informants for documentation of indigenous knowledge on the medicinal plants. It was found that altogether seventy-one medicinal plant species belonging to sixty genera and thirty-three families were used by local people for twenty-seven diseases. Roots of thirty-four species were the most frequently used plant parts and bigger numbers of species were found to be used in fever of ten species. Most medicines were prepared in the form of juice of forty species. The attempt of the study was to document ethno medicinal practices to treat different diseases in the study area for conservation of indigenous knowledge.

Keywords: Bridim village, ethnomedicine, national park, plants

Procedia PDF Downloads 278
583 Diagnosis of the Heart Rhythm Disorders by Using Hybrid Classifiers

Authors: Sule Yucelbas, Gulay Tezel, Cuneyt Yucelbas, Seral Ozsen

Abstract:

In this study, it was tried to identify some heart rhythm disorders by electrocardiography (ECG) data that is taken from MIT-BIH arrhythmia database by subtracting the required features, presenting to artificial neural networks (ANN), artificial immune systems (AIS), artificial neural network based on artificial immune system (AIS-ANN) and particle swarm optimization based artificial neural network (PSO-NN) classifier systems. The main purpose of this study is to evaluate the performance of hybrid AIS-ANN and PSO-ANN classifiers with regard to the ANN and AIS. For this purpose, the normal sinus rhythm (NSR), atrial premature contraction (APC), sinus arrhythmia (SA), ventricular trigeminy (VTI), ventricular tachycardia (VTK) and atrial fibrillation (AF) data for each of the RR intervals were found. Then these data in the form of pairs (NSR-APC, NSR-SA, NSR-VTI, NSR-VTK and NSR-AF) is created by combining discrete wavelet transform which is applied to each of these two groups of data and two different data sets with 9 and 27 features were obtained from each of them after data reduction. Afterwards, the data randomly was firstly mixed within themselves, and then 4-fold cross validation method was applied to create the training and testing data. The training and testing accuracy rates and training time are compared with each other. As a result, performances of the hybrid classification systems, AIS-ANN and PSO-ANN were seen to be close to the performance of the ANN system. Also, the results of the hybrid systems were much better than AIS, too. However, ANN had much shorter period of training time than other systems. In terms of training times, ANN was followed by PSO-ANN, AIS-ANN and AIS systems respectively. Also, the features that extracted from the data affected the classification results significantly.

Keywords: AIS, ANN, ECG, hybrid classifiers, PSO

Procedia PDF Downloads 423
582 Development of Analytical Systems for Nurses in Kenya

Authors: Peris Wanjiku

Abstract:

The objective of this paper is to describe the development and implications of a national nursing workforce analytical system in Kenya. Findings: Creating a national electronic nursing workforce analytical system provides more reliable information on nurses ‘national demographics, migration patterns, and workforce capacity and efficiency. Data analysis is most useful for human resources for health (HRH) planning when workforce capacity data can be linked to worksite staffing requirements. As a result of establishing this database, the Kenya Ministry of Health has improved its capability to assess its nursing workforce and document important workforce trends, such as out-migration. Current data identify the United States as the leading recipient country of Kenyan nurses. The overwhelming majority of Kenyan nurses who decide to out-migrate are amongst Kenya’s most qualified. Conclusions: The Kenya nursing database is a first step toward facilitating evidence-based decision-making in HRH. This database is unique to developing countries in sub-Saharan Africa. Establishing an electronic workforce database requires long-term investment and sustained support by national and global stakeholders.

Keywords: analytical, information, health, migration

Procedia PDF Downloads 80
581 Guide to the Development of the Intensive English Program for Graduate Students

Authors: Piyawan Sunasuan, Thiranan Pansuppawat, Mananya Manaratchasak, Maream Nillapun

Abstract:

This research aims to guide the development of the intensive English program for graduate students. The objectives are 1) to study the English skills in which needed for the graduate students and 2) to study the potential of the current course with the expected proficiency level. The samples are 46 graduate students enrolled in the ENG 102 and ENG 103 courses of the school year of 2019/2020 in semester one from the Silpakorn University, Sanamchandra Palace Campus, and two teachers. The researchers use 1) student survey, 2) teacher interview, and 3) focus group discussion among selected students. The data is analyzed by calculating the mean (x̅), the standard deviation, and document analysis. The findings show that nine skills are in the need of the course development; 1) academic writing 2) occupational purpose writing 3) communicative reading 4) occupational purpose reading 5) academic speaking 6) occupational purpose speaking 7) occupational purpose listening 8) academic listening and 9) communicative listening. The current course does not meet the expectation on a high level but has potential.

Keywords: English for academic purposes, English for communication, English for occupational purposes, intensive English

Procedia PDF Downloads 131
580 Multimedia Data Fusion for Event Detection in Twitter by Using Dempster-Shafer Evidence Theory

Authors: Samar M. Alqhtani, Suhuai Luo, Brian Regan

Abstract:

Data fusion technology can be the best way to extract useful information from multiple sources of data. It has been widely applied in various applications. This paper presents a data fusion approach in multimedia data for event detection in twitter by using Dempster-Shafer evidence theory. The methodology applies a mining algorithm to detect the event. There are two types of data in the fusion. The first is features extracted from text by using the bag-ofwords method which is calculated using the term frequency-inverse document frequency (TF-IDF). The second is the visual features extracted by applying scale-invariant feature transform (SIFT). The Dempster - Shafer theory of evidence is applied in order to fuse the information from these two sources. Our experiments have indicated that comparing to the approaches using individual data source, the proposed data fusion approach can increase the prediction accuracy for event detection. The experimental result showed that the proposed method achieved a high accuracy of 0.97, comparing with 0.93 with texts only, and 0.86 with images only.

Keywords: data fusion, Dempster-Shafer theory, data mining, event detection

Procedia PDF Downloads 391
579 Ethnobotanical Study of Spontaneous Medicinal Plants Used in the Treatment of Viral Respiratory Diseases in the Prerif, Morocco

Authors: El Amane Salma, Rahou Abdelilah

Abstract:

Viral respiratory infections (common cold, flu, sinusitis, bronchiolitis, etc.) are among the most common infections in the world with severe symptoms. In Morocco, as everywhere in the world, especially in developing countries, the therapeutic indications of medicinal plants are very present to treat several diseases, including the respiratory system. The objective of our study is to identify and document medicinal plants used in traditional medicine to treat viral respiratory infections and alleviate their symptoms in order to generate interest for future studies in verifying the efficacy of these traditional medicines and their conservation. The information acquired from 81 questionnaires and the floristic identification allowed us to identify 19 spontaneous species belonging to 11 families, used as traditional therapies for viral respiratory diseases in the Prerif. The herbs are the most used life form. The results also showed that leaves were the most commonly used plant parts and most of the herbal medicines were prepared in the form of infusions and administered orally. Documented data was evaluated using use value (UV), family importance value (FIV) and relative frequency citation (RCF).

Keywords: medicinal plants, ethnobotanical, ethnopharmacological, viral respiratory diseases, Morocco

Procedia PDF Downloads 166
578 Artificial Intelligence Assisted Sentiment Analysis of Hotel Reviews Using Topic Modeling

Authors: Sushma Ghogale

Abstract:

With a surge in user-generated content or feedback or reviews on the internet, it has become possible and important to know consumers' opinions about products and services. This data is important for both potential customers and businesses providing the services. Data from social media is attracting significant attention and has become the most prominent channel of expressing an unregulated opinion. Prospective customers look for reviews from experienced customers before deciding to buy a product or service. Several websites provide a platform for users to post their feedback for the provider and potential customers. However, the biggest challenge in analyzing such data is in extracting latent features and providing term-level analysis of the data. This paper proposes an approach to use topic modeling to classify the reviews into topics and conduct sentiment analysis to mine the opinions. This approach can analyse and classify latent topics mentioned by reviewers on business sites or review sites, or social media using topic modeling to identify the importance of each topic. It is followed by sentiment analysis to assess the satisfaction level of each topic. This approach provides a classification of hotel reviews using multiple machine learning techniques and comparing different classifiers to mine the opinions of user reviews through sentiment analysis. This experiment concludes that Multinomial Naïve Bayes classifier produces higher accuracy than other classifiers.

Keywords: latent Dirichlet allocation, topic modeling, text classification, sentiment analysis

Procedia PDF Downloads 86
577 Rank-Based Chain-Mode Ensemble for Binary Classification

Authors: Chongya Song, Kang Yen, Alexander Pons, Jin Liu

Abstract:

In the field of machine learning, the ensemble has been employed as a common methodology to improve the performance upon multiple base classifiers. However, the true predictions are often canceled out by the false ones during consensus due to a phenomenon called “curse of correlation” which is represented as the strong interferences among the predictions produced by the base classifiers. In addition, the existing practices are still not able to effectively mitigate the problem of imbalanced classification. Based on the analysis on our experiment results, we conclude that the two problems are caused by some inherent deficiencies in the approach of consensus. Therefore, we create an enhanced ensemble algorithm which adopts a designed rank-based chain-mode consensus to overcome the two problems. In order to evaluate the proposed ensemble algorithm, we employ a well-known benchmark data set NSL-KDD (the improved version of dataset KDDCup99 produced by University of New Brunswick) to make comparisons between the proposed and 8 common ensemble algorithms. Particularly, each compared ensemble classifier uses the same 22 base classifiers, so that the differences in terms of the improvements toward the accuracy and reliability upon the base classifiers can be truly revealed. As a result, the proposed rank-based chain-mode consensus is proved to be a more effective ensemble solution than the traditional consensus approach, which outperforms the 8 ensemble algorithms by 20% on almost all compared metrices which include accuracy, precision, recall, F1-score and area under receiver operating characteristic curve.

Keywords: consensus, curse of correlation, imbalance classification, rank-based chain-mode ensemble

Procedia PDF Downloads 117