Commenced in January 2007

Frequency: Monthly

Edition: International

Paper Count: 11024

Search results for: machine learning; medicinal plants

10664 Determination of Water Pollution and Water Quality with Decision Trees

Abstract:

With the increasing emphasis on water quality worldwide, the search for and expanding the market for new and intelligent monitoring systems has increased. The current method is the laboratory process, where samples are taken from bodies of water, and tests are carried out in laboratories. This method is time-consuming, a waste of manpower, and uneconomical. To solve this problem, we used machine learning methods to detect water pollution in our study. We created decision trees with the Orange3 software we used in our study and tried to determine all the factors that cause water pollution. An automatic prediction model based on water quality was developed by taking many model inputs such as water temperature, pH, transparency, conductivity, dissolved oxygen, and ammonia nitrogen with machine learning methods. The proposed approach consists of three stages: preprocessing of the data used, feature detection, and classification. We tried to determine the success of our study with different accuracy metrics and the results. We presented it comparatively. In addition, we achieved approximately 98% success with the decision tree.

Keywords: decision tree, water quality, water pollution, machine learning

Procedia PDF Downloads 72

10663 Benchmarking Machine Learning Approaches for Forecasting Hotel Revenue

Authors: Rachel Y. Zhang, Christopher K. Anderson

Abstract:

A critical aspect of revenue management is a firm’s ability to predict demand as a function of price. Historically hotels have used simple time series models (regression and/or pick-up based models) owing to the complexities of trying to build casual models of demands. Machine learning approaches are slowly attracting attention owing to their flexibility in modeling relationships. This study provides an overview of approaches to forecasting hospitality demand – focusing on the opportunities created by machine learning approaches, including K-Nearest-Neighbors, Support vector machine, Regression Tree, and Artificial Neural Network algorithms. The out-of-sample performances of above approaches to forecasting hotel demand are illustrated by using a proprietary sample of the market level (24 properties) transactional data for Las Vegas NV. Causal predictive models can be built and evaluated owing to the availability of market level (versus firm level) data. This research also compares and contrast model accuracy of firm-level models (i.e. predictive models for hotel A only using hotel A’s data) to models using market level data (prices, review scores, location, chain scale, etc… for all hotels within the market). The prospected models will be valuable for hotel revenue prediction given the basic characters of a hotel property or can be applied in performance evaluation for an existed hotel. The findings will unveil the features that play key roles in a hotel’s revenue performance, which would have considerable potential usefulness in both revenue prediction and evaluation.

Keywords: hotel revenue, k-nearest-neighbors, machine learning, neural network, prediction model, regression tree, support vector machine

Procedia PDF Downloads 119

10662 A Comparative Evaluation of Antioxidant Activity of in vivo and in vitro Raised Holarrhena antidysenterica Linn.

Authors: Gayatri Nahak, Satyajit Kanungo, Rajani Kanta Sahu

Abstract:

Holarrhena antidysenterica Linn. (Apocynaceae) is a typical Indian medicinal plant popularly known as “Indrajav”. Traditionally the plant has been considered a popular remedy for the treatment of dysentery, diarrhea, intestinal worms and the seeds of this plant are also used as an anti-diabetic remedy. In the present study axillary shoot multiplication, callus induction and shoot regeneration from callus culture were obtained on Murashige and Skoog (MS) medium supplemented with different concentrations and combinations of plant growth regulators. Then in vivo and in vitro grown healthy plants were selected for study of antioxidant activity through DPPH and OH methods. Significantly higher antioxidant activity and phenol contents were observed in vitro raised plant in comparison to in vivo plants. The findings indicated the greater amount of phenolic compounds leads to more potent radical scavenging effect as shown in in vitro raised plant in comparison to in vivo plants which showed the ability to utilize tissue culture techniques towards development of desired bioactive metabolites from in vitro culture as an alternative way to avoid using endangered plants in pharmaceutical purposes.

Keywords: Holarrhena antidysenterica, in vitro, in vivo, antioxidant activity

Procedia PDF Downloads 497

10661 Accelerating Quantum Chemistry Calculations: Machine Learning for Efficient Evaluation of Electron-Repulsion Integrals

Authors: Nishant Rodrigues, Nicole Spanedda, Chilukuri K. Mohan, Arindam Chakraborty

Abstract:

A crucial objective in quantum chemistry is the computation of the energy levels of chemical systems. This task requires electron-repulsion integrals as inputs, and the steep computational cost of evaluating these integrals poses a major numerical challenge in efficient implementation of quantum chemical software. This work presents a moment-based machine-learning approach for the efficient evaluation of electron-repulsion integrals. These integrals were approximated using linear combinations of a small number of moments. Machine learning algorithms were applied to estimate the coefficients in the linear combination. A random forest approach was used to identify promising features using a recursive feature elimination approach, which performed best for learning the sign of each coefficient but not the magnitude. A neural network with two hidden layers were then used to learn the coefficient magnitudes along with an iterative feature masking approach to perform input vector compression, identifying a small subset of orbitals whose coefficients are sufficient for the quantum state energy computation. Finally, a small ensemble of neural networks (with a median rule for decision fusion) was shown to improve results when compared to a single network.

Keywords: quantum energy calculations, atomic orbitals, electron-repulsion integrals, ensemble machine learning, random forests, neural networks, feature extraction

Procedia PDF Downloads 95

10660 Advancing Urban Sustainability through Data-Driven Machine Learning Solutions

Authors: Nasim Eslamirad, Mahdi Rasoulinezhad, Francesco De Luca, Sadok Ben Yahia, Kimmo Sakari Lylykangas, Francesco Pilla

Abstract:

With the ongoing urbanization, cities face increasing environmental challenges impacting human well-being. To tackle these issues, data-driven approaches in urban analysis have gained prominence, leveraging urban data to promote sustainability. Integrating Machine Learning techniques enables researchers to analyze and predict complex environmental phenomena like Urban Heat Island occurrences in urban areas. This paper demonstrates the implementation of data-driven approach and interpretable Machine Learning algorithms with interpretability techniques to conduct comprehensive data analyses for sustainable urban design. The developed framework and algorithms are demonstrated for Tallinn, Estonia to develop sustainable urban strategies to mitigate urban heat waves. Geospatial data, preprocessed and labeled with UHI levels, are used to train various ML models, with Logistic Regression emerging as the best-performing model based on evaluation metrics to derive a mathematical equation representing the area with UHI or without UHI effects, providing insights into UHI occurrences based on buildings and urban features. The derived formula highlights the importance of building volume, height, area, and shape length to create an urban environment with UHI impact. The data-driven approach and derived equation inform mitigation strategies and sustainable urban development in Tallinn and offer valuable guidance for other locations with varying climates.

Keywords: data-driven approach, machine learning transparent models, interpretable machine learning models, urban heat island effect

Procedia PDF Downloads 18

10659 Automatic Lead Qualification with Opinion Mining in Customer Relationship Management Projects

Authors: Victor Radich, Tania Basso, Regina Moraes

Abstract:

Lead qualification is one of the main procedures in Customer Relationship Management (CRM) projects. Its main goal is to identify potential consumers who have the ideal characteristics to establish a profitable and long-term relationship with a certain organization. Social networks can be an important source of data for identifying and qualifying leads since interest in specific products or services can be identified from the users’ expressed feelings of (dis)satisfaction. In this context, this work proposes the use of machine learning techniques and sentiment analysis as an extra step in the lead qualification process in order to improve it. In addition to machine learning models, sentiment analysis or opinion mining can be used to understand the evaluation that the user makes of a particular service, product, or brand. The results obtained so far have shown that it is possible to extract data from social networks and combine the techniques for a more complete classification.

Keywords: lead qualification, sentiment analysis, opinion mining, machine learning, CRM, lead scoring

Procedia PDF Downloads 65

10658 Housing Price Prediction Using Machine Learning Algorithms: The Case of Melbourne City, Australia

Authors: The Danh Phan

Abstract:

House price forecasting is a main topic in the real estate market research. Effective house price prediction models could not only allow home buyers and real estate agents to make better data-driven decisions but may also be beneficial for the property policymaking process. This study investigates the housing market by using machine learning techniques to analyze real historical house sale transactions in Australia. It seeks useful models which could be deployed as an application for house buyers and sellers. Data analytics show a high discrepancy between the house price in the most expensive suburbs and the most affordable suburbs in the city of Melbourne. In addition, experiments demonstrate that the combination of Stepwise and Support Vector Machine (SVM), based on the Mean Squared Error (MSE) measurement, consistently outperforms other models in terms of prediction accuracy.

Keywords: house price prediction, regression trees, neural network, support vector machine, stepwise

Procedia PDF Downloads 210

10657 Copyright Clearance for Artificial Intelligence Training Data: Challenges and Solutions

Authors: Erva Akin

Abstract:

– The use of copyrighted material for machine learning purposes is a challenging issue in the field of artificial intelligence (AI). While machine learning algorithms require large amounts of data to train and improve their accuracy and creativity, the use of copyrighted material without permission from the authors may infringe on their intellectual property rights. In order to overcome copyright legal hurdle against the data sharing, access and re-use of data, the use of copyrighted material for machine learning purposes may be considered permissible under certain circumstances. For example, if the copyright holder has given permission to use the data through a licensing agreement, then the use for machine learning purposes may be lawful. It is also argued that copying for non-expressive purposes that do not involve conveying expressive elements to the public, such as automated data extraction, should not be seen as infringing. The focus of such ‘copy-reliant technologies’ is on understanding language rules, styles, and syntax and no creative ideas are being used. However, the non-expressive use defense is within the framework of the fair use doctrine, which allows the use of copyrighted material for research or educational purposes. The questions arise because the fair use doctrine is not available in EU law, instead, the InfoSoc Directive provides for a rigid system of exclusive rights with a list of exceptions and limitations. One could only argue that non-expressive uses of copyrighted material for machine learning purposes do not constitute a ‘reproduction’ in the first place. Nevertheless, the use of machine learning with copyrighted material is difficult because EU copyright law applies to the mere use of the works. Two solutions can be proposed to address the problem of copyright clearance for AI training data. The first is to introduce a broad exception for text and data mining, either mandatorily or for commercial and scientific purposes, or to permit the reproduction of works for non-expressive purposes. The second is that copyright laws should permit the reproduction of works for non-expressive purposes, which opens the door to discussions regarding the transposition of the fair use principle from the US into EU law. Both solutions aim to provide more space for AI developers to operate and encourage greater freedom, which could lead to more rapid innovation in the field. The Data Governance Act presents a significant opportunity to advance these debates. Finally, issues concerning the balance of general public interests and legitimate private interests in machine learning training data must be addressed. In my opinion, it is crucial that robot-creation output should fall into the public domain. Machines depend on human creativity, innovation, and expression. To encourage technological advancement and innovation, freedom of expression and business operation must be prioritised.

Keywords: artificial intelligence, copyright, data governance, machine learning

Procedia PDF Downloads 73

10656 New Machine Learning Optimization Approach Based on Input Variables Disposition Applied for Time Series Prediction

Authors: Hervice Roméo Fogno Fotsoa, Germaine Djuidje Kenmoe, Claude Vidal Aloyem Kazé

Abstract:

One of the main applications of machine learning is the prediction of time series. But a more accurate prediction requires a more optimal model of machine learning. Several optimization techniques have been developed, but without considering the input variables disposition of the system. Thus, this work aims to present a new machine learning architecture optimization technique based on their optimal input variables disposition. The validations are done on the prediction of wind time series, using data collected in Cameroon. The number of possible dispositions with four input variables is determined, i.e., twenty-four. Each of the dispositions is used to perform the prediction, with the main criteria being the training and prediction performances. The results obtained from a static architecture and a dynamic architecture of neural networks have shown that these performances are a function of the input variable's disposition, and this is in a different way from the architectures. This analysis revealed that it is necessary to take into account the input variable's disposition for the development of a more optimal neural network model. Thus, a new neural network training algorithm is proposed by introducing the search for the optimal input variables disposition in the traditional back-propagation algorithm. The results of the application of this new optimization approach on the two single neural network architectures are compared with the previously obtained results step by step. Moreover, this proposed approach is validated in a collaborative optimization method with a single objective optimization technique, i.e., genetic algorithm back-propagation neural networks. From these comparisons, it is concluded that each proposed model outperforms its traditional model in terms of training and prediction performance of time series. Thus the proposed optimization approach can be useful in improving the accuracy of time series forecasts. This proves that the proposed optimization approach can be useful in improving the accuracy of time series prediction based on machine learning.

Keywords: input variable disposition, machine learning, optimization, performance, time series prediction

Procedia PDF Downloads 89

10655 In Vitro Antibacterial Activity of Selected Tanzania Medicinal Plants

Authors: Mhuji Kilonzo, Patrick Ndakidemi, Musa Chacha

Abstract:

Objective: To evaluate antibacterial activity from four selected medicinal plants namely Mystroxylon aethiopicum, Lonchocarpus capassa, Albizia anthelmentica and Myrica salicifolia used for management of bacterial infection in Tanzania. Methods: Minimum Inhibitory Concentration (MIC) of plants extracts against the tested bacterial species was determined by using 96 wells microdilution method. In this method, 50 μL of nutrient broth were loaded in each well followed by 50 μL of extract (100 mg/mL) to make a final volume of 100 μL. Subsequently, 50 μL were transferred from first rows of each well to the second rows and the process was repeated down the columns to the last wells from which 50 μL were discarded. Thereafter, 50 μL of the selected bacterial suspension were added to each well thus making a final volume of 100 μL. The lowest concentration which showed no bacterial growth was considered as MIC. Results: It was revealed that L. capassa leaf ethyl acetate extract exhibited antibacterial activity against Salmonella kisarawe and Salmonella typhi with MIC values of 0.39 and 0.781 mg/mL respectively. Likewise, L. capassa root bark ethyl acetate extracts inhibited growth of S. typhi and E. coli with MIC values of 0.39 and 0.781 mg/mL respectively. The M. aethiopicum leaf and root bark chloroform extracts displayed antibacterial activity against S. kisarawe and S. typhi respectively with MIC value of 0.781 mg/mL. The M. salicifolia stem bark ethyl acetate exhibited antibacterial activity against P. aeruginosa with MIC value of 0.39 mg/mL whereas the methanolic stem and root bark of the same plant inhibited the growth of Proteus mirabilis and Klebsiella pneumoniae with MIC value of 0.781 mg/mL. Conclusion: It was concluded that M. aethiopicum, L. capassa, A. anthelmentica and M. salicifolia are potential source of antibacterial agents. Further studies to establish structures of antibacterial and evaluate active ingredients are recommended.

Keywords: Albizia anthelmentica, Lonchocarpus capassa, Mystroxylon aethiopicum, Myrica salicifolia

Procedia PDF Downloads 210

10654 Design Consideration of a Plastic Shredder in Recycling Processes

Authors: Tolulope A. Olukunle

Abstract:

Plastic waste management has emerged as one of the greatest challenges facing developing countries. This paper describes the design of various components of a plastic shredder. This machine is widely used in industries and recycling plants. The introduction of plastic shredder machine will promote reduction of post-consumer plastic waste accumulation and serves as a system for wealth creation and empowerment through conversion of waste into economically viable products. In this design research, a 10 kW electric motor with a rotational speed of 500 rpm was chosen to drive the shredder. A pulley size of 400 mm is mounted on the electric motor at a distance of 1000 mm away from the shredder pulley. The shredder rotational speed is 300 rpm.

Keywords: design, machine, plastic waste, recycling

Procedia PDF Downloads 306

10653 Effect of Different Irrigation Intervals on Protein and Gel Production of Aloe Vera (Aloe Barbadensis M.) in Iran

Authors: Seyed Mohammad Hosein Al Omrani Nejad, Ali Rezvani Aghdam

Abstract:

This study was done in order to evaluation different irrigation intervals on amount of protein, and gel production in Aloe vera, a traditional medicinal plant. Plants was plnted in Greenhouse and irrigated according to Accumulative Pan Evaporation(APE). The treatments were included 20, 40, 60, 80, 100, 120, 140, 160, 180, and 200 mm APE which has been showed W1,W2, W3, W4, W5, W6, W7, W8,W9 and W10 respectively.The amount of protein and gel production was measured seperately. Results showed that highest protein and fresh weight of gel obtained plants which irrigated W6 and W7 respectively. According to these results can recomend which if plant irrigatedwhen APE reached 120 and 140 mm by Class A Evaporation Pan method gel production and protein would besuitable in north of khozestan province in limited irrigation conditions.

Keywords: irrigation, protein, gel, aloe vera, Iran

Procedia PDF Downloads 375

10652 Invitro Study of Anti-Leishmanial Property of Nigella Sativa Methanalic Black Seed Extract

Authors: Tawqeer Ali Syed, Prakash Chandra

Abstract:

This study aims to evaluate the antileishmanial activity of Nigella sativa black seed extract. This well-known plant extract was taken from the botanical garden of Kashmir. Materials and Methods: The methanolic extracts of these plants were screened for their antileishmanial activity against Leishmania major using 3‑(4.5‑dimethylthiazol‑2yl)‑2.5‑diphenyltetrazolium bromide assay or MTT assay. Results: The methanolic extract of Nigella sativa showed potential antileishmanial activity at an inhibition% value of 80.29% ± 0.65%. IC 50 was calculated after 48 hours to be 964.3 µg/ml. Conclusion: Considering these results, these medicinal plants from Kashmir could serve as potential drug sources for antileishmanial compounds.

Keywords: MTT assay, antileishmanial, cell viability, Nigella sativa

Procedia PDF Downloads 196

10651 AMF activates PDH 45 and G-proteins Genes to Alleviate Abiotic Stress in Tomato Plants

Authors: Deepak Bhardwaj, Narendra Tuteja

Abstract:

Global climate change is impacting large agrarian societies, especially those in countries located near the equator. Agriculture, and consequently, plant-based food, is the hardest hit in tropical and sub-tropical countries such as India due to an increased incidence of drought as well as an increase in soil salinity. One method that holds promise is AMF-rich biofertilizers which assist in activating proteins which in turn help alleviate abiotic stress in plants. In the present study, we identified two important species of (arbuscular mycorrhizal fungus) AMF belonging to Glomus and Gigaspora from the rhizosphere of the important medicinal plant Justicia adathoda. These two species have been found to be responsible for the abundance of Justicia adathoda in the semi-arid areas of the Jammu valley located in northern India, namely, the Union Territory of Jammu and Kashmir. We isolated the species of Glomus and Gigaspora from the rhizosphere of Justicia adathoda and used them as biofertilizers for the tomato plant. Significant improvements in the growth parameters were observed in the tomato plants inoculated with Glomus sp. and Gigaspora sp. in comparison with the tomato plants that were grown without AMF treatments. Tomato plants grown along with Glomus sp. and Gigaspora sp. have been observed to withstand 200 mM of salinity and 25% PEG stress. AMF also resulted in an increased concentration of proline and antioxidant enzymes in tomato plants. We also examined the expression levels of salinity and drought stress-inducible genes such as pea DNA helicase 45 (PDH 45) and genes of G-protein subunits of the tomato plants inoculated with and without AMF under stress and normal conditions. All the stress-inducible genes showed a significant increase in their gene expression under stress and AMF inoculation, while their levels were found to be normal under AMF inoculation without stress. We propose a model of abiotic stress alleviation in tomato plants with the help of external factors such as AMF and internally with the help of proteins like PDH 45 and G-proteins.

Keywords: AMF, abiotic stress, g-proteins, PDH-45

Procedia PDF Downloads 162

10650 Antibacterial Activity of Melaleuca Cajuputi Oil against Resistant Strain Bacteria

Authors: R. M. Noah, N. M. Nasir, M. R. Jais, M. S. S. Wahab, M. H. Abdullah, A. S. S. Raj

Abstract:

Infectious diseases are getting more difficult to treat due to the resistant strains of bacteria. Current generations of antibiotics are most likely ineffective against multi-drug resistant strains bacteria. Thus, there is an urgent need in search of natural antibiotics in particular from medicinal plants. One of the common medicinal plants, Melaleuca cajuputi, has been reported to possess antibacterial properties. The study was conducted to evaluate and justify the presence of antibacterial activity of Melaleuca cajuputi essential oil (EO) against the multi-drug resistant bacteria. Clinical isolates obtained from the teaching hospital were re-assessed to confirm the exact identity of the bacteria to be tested, namely methicillin-resistant staphylococcus aureus (MRSA), carbapenem-resistant enterobacteriaceae (CRE), and extended-spectrum beta-lactamases producer (ESBLs). A well diffusion method was done to observe the inhibition zones of the essential oil against the bacteria. Minimum inhibitory concentration (MIC) was determined using the microdilution method in 96-well flat microplate. The absorbance was measured using a microplate reader. Minimum bactericidal concentration (MBC) was performed using the agar medium method. The zones of inhibition produced by the EO against MRSA, CRE, and ESBL were comparable to that of generic antibiotics used, gentamicin and augmentin. The MIC and MBC results highlighted the antimicrobial efficacy of the EO. The outcome of this study indicated that the EO of Melaleuca cajuputi had antibacterial activity on the multi-drug resistant bacteria. This finding was eventually substantiated by electron microscopy work.

Keywords: melaleuca cajuputi, antibacterial, resistant bacteria, essential oil

Procedia PDF Downloads 109

10649 Comparison of Different Machine Learning Models for Time-Series Based Load Forecasting of Electric Vehicle Charging Stations

Authors: H. J. Joshi, Satyajeet Patil, Parth Dandavate, Mihir Kulkarni, Harshita Agrawal

Abstract:

As the world looks towards a sustainable future, electric vehicles have become increasingly popular. Millions worldwide are looking to switch to Electric cars over the previously favored combustion engine-powered cars. This demand has seen an increase in Electric Vehicle Charging Stations. The big challenge is that the randomness of electrical energy makes it tough for these charging stations to provide an adequate amount of energy over a specific amount of time. Thus, it has become increasingly crucial to model these patterns and forecast the energy needs of power stations. This paper aims to analyze how different machine learning models perform on Electric Vehicle charging time-series data. The data set consists of authentic Electric Vehicle Data from the Netherlands. It has an overview of ten thousand transactions from public stations operated by EVnetNL.

Keywords: forecasting, smart grid, electric vehicle load forecasting, machine learning, time series forecasting

Procedia PDF Downloads 90

10648 Assessment of Antioxidant Activities in Roots of Miswak (Salvadora persica) Plants Grown at Two Different Locations in Saudi Arabia

Authors: Mohamed M. Ibrahima, Abdul Aziz A.AL Sahli, Ibrahim A. Alaraidh, Ali A. Al-Homaidan, E.M. Mostafa, G.A. EL-Gaaly

Abstract:

Traditionally, in Middle Eastern countries, many cultures use chewing sticks of arak for medicinal purposes especially, for oral cleanliness care. It was used by Muslims for the treatment of teeth and highly recommended to be used by muslims during the whole day. Therefore, the present work aimed to determine the total phenolic content and total flavonoids in two Miswak extracts obtained from arak roots collected from two different localities in Saudi Arabia. They were extracted with aqueous ethanol (80%) and used to estimate in vitro their antioxidative abilities. The new findings showed that the two tested extracts contained significantly different amounts of both total phenolic content and total flavonoids. According to the increase of total phenolic contents and total flavonoids obtained from the two extracts, Miswak collected from the southern region was found to contain more contents than those collected from the middle region. The results of antioxidant activities of Miswak root extract obtained by using different in vitro methods were varied depending on the technique used. According to the malondialdehyde (MDA) method, hydrogen peroxide (H2O2) scavenging ability and 1,1-diphenyl-2-picrylhydrazyl (DPPH) methods, the two Miswak extracts exhibited to have high to very high antioxidant activities. Mostly, the values of antioxidant activities of Southern region have been shown to be always the highest.

Keywords: Arak, antioxidant, medicinal plants, Saudi Arabia

Procedia PDF Downloads 522

10647 ANOVA-Based Feature Selection and Machine Learning System for IoT Anomaly Detection

Authors: Muhammad Ali

Abstract:

Cyber-attacks and anomaly detection on the Internet of Things (IoT) infrastructure is emerging concern in the domain of data-driven intrusion. Rapidly increasing IoT risk is now making headlines around the world. denial of service, malicious control, data type probing, malicious operation, DDos, scan, spying, and wrong setup are attacks and anomalies that can affect an IoT system failure. Everyone talks about cyber security, connectivity, smart devices, and real-time data extraction. IoT devices expose a wide variety of new cyber security attack vectors in network traffic. For further than IoT development, and mainly for smart and IoT applications, there is a necessity for intelligent processing and analysis of data. So, our approach is too secure. We train several machine learning models that have been compared to accurately predicting attacks and anomalies on IoT systems, considering IoT applications, with ANOVA-based feature selection with fewer prediction models to evaluate network traffic to help prevent IoT devices. The machine learning (ML) algorithms that have been used here are KNN, SVM, NB, D.T., and R.F., with the most satisfactory test accuracy with fast detection. The evaluation of ML metrics includes precision, recall, F1 score, FPR, NPV, G.M., MCC, and AUC & ROC. The Random Forest algorithm achieved the best results with less prediction time, with an accuracy of 99.98%.

Keywords: machine learning, analysis of variance, Internet of Thing, network security, intrusion detection

Procedia PDF Downloads 106

10646 FlexPoints: Efficient Algorithm for Detection of Electrocardiogram Characteristic Points

Authors: Daniel Bulanda, Janusz A. Starzyk, Adrian Horzyk

Abstract:

The electrocardiogram (ECG) is one of the most commonly used medical tests, essential for correct diagnosis and treatment of the patient. While ECG devices generate a huge amount of data, only a small part of them carries valuable medical information. To deal with this problem, many compression algorithms and filters have been developed over the past years. However, the rapid development of new machine learning techniques poses new challenges. To address this class of problems, we created the FlexPoints algorithm that searches for characteristic points on the ECG signal and ignores all other points that do not carry relevant medical information. The conducted experiments proved that the presented algorithm can significantly reduce the number of data points which represents ECG signal without losing valuable medical information. These sparse but essential characteristic points (flex points) can be a perfect input for some modern machine learning models, which works much better using flex points as an input instead of raw data or data compressed by many popular algorithms.

Keywords: characteristic points, electrocardiogram, ECG, machine learning, signal compression

Procedia PDF Downloads 151

10645 WebAppShield: An Approach Exploiting Machine Learning to Detect SQLi Attacks in an Application Layer in Run-time

Authors: Ahmed Abdulla Ashlam, Atta Badii, Frederic Stahl

Abstract:

In recent years, SQL injection attacks have been identified as being prevalent against web applications. They affect network security and user data, which leads to a considerable loss of money and data every year. This paper presents the use of classification algorithms in machine learning using a method to classify the login data filtering inputs into "SQLi" or "Non-SQLi,” thus increasing the reliability and accuracy of results in terms of deciding whether an operation is an attack or a valid operation. A method Web-App auto-generated twin data structure replication. Shielding against SQLi attacks (WebAppShield) that verifies all users and prevents attackers (SQLi attacks) from entering and or accessing the database, which the machine learning module predicts as "Non-SQLi" has been developed. A special login form has been developed with a special instance of data validation; this verification process secures the web application from its early stages. The system has been tested and validated, up to 99% of SQLi attacks have been prevented.

Keywords: SQL injection, attacks, web application, accuracy, database

Procedia PDF Downloads 132

10644 A Machine Learning-Based Approach to Capture Extreme Rainfall Events

Authors: Willy Mbenza, Sho Kenjiro

Abstract:

Increasing efforts are directed towards a better understanding and foreknowledge of extreme precipitation likelihood, given the adverse effects associated with their occurrence. This knowledge plays a crucial role in long-term planning and the formulation of effective emergency response. However, predicting extreme events reliably presents a challenge to conventional empirical/statistics due to the involvement of numerous variables spanning different time and space scales. In the recent time, Machine Learning has emerged as a promising tool for predicting the dynamics of extreme precipitation. ML techniques enables the consideration of both local and regional physical variables that have a strong influence on the likelihood of extreme precipitation. These variables encompasses factors such as air temperature, soil moisture, specific humidity, aerosol concentration, among others. In this study, we develop an ML model that incorporates both local and regional variables while establishing a robust relationship between physical variables and precipitation during the downscaling process. Furthermore, the model provides valuable information on the frequency and duration of a given intensity of precipitation.

Keywords: machine learning (ML), predictions, rainfall events, regional variables

Procedia PDF Downloads 76

10643 Use Cloud-Based Watson Deep Learning Platform to Train Models Faster and More Accurate

Authors: Susan Diamond

Abstract:

Machine Learning workloads have traditionally been run in high-performance computing (HPC) environments, where users log in to dedicated machines and utilize the attached GPUs to run training jobs on huge datasets. Training of large neural network models is very resource intensive, and even after exploiting parallelism and accelerators such as GPUs, a single training job can still take days. Consequently, the cost of hardware is a barrier to entry. Even when upfront cost is not a concern, the lead time to set up such an HPC environment takes months from acquiring hardware to set up the hardware with the right set of firmware, software installed and configured. Furthermore, scalability is hard to achieve in a rigid traditional lab environment. Therefore, it is slow to react to the dynamic change in the artificial intelligent industry. Watson Deep Learning as a service, a cloud-based deep learning platform that mitigates the long lead time and high upfront investment in hardware. It enables robust and scalable sharing of resources among the teams in an organization. It is designed for on-demand cloud environments. Providing a similar user experience in a multi-tenant cloud environment comes with its own unique challenges regarding fault tolerance, performance, and security. Watson Deep Learning as a service tackles these challenges and present a deep learning stack for the cloud environments in a secure, scalable and fault-tolerant manner. It supports a wide range of deep-learning frameworks such as Tensorflow, PyTorch, Caffe, Torch, Theano, and MXNet etc. These frameworks reduce the effort and skillset required to design, train, and use deep learning models. Deep Learning as a service is used at IBM by AI researchers in areas including machine translation, computer vision, and healthcare.

Keywords: deep learning, machine learning, cognitive computing, model training

Procedia PDF Downloads 198

10642 Tibyan Automated Arabic Correction Using Machine-Learning in Detecting Syntactical Mistakes

Authors: Ashwag O. Maghraby, Nida N. Khan, Hosnia A. Ahmed, Ghufran N. Brohi, Hind F. Assouli, Jawaher S. Melibari

Abstract:

The Arabic language is one of the most important languages. Learning it is so important for many people around the world because of its religious and economic importance and the real challenge lies in practicing it without grammatical or syntactical mistakes. This research focused on detecting and correcting the syntactic mistakes of Arabic syntax according to their position in the sentence and focused on two of the main syntactical rules in Arabic: Dual and Plural. It analyzes each sentence in the text, using Stanford CoreNLP morphological analyzer and machine-learning approach in order to detect the syntactical mistakes and then correct it. A prototype of the proposed system was implemented and evaluated. It uses support vector machine (SVM) algorithm to detect Arabic grammatical errors and correct them using the rule-based approach. The prototype system has a far accuracy 81%. In general, it shows a set of useful grammatical suggestions that the user may forget about while writing due to lack of familiarity with grammar or as a result of the speed of writing such as alerting the user when using a plural term to indicate one person.

Keywords: Arabic language acquisition and learning, natural language processing, morphological analyzer, part-of-speech

Procedia PDF Downloads 140

10641 The Abundance and Distribution of Locally Important Species Along Different Altitude: The Case of Mountain Damota, Wolaita South Ethiopia

Authors: Tamirat Solomon, Tadesse Faltamo, Belete Limani

Abstract:

This study was conducted on the mountain Damota of Wolaita to assess the abundance and spatial distribution of two locally important indigenous medicinal plants on the mountain landscape. A total of 130 plots measuring 20x20m were established along eight systematically laid transect lines. In each plot, the abundance and distribution of Hagenia abyssinica (tree) and Pentas schiperiana Vatke (shrub) were evaluated. The abundance and distribution of H. abyssinica were evaluated by measuring height and DBH for mature trees and counting seedlings and saplings, whereas the P. schiperiana Vatke was assessed for its abundance and distribution by counting in each plot. In the entire study plots, a total of 485 H. abyssinica and 760 P. schiperiana vatake were recorded. It was observed that the distribution of the species increased while the altitude increased and the highest abundance of the species was recorded at an altitude range between 2332 and 2661m.a.s.l. However, at the altitudes below 2320 m.a.s.l., the species distributions and abundance was decreased, indicating either the ecological preference of the species or the extraction of the local community surrounding the mountain influenced the species. On average, only 28 seedlings/ha of H. abyssinica and 146/ha of P. schiperiana vatke were recorded in the study areas showing the tendency of decline in the abundance and distribution of both species. Finally, we recommend management intervention for the socially important species which are under threat on the mountain landscape.

Keywords: indigenous medicinal plants, H.abyssinic, P. schiperiana, distribution, abundance, socio-economic importance

Procedia PDF Downloads 109

10640 Machine Learning Driven Analysis of Kepler Objects of Interest to Identify Exoplanets

Authors: Akshat Kumar, Vidushi

Abstract:

This paper identifies 27 KOIs, 26 of which are currently classified as candidates and one as false positives that have a high probability of being confirmed. For this purpose, 11 machine learning algorithms were implemented on the cumulative kepler dataset sourced from the NASA exoplanet archive; it was observed that the best-performing model was HistGradientBoosting and XGBoost with a test accuracy of 93.5%, and the lowest-performing model was Gaussian NB with a test accuracy of 54%, to test model performance F1, cross-validation score and RUC curve was calculated. Based on the learned models, the significant characteristics for confirm exoplanets were identified, putting emphasis on the object’s transit and stellar properties; these characteristics were namely koi_count, koi_prad, koi_period, koi_dor, koi_ror, and koi_smass, which were later considered to filter out the potential KOIs. The paper also calculates the Earth similarity index based on the planetary radius and equilibrium temperature for each KOI identified to aid in their classification.

Keywords: Kepler objects of interest, exoplanets, space exploration, machine learning, earth similarity index, transit photometry

Procedia PDF Downloads 49

10639 Risk Factors of Becoming NEET Youth in Iran: A Machine Learning Approach

Authors: Hamed Rahmani, Wim Groot

Abstract:

The term "youth not in employment, education or training (NEET)" refers to a combination of youth unemployment and school dropout. This study investigates the variables that increase the risk of becoming NEET in Iran. A selection bias-adjusted Probit model was employed using machine learning to identify these risk factors. We used cross-sectional data obtained from the Statistical Centre of Iran and the Ministry of Cooperatives Labour and Social Welfare that was taken from the labour force survey conducted in the spring of 2021. We look at years of education, work experience, housework, the number of children under the age of six in the home, family education, birthplace, and the amount of land owned by households. Results show that hours spent performing domestic chores enhance the likelihood of youth becoming NEET, and years of education and years of potential work experience decrease the chance of being NEET. The findings also show that female youth born in cities were less likely than those born in rural regions to become NEET.

Keywords: NEET youth, probit, CART, machine learning, unemployment

Procedia PDF Downloads 96

10638 Development of Computational Approach for Calculation of Hydrogen Solubility in Hydrocarbons for Treatment of Petroleum

Authors: Abdulrahman Sumayli, Saad M. AlShahrani

Abstract:

For the hydrogenation process, knowing the solubility of hydrogen (H2) in hydrocarbons is critical to improve the efficiency of the process. We investigated the H2 solubility computation in four heavy crude oil feedstocks using machine learning techniques. Temperature, pressure, and feedstock type were considered as the inputs to the models, while the hydrogen solubility was the sole response. Specifically, we employed three different models: Support Vector Regression (SVR), Gaussian process regression (GPR), and Bayesian ridge regression (BRR). To achieve the best performance, the hyper-parameters of these models are optimized using the whale optimization algorithm (WOA). We evaluated the models using a dataset of solubility measurements in various feedstocks, and we compared their performance based on several metrics. Our results show that the WOA-SVR model tuned with WOA achieves the best performance overall, with an RMSE of 1.38 × 10− 2 and an R-squared of 0.991. These findings suggest that machine learning techniques can provide accurate predictions of hydrogen solubility in different feedstocks, which could be useful in the development of hydrogen-related technologies. Besides, the solubility of hydrogen in the four heavy oil fractions is estimated in different ranges of temperatures and pressures of 150 ◦C–350 ◦C and 1.2 MPa–10.8 MPa, respectively

Keywords: temperature, pressure variations, machine learning, oil treatment

Procedia PDF Downloads 57

10637 Deep Learning based Image Classifiers for Detection of CSSVD in Cacao Plants

Authors: Atuhurra Jesse, N'guessan Yves-Roland Douha, Pabitra Lenka

Abstract:

The detection of diseases within plants has attracted a lot of attention from computer vision enthusiasts. Despite the progress made to detect diseases in many plants, there remains a research gap to train image classifiers to detect the cacao swollen shoot virus disease or CSSVD for short, pertinent to cacao plants. This gap has mainly been due to the unavailability of high quality labeled training data. Moreover, institutions have been hesitant to share their data related to CSSVD. To fill these gaps, image classifiers to detect CSSVD-infected cacao plants are presented in this study. The classifiers are based on VGG16, ResNet50 and Vision Transformer (ViT). The image classifiers are evaluated on a recently released and publicly accessible KaraAgroAI Cocoa dataset. The best performing image classifier, based on ResNet50, achieves 95.39\% precision, 93.75\% recall, 94.34\% F1-score and 94\% accuracy on only 20 epochs. There is a +9.75\% improvement in recall when compared to previous works. These results indicate that the image classifiers learn to identify cacao plants infected with CSSVD.

Keywords: CSSVD, image classification, ResNet50, vision transformer, KaraAgroAI cocoa dataset

Procedia PDF Downloads 85

10636 Development of pm2.5 Forecasting System in Seoul, South Korea Using Chemical Transport Modeling and ConvLSTM-DNN

Authors: Ji-Seok Koo, Hee‑Yong Kwon, Hui-Young Yun, Kyung-Hui Wang, Youn-Seo Koo

Abstract:

This paper presents a forecasting system for PM2.5 levels in Seoul, South Korea, leveraging a combination of chemical transport modeling and ConvLSTM-DNN machine learning technology. Exposure to PM2.5 has known detrimental impacts on public health, making its prediction crucial for establishing preventive measures. Existing forecasting models, like the Community Multiscale Air Quality (CMAQ) and Weather Research and Forecasting (WRF), are hindered by their reliance on uncertain input data, such as anthropogenic emissions and meteorological patterns, as well as certain intrinsic model limitations. The system we've developed specifically addresses these issues by integrating machine learning and using carefully selected input features that account for local and distant sources of PM2.5. In South Korea, the PM2.5 concentration is greatly influenced by both local emissions and long-range transport from China, and our model effectively captures these spatial and temporal dynamics. Our PM2.5 prediction system combines the strengths of advanced hybrid machine learning algorithms, convLSTM and DNN, to improve upon the limitations of the traditional CMAQ model. Data used in the system include forecasted information from CMAQ and WRF models, along with actual PM2.5 concentration and weather variable data from monitoring stations in China and South Korea. The system was implemented specifically for Seoul's PM2.5 forecasting.

Keywords: PM2.5 forecast, machine learning, convLSTM, DNN

Procedia PDF Downloads 44

10635 Mental Health Diagnosis through Machine Learning Approaches

Authors: Md Rafiqul Islam, Ashir Ahmed, Anwaar Ulhaq, Abu Raihan M. Kamal, Yuan Miao, Hua Wang

Abstract:

Mental health of people is equally important as of their physical health. Mental health and well-being are influenced not only by individual attributes but also by the social circumstances in which people find themselves and the environment in which they live. Like physical health, there is a number of internal and external factors such as biological, social and occupational factors that could influence the mental health of people. People living in poverty, suffering from chronic health conditions, minority groups, and those who exposed to/or displaced by war or conflict are generally more likely to develop mental health conditions. However, to authors’ best knowledge, there is dearth of knowledge on the impact of workplace (especially the highly stressed IT/Tech workplace) on the mental health of its workers. This study attempts to examine the factors influencing the mental health of tech workers. A publicly available dataset containing more than 65,000 cells and 100 attributes is examined for this purpose. Number of machine learning techniques such as ‘Decision Tree’, ‘K nearest neighbor’ ‘Support Vector Machine’ and ‘Ensemble’, are then applied to the selected dataset to draw the findings. It is anticipated that the analysis reported in this study would contribute in presenting useful insights on the attributes contributing in the mental health of tech workers using relevant machine learning techniques.

Keywords: mental disorder, diagnosis, occupational stress, IT workplace

Procedia PDF Downloads 278