Search results for: wheat yield prediction
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 4870

Search results for: wheat yield prediction

3460 BiFormerDTA: Structural Embedding of Protein in Drug Target Affinity Prediction Using BiFormer

Authors: Leila Baghaarabani, Parvin Razzaghi, Mennatolla Magdy Mostafa, Ahmad Albaqsami, Al Warith Al Rushaidi, Masoud Al Rawahi

Abstract:

Predicting the interaction between drugs and their molecular targets is pivotal for advancing drug development processes. Due to the time and cost limitations, computational approaches have emerged as an effective approach to drug-target interaction (DTI) prediction. Most of the introduced computational based approaches utilize the drug molecule and protein sequence as input. This study does not only utilize these inputs, it also introduces a protein representation developed using a masked protein language model. In this representation, for every individual amino acid residue within the protein sequence, there exists a corresponding probability distribution that indicates the likelihood of each amino acid being present at that particular position. Then, the similarity between each pair of amino acids is computed to create a similarity matrix. To encode the knowledge of the similarity matrix, Bi-Level Routing Attention (BiFormer) is utilized, which combines aspects of transformer-based models with protein sequence analysis and represents a significant advancement in the field of drug-protein interaction prediction. BiFormer has the ability to pinpoint the most effective regions of the protein sequence that are responsible for facilitating interactions between the protein and drugs, thereby enhancing the understanding of these critical interactions. Thus, it appears promising in its ability to capture the local structural relationship of the proteins by enhancing the understanding of how it contributes to drugprotein interactions, thereby facilitating more accurate predictions. To evaluate the proposed method, it was tested on two widely recognized datasets: Davis and KIBA. A comprehensive series of experiments was conducted to illustrate its effectiveness in comparison to cutting edge techniques.

Keywords: BiFormer, transformer, protein language processing, self-attention mechanism, binding affinity, drug target interaction, similarity matrix, protein masked representation, protein language model

Procedia PDF Downloads 5
3459 Gasification of Groundnut Shell in an Air Bubbling Fluidized Bed Gasifier

Authors: Dharminer Singh, Sanjeev Yadav, Pravakar Mohanty

Abstract:

In this work, gasification of groundnut shell was carried out in an air bubbling fluidized bed gasifier. Atmospheric air used as gasification agent in the gasifier. The groundnut shell used for gasification was in powder form and the locally available river sand was used as bed material. Conventional charcoal was used for heating sand bed. Two cyclones were used for proper segregation of char particles and for proper cleaning and cooling the product gas. Experiments were performed on different equivalence ratio (ER) 0.3 - 0.33 by varying feeding rate 36 - 32.8 kg/h of biomass and by keeping the air flow rate constant at bed temperature between 700 °C – 800 °C. Performance of gasifier was evaluated on the basis of different parameters such as cold gas efficiency, carbon conversion efficiency (CCE), Tar and Suspended particles matter (SPM) generation, gas yield, and Higher heating value (HHV) of gas. The optimal ER value for gasification of groundnut shell (GNS) powder in an air bubbling fluidized bed gasifier was found to be 0.31. Cold gas efficiency and CCE value at optimal ER was found to be 63.7 %, and 91 %, respectively. Concentration of Tar and SPM, HHV of gas, and gas yield at optimal ER was found to be 11.88 g/Nm3, 2.38 MJ/Nm3, and 2.01m3/kg, respectively. In the product gas, concentrations of CO, CO2, CH4 and H2 were found to be 12.94%, 13.5%, 5.74% and 13.77%, respectively. At ER 0.31, it was observed that bed temperature of gasifier was in steady state for long time at 714 °C with 5 – 10 °C fluctuation.

Keywords: air bubbling fluidized bed gasifier, groundnut shell powder, equivalence ratio (ER), cold gas efficiency, carbon conversion efficiency (CCE), high heating value (HHV)

Procedia PDF Downloads 278
3458 Revolutionizing RNA Extraction: A Unified, Sustainable, and Rapid Protocol for High-Quality Isolation from Diverse Tissues

Authors: Ying Qi Chan, Chunyu Li, Xu Rou Yoyo Ma, Yaya Li, Saber Khederzadeh

Abstract:

In the ever-evolving landscape of genome extraction protocols, the existing methodologies grapple with issues ranging from sub-optimal yields and compromised quality to time-intensive procedures and reliance on hazardous reagents, often necessitating substantial tissue quantities. This predicament is particularly challenging for scientists in developing countries, where resources are limited. Our investigation presents a protocol for the efficient extraction of high-yield RNA from various tissues such as muscle, insect, and plant samples. Noteworthy for its advantages, our protocol stands out as the safest, swiftest (completed in just 38 minutes), most cost-effective (coming in at a mere US$0.017), and highly efficient method in comparison to existing protocols. Notably, our method avoids the use of hazardous or toxic chemicals such as chloroform and phenol and enzymatic agents like RNase and Proteinase K. Our RNA extraction protocol has demonstrated clear advantages over other methods, including commercial kits, in terms of yield. This nucleic acid extraction protocol is more environmentally and research-friendly, suitable for a range of tissues, even in tiny volumes, hence facilitating various genetic diagnosis and researches across the globe.

Keywords: RNA extraction, rapid protocol, universal method, diverse tissues

Procedia PDF Downloads 73
3457 Optimal Design of RC Pier Accompanied with Multi Sliding Friction Damping Mechanism Using Combination of SNOPT and ANN Method

Authors: Angga S. Fajar, Y. Takahashi, J. Kiyono, S. Sawada

Abstract:

The structural system concept of RC pier accompanied with multi sliding friction damping mechanism was developed based on numerical analysis approach. However in the implementation, to make design for such kind of this structural system consumes a lot of effort in case high of complexity. During making design, the special behaviors of this structural system should be considered including flexible small deformation, sufficient elastic deformation capacity, sufficient lateral force resistance, and sufficient energy dissipation. The confinement distribution of friction devices has significant influence to its. Optimization and prediction with multi function regression of this structural system expected capable of providing easier and simpler design method. The confinement distribution of friction devices is optimized with SNOPT in Opensees, while some design variables of the structure are predicted using multi function regression of ANN. Based on the optimization and prediction this structural system is able to be designed easily and simply.

Keywords: RC Pier, multi sliding friction device, optimal design, flexible small deformation

Procedia PDF Downloads 365
3456 Identification of Candidate Gene for Root Development and Its Association With Plant Architecture and Yield in Cassava

Authors: Abiodun Olayinka, Daniel Dzidzienyo, Pangirayi Tongoona, Samuel Offei, Edwige Gaby Nkouaya Mbanjo, Chiedozie Egesi, Ismail Yusuf Rabbi

Abstract:

Cassava (Manihot esculenta Crantz) is a major source of starch for various industrial applications. However, the traditional cultivation and harvesting methods of cassava are labour-intensive and inefficient, limiting the supply of fresh cassava roots for industrial starch production. To achieve improved productivity and quality of fresh cassava roots through mechanized cultivation, cassava cultivars with compact plant architecture and moderate plant height are needed. Plant architecture-related traits, such as plant height, harvest index, stem diameter, branching angle, and lodging tolerance, are critical for crop productivity and suitability for mechanized cultivation. However, the genetics of cassava plant architecture remain poorly understood. This study aimed to identify the genetic bases of the relationships between plant architecture traits and productivity-related traits, particularly starch content. A panel of 453 clones developed at the International Institute of Tropical Agriculture, Nigeria, was genotyped and phenotyped for 18 plant architecture and productivity-related traits at four locations in Nigeria. A genome-wide association study (GWAS) was conducted using the phenotypic data from a panel of 453 clones and 61,238 high-quality Diversity Arrays Technology sequencing (DArTseq) derived Single Nucleotide Polymorphism (SNP) markers that are evenly distributed across the cassava genome. Five significant associations between ten SNPs and three plant architecture component traits were identified through GWAS. We found five SNPs on chromosomes 6 and 16 that were significantly associated with shoot weight, harvest index, and total yield through genome-wide association mapping. We also discovered an essential candidate gene that is co-located with peak SNPs linked to these traits in M. esculenta. A review of the cassava reference genome v7.1 revealed that the SNP on chromosome 6 is in proximity to Manes.06G101600.1, a gene that regulates endodermal differentiation and root development in plants. The findings of this study provide insights into the genetic basis of plant architecture and yield in cassava. Cassava breeders could leverage this knowledge to optimize plant architecture and yield in cassava through marker-assisted selection and targeted manipulation of the candidate gene.

Keywords: manihot esculenta crantz, plant architecture, dartseq, snp markers, genome-wide association study

Procedia PDF Downloads 94
3455 Prediction of Live Birth in a Matched Cohort of Elective Single Embryo Transfers

Authors: Mohsen Bahrami, Banafsheh Nikmehr, Yueqiang Song, Anuradha Koduru, Ayse K. Vuruskan, Hongkun Lu, Tamer M. Yalcinkaya

Abstract:

In recent years, we have witnessed an explosion of studies aimed at using a combination of artificial intelligence (AI) and time-lapse imaging data on embryos to improve IVF outcomes. However, despite promising results, no study has used a matched cohort of transferred embryos which only differ in pregnancy outcome, i.e., embryos from a single clinic which are similar in parameters, such as: morphokinetic condition, patient age, and overall clinic and lab performance. Here, we used time-lapse data on embryos with known pregnancy outcomes to see if the rich spatiotemporal information embedded in this data would allow the prediction of the pregnancy outcome regardless of such critical parameters. Methodology—We did a retrospective analysis of time-lapse data from our IVF clinic utilizing Embryoscope 100% of the time for embryo culture to blastocyst stage with known clinical outcomes, including live birth vs nonpregnant (embryos with spontaneous abortion outcomes were excluded). We used time-lapse data from 200 elective single transfer embryos randomly selected from January 2019 to June 2021. Our sample included 100 embryos in each group with no significant difference in patient age (P=0.9550) and morphokinetic scores (P=0.4032). Data from all patients were combined to make a 4th order tensor, and feature extraction were subsequently carried out by a tensor decomposition methodology. The features were then used in a machine learning classifier to classify the two groups. Major Findings—The performance of the model was evaluated using 100 random subsampling cross validation (train (80%) - test (20%)). The prediction accuracy, averaged across 100 permutations, exceeded 80%. We also did a random grouping analysis, in which labels (live birth, nonpregnant) were randomly assigned to embryos, which yielded 50% accuracy. Conclusion—The high accuracy in the main analysis and the low accuracy in random grouping analysis suggest a consistent spatiotemporal pattern which is associated with pregnancy outcomes, regardless of patient age and embryo morphokinetic condition, and beyond already known parameters, such as: early cleavage or early blastulation. Despite small samples size, this ongoing analysis is the first to show the potential of AI methods in capturing the complex morphokinetic changes embedded in embryo time-lapse data, which contribute to successful pregnancy outcomes, regardless of already known parameters. The results on a larger sample size with complementary analysis on prediction of other key outcomes, such as: euploidy and aneuploidy of embryos will be presented at the meeting.

Keywords: IVF, embryo, machine learning, time-lapse imaging data

Procedia PDF Downloads 91
3454 Neural Network and Support Vector Machine for Prediction of Foot Disorders Based on Foot Analysis

Authors: Monireh Ahmadi Bani, Adel Khorramrouz, Lalenoor Morvarid, Bagheri Mahtab

Abstract:

Background:- Foot disorders are common in musculoskeletal problems. Plantar pressure distribution measurement is one the most important part of foot disorders diagnosis for quantitative analysis. However, the association of plantar pressure and foot disorders is not clear. With the growth of dataset and machine learning methods, the relationship between foot disorders and plantar pressures can be detected. Significance of the study:- The purpose of this study was to predict the probability of common foot disorders based on peak plantar pressure distribution and center of pressure during walking. Methodologies:- 2323 participants were assessed in a foot therapy clinic between 2015 and 2021. Foot disorders were diagnosed by an experienced physician and then they were asked to walk on a force plate scanner. After the data preprocessing, due to the difference in walking time and foot size, we normalized the samples based on time and foot size. Some of force plate variables were selected as input to a deep neural network (DNN), and the probability of any each foot disorder was measured. In next step, we used support vector machine (SVM) and run dataset for each foot disorder (classification of yes or no). We compared DNN and SVM for foot disorders prediction based on plantar pressure distributions and center of pressure. Findings:- The results demonstrated that the accuracy of deep learning architecture is sufficient for most clinical and research applications in the study population. In addition, the SVM approach has more accuracy for predictions, enabling applications for foot disorders diagnosis. The detection accuracy was 71% by the deep learning algorithm and 78% by the SVM algorithm. Moreover, when we worked with peak plantar pressure distribution, it was more accurate than center of pressure dataset. Conclusion:- Both algorithms- deep learning and SVM will help therapist and patients to improve the data pool and enhance foot disorders prediction with less expense and error after removing some restrictions properly.

Keywords: deep neural network, foot disorder, plantar pressure, support vector machine

Procedia PDF Downloads 353
3453 Uncertainty in Building Energy Performance Analysis at Different Stages of the Building’s Lifecycle

Authors: Elham Delzendeh, Song Wu, Mustafa Al-Adhami, Rima Alaaeddine

Abstract:

Over the last 15 years, prediction of energy consumption has become a common practice and necessity at different stages of the building’s lifecycle, particularly, at the design and post-occupancy stages for planning and maintenance purposes. This is due to the ever-growing response of governments to address sustainability and reduction of CO₂ emission in the building sector. However, there is a level of uncertainty in the estimation of energy consumption in buildings. The accuracy of energy consumption predictions is directly related to the precision of the initial inputs used in the energy assessment process. In this study, multiple cases of large non-residential buildings at design, construction, and post-occupancy stages are investigated. The energy consumption process and inputs, and the actual and predicted energy consumption of the cases are analysed. The findings of this study have pointed out and evidenced various parameters that cause uncertainty in the prediction of energy consumption in buildings such as modelling, location data, and occupant behaviour. In addition, unavailability and insufficiency of energy-consumption-related inputs at different stages of the building’s lifecycle are classified and categorized. Understanding the roots of uncertainty in building energy analysis will help energy modellers and energy simulation software developers reach more accurate energy consumption predictions in buildings.

Keywords: building lifecycle, efficiency, energy analysis, energy performance, uncertainty

Procedia PDF Downloads 136
3452 Improve Safety Performance of Un-Signalized Intersections in Oman

Authors: Siham G. Farag

Abstract:

The main objective of this paper is to provide a new methodology for road safety assessment in Oman through the development of suitable accident prediction models. GLM technique with Poisson or NBR using SAS package was carried out to develop these models. The paper utilized the accidents data of 31 un-signalized T-intersections during three years. Five goodness-of-fit measures were used to assess the overall quality of the developed models. Two types of models were developed separately; the flow-based models including only traffic exposure functions, and the full models containing both exposure functions and other significant geometry and traffic variables. The results show that, traffic exposure functions produced much better fit to the accident data. The most effective geometric variables were major-road mean speed, minor-road 85th percentile speed, major-road lane width, distance to the nearest junction, and right-turn curb radius. The developed models can be used for intersection treatment or upgrading and specify the appropriate design parameters of T- intersections. Finally, the models presented in this thesis reflect the intersection conditions in Oman and could represent the typical conditions in several countries in the middle east area, especially gulf countries.

Keywords: accidents prediction models (APMs), generalized linear model (GLM), T-intersections, Oman

Procedia PDF Downloads 269
3451 A Serum- And Feeder-Free Culture System for the Robust Generation of Human Stem Cell-Derived CD19+ B Cells and Antibody-Secreting Cells

Authors: Kirsten Wilson, Patrick M. Brauer, Sandra Babic, Diana Golubeva, Jessica Van Eyk, Tinya Wang, Avanti Karkhanis, Tim A. Le Fevre, Andy I. Kokaji, Allen C. Eaves, Sharon A. Louis, , Nooshin Tabatabaei-Zavareh

Abstract:

Long-lived plasma cells are rare, non-proliferative B cells generated from antibody-secreting cells (ASCs) following an immune response to protect the host against pathogen re-exposure. Despite their therapeutic potential, the lack of in vitro protocols in the field makes it challenging to use B cells as a cellular therapeutic tool. As a result, there is a need to establish robust and reproducible methods for the generation of B cells. To address this, we have developed a culture system for generating B cells from hematopoietic stem and/or progenitor cells (HSPCs) derived from human umbilical cord blood (CB) or pluripotent stem cells (PSCs). HSPCs isolated from CB were cultured using the StemSpan™ B Cell Generation Kit and produced CD19+ B cells at a frequency of 23.2 ± 1.5% and 59.6 ± 2.3%, with a yield of 91 ± 11 and 196 ± 37 CD19+ cells per input CD34+ cell on culture days 28 and 35, respectively (n = 50 - 59). CD19+IgM+ cells were detected at a frequency of 31.2 ± 2.6% and were produced at a yield of 113 ± 26 cells per input CD34+ cell on culture day 35 (n = 50 - 59). The B cell receptor loci of CB-derived B cells were sequenced to confirm V(D)J gene rearrangement. ELISpot analysis revealed that ASCs were generated at a frequency of 570 ± 57 per 10,000 day 35 cells, with an average IgM+ ASC yield of 16 ± 2 cells per input CD34+ cell (n = 33 - 42). PSC-derived HSPCs were generated using the STEMdiff™ Hematopoietic - EB reagents and differentiated to CD10+CD19+ B cells with a frequency of 4 ± 0.8% after 28 days of culture (n = 37, 1 embryonic and 3 induced pluripotent stem cell lines tested). Subsequent culture of PSC-derived HSPCs increased CD19+ frequency and generated ASCs from 1 - 2 iPSC lines. This method is the first report of a serum- and feeder-free system for the generation of B cells from CB and PSCs, enabling further B lineage-specific research for potential future clinical applications.

Keywords: stem cells, B cells, immunology, hematopoiesis, PSC, differentiation

Procedia PDF Downloads 57
3450 Optimizing E-commerce Retention: A Detailed Study of Machine Learning Techniques for Churn Prediction

Authors: Saurabh Kumar

Abstract:

In the fiercely competitive landscape of e-commerce, understanding and mitigating customer churn has become paramount for sustainable business growth. This paper presents a thorough investigation into the application of machine learning techniques for churn prediction in e-commerce, aiming to provide actionable insights for businesses seeking to enhance customer retention strategies. We conduct a comparative study of various machine learning algorithms, including traditional statistical methods and ensemble techniques, leveraging a rich dataset sourced from Kaggle. Through rigorous evaluation, we assess the predictive performance, interpretability, and scalability of each method, elucidating their respective strengths and limitations in capturing the intricate dynamics of customer churn. We identified the XGBoost classifier to be the best performing. Our findings not only offer practical guidelines for selecting suitable modeling approaches but also contribute to the broader understanding of customer behavior in the e-commerce domain. Ultimately, this research equips businesses with the knowledge and tools necessary to proactively identify and address churn, thereby fostering long-term customer relationships and sustaining competitive advantage.

Keywords: customer churn, e-commerce, machine learning techniques, predictive performance, sustainable business growth

Procedia PDF Downloads 26
3449 Mechanical Response of Aluminum Foam Under Biaxial Combined Quasi-Static Compression-Torsional Loads

Authors: Solomon Huluka, Akrum Abdul-Latif, Rachid Baleh

Abstract:

Metal foams have been developed intensively as a new class of materials for the last two decades due to their unique structural and multifunctional properties. The aim of this experimental work was to characterize the effect of biaxial loading complexity (combined compression-torsion) on the plastic response of highly uniform architecture open-cell aluminum foams of spherical porous with a density of 80%. For foam manufacturing, the Kelvin cells model was used to generate the generally spherical shape with a cell diameter of 11 mm. A patented rig called ACTP (Absorption par Compression-Torsion Plastique), was used to investigate the foam response under quasi-static complex loading paths having different torsional components (i.e. 0°, 45° and 60°). The key mechanical responses to be examined are yield stress, stress plateau, and energy absorption capacity. The collapse mode was also investigated. It was concluded that the higher the loading complexity, the greater the yield strength and the greater energy absorption capacity of the foam. Experimentally, it was also noticed that there were large softening effects that occurred after the first pick stress for both biaxial-45° and biaxial-60° loading.

Keywords: aluminum foam, loading complexity, characterization, biaxial loading

Procedia PDF Downloads 141
3448 Preparation and Characterization of Maltodextrin Microcapsules Containing Walnut Green Husk Extract

Authors: Fatemeh Cheraghali, Saeedeh Shojaee-Aliabadi, Seyede Marzieh Hosseini, Leila Mirmoghtadaie

Abstract:

In recent years, the field of natural antimicrobial and antioxidant compounds is one of the main research topics in the food industry. Application of agricultural residues is mainly cheap, and available resources are receiving increased attention. Walnut green husk is one of the agricultural residues that is considered as natural compounds with biological properties because of phenolic compounds. In this study, maltodextrin 10% was used for microencapsulation of walnut green husk extract. At first, the extract was examined to consider extraction yield, total phenolic compounds, and antioxidant activation. The results showed the extraction yield of 81.43%, total phenolic compounds of 3997 [mg GAE/100 g], antioxidant activity [DPPH] of 84.85% for walnut green husk extract. Antioxidant activity is about 75%-81% and by DPPH. At the next stage, microencapsulation was done by spry-drying method. The microencapsulation efficiency was 72%-79%. The results of SEM tests confirmed this microencapsulation process. In addition, microencapsulated and free extract was more effective on gram-positive bacteria’s rather than the gram-negative ones. According to the study, walnut green husk can be used as a cheap antioxidant and antimicrobial compounds due to sufficient value of phenolic compounds.

Keywords: biopolymer, microencapsulation, spray-drying, walnut green husk

Procedia PDF Downloads 159
3447 Finite Element Modeling of Global Ti-6Al-4V Mechanical Behavior in Relationship with Microstructural Parameters

Authors: Fatna Benmessaoud, Mohammed Cheikh, Vencent Velay, Vanessa Vedal, Farhad Rezai-Aria, Christine Boher

Abstract:

The global mechanical behavior of materials is strongly linked to their microstructure, especially their crystallographic texture and their grains morphology. These material aspects determine the mechanical fields character (heterogeneous or homogeneous), thus, they give to the global behavior a degree of anisotropy according the initial microstructure. For these reasons, the prediction of global behavior of materials in relationship with the microstructure must be performed with a multi-scale approach. Therefore, multi-scale modeling in the context of crystal plasticity is widely used. In this present contribution, a phenomenological elasto-viscoplastic model developed in the crystal plasticity context and finite element method are used to investigate the effects of crystallographic texture and grains sizes on global behavior of a polycrystalline equiaxed Ti-6Al-4V alloy. The constitutive equations of this model are written on local scale for each slip system within each grain while the strain and stress mechanical fields are investigated at the global scale via finite element scale transition. The beta phase of Ti-6Al-4V alloy modeled is negligible; its percent is less than 10%. Three families of slip systems of alpha phase are considered: basal and prismatic families with a burgers vector and pyramidal family with a burgers vector. The twinning mechanism of plastic strain is not observed in Ti-6Al-4V, therefore, it is not considered in the present modeling. Nine representative elementary volumes (REV) are generated with Voronoi tessellations. For each individual equiaxed grain, the own crystallographic orientation vis-à-vis the loading is taken into account. The meshing strategy is optimized in a way to eliminate the meshing effects and at the same time to allow calculating the individual grain size. The stress and strain fields are determined in each Gauss point of the mesh element. A post-treatment is used to calculate the local behavior (in each grain) and then by appropriate homogenization, the macroscopic behavior is calculated. The developed model is validated by comparing the numerical simulation results with an experimental data reported in the literature. It is observed that the present model is able to predict the global mechanical behavior of Ti-6Al-4V alloy and investigate the microstructural parameters' effects. According to the simulations performed on the generated volumes (REV), the macroscopic mechanical behavior of Ti-6Al-4V is strongly linked to the active slip systems family (prismatic, basal or pyramidal). The crystallographic texture determines which family of slip systems can be activated; therefore it gives to the plastic strain a heterogeneous character thus an anisotropic macroscopic mechanical behavior. The average grains size influences also the Ti-6Al-4V mechanical proprieties, especially the yield stress; by decreasing of the average grains size, the yield strength increases according to Hall-Petch relationship. The grains sizes' distribution gives to the strain fields considerable heterogeneity. By increasing grain sizes, the scattering in the localization of plastic strain is observed, thus, in certain areas the stress concentrations are stronger than other regions.

Keywords: microstructural parameters, multi-scale modeling, crystal plasticity, Ti-6Al-4V alloy

Procedia PDF Downloads 125
3446 Optimization of Alkali Assisted Microwave Pretreatments of Sorghum Straw for Efficient Bioethanol Production

Authors: Bahiru Tsegaye, Chandrajit Balomajumder, Partha Roy

Abstract:

The limited supply and related negative environmental consequence of fossil fuels are driving researcher for finding sustainable sources of energy. Lignocellulose biomass like sorghum straw is considered as among cheap, renewable and abundantly available sources of energy. However, lignocellulose biomass conversion to bioenergy like bioethanol is hindered due to the reluctant nature of lignin in the biomass. Therefore, removal of lignin is a vital step for lignocellulose conversion to renewable energy. The aim of this study is to optimize microwave pretreatment conditions using design expert software to remove lignin and to release maximum possible polysaccharides from sorghum straw for efficient hydrolysis and fermentation process. Sodium hydroxide concentration between 0.5-1.5%, v/v, pretreatment time from 5-25 minutes and pretreatment temperature from 120-2000C were considered to depolymerize sorghum straw. The effect of pretreatment was studied by analyzing the compositional changes before and after pretreatments following renewable energy laboratory procedure. Analysis of variance (ANOVA) was used to test the significance of the model used for optimization. About 32.8%-48.27% of hemicellulose solubilization, 53% -82.62% of cellulose release, and 49.25% to 78.29% lignin solubilization were observed during microwave pretreatment. Pretreatment for 10 minutes with alkali concentration of 1.5% and temperature of 1400C released maximum cellulose and lignin. At this optimal condition, maximum of 82.62% of cellulose release and 78.29% of lignin removal was achieved. Sorghum straw at optimal pretreatment condition was subjected to enzymatic hydrolysis and fermentation. The efficiency of hydrolysis was measured by analyzing reducing sugars by 3, 5 dinitrisylicylic acid method. Reducing sugars of about 619 mg/g of sorghum straw were obtained after enzymatic hydrolysis. This study showed a significant amount of lignin removal and cellulose release at optimal condition. This enhances the yield of reducing sugars as well as ethanol yield. The study demonstrates the potential of microwave pretreatments for enhancing bioethanol yield from sorghum straw.

Keywords: cellulose, hydrolysis, lignocellulose, optimization

Procedia PDF Downloads 269
3445 Traffic Congestions Modeling and Predictions by Social Networks

Authors: Bojan Najdenov, Danco Davcev

Abstract:

Reduction of traffic congestions and the effects of pollution and waste of resources that come with them has been a big challenge in the past decades. Having reliable systems to facilitate the process of modeling and prediction of traffic conditions would not only reduce the environmental pollution, but will also save people time and money. Social networks play big role of people’s lives nowadays providing them means of communicating and sharing thoughts and ideas, that way generating huge knowledge bases by crowdsourcing. In addition to that, crowdsourcing as a concept provides mechanisms for fast and relatively reliable data generation and also many services are being used on regular basis because they are mainly powered by the public as main content providers. In this paper we present the Social-NETS-Traffic-Control System (SNTCS) that should serve as a facilitator in the process of modeling and prediction of traffic congestions. The main contribution of our system is to integrate data from social networks as Twitter and also implements a custom created crowdsourcing subsystem with which users report traffic conditions using an android application. Our first experience of the usage of the system confirms that the integrated approach allows easy extension of the system with other social networks and represents a very useful tool for traffic control.

Keywords: traffic, congestion reduction, crowdsource, social networks, twitter, android

Procedia PDF Downloads 480
3444 Establishment and Improvement of Oil Palm Liquid Culture for Clonal Propagation

Authors: Mohd Naqiuddin Bin Husri, Siti Rahmah Abd Rahman, Dalilah Abu Bakar, Dayang Izawati Abang Masli, Meilina Ong Abdullah

Abstract:

A serious shortage of prime agricultural land coupled with environmental concerns inland expansion has daunted efforts to increase the national yield average. To address this issue, maximising yield per unit hectare through quality planting material is of great importance. Breeding for improved planting materials has been a continuous effort since the early days of this industry, it is time-consuming, and the likelihood of segregation within the progenies further impedes progress in this area. Incorporation of the cloning technology in oil palm breeding programmes is therefore advantageous to expedite the development of commercial elite and high-yielding planting materials. After more than 22 years of research and development through this project, reliable protocols for liquid/suspension culture systems coupled with various innovative technologies which are effective at promoting proliferation and growth of oil palm culture have been established. Subsequently, clonal palms derived from the suspension culture system were extensively studied in the field, and the results have been encouraging. Clones such as CPS1, CPS2 and a few others recorded superior performance in comparison with D x P standard crosses.

Keywords: tissue culture, suspension culture, oil palm, Elaeis guineensis

Procedia PDF Downloads 189
3443 Alternate Furrow Irrigation and Potassium Fertilizer on Seed Yield, Water Use Efficiency and Fatty Acids of Rapeseed

Authors: A. Bahrani

Abstract:

In order to study the effect of restricted irrigation systems and different potassium fertilizer on water use efficiency and yield of rapeseed (Brassica napus L.), an experiment was conducted in an arid area in Khuzestan, Iran in 2013. The main plots consisted of three irrigation methods: FI (full irrigation), alternate furrow irrigation (AFI) and fixed furrow irrigation (FFI). Each subplot received three rates of K fertiliser application: 0, 150 or 300 kg ha-1. The results showed that the plots receiving the full irrigation resulted in significantly higher grain yields, 1000-kernel weight and grain number per pod than both alternate treatments. However, the highest WUE were obtained in alternate furrow irrigation and 300 kg K ha-1 and the lowest one was found in the FI treatment and 0 kg K ha-1. Potassium application increased RWC in alternate furrow irrigation and fixed furrow irrigation than FI treatment. Maximum oil content was observed in those treatments where full irrigation was applied while minimum oil content was produced in FFI irrigated treatments. Potassium fertilizer also increased grain oil by 15 % than control. Deficit irrigation reduced oleic acid and erucic acid. However, oleic acid and linoleic acid increased with increasing of potassium.

Keywords: erucic acid, irrigation methods, linoleic acid, oil percent, oleic acid

Procedia PDF Downloads 280
3442 An Approach for Pattern Recognition and Prediction of Information Diffusion Model on Twitter

Authors: Amartya Hatua, Trung Nguyen, Andrew Sung

Abstract:

In this paper, we study the information diffusion process on Twitter as a multivariate time series problem. Our model concerns three measures (volume, network influence, and sentiment of tweets) based on 10 features, and we collected 27 million tweets to build our information diffusion time series dataset for analysis. Then, different time series clustering techniques with Dynamic Time Warping (DTW) distance were used to identify different patterns of information diffusion. Finally, we built the information diffusion prediction models for new hashtags which comprise two phrases: The first phrase is recognizing the pattern using k-NN with DTW distance; the second phrase is building the forecasting model using the traditional Autoregressive Integrated Moving Average (ARIMA) model and the non-linear recurrent neural network of Long Short-Term Memory (LSTM). Preliminary results of performance evaluation between different forecasting models show that LSTM with clustering information notably outperforms other models. Therefore, our approach can be applied in real-world applications to analyze and predict the information diffusion characteristics of selected topics or memes (hashtags) in Twitter.

Keywords: ARIMA, DTW, information diffusion, LSTM, RNN, time series clustering, time series forecasting, Twitter

Procedia PDF Downloads 390
3441 Spillage Prediction Using Fluid-Structure Interaction Simulation with Coupled Eulerian-Lagrangian Technique

Authors: Ravi Soni, Irfan Pathan, Manish Pande

Abstract:

The current product development process needs simultaneous consideration of different physics. The performance of the product needs to be considered under both structural and fluid loads. Examples include ducts and valves where structural behavior affects fluid motion and vice versa. Simulation of fluid-structure interaction involves modeling interaction between moving components and the fluid flow. In these scenarios, it is difficult to calculate the damping provided by fluid flow because of dynamic motions of components and the transient nature of the flow. Abaqus Explicit offers general capabilities for modeling fluid-structure interaction with the Coupled Eulerian-Lagrangian (CEL) method. The Coupled Eulerian-Lagrangian technique has been used to simulate fluid spillage through fuel valves during dynamic closure events. The technique to simulate pressure drops across Eulerian domains has been developed using stagnation pressure. Also, the fluid flow is calculated considering material flow through elements at the outlet section of the valves. The methodology has been verified on Eaton products and shows a good correlation with the test results.

Keywords: Coupled Eulerian-Lagrangian Technique, fluid structure interaction, spillage prediction, stagnation pressure

Procedia PDF Downloads 377
3440 Improvement of Egyptian Vacuum Distillates by Solvent Dewaxing

Authors: Ehssan M. R. Nassef

Abstract:

De-waxing of vacuum distillates by using solvent was investigated in the present study. The present work deals with studying solvent dewaxing system which have been developed to give better dewaxing performance with respect to the important factors in the choice of solvents which are good solubility of oil in the solvent and low solubility of wax in the solvent. In this study, solvent dewaxing process using Methyl Ethyl Ketone (MEK) and toluene are used for Egyptian vacuum distillates using two types of distillates. The effect of varying the composition of(MEK to toluene) on the percent yield of the oil, percent of wax, pour point, refractive index at 20 and 70°C, viscosity at 40 and 100°C, viscosity index and specific gravity of the oil produced for the two types of distillates (I & II) were evaluated. In the present study, the operating conditions of solvent dewaxing using MEK toluene mixture achieved the best pour point at -15°C for distillate I at (1:1) solvent composition mixture. At the same ratio of MEK to toluene the best specific gravity of oil produced changed from 0.871 to 0.8802, with refractive index of 1.84. Percent yield of 65% for oil was obtained. The results for distillate II, of higher specific gravity, are comparatively higher than those for distillate I. The effect of temperature was also investigated and the best temperature was -20°C.

Keywords: dewaxing, solvent dewaxing, pour point, lubricating oil production, wax

Procedia PDF Downloads 531
3439 A Predictive Model for Turbulence Evolution and Mixing Using Machine Learning

Authors: Yuhang Wang, Jorg Schluter, Sergiy Shelyag

Abstract:

The high cost associated with high-resolution computational fluid dynamics (CFD) is one of the main challenges that inhibit the design, development, and optimisation of new combustion systems adapted for renewable fuels. In this study, we propose a physics-guided CNN-based model to predict turbulence evolution and mixing without requiring a traditional CFD solver. The model architecture is built upon U-Net and the inception module, while a physics-guided loss function is designed by introducing two additional physical constraints to allow for the conservation of both mass and pressure over the entire predicted flow fields. Then, the model is trained on the Large Eddy Simulation (LES) results of a natural turbulent mixing layer with two different Reynolds number cases (Re = 3000 and 30000). As a result, the model prediction shows an excellent agreement with the corresponding CFD solutions in terms of both spatial distributions and temporal evolution of turbulent mixing. Such promising model prediction performance opens up the possibilities of doing accurate high-resolution manifold-based combustion simulations at a low computational cost for accelerating the iterative design process of new combustion systems.

Keywords: computational fluid dynamics, turbulence, machine learning, combustion modelling

Procedia PDF Downloads 90
3438 Effect of Pre-Aging and Aging Parameters on Mechanical Behavior of Be-Treated 7075 Aluminum Alloys: Experimental Correlation using Minitab Software

Authors: M. Tash, S. Alkahtani

Abstract:

The present study was undertaken to investigate the effect of pre-aging and aging parameters (time and temperature) on the mechanical properties of Al-Mg-Zn (7075) alloys. Ultimate tensile strength, 0.5% offset yield strength and % elongation measurements were carried out on specimens prepared from cast and heat treated 7075 alloys. Duplex aging treatments were carried out for the as solution treated (SHT) specimens (pre-aged at different time and temperature followed by high temperature aging). A statistical design of experiments (DOE) approach using fractional factorial design was applied to determine the influence of controlling variables of pre-aging and aging treatment parameters and any interactions between them on the mechanical properties of 7075 alloys. A mathematical models are developed to relate the alloy ultimate tensile strength, yield strength and % elongation with the different pre-aging and aging parameters i.e. Pre-aging Temperature (PA T0C), Pre-aging time (PA t h), Aging temperature (AT0C), Aging time (At h), to acquire an understanding of the effects of these variables and their interactions on the mechanical properties of be-treated 7075 alloys.

Keywords: aging heat Treatment, tensile properties, be-treated cast Al-Mg-Zn (7075) alloys, experimental correlation

Procedia PDF Downloads 273
3437 The Prediction of Reflection Noise and Its Reduction by Shaped Noise Barriers

Authors: I. L. Kim, J. Y. Lee, A. K. Tekile

Abstract:

In consequence of the very high urbanization rate of Korea, the number of traffic noise damages in areas congested with population and facilities is steadily increasing. The current environmental noise levels data in major cities of the country show that the noise levels exceed the standards set for both day and night times. This research was about comparative analysis in search for optimal soundproof panel shape and design factor that can minimize sound reflection noise. In addition to the normal flat-type panel shape, the reflection noise reduction of swelling-type, combined swelling and curved-type, and screen-type were evaluated. The noise source model Nord 2000, which often provides abundant information compared to models for the similar purpose, was used in the study to determine the overall noise level. Based on vehicle categorization in Korea, the noise levels for varying frequency from different heights of the sound source (directivity heights of Harmonize model) have been calculated for simulation. Each simulation has been made using the ray-tracing method. The noise level has also been calculated using the noise prediction program called SoundPlan 7.2, for comparison. The noise level prediction was made at 15m (R1), 30 m (R2) and at middle of the road, 2m (R3) receiving the point. By designing the noise barriers by shape and running the prediction program by inserting the noise source on the 2nd lane to the noise barrier side, among the 6 lanes considered, the reflection noise slightly decreased or increased in all noise barriers. At R1, especially in the cases of the screen-type noise barriers, there was no reduction effect predicted in all conditions. However, the swelling-type showed a decrease of 0.7~1.2 dB at R1, performing the best reduction effect among the tested noise barriers. Compared to other forms of noise barriers, the swelling-type was thought to be the most suitable for reducing the reflection noise; however, since a slight increase was predicted at R2, further research based on a more sophisticated categorization of related design factors is necessary. Moreover, as swellings are difficult to produce and the size of the modules are smaller than other panels, it is challenging to install swelling-type noise barriers. If these problems are solved, its applicable region will not be limited to other types of noise barriers. Hence, when a swelling-type noise barrier is installed at a downtown region where the amount of traffic is increasing every day, it will both secure visibility through the transparent walls and diminish any noise pollution due to the reflection. Moreover, when decorated with shapes and design, noise barriers will achieve a visual attraction than a flat-type one and thus will alleviate any psychological hardships related to noise, other than the unique physical soundproofing functions of the soundproof panels.

Keywords: reflection noise, shaped noise barriers, sound proof panel, traffic noise

Procedia PDF Downloads 507
3436 Using Soil Texture Field Observations as Ordinal Qualitative Variables for Digital Soil Mapping

Authors: Anne C. Richer-De-Forges, Dominique Arrouays, Songchao Chen, Mercedes Roman Dobarco

Abstract:

Most of the digital soil mapping (DSM) products rely on machine learning (ML) prediction models and/or the use or pedotransfer functions (PTF) in which calibration data come from soil analyses performed in labs. However, many other observations (often qualitative, nominal, or ordinal) could be used as proxies of lab measurements or as input data for ML of PTF predictions. DSM and ML are briefly described with some examples taken from the literature. Then, we explore the potential of an ordinal qualitative variable, i.e., the hand-feel soil texture (HFST) estimating the mineral particle distribution (PSD): % of clay (0-2µm), silt (2-50µm) and sand (50-2000µm) in 15 classes. The PSD can also be measured by lab measurements (LAST) to determine the exact proportion of these particle-sizes. However, due to cost constraints, HFST are much more numerous and spatially dense than LAST. Soil texture (ST) is a very important soil parameter to map as it is controlling many of the soil properties and functions. Therefore, comes an essential question: is it possible to use HFST as a proxy of LAST for calibration and/or validation of DSM predictions of ST? To answer this question, the first step is to compare HFST with LAST on a representative set where both information are available. This comparison was made on ca 17,400 samples representative of a French region (34,000 km2). The accuracy of HFST was assessed, and each HFST class was characterized by a probability distribution function (PDF) of its LAST values. This enables to randomly replace HFST observations by LAST values while respecting the PDF previously calculated and results in a very large increase of observations available for the calibration or validation of PTF and ML predictions. Some preliminary results are shown. First, the comparison between HFST classes and LAST analyses showed that accuracies could be considered very good when compared to other studies. The causes of some inconsistencies were explored and most of them were well explained by other soil characteristics. Then we show some examples applying these relationships and the increase of data to several issues related to DSM. The first issue is: do the PDF functions that were established enable to use HSFT class observations to improve the LAST soil texture prediction? For this objective, we replaced all HFST for topsoil by values from the PDF 100 time replicates). Results were promising for the PTF we tested (a PTF predicting soil water holding capacity). For the question related to the ML prediction of LAST soil texture on the region, we did the same kind of replacement, but we implemented a 10-fold cross-validation using points where we had LAST values. We obtained only preliminary results but they were rather promising. Then we show another example illustrating the potential of using HFST as validation data. As in numerous countries, the HFST observations are very numerous; these promising results pave the way to an important improvement of DSM products in all the countries of the world.

Keywords: digital soil mapping, improvement of digital soil mapping predictions, potential of using hand-feel soil texture, soil texture prediction

Procedia PDF Downloads 222
3435 Prediction of Coronary Artery Stenosis Severity Based on Machine Learning Algorithms

Authors: Yu-Jia Jian, Emily Chia-Yu Su, Hui-Ling Hsu, Jian-Jhih Chen

Abstract:

Coronary artery is the major supplier of myocardial blood flow. When fat and cholesterol are deposit in the coronary arterial wall, narrowing and stenosis of the artery occurs, which may lead to myocardial ischemia and eventually infarction. According to the World Health Organization (WHO), estimated 740 million people have died of coronary heart disease in 2015. According to Statistics from Ministry of Health and Welfare in Taiwan, heart disease (except for hypertensive diseases) ranked the second among the top 10 causes of death from 2013 to 2016, and it still shows a growing trend. According to American Heart Association (AHA), the risk factors for coronary heart disease including: age (> 65 years), sex (men to women with 2:1 ratio), obesity, diabetes, hypertension, hyperlipidemia, smoking, family history, lack of exercise and more. We have collected a dataset of 421 patients from a hospital located in northern Taiwan who received coronary computed tomography (CT) angiography. There were 300 males (71.26%) and 121 females (28.74%), with age ranging from 24 to 92 years, and a mean age of 56.3 years. Prior to coronary CT angiography, basic data of the patients, including age, gender, obesity index (BMI), diastolic blood pressure, systolic blood pressure, diabetes, hypertension, hyperlipidemia, smoking, family history of coronary heart disease and exercise habits, were collected and used as input variables. The output variable of the prediction module is the degree of coronary artery stenosis. The output variable of the prediction module is the narrow constriction of the coronary artery. In this study, the dataset was randomly divided into 80% as training set and 20% as test set. Four machine learning algorithms, including logistic regression, stepwise regression, neural network and decision tree, were incorporated to generate prediction results. We used area under curve (AUC) / accuracy (Acc.) to compare the four models, the best model is neural network, followed by stepwise logistic regression, decision tree, and logistic regression, with 0.68 / 79 %, 0.68 / 74%, 0.65 / 78%, and 0.65 / 74%, respectively. Sensitivity of neural network was 27.3%, specificity was 90.8%, stepwise Logistic regression sensitivity was 18.2%, specificity was 92.3%, decision tree sensitivity was 13.6%, specificity was 100%, logistic regression sensitivity was 27.3%, specificity 89.2%. From the result of this study, we hope to improve the accuracy by improving the module parameters or other methods in the future and we hope to solve the problem of low sensitivity by adjusting the imbalanced proportion of positive and negative data.

Keywords: decision support, computed tomography, coronary artery, machine learning

Procedia PDF Downloads 228
3434 On-Farm Evaluation of Fast and Slow Growing Genotypes for Organic and Pasture Poultry Production Systems

Authors: Komala Arsi, Terrel Spencer, Casey M. Owens, Dan J. Donoghue, Ann M. Donoghue

Abstract:

Organic poultry production is becoming increasingly popular in the United States with approximately 17% increase in the sales of organic meat and poultry in 2016. As per the National Organic Program (NOP), organic poultry production system should operate according to specific standards, including access to outdoors. In the United States, organic poultry farmers are raising both fast growing and slow growing genotypes for alternative productive systems. Even though heritage breed birds grow much slower compared to commercial breeds, many free range producers believe that they are better suited for outdoor production systems. We conducted an on-farm trial on a working pasture poultry farm to compare the performance and meat quality characteristics of a slow-growing heritage breed (Freedom Rangers, FR), and two commonly used fast growing types of chickens (Cornish cross, CC and Naked Neck, NN), raised on pasture, in side by side pens segregated by breed (n=70/breed). CC and NN group birds were reared for eight weeks whereas FR group birds were reared for 10 weeks and all the birds were commercially processed. By the end of the rearing period, the final body weight of FR group birds was significantly lower than both the fast growing genotypes (CC and NN). Both CC and NN birds showed significantly higher live weight, carcass weight as well as fillet, tender and leg yield (P < 0.05). There was no difference in the wing and rack yield among the different groups. Color of the meat was measured using CEILAB method and expressed as lightness (L), redness (a*) and yellowness (b*). The breast meat from FR birds was much redder (higher a* values) and less yellow (lesser b* values) compared to both the fast growing type of chickens (P < 0.05). Overall, fast growing genotypes produced higher carcass weight and meat yield compared to slow growing genotypes and appear to be an economical option for alternative production systems.

Keywords: fast growing chickens, meat quality, pasture, slow growing chickens

Procedia PDF Downloads 385
3433 Machine Learning Approaches to Water Usage Prediction in Kocaeli: A Comparative Study

Authors: Kasim Görenekli, Ali Gülbağ

Abstract:

This study presents a comprehensive analysis of water consumption patterns in Kocaeli province, Turkey, utilizing various machine learning approaches. We analyzed data from 5,000 water subscribers across residential, commercial, and official categories over an 80-month period from January 2016 to August 2022, resulting in a total of 400,000 records. The dataset encompasses water consumption records, weather information, weekends and holidays, previous months' consumption, and the influence of the COVID-19 pandemic.We implemented and compared several machine learning models, including Linear Regression, Random Forest, Support Vector Regression (SVR), XGBoost, Artificial Neural Networks (ANN), Long Short-Term Memory (LSTM), and Gated Recurrent Units (GRU). Particle Swarm Optimization (PSO) was applied to optimize hyperparameters for all models.Our results demonstrate varying performance across subscriber types and models. For official subscribers, Random Forest achieved the highest R² of 0.699 with PSO optimization. For commercial subscribers, Linear Regression performed best with an R² of 0.730 with PSO. Residential water usage proved more challenging to predict, with XGBoost achieving the highest R² of 0.572 with PSO.The study identified key factors influencing water consumption, with previous months' consumption, meter diameter, and weather conditions being among the most significant predictors. The impact of the COVID-19 pandemic on consumption patterns was also observed, particularly in residential usage.This research provides valuable insights for effective water resource management in Kocaeli and similar regions, considering Turkey's high water loss rate and below-average per capita water supply. The comparative analysis of different machine learning approaches offers a comprehensive framework for selecting appropriate models for water consumption prediction in urban settings.

Keywords: mMachine learning, water consumption prediction, particle swarm optimization, COVID-19, water resource management

Procedia PDF Downloads 14
3432 In-situ Fabrication of a Metal-Intermetallic Composite: Microstructure Evolution and Mechanical Response

Authors: Monireh Azimi, Mohammad Reza Toroghinejad, Leo A. I. Kestens

Abstract:

The role of different metallic and intermetallic reinforcements on the microstructure and the associated mechanical response of a composite is of crucial importance. To investigate this issue, a multiphase metal-intermetallic composite was in-situ fabricated through reactive annealing and accumulative roll bonding (ARB) processes. EBSD results indicated that the lamellar grain structure of the Al matrix after the first cycle has evolved with increasing strain to a mixed structure consisting of equiaxed and lamellar grains, whereby the steady-state did not occur after the 3rd (last) cycle—applying a strain of 6.1 in the Al phase, the length and thickness of the grains reduced by 92.2% and 97.3%, respectively, compared to the annealed state. Intermetallic phases together with the metallic reinforcement of Ni influence grain fragmentation of the Al matrix and give rise to a specific texture evolution by creating heterogeneity in the strain and flow patterns. Mechanical properties of the multiphase composite demonstrated the yield and ultimate tensile strengths of 217.9 MPa and 340.1 MPa, respectively, compared to 48.7 MPa and 55.4 MPa in the metal-intermetallic laminated (MIL) sandwich before applying the ARB process, which corresponds to an increase of 347% and 514% of yield and tensile strength, respectively.

Keywords: accumulative roll bonding, mechanical properties, metal-intermetallic composite, severe plastic deformation, texture

Procedia PDF Downloads 194
3431 Stock Prediction and Portfolio Optimization Thesis

Authors: Deniz Peksen

Abstract:

This thesis aims to predict trend movement of closing price of stock and to maximize portfolio by utilizing the predictions. In this context, the study aims to define a stock portfolio strategy from models created by using Logistic Regression, Gradient Boosting and Random Forest. Recently, predicting the trend of stock price has gained a significance role in making buy and sell decisions and generating returns with investment strategies formed by machine learning basis decisions. There are plenty of studies in the literature on the prediction of stock prices in capital markets using machine learning methods but most of them focus on closing prices instead of the direction of price trend. Our study differs from literature in terms of target definition. Ours is a classification problem which is focusing on the market trend in next 20 trading days. To predict trend direction, fourteen years of data were used for training. Following three years were used for validation. Finally, last three years were used for testing. Training data are between 2002-06-18 and 2016-12-30 Validation data are between 2017-01-02 and 2019-12-31 Testing data are between 2020-01-02 and 2022-03-17 We determine Hold Stock Portfolio, Best Stock Portfolio and USD-TRY Exchange rate as benchmarks which we should outperform. We compared our machine learning basis portfolio return on test data with return of Hold Stock Portfolio, Best Stock Portfolio and USD-TRY Exchange rate. We assessed our model performance with the help of roc-auc score and lift charts. We use logistic regression, Gradient Boosting and Random Forest with grid search approach to fine-tune hyper-parameters. As a result of the empirical study, the existence of uptrend and downtrend of five stocks could not be predicted by the models. When we use these predictions to define buy and sell decisions in order to generate model-based-portfolio, model-based-portfolio fails in test dataset. It was found that Model-based buy and sell decisions generated a stock portfolio strategy whose returns can not outperform non-model portfolio strategies on test dataset. We found that any effort for predicting the trend which is formulated on stock price is a challenge. We found same results as Random Walk Theory claims which says that stock price or price changes are unpredictable. Our model iterations failed on test dataset. Although, we built up several good models on validation dataset, we failed on test dataset. We implemented Random Forest, Gradient Boosting and Logistic Regression. We discovered that complex models did not provide advantage or additional performance while comparing them with Logistic Regression. More complexity did not lead us to reach better performance. Using a complex model is not an answer to figure out the stock-related prediction problem. Our approach was to predict the trend instead of the price. This approach converted our problem into classification. However, this label approach does not lead us to solve the stock prediction problem and deny or refute the accuracy of the Random Walk Theory for the stock price.

Keywords: stock prediction, portfolio optimization, data science, machine learning

Procedia PDF Downloads 80