Search results for: multivariate regression tree
3832 Recommender Systems Using Ensemble Techniques
Authors: Yeonjeong Lee, Kyoung-jae Kim, Youngtae Kim
Abstract:
This study proposes a novel recommender system that uses data mining and multi-model ensemble techniques to enhance the recommendation performance through reflecting the precise user’s preference. The proposed model consists of two steps. In the first step, this study uses logistic regression, decision trees, and artificial neural networks to predict customers who have high likelihood to purchase products in each product group. Then, this study combines the results of each predictor using the multi-model ensemble techniques such as bagging and bumping. In the second step, this study uses the market basket analysis to extract association rules for co-purchased products. Finally, the system selects customers who have high likelihood to purchase products in each product group and recommends proper products from same or different product groups to them through above two steps. We test the usability of the proposed system by using prototype and real-world transaction and profile data. In addition, we survey about user satisfaction for the recommended product list from the proposed system and the randomly selected product lists. The results also show that the proposed system may be useful in real-world online shopping store.Keywords: product recommender system, ensemble technique, association rules, decision tree, artificial neural networks
Procedia PDF Downloads 2953831 Using Monte Carlo Model for Simulation of Rented Housing in Mashhad, Iran
Authors: Mohammad Rahim Rahnama
Abstract:
The study employs Monte Carlo method for simulation of rented housing in Mashhad second largest city in Iran. A total number of 334 rental residential units in Mashhad, including both apartments and houses (villa), were randomly selected from advertisements placed in Khorasan Newspapers during the months of July and August of 2015. In order to simulate the monthly rent price, the rent index was calculated through combining the mortgage and the rent price. In the next step, the relation between the variables of the floor area and that of the number of bedrooms for each unit, in both apartments and houses(villa), was calculated through multivariate regression using SPSS and was coded in XML. The initial model was called using simulation button in SPSS and was simulated using triangular and binominal algorithms. The findings revealed that the average simulated rental index was 548.5$ per month. Calculating the sensitivity of rental index to a number of bedrooms we found that firstly, 97% of units have three bedrooms, and secondly as the number of bedrooms increases from one to three, for the rent price of less than 200$, the percentage of units having one bedroom decreases from 10% to 0. Contrariwise, for units with the rent price of more than 571.4$, the percentage of bedrooms increases from 37% to 48%. In the light of these findings, it becomes clear that planning to build rental residential units, overseeing the rent prices, and granting subsidies to rental residential units, for apartments with two bedrooms, present a felicitous policy for regulating residential units in Mashhad.Keywords: Mashhad, Monte Carlo, simulation, rent price, residential unit
Procedia PDF Downloads 2773830 Prognostic and Predictive Value of Tumor: Infiltrating Lymphocytes in Triple Negative Breast Cancer
Authors: Wooseok Byon, Eunyoung Kim, Junseong Kwon, Byung Joo Song, Chan Heun Park
Abstract:
Background/Purpose: Previous preclinical and clinical data suggest that increased lymphocytic infiltration would be associated with good prognosis and benefit from immunogenic chemotherapy especially in triple-negative breast cancer (TNBC). We investigated a single-center experience of TNBC and relationship with lymphocytic infiltration. Methods: From January 2004 to December 2012, at the Department of Surgery, Kangbuk Samsung Hospital, Sungkyunkwan University, School of Medicine, we retrospectively reviewed 897 breast cancer patients-clinical outcomes, clinicopathological characteristics, breast cancer subtypes. And we reviewed lymphocytic infiltration of TNBC specimens by two pathologists. Statistical analysis of risk factors associated with recurrence was performed. Results: A total of 897 patients, 76 were TNBC (8.47%). Mean age of TNBC patients were 50.95 (SD10.42) years, mean follow-up periods was 40.06 months. We reviewed 49 slides, and there were 8 recurrent breast cancer patients (16.32%), and 4 patients were expired (8.16%). There were 9 lymphocytic predominant breast cancers (LPBC)-carcinomas with either intratumoral lymphocytes in >60% of tumor cell nests. 1 patient of LPBC was recurred and 8 were not. In multivariate logistic regression, the odds ratio of lymphocytic infiltration was 0.59 (p=0.643). Conclusion: In a single-center experience of TNBC, the lymphocytic infiltration in tumor cell nest might be a good trend on the prognosis but there was not statistically significant.Keywords: tumor-infiltrating lymphocytes, triple negative breast cancer, medical and health sciences
Procedia PDF Downloads 4083829 Conservation Studies on Endangered and Potential Native Ornamentals and Their Domestication for Novelty in Floriculture Industry
Authors: Puja Sharma, S. R. Dhiman, Bhararti Kashyap, Y. C. Gupta, Shabnam Pangtu
Abstract:
The experiments were carried out for mass multiplication and domestication of an endangered native tree spp, an orchid and an ornamental shrub having high medicinal value. Floriculture industry is novelty driven, hence the potential of these native ornamentals was assessed for their utilization as a novelty in the industry. For the mass propagation of endangered tree Oroxylum indicum, seed propagation and vegetative propagation techniques were successfully utilized. Highest seed germination was recorded in a medium containing cocopeat and perlite (1:1 v/v). Semi hard wood cuttings treated with IBA 2000 ppm planted in cocopeat+ sand+ perlite medium and maintained at 80% RH has resulted in about 90% rooting. The low growing tree was successfully domestication and has potential to be utilized in landscape industry. In the present study, cutting propagation and division of clump were used as methods for multiplication of Aerides multiflora, a native orchid spp. Soft wood cuttings treated with IBA 500 ppm planted in cocopeat medium was found to be the most suitable vegetative method resulting in 90 % rooting. It was domesticated as pot plant and for making hanging baskets. Propagation through seeds and cuttings was carried out for Pyracantha crenulata, a native ornamental shrub which is a cardiovascular medicine. For vegetative propagation, treatment of basal end of semi- hardwood cuttings of Pyracantha with IBA 3000 ppm (quick dip) and planting in cocopeat under mist chamber maintained at a relative humidity of 70-80% resulted in about 90% rooting out of all applied treatments in the study. For seed propagation, treatment of seeds in boiling water for 20 minutes and planting in cocopeat resulted in 82.55 % germination. The shrub was domesticated for its use as pot plant, protective hedge and for making bonsai.Keywords: native, endangered, multiplication, domestication, oroxylum, aerides, pyracantha
Procedia PDF Downloads 813828 Effects of Adding Condensed Tannin from Shrub and Tree Leaves in Concentrate on Sheep Production Fed on Elephant Grass as a Basal Diet
Authors: Kusmartono, Siti Chuzaemi, Hartutik dan Mashudi
Abstract:
Two studies were conducted involving an in vitro (Expt 1) and in vivo (Expt 2) measurements. Expt 1. aimed to evaluate effects of adding CT extracts on gas production and efficiency of microbial protein synthesis (EMPS), Expt 2 aimed to evaluate effects of supplementing shrub/tree leaves as CT source on feed consumption, digestibility, N retention, body weight gain and dressing percentage of growing sheep fed on elephant grass (EG) as a basal diet.Ten shrub and tree leaves used as CT sources were wild sunflower (Tithonia diversifolia), mulberry (Morus macroura), cassava (Manihot utilissima), avicienna (Avicennia marina), calliandra (Calliandra calothyrsus), sesbania (Sesbania grandiflora), acacia (acacia vilosa), glyricidia (Glyricidia sepium), jackfruit (Artocarpus heterophyllus), moringa (Moringa oleifera). The treatments applied in Expt 1 were: T1=Elephant grass (60%)+concentrate (40%); T2 = T1 + CT (3% DM); T3= T2 + PEG; T4 = T1 + CT (3.5% DM); T5 = T4 + PEG; T6 = T1 + CT (4% DM) and T7 = T6 + PEG. Data obtained were analysed using Randomized Block Design. Statistical analyses showed that treatments significanty affected (P<0.05) total gas production and EMPS. The lowest values of total gas production (45.9 ml/500 mg DM) and highest value of EMPS (64.6 g/kg BOTR) were observed in the treatment T4 (3.5% CT from cassava leave extract). Based on this result it was concluded that this treatment was the best and was chosen for further investigation using in vivo method. The treatmets applied for in vivo trial were: T1 = EG (60%) + concentrate (40%); T2 = T1 + dried cassava leave (equivalent to 3.5% CT); T3 = T2 + PEG. 18 growing sheep aging of 8-9 months and weighing of 23.67kg ± 1.23 were used in Expt 2. Results of in vivo study showed that treatments significanty affected (P<0.05) nutrients intake and digestibility (DM, OM and CP). N retention for sheep receiving treatment T2 were significantly higher (P<0.05; 15.6 g/d) than T1 (9.1 g/d) and T3 (8.53 g/d). Similar results were obtained for daily weight gain where T2 were the highest (62.79 g/d), followed by T1 (51.9 g/d) and T3 (52.85 g/d). Dressing percentage of T2 was the highest (51.54%) followed by T1 (49.61%) and T3 (49.32%). It can be concluded that adding adding dried cassava leaves did not reduce palatability due to CT, but rather increased OM digestibility and hence feed consumption was improved. N retention was increased due to the action of CT in the cassava leaves and this may have explained a higher input of N into duodenum which was further led to higer daily weight gain and dressing percentage.Keywords: in vitro gas production, sheep, shrub and tree leaves, condensed tannin
Procedia PDF Downloads 2663827 Influence of Maturation Degree of Arbutus (Arbutus unedo L.) Fruits in Spirit Composition and Quality
Authors: Goreti Botelho, Filomena Gomes, Fernanda M. Ferreira, Ilda Caldeira
Abstract:
The strawberry tree (Arbutus unedo L.) is a small tree or shrub from botanical Ericaceae family that grows spontaneously nearby the Mediterranean basin and produce edible red fruits. A traditional processed fruit application, in Mediterranean countries, is the production of a spirit (known as aguardente de medronho, in Portugal) obtained from the fermented fruit. The main objective of our study was to contribute to the knowledge about the influence of the degree of maturation of fruits in the volatile composition and quality of arbutus spirit. The major volatiles in the three distillates fractions (head, heart and tail) obtained from fermentation of two different fruit maturation levels were quantified by GC-FID analysis and ANOVA one-way was performed. Additionally, the total antioxidant capacity and total phenolic compounds of both arbutus fruit spirits were determined, by ABTS and Folin-Ciocalteau method, respectively. The methanol concentration is superior (1022.39 g/hL a.a.) in the spirit made from fruits with highest total soluble solids, which is a value above the legal limit (1000 g/hL a.a.). Overall, our study emphasizes, for the first time, the influence of maturation degree of arbutus fruits in the spirit volatile composition and quality.Keywords: arbutus fruit, maturation, quality, spirit
Procedia PDF Downloads 3823826 Ownership, Management Responsibility and Corporate Performance of the Listed Firms in Kazakhstan
Authors: Gulnara Moldasheva
Abstract:
The research explores the relationship between management responsibility and corporate governance of listed companies in Kazakhstan. This research employs firm level data of randomly selected listed non-financial firms and firm level data “operational” financial sector, consisted from banking sector, insurance companies and accumulated pension funds using multivariate regression analysis under fixed effect model approach. Ownership structure includes institutional ownership, managerial ownership and private investor’s ownership. Management responsibility of the firm is expressed by the decision of the firm on amount of leverage. Results of the cross sectional panel study for non-financial firms showed that only institutional shareholding is significantly negatively correlated with debt to equity ratio. Findings from “operational” financial sector show that leverage is significantly affected only by the CEO/Chair duality and the size of financial institutions, and insignificantly affected by ownership structure. Also, the findings show, that there is a significant negative relationship between profitability and the debt to equity ratio for non-financial firms, which is consistent with pecking order theory. Generally, the found results suggest that corporate governance and a management responsibility play important role in corporate performance of listed firms in Kazakhstan.Keywords: ownership, corporate governance, debt to equity ratio, corporate performance
Procedia PDF Downloads 3443825 Development of Computational Approach for Calculation of Hydrogen Solubility in Hydrocarbons for Treatment of Petroleum
Authors: Abdulrahman Sumayli, Saad M. AlShahrani
Abstract:
For the hydrogenation process, knowing the solubility of hydrogen (H2) in hydrocarbons is critical to improve the efficiency of the process. We investigated the H2 solubility computation in four heavy crude oil feedstocks using machine learning techniques. Temperature, pressure, and feedstock type were considered as the inputs to the models, while the hydrogen solubility was the sole response. Specifically, we employed three different models: Support Vector Regression (SVR), Gaussian process regression (GPR), and Bayesian ridge regression (BRR). To achieve the best performance, the hyper-parameters of these models are optimized using the whale optimization algorithm (WOA). We evaluated the models using a dataset of solubility measurements in various feedstocks, and we compared their performance based on several metrics. Our results show that the WOA-SVR model tuned with WOA achieves the best performance overall, with an RMSE of 1.38 × 10− 2 and an R-squared of 0.991. These findings suggest that machine learning techniques can provide accurate predictions of hydrogen solubility in different feedstocks, which could be useful in the development of hydrogen-related technologies. Besides, the solubility of hydrogen in the four heavy oil fractions is estimated in different ranges of temperatures and pressures of 150 ◦C–350 ◦C and 1.2 MPa–10.8 MPa, respectivelyKeywords: temperature, pressure variations, machine learning, oil treatment
Procedia PDF Downloads 693824 Re-Stating the Origin of Tetrapod Using Measures of Phylogenetic Support for Phylogenomic Data
Authors: Yunfeng Shan, Xiaoliang Wang, Youjun Zhou
Abstract:
Whole-genome data from two lungfish species, along with other species, present a valuable opportunity to re-investigate the longstanding debate regarding the evolutionary relationships among tetrapods, lungfishes, and coelacanths. However, the use of bootstrap support has become outdated for large-scale phylogenomic data. Without robust phylogenetic support, the phylogenetic trees become meaningless. Therefore, it is necessary to re-evaluate the phylogenies of tetrapods, lungfishes, and coelacanths using novel measures of phylogenetic support specifically designed for phylogenomic data, as the previous phylogenies were based on 100% bootstrap support. Our findings consistently provide strong evidence favoring lungfish as the closest living relative of tetrapods. This conclusion is based on high internode certainty, relative gene support, and high gene concordance factor. The evidence stems from five previous datasets derived from lungfish transcriptomes. These results yield fresh insights into the three hypotheses regarding the phylogenies of tetrapods, lungfishes, and coelacanths. Importantly, these hypotheses are not mere conjectures but are substantiated by a significant number of genes. Analyzing real biological data further demonstrates that the inclusion of additional taxa leads to more diverse tree topologies. Consequently, gene trees and species trees may not be identical even when whole-genome sequencing data is utilized. However, it is worth noting that many gene trees can accurately reflect the species tree if an appropriate number of taxa, typically ranging from six to ten, are sampled. Therefore, it is crucial to carefully select the number of taxa and an appropriate outgroup, such as slow-evolving species, while excluding fast-evolving taxa as outgroups to mitigate the adverse effects of long-branch attraction and achieve an accurate reconstruction of the species tree. This is particularly important as more whole-genome sequencing data becomes available.Keywords: novel measures of phylogenetic support for phylogenomic data, gene concordance factor confidence, relative gene support, internode certainty, origin of tetrapods
Procedia PDF Downloads 603823 Detection and Identification of Antibiotic Resistant Bacteria Using Infra-Red-Microscopy and Advanced Multivariate Analysis
Authors: Uraib Sharaha, Ahmad Salman, Eladio Rodriguez-Diaz, Elad Shufan, Klaris Riesenberg, Irving J. Bigio, Mahmoud Huleihel
Abstract:
Antimicrobial drugs have an important role in controlling illness associated with infectious diseases in animals and humans. However, the increasing resistance of bacteria to a broad spectrum of commonly used antibiotics has become a global health-care problem. Rapid determination of antimicrobial susceptibility of a clinical isolate is often crucial for the optimal antimicrobial therapy of infected patients and in many cases can save lives. The conventional methods for susceptibility testing like disk diffusion are time-consuming and other method including E-test, genotyping are relatively expensive. Fourier transform infrared (FTIR) microscopy is rapid, safe, and low cost method that was widely and successfully used in different studies for the identification of various biological samples including bacteria. The new modern infrared (IR) spectrometers with high spectral resolution enable measuring unprecedented biochemical information from cells at the molecular level. Moreover, the development of new bioinformatics analyses combined with IR spectroscopy becomes a powerful technique, which enables the detection of structural changes associated with resistivity. The main goal of this study is to evaluate the potential of the FTIR microscopy in tandem with machine learning algorithms for rapid and reliable identification of bacterial susceptibility to antibiotics in time span of few minutes. The bacterial samples, which were identified at the species level by MALDI-TOF and examined for their susceptibility by the routine assay (micro-diffusion discs), are obtained from the bacteriology laboratories in Soroka University Medical Center (SUMC). These samples were examined by FTIR microscopy and analyzed by advanced statistical methods. Our results, based on 550 E.coli samples, were promising and showed that by using infrared spectroscopic technique together with multivariate analysis, it is possible to classify the tested bacteria into sensitive and resistant with success rate higher than 85% for eight different antibiotics. Based on these preliminary results, it is worthwhile to continue developing the FTIR microscopy technique as a rapid and reliable method for identification antibiotic susceptibility.Keywords: antibiotics, E. coli, FTIR, multivariate analysis, susceptibility
Procedia PDF Downloads 2663822 Representativity Based Wasserstein Active Regression
Authors: Benjamin Bobbia, Matthias Picard
Abstract:
In recent years active learning methodologies based on the representativity of the data seems more promising to limit overfitting. The presented query methodology for regression using the Wasserstein distance measuring the representativity of our labelled dataset compared to the global distribution. In this work a crucial use of GroupSort Neural Networks is made therewith to draw a double advantage. The Wasserstein distance can be exactly expressed in terms of such neural networks. Moreover, one can provide explicit bounds for their size and depth together with rates of convergence. However, heterogeneity of the dataset is also considered by weighting the Wasserstein distance with the error of approximation at the previous step of active learning. Such an approach leads to a reduction of overfitting and high prediction performance after few steps of query. After having detailed the methodology and algorithm, an empirical study is presented in order to investigate the range of our hyperparameters. The performances of this method are compared, in terms of numbers of query needed, with other classical and recent query methods on several UCI datasets.Keywords: active learning, Lipschitz regularization, neural networks, optimal transport, regression
Procedia PDF Downloads 813821 A Machine Learning Approach for Earthquake Prediction in Various Zones Based on Solar Activity
Authors: Viacheslav Shkuratskyy, Aminu Bello Usman, Michael O’Dea, Saifur Rahman Sabuj
Abstract:
This paper examines relationships between solar activity and earthquakes; it applied machine learning techniques: K-nearest neighbour, support vector regression, random forest regression, and long short-term memory network. Data from the SILSO World Data Center, the NOAA National Center, the GOES satellite, NASA OMNIWeb, and the United States Geological Survey were used for the experiment. The 23rd and 24th solar cycles, daily sunspot number, solar wind velocity, proton density, and proton temperature were all included in the dataset. The study also examined sunspots, solar wind, and solar flares, which all reflect solar activity and earthquake frequency distribution by magnitude and depth. The findings showed that the long short-term memory network model predicts earthquakes more correctly than the other models applied in the study, and solar activity is more likely to affect earthquakes of lower magnitude and shallow depth than earthquakes of magnitude 5.5 or larger with intermediate depth and deep depth.Keywords: k-nearest neighbour, support vector regression, random forest regression, long short-term memory network, earthquakes, solar activity, sunspot number, solar wind, solar flares
Procedia PDF Downloads 743820 A Hybrid Fuzzy Clustering Approach for Fertile and Unfertile Analysis
Authors: Shima Soltanzadeh, Mohammad Hosain Fazel Zarandi, Mojtaba Barzegar Astanjin
Abstract:
Diagnosis of male infertility by the laboratory tests is expensive and, sometimes it is intolerable for patients. Filling out the questionnaire and then using classification method can be the first step in decision-making process, so only in the cases with a high probability of infertility we can use the laboratory tests. In this paper, we evaluated the performance of four classification methods including naive Bayesian, neural network, logistic regression and fuzzy c-means clustering as a classification, in the diagnosis of male infertility due to environmental factors. Since the data are unbalanced, the ROC curves are most suitable method for the comparison. In this paper, we also have selected the more important features using a filtering method and examined the impact of this feature reduction on the performance of each methods; generally, most of the methods had better performance after applying the filter. We have showed that using fuzzy c-means clustering as a classification has a good performance according to the ROC curves and its performance is comparable to other classification methods like logistic regression.Keywords: classification, fuzzy c-means, logistic regression, Naive Bayesian, neural network, ROC curve
Procedia PDF Downloads 3403819 Short Answer Grading Using Multi-Context Features
Authors: S. Sharan Sundar, Nithish B. Moudhgalya, Nidhi Bhandari, Vineeth Vijayaraghavan
Abstract:
Automatic Short Answer Grading is one of the prime applications of artificial intelligence in education. Several approaches involving the utilization of selective handcrafted features, graphical matching techniques, concept identification and mapping, complex deep frameworks, sentence embeddings, etc. have been explored over the years. However, keeping in mind the real-world application of the task, these solutions present a slight overhead in terms of computations and resources in achieving high performances. In this work, a simple and effective solution making use of elemental features based on statistical, linguistic properties, and word-based similarity measures in conjunction with tree-based classifiers and regressors is proposed. The results for classification tasks show improvements ranging from 1%-30%, while the regression task shows a stark improvement of 35%. The authors attribute these improvements to the addition of multiple similarity scores to provide ensemble of scoring criteria to the models. The authors also believe the work could reinstate that classical natural language processing techniques and simple machine learning models can be used to achieve high results for short answer grading.Keywords: artificial intelligence, intelligent systems, natural language processing, text mining
Procedia PDF Downloads 1333818 Sensitivity Based Robust Optimization Using 9 Level Orthogonal Array and Stepwise Regression
Authors: K. K. Lee, H. W. Han, H. L. Kang, T. A. Kim, S. H. Han
Abstract:
For the robust optimization of the manufacturing product design, there are design objectives that must be achieved, such as a minimization of the mean and standard deviation in objective functions within the required sensitivity constraints. The authors utilized the sensitivity of objective functions and constraints with respect to the effective design variables to reduce the computational burden associated with the evaluation of the probabilities. The individual mean and sensitivity values could be estimated easily by using the 9 level orthogonal array based response surface models optimized by the stepwise regression. The present study evaluates a proposed procedure from the robust optimization of rubber domes that are commonly used for keyboard switching, by using the 9 level orthogonal array and stepwise regression along with a desirability function. In addition, a new robust optimization process, i.e., the I2GEO (Identify, Integrate, Generate, Explore and Optimize), was proposed on the basis of the robust optimization in rubber domes. The optimized results from the response surface models and the estimated results by using the finite element analysis were consistent within a small margin of error. The standard deviation of objective function is decreasing 54.17% with suggested sensitivity based robust optimization. (Business for Cooperative R&D between Industry, Academy, and Research Institute funded Korea Small and Medium Business Administration in 2017, S2455569)Keywords: objective function, orthogonal array, response surface model, robust optimization, stepwise regression
Procedia PDF Downloads 2883817 Linear Regression Estimation of Tactile Comfort for Denim Fabrics Based on In-Plane Shear Behavior
Authors: Nazli Uren, Ayse Okur
Abstract:
Tactile comfort of a textile product is an essential property and a major concern when it comes to customer perceptions and preferences. The subjective nature of comfort and the difficulties regarding the simulation of human hand sensory feelings make it hard to establish a well-accepted link between tactile comfort and objective evaluations. On the other hand, shear behavior of a fabric is a mechanical parameter which can be measured by various objective test methods. The principal aim of this study is to determine the tactile comfort of commercially available denim fabrics by subjective measurements, create a tactile score database for denim fabrics and investigate the relations between tactile comfort and shear behavior. In-plane shear behaviors of 17 different commercially available denim fabrics with a variety of raw material and weave structure were measured by a custom design shear frame and conventional bias extension method in two corresponding diagonal directions. Tactile comfort of denim fabrics was determined via subjective customer evaluations as well. Aforesaid relations were statistically investigated and introduced as regression equations. The analyses regarding the relations between tactile comfort and shear behavior showed that there are considerably high correlation coefficients. The suggested regression equations were likewise found out to be statistically significant. Accordingly, it was concluded that the tactile comfort of denim fabrics can be estimated with a high precision, based on the results of in-plane shear behavior measurements.Keywords: denim fabrics, in-plane shear behavior, linear regression estimation, tactile comfort
Procedia PDF Downloads 3033816 Survival Outcomes Related to Treatment Modalities in Patients with Oropharyngeal Squamous Cell Carcinoma
Authors: Danni Cheng
Abstract:
Purpose:Surgicallyinclusive treatment(SIT)isthemajor treatment fororopharyngealsquamouscellcarcinoma (OPSCC) in Eastern countries, while nonsurgical treatments(NSTs) are the priority treatment in Western countries. The preferred treatmentsforOPSCC patients remaindebated. Methods:Atotalof 153 consecutive OPSCC casesdiagnosed between 2009 and 2019inWCH, and 15,400 OPSCC cases from SEER database (2000-2017) were obtained. Clinical characteristics, treatments, and survival outcomes were retrospectively collected. We conductedKaplan-Meier curves univariate and multivariate analysis to compare the prognosis of OPSCC patients in WCH, SEER Asian, and SEER all ethnic population by different treatment modalities,HPVstatus, ages, and TNM stages. Results: The 5-year overall survival rate was 59% in WCH, 64% in the SEER all ethnic and 67% in SEER Asian group. In both univariate and multivariate analysis, SIT was observed as a consistent benefit factor for OPSCC patients in all three populations when classified by genders, tumor stages, and HPV status. Patients who underwent SIT had significantly better survival outcomes than those who received NSTsin WCH, SEER Asian, and SEER all ethnic groups. HPV positive status was the beneficial factor of OPSCC patients in all three groups. Besides, male patients had worse survival outcomes in both WCH and SEER Asian group, whereas male patients had better outcomes in the SEER all ethnic group. Conclusion: In contrast to nowadaysNSTs are the first-line therapiesfor OPSCC, our ten-year real-world data and SEER data indicated that OPSCC patients who underwent SIT had better prognosis than NSTs.Keywords: OPSCC, survival outcome, SEER, treatment modalities
Procedia PDF Downloads 1773815 A Statistical Approach to Predict and Classify the Commercial Hatchability of Chickens Using Extrinsic Parameters of Breeders and Eggs
Authors: M. S. Wickramarachchi, L. S. Nawarathna, C. M. B. Dematawewa
Abstract:
Hatchery performance is critical for the profitability of poultry breeder operations. Some extrinsic parameters of eggs and breeders cause to increase or decrease the hatchability. This study aims to identify the affecting extrinsic parameters on the commercial hatchability of local chicken's eggs and determine the most efficient classification model with a hatchability rate greater than 90%. In this study, seven extrinsic parameters were considered: egg weight, moisture loss, breeders age, number of fertilised eggs, shell width, shell length, and shell thickness. Multiple linear regression was performed to determine the most influencing variable on hatchability. First, the correlation between each parameter and hatchability were checked. Then a multiple regression model was developed, and the accuracy of the fitted model was evaluated. Linear Discriminant Analysis (LDA), Classification and Regression Trees (CART), k-Nearest Neighbors (kNN), Support Vector Machines (SVM) with a linear kernel, and Random Forest (RF) algorithms were applied to classify the hatchability. This grouping process was conducted using binary classification techniques. Hatchability was negatively correlated with egg weight, breeders' age, shell width, shell length, and positive correlations were identified with moisture loss, number of fertilised eggs, and shell thickness. Multiple linear regression models were more accurate than single linear models regarding the highest coefficient of determination (R²) with 94% and minimum AIC and BIC values. According to the classification results, RF, CART, and kNN had performed the highest accuracy values 0.99, 0.975, and 0.972, respectively, for the commercial hatchery process. Therefore, the RF is the most appropriate machine learning algorithm for classifying the breeder outcomes, which are economically profitable or not, in a commercial hatchery.Keywords: classification models, egg weight, fertilised eggs, multiple linear regression
Procedia PDF Downloads 883814 Ordinary Differentiation Equations (ODE) Reconstruction of High-Dimensional Genetic Networks through Game Theory with Application to Dissecting Tree Salt Tolerance
Authors: Libo Jiang, Huan Li, Rongling Wu
Abstract:
Ordinary differentiation equations (ODE) have proven to be powerful for reconstructing precise and informative gene regulatory networks (GRNs) from dynamic gene expression data. However, joint modeling and analysis of all genes, essential for the systematical characterization of genetic interactions, are challenging due to high dimensionality and a complex pattern of genetic regulation including activation, repression, and antitermination. Here, we address these challenges by unifying variable selection and game theory through ODE. Each gene within a GRN is co-expressed with its partner genes in a way like a game of multiple players, each of which tends to choose an optimal strategy to maximize its “fitness” across the whole network. Based on this unifying theory, we designed and conducted a real experiment to infer salt tolerance-related GRNs for Euphrates poplar, a hero tree that can grow in the saline desert. The pattern and magnitude of interactions between several hub genes within these GRNs were found to determine the capacity of Euphrates poplar to resist to saline stress.Keywords: gene regulatory network, ordinary differential equation, game theory, LASSO, saline resistance
Procedia PDF Downloads 6403813 Evaluation of Invasive Tree Species for Production of Phosphate Bonded Composites
Authors: Stephen Osakue Amiandamhen, Schwaller Andreas, Martina Meincken, Luvuyo Tyhoda
Abstract:
Invasive alien tree species are currently being cleared in South Africa as a result of the forest and water imbalances. These species grow wildly constituting about 40% of total forest area. They compete with the ecosystem for natural resources and are considered as ecosystem engineers by rapidly changing disturbance regimes. As such, they are harvested for commercial uses but much of it is wasted because of their form and structure. The waste is being sold to local communities as fuel wood. These species can be considered as potential feedstock for the production of phosphate bonded composites. The presence of bark in wood-based composites leads to undesirable properties, and debarking as an option can be cost implicative. This study investigates the potentials of these invasive species processed without debarking on some fundamental properties of wood-based panels. Some invasive alien tree species were collected from EC Biomass, Port Elizabeth, South Africa. They include Acacia mearnsii (Black wattle), A. longifolia (Long-leaved wattle), A. cyclops (Red-eyed wattle), A. saligna (Golden-wreath wattle) and Eucalyptus globulus (Blue gum). The logs were chipped as received. The chips were hammer-milled and screened through a 1 mm sieve. The wood particles were conditioned and the quantity of bark in the wood was determined. The binding matrix was prepared using a reactive magnesia, phosphoric acid and class S fly ash. The materials were mixed and poured into a metallic mould. The composite within the mould was compressed at room temperature at a pressure of 200 KPa. After initial setting which took about 5 minutes, the composite board was demoulded and air-cured for 72 h. The cured product was thereafter conditioned at 20°C and 70% relative humidity for 48 h. Test of physical and strength properties were conducted on the composite boards. The effect of binder formulation and fly ash content on the properties of the boards was studied using fitted response surface technology, according to a central composite experimental design (CCD) at a fixed wood loading of 75% (w/w) of total inorganic contents. The results showed that phosphate/magnesia ratio of 3:1 and fly ash content of 10% was required to obtain a product of good properties and sufficient strength for intended applications. The proposed products can be used for ceilings, partitioning and insulating wall panels.Keywords: invasive alien tree species, phosphate bonded composites, physical properties, strength
Procedia PDF Downloads 2953812 Sex Work Practice and Health Seeking Behavior among Hiv Positive Female Sex Workers in Rural Karnataka, India
Authors: Rajeshwari Biradar
Abstract:
Background: The anecdotal evidences indicate that utilization of HIV services especially in Government facilities is affected by stigma and discrimination among HIV positive female sex workers (FSWs) in Karnataka. To our knowledge, there is no quantitative study on this issue. In this study an attempt is made to examine these aspects among positive FSWs exposed to prevention programs. Methods: This is a cross‐ sectional quantitative survey of HIV positive FSWs in the 3 districts of northern Karnataka using a structured questionnaire. The list of HIV Positive FSWs was organized by stratification, and 607 positive FSWs were selected using a systematic random selection. The data were analyzed using both bivariate and multivariate statistical techniques. Results: Half of the sex workers (52%) are traditional (devadasi, dedicated to the temple), 22% are widowed and the mean age is 33 years. The FSWs practice sex work on an average 13 days a month with 2.3 clients per day and was in sex work for about 13 years. Almost all of them (97%) used condom with the clients they had on the last day of sex work. About 74% were ever registered in the ART center and 47% of them reported being ever on ART, of which 6% dropped out. Multivariate results support the hypothesis that the interventions addressing stigma and discrimination enabled accessing health services in the government facilities (AOR=1.37; p=0.17). Conclusions: Based on the results of the study, programs addressing stigma, discrimination and positive prevention can be implemented in places where government health services are not utilized by HIV positive FSWs. However, the study may be limited by the fact that majority of the FSWs entered into sex work through the traditional devadasi system, which may not be the case in other parts of India.Keywords: sex work, HIV/AIDS, female sex workers, health
Procedia PDF Downloads 1883811 Supplier Risk Management: A Multivariate Statistical Modelling and Portfolio Optimization Based Approach for Supplier Delivery Performance Development
Authors: Jiahui Yang, John Quigley, Lesley Walls
Abstract:
In this paper, the authors develop a stochastic model regarding the investment in supplier delivery performance development from a buyer’s perspective. The authors propose a multivariate model through a Multinomial-Dirichlet distribution within an Empirical Bayesian inference framework, representing both the epistemic and aleatory uncertainties in deliveries. A closed form solution is obtained and the lower and upper bound for both optimal investment level and expected profit under uncertainty are derived. The theoretical properties provide decision makers with useful insights regarding supplier delivery performance improvement problems where multiple delivery statuses are involved. The authors also extend the model from a single supplier investment into a supplier portfolio, using a Lagrangian method to obtain a theoretical expression for an optimal investment level and overall expected profit. The model enables a buyer to know how the marginal expected profit/investment level of each supplier changes with respect to the budget and which supplier should be invested in when additional budget is available. An application of this model is illustrated in a simulation study. Overall, the main contribution of this study is to provide an optimal investment decision making framework for supplier development, taking into account multiple delivery statuses as well as multiple projects.Keywords: decision making, empirical bayesian, portfolio optimization, supplier development, supply chain management
Procedia PDF Downloads 2893810 Non-Methane Hydrocarbons Emission during the Photocopying Process
Authors: Kiurski S. Jelena, Aksentijević M. Snežana, Kecić S. Vesna, Oros B. Ivana
Abstract:
The prosperity of electronic equipment in photocopying environment not only has improved work efficiency, but also has changed indoor air quality. Considering the number of photocopying employed, indoor air quality might be worse than in general office environments. Determining the contribution from any type of equipment to indoor air pollution is a complex matter. Non-methane hydrocarbons are known to have an important role of air quality due to their high reactivity. The presence of hazardous pollutants in indoor air has been detected in one photocopying shop in Novi Sad, Serbia. Air samples were collected and analyzed for five days, during 8-hr working time in three-time intervals, whereas three different sampling points were determined. Using multiple linear regression model and software package STATISTICA 10 the concentrations of occupational hazards and micro-climates parameters were mutually correlated. Based on the obtained multiple coefficients of determination (0.3751, 0.2389, and 0.1975), a weak positive correlation between the observed variables was determined. Small values of parameter F indicated that there was no statistically significant difference between the concentration levels of non-methane hydrocarbons and micro-climates parameters. The results showed that variable could be presented by the general regression model: y = b0 + b1xi1+ b2xi2. Obtained regression equations allow to measure the quantitative agreement between the variation of variables and thus obtain more accurate knowledge of their mutual relations.Keywords: non-methane hydrocarbons, photocopying process, multiple regression analysis, indoor air quality, pollutant emission
Procedia PDF Downloads 3783809 Principal Component Regression in Amylose Content on the Malaysian Market Rice Grains Using Near Infrared Reflectance Spectroscopy
Authors: Syahira Ibrahim, Herlina Abdul Rahim
Abstract:
The amylose content is an essential element in determining the texture and taste of rice grains. This paper evaluates the use of VIS-SWNIRS in estimating the amylose content for seven varieties of rice grains available in the Malaysian market. Each type consists of 30 samples and all the samples are scanned using the spectroscopy to obtain a range of values between 680-1000nm. The Savitzky-Golay (SG) smoothing filter is applied to each sample’s data before the Principal Component Regression (PCR) technique is used to examine the data and produce a single value for each sample. This value is then compared with reference values obtained from the standard iodine colorimetric test in terms of its coefficient of determination, R2. Results show that this technique produced low R2 values of less than 0.50. In order to improve the result, the range should include a wavelength range of 1100-2500nm and the number of samples processed should also be increased.Keywords: amylose content, diffuse reflectance, Malaysia rice grain, principal component regression (PCR), Visible and Shortwave near-infrared spectroscopy (VIS-SWNIRS)
Procedia PDF Downloads 3823808 Correlates of Comprehensive HIV/AIDS Knowledge and Acceptance Attitude Towards People Living with HIV/AIDS: A Cross-Sectional Study among Unmarried Young Women in Uganda
Authors: Tesfaldet Mekonnen Estifanos, Chen Hui, Afewerki Weldezgi
Abstract:
Background: Youth in general and young females in particular, remain at the center of the HIV/AIDS epidemic. Sexual risk-taking among young unmarried women is relatively high and are the most vulnerable and highly exposed to HIV/AIDS. Improvements in the status of HIV/AIDS knowledge and acceptance attitude towards people living with HIV (PLWHIV) plays a great role in averting the incidence of HIV/AIDS. Thus, the aim of the study was to explore the level and correlates of HIV/AIDS knowledge and accepting attitude toward PLWHIV. Methods: A cross-sectional study was conducted using data from the Uganda Demographic Health Survey 2016 (UDHS-2016). National level representative household surveys using a multistage cluster probability sampling method, face to face interviews with standard questionnaires were performed. Unmarried women aged 15-24 years with a sample size of 2019 were selected from the total sample of 8674 women aged 15-49 years and were analyzed using SPSS version 23. Independent variables such as age, religion, educational level, residence, and wealth index were included. Two binary outcome variables (comprehensive HIV/AIDS knowledge and acceptance attitude toward PLWHIV) were utilized. We used the chi-square test as well as multivariate regression analysis to explore correlations of explanatory variables with the outcome variables. The results were reported by odds ratios (OR) with 95% confidence interval (95% CI), taking a p-value less than 0.05 as significant. Results: Almost all (99.3%) of the unmarried women aged 15-24 years were aware of HIV/AIDS, but only 51.2% had adequate comprehensive knowledge on HIV/AIDS. Only 69.4% knew both methods: using a condom every time had sex, and having only one faithful uninfected partner can prevent HIV/AIDS transmission. About 66.6% of the unmarried women reject at least two common local misconceptions about HIV/AIDS. Moreover, an alarmingly few (20.3%) of the respondents had a positive acceptance attitude to PLWHIV. On multivariate analysis, age (20-24 years), living in urban, being educated and wealthier, were predictors of having adequate comprehensive HIV/AIDS knowledge. On the other hand, research participants with adequate comprehensive knowledge about HIV/AIDS were highly likely (OR, 1.94 95% CI, 1.52-2.46) to have a positive acceptance attitude to PLWHIV than those with inadequate knowledge. Respondents with no education, Muslim, and Pentecostal religion were emerged less likely to have a positive acceptance attitude to PLWHIV. Conclusion: This study found out the highly accepted level of awareness, but the knowledge and positive acceptance attitude are not encouraging. Thus, expanding access to comprehensive sexuality and strengthening educational campaigns on HIV/AIDS in communities, health facilities, and schools is needed with a greater focus on disadvantaged women having low educational level, poor socioeconomic status, and those residing in rural areas. Sexual risk behaviors among the most affected people - young women have also a role in the spread of HIV/AIDS. Hence, further research assessing the significant contributing factors for sexual risk-taking might have a positive impact on the fight against HIV/AIDS.Keywords: acceptance attitude, HIV/AIDS, knowledge, unmarried women
Procedia PDF Downloads 1553807 Indicators of Value of Life in Children with Colorectal Illness
Authors: Enkelejda Shkurti, Diamant Shtiza
Abstract:
Background: Health-related quality of life (HRQoL) is a significant consequence in health care. The objective of our study was to recognize features related to lower HRQoL scores in children with anorectal malformation (ARM) and Hirschsprung disease (HD). Methods: Children younger than 18 years, with HD or ARM, that were assessed at our private clinic in Tirana, Albania, from December 2018 to October 2019, were acknowledged. The outcomes of broad questionnaires concerning diagnosis, symptoms, and preceding health/surgical history and authenticated tools to measure urinary status, stooling grade, and HRQoL were appraised. Results: In patients aged 0-6 years, vomiting and abdominal enlargement were allied with a substantial decrease in total HRQoL scores. In children > 6 years of age, vomiting, abdominal swelling, and abdominal discomfort were also linked to a considerably lower HRQoL. The main indicator of lower HRQoL scores on regression tree analysis in all age clusters was the occurrence of psychosomatic, behavioral, or progressive comorbidity. Conclusion: Children with both HD or ARM that have a psychosomatic, behavioral, or growing problem experience considerably lower HRQoL than patients deprived of such problems, proposing that establishment of behavioral/growing sustenance as part of the care of these patients may have a considerable influence on their HRQoL.Keywords: anorectal malformation, Hirsch Sprung disease, quality of life, Albania
Procedia PDF Downloads 1753806 Determining Optimal Number of Trees in Random Forests
Authors: Songul Cinaroglu
Abstract:
Background: Random Forest is an efficient, multi-class machine learning method using for classification, regression and other tasks. This method is operating by constructing each tree using different bootstrap sample of the data. Determining the number of trees in random forests is an open question in the literature for studies about improving classification performance of random forests. Aim: The aim of this study is to analyze whether there is an optimal number of trees in Random Forests and how performance of Random Forests differ according to increase in number of trees using sample health data sets in R programme. Method: In this study we analyzed the performance of Random Forests as the number of trees grows and doubling the number of trees at every iteration using “random forest” package in R programme. For determining minimum and optimal number of trees we performed Mc Nemar test and Area Under ROC Curve respectively. Results: At the end of the analysis it was found that as the number of trees grows, it does not always means that the performance of the forest is better than forests which have fever trees. In other words larger number of trees only increases computational costs but not increases performance results. Conclusion: Despite general practice in using random forests is to generate large number of trees for having high performance results, this study shows that increasing number of trees doesn’t always improves performance. Future studies can compare different kinds of data sets and different performance measures to test whether Random Forest performance results change as number of trees increase or not.Keywords: classification methods, decision trees, number of trees, random forest
Procedia PDF Downloads 3963805 Prevalence and Correlates of Anemia in Adolescents in Riyadh City, Kingdom of Saudi Arabia
Authors: Aljohara M. Alquaiz, Tawfik A. M. Khoja, Abdullah Alsharif, Ambreen Kazi, Ashry Gad Mohamed, Hamad Al Mane, Abdullah Aldiris, Shaffi Ahamed Shaikh
Abstract:
Objective: To determine the prevalence and correlates of anemia in male and female adolescents in Riyadh, Kingdom of Saudi Arabia. Design: A cross-sectional community based study setting: Five primary health care centers in Riyadh. Subjects: We invited 203 male and 292 female adolescents aged 13-18 years for interview, anthropometric measurements and complete blood count. Blood hemoglobin was measured with coulter cellular analysis system using light scatter method. Results: Using the WHO cut-off of Hb < 12gms/dl, 16.7%(34) males and 34%(100) females were suffering from anemia. The mean Hb (±SD) in males and females was 13.5(±1.4) and 12.3(±1.2) mg/dl, respectively. Mean(±SD) MCV, MCH, MCHC and RDW in male and female adolescents were 77.8(±6.2) vs76.4(±10.3)fL, 26.1(±2.7) vs25.5(±2.6)pg, 32.7(±2.4) vs32.2(±2.6)g/dL, 13.9(±1.4) vs13.6(±1.3)%, respectively. Multivariate logistic regression revealed that positive family history of iron deficiency anemia(IDA)(OR 4.7,95%CI 1.7–12.2), infrequent intake (OR 3.7,95%CI 1.3–10.0) and never intake of fresh juices(OR 3.5,95%CI 1.4–9.5), 13 to 14 years age (OR 3.1,95%CI 1.2–9.3) were significantly associated with anemia in male adolescents; whereas in females: family history of IDA (OR 3.4, 95%CI 1.5–7.6), being over-weight(OR 3.0,95%CI 1.4–6.1), no intake of fresh juice (OR 2.6,95%CI 1.4–5.1), living in an apartment (OR 2.0, 95%CI 1.1-3.8) or living in small house (OR 2.5, 95%CI 1.2-5.3) were significantly associated with anemia. Conclusion: Anemia is more prevalent among Saudi female adolescents as compared to males. Important factors like positive family history of IDA, overweight, lack of fresh juice intake and low socioeconomic status are significantly associated with anemia in adolescents.Keywords: adolescents, anemia, correlates, obesity
Procedia PDF Downloads 3523804 Ranking of Provinces in Iran for Capital Formation in Spatial Planning with Numerical Taxonomy Technique (An Improvement) Case Study: Agriculture Sector
Authors: Farhad Nouparast
Abstract:
For more production we need more capital formation. Capital formation in each country should be based on comparative advantages in different economic sectors due to the different production possibility curves. In regional planning, recognizing the relative advantages and consequently investing in more production requires identifying areas with the necessary capabilities and location of each region compared to other regions. In this article, ranking of Iran's provinces is done according to the specific and given variables as the best investment position in agricultural activity. So we can provide the necessary background for investment analysis in different regions of the country to formulate national and regional planning and execute investment projects. It is used factor analysis technique and numerical taxonomy analysis to do this in thisarticle. At first, the provinces are homogenized and graded according to the variables using cross-sectional data obtained from the agricultural census and population and housing census of Iran as data matrix. The results show that which provinces have the most potential for capital formation in agronomy sub-sector. Taxonomy classifies organisms based on similar genetic traits in biology and botany. Numerical taxonomy using quantitative methods controls large amounts of information and get the number of samples and categories and take them based on inherent characteristics and differences indirectly accommodates. Numerical taxonomy is related to multivariate statistics.Keywords: Capital Formation, Factor Analysis, Multivariate statistics, Numerical Taxonomy Analysis, Production, Ranking, Spatial Planning
Procedia PDF Downloads 1423803 Automatic Adult Age Estimation Using Deep Learning of the ResNeXt Model Based on CT Reconstruction Images of the Costal Cartilage
Authors: Ting Lu, Ya-Ru Diao, Fei Fan, Ye Xue, Lei Shi, Xian-e Tang, Meng-jun Zhan, Zhen-hua Deng
Abstract:
Accurate adult age estimation (AAE) is a significant and challenging task in forensic and archeology fields. Attempts have been made to explore optimal adult age metrics, and the rib is considered a potential age marker. The traditional way is to extract age-related features designed by experts from macroscopic or radiological images followed by classification or regression analysis. Those results still have not met the high-level requirements for practice, and the limitation of using feature design and manual extraction methods is loss of information since the features are likely not designed explicitly for extracting information relevant to age. Deep learning (DL) has recently garnered much interest in imaging learning and computer vision. It enables learning features that are important without a prior bias or hypothesis and could be supportive of AAE. This study aimed to develop DL models for AAE based on CT images and compare their performance to the manual visual scoring method. Chest CT data were reconstructed using volume rendering (VR). Retrospective data of 2500 patients aged 20.00-69.99 years were obtained between December 2019 and September 2021. Five-fold cross-validation was performed, and datasets were randomly split into training and validation sets in a 4:1 ratio for each fold. Before feeding the inputs into networks, all images were augmented with random rotation and vertical flip, normalized, and resized to 224×224 pixels. ResNeXt was chosen as the DL baseline due to its advantages of higher efficiency and accuracy in image classification. Mean absolute error (MAE) was the primary parameter. Independent data from 100 patients acquired between March and April 2022 were used as a test set. The manual method completely followed the prior study, which reported the lowest MAEs (5.31 in males and 6.72 in females) among similar studies. CT data and VR images were used. The radiation density of the first costal cartilage was recorded using CT data on the workstation. The osseous and calcified projections of the 1 to 7 costal cartilages were scored based on VR images using an eight-stage staging technique. According to the results of the prior study, the optimal models were the decision tree regression model in males and the stepwise multiple linear regression equation in females. Predicted ages of the test set were calculated separately using different models by sex. A total of 2600 patients (training and validation sets, mean age=45.19 years±14.20 [SD]; test set, mean age=46.57±9.66) were evaluated in this study. Of ResNeXt model training, MAEs were obtained with 3.95 in males and 3.65 in females. Based on the test set, DL achieved MAEs of 4.05 in males and 4.54 in females, which were far better than the MAEs of 8.90 and 6.42 respectively, for the manual method. Those results showed that the DL of the ResNeXt model outperformed the manual method in AAE based on CT reconstruction of the costal cartilage and the developed system may be a supportive tool for AAE.Keywords: forensic anthropology, age determination by the skeleton, costal cartilage, CT, deep learning
Procedia PDF Downloads 74