Search results for: multivariate regression analysis
28927 Partial Least Square Regression for High-Dimentional and High-Correlated Data
Authors: Mohammed Abdullah Alshahrani
Abstract:
The research focuses on investigating the use of partial least squares (PLS) methodology for addressing challenges associated with high-dimensional correlated data. Recent technological advancements have led to experiments producing data characterized by a large number of variables compared to observations, with substantial inter-variable correlations. Such data patterns are common in chemometrics, where near-infrared (NIR) spectrometer calibrations record chemical absorbance levels across hundreds of wavelengths, and in genomics, where thousands of genomic regions' copy number alterations (CNA) are recorded from cancer patients. PLS serves as a widely used method for analyzing high-dimensional data, functioning as a regression tool in chemometrics and a classification method in genomics. It handles data complexity by creating latent variables (components) from original variables. However, applying PLS can present challenges. The study investigates key areas to address these challenges, including unifying interpretations across three main PLS algorithms and exploring unusual negative shrinkage factors encountered during model fitting. The research presents an alternative approach to addressing the interpretation challenge of predictor weights associated with PLS. Sparse estimation of predictor weights is employed using a penalty function combining a lasso penalty for sparsity and a Cauchy distribution-based penalty to account for variable dependencies. The results demonstrate sparse and grouped weight estimates, aiding interpretation and prediction tasks in genomic data analysis. High-dimensional data scenarios, where predictors outnumber observations, are common in regression analysis applications. Ordinary least squares regression (OLS), the standard method, performs inadequately with high-dimensional and highly correlated data. Copy number alterations (CNA) in key genes have been linked to disease phenotypes, highlighting the importance of accurate classification of gene expression data in bioinformatics and biology using regularized methods like PLS for regression and classification.Keywords: partial least square regression, genetics data, negative filter factors, high dimensional data, high correlated data
Procedia PDF Downloads 4928926 Testing the Change in Correlation Structure across Markets: High-Dimensional Data
Authors: Malay Bhattacharyya, Saparya Suresh
Abstract:
The Correlation Structure associated with a portfolio is subjected to vary across time. Studying the structural breaks in the time-dependent Correlation matrix associated with a collection had been a subject of interest for a better understanding of the market movements, portfolio selection, etc. The current paper proposes a methodology for testing the change in the time-dependent correlation structure of a portfolio in the high dimensional data using the techniques of generalized inverse, singular valued decomposition and multivariate distribution theory which has not been addressed so far. The asymptotic properties of the proposed test are derived. Also, the performance and the validity of the method is tested on a real data set. The proposed test performs well for detecting the change in the dependence of global markets in the context of high dimensional data.Keywords: correlation structure, high dimensional data, multivariate distribution theory, singular valued decomposition
Procedia PDF Downloads 12528925 A Multinomial Logistic Regression Analysis of Factors Influencing Couples' Fertility Preferences in Kenya
Authors: Naomi W. Maina
Abstract:
Fertility preference is a subject of great significance in developing countries. Studies reveal that the preferences of fertility are actually significant in determining the society’s fertility levels because the fertility behavior of the future has a high likelihood of falling under the effect of currently observed fertility inclinations. The objective of this study was to establish the factors associated with fertility preference amongst couples in Kenya by fitting a multinomial logistic regression model against 5,265 couple data obtained from Kenya demographic health survey 2014. Results revealed that the type of place of residence, the region of residence, age and spousal age gap significantly influence desire for additional children among couples in Kenya. There was the notable high likelihood of couples living in rural settlements having similar fertility preference compared to those living in urban settlements. Moreover, geographical disparities such as in northern Kenya revealed significant differences in a couples desire to have additional children compared to Nairobi. The odds of a couple’s desire for additional children were further observed to vary dependent on either the wife or husbands age and to a large extent the spousal age gap. Evidenced from the study, was the fact that as spousal age gap increases, the desire for more children amongst couples decreases. Insights derived from this study would be attractive to demographers, health practitioners, policymakers, and non-governmental organizations implementing fertility related interventions in Kenya among other stakeholders. Moreover, with the adoption of devolution, there is a clear need for adoption of population policies that are County specific as opposed to a national population policy as is the current practice in Kenya. Additionally, researchers or students who have little understanding in the application of multinomial logistic regression, both theoretical understanding and practical analysis in SPSS as well as application on real datasets, will find this article useful.Keywords: couples' desire, fertility, fertility preference, multinomial regression analysis
Procedia PDF Downloads 18128924 The Association of Smoking and Body Mass Index with Acne Vulgaris in Adolescents and Young Adults
Authors: Almutazballlah Qablan, Jihan M. Muhaidat, Bana Abu Rajab
Abstract:
Background: Acne vulgaris is the most common skin condition that general practitioners and dermatologists encounter. It represents a chronic inflammatory disease affecting the pilosebaceous unit. Although acne vulgaris is not a life-threatening condition, it has a considerable psychological impact on the affected person. Acne patients have poor body image, low self-esteem, social isolation, and restricted activities. As part of the emotional impact, increased levels of anxiety, anger, depression, and frustration have also been observed in acne patients. (1) In this study, we want to assess the association between two modifiable risk factors; BMI and smoking, regarding acne vulgaris. Methods: A case-control study was conducted at King Abdullah University Hospital in Irbid, north Jordan in 2019/2020. A total number of 163 Acne cases were collected and interviewed by the author; on the other hand, there were 162 control cases. Anthropometric measures for Acne patients and control individuals were taken, and BMI was calculated. Both groups were asked about smoking habits. Data on subjects between 14 and 33 years of age were extracted. The characteristics of people who reported acne were compared with those with no acne using univariate and multivariate analysis. The Statistical Package for Social Sciences (SPSS) was relied on to analyze the collected data. The crosstabs methods (chi-square) and odd ratios were relied on to test the study hypothesis. Results: Cigarette smoking was highly associated with no-acne, with an odds ratio of 0.4 (95% CI: 0.2–0.9), P-value = 0.018. BMI and waterpipe smoking were not significantly associated with acne in the multivariate analysis. Conclusion: Cigarette smoking was found to be protective from Acne. No significant relation between BMI nor waterpipe smoking and the development of Acne Vulgaris.Keywords: acne, BMI, smoking, case-control
Procedia PDF Downloads 9628923 Two-Phase Sampling for Estimating a Finite Population Total in Presence of Missing Values
Authors: Daniel Fundi Murithi
Abstract:
Missing data is a real bane in many surveys. To overcome the problems caused by missing data, partial deletion, and single imputation methods, among others, have been proposed. However, problems such as discarding usable data and inaccuracy in reproducing known population parameters and standard errors are associated with them. For regression and stochastic imputation, it is assumed that there is a variable with complete cases to be used as a predictor in estimating missing values in the other variable, and the relationship between the two variables is linear, which might not be realistic in practice. In this project, we estimate population total in presence of missing values in two-phase sampling. Instead of regression or stochastic models, non-parametric model based regression model is used in imputing missing values. Empirical study showed that nonparametric model-based regression imputation is better in reproducing variance of population total estimate obtained when there were no missing values compared to mean, median, regression, and stochastic imputation methods. Although regression and stochastic imputation were better than nonparametric model-based imputation in reproducing population total estimates obtained when there were no missing values in one of the sample sizes considered, nonparametric model-based imputation may be used when the relationship between outcome and predictor variables is not linear.Keywords: finite population total, missing data, model-based imputation, two-phase sampling
Procedia PDF Downloads 13028922 Determinants of Aggregate Electricity Consumption in Ghana: A Multivariate Time Series Analysis
Authors: Renata Konadu
Abstract:
In Ghana, electricity has become the main form of energy which all sectors of the economy rely on for their businesses. Therefore, as the economy grows, the demand and consumption of electricity also grow alongside due to the heavy dependence on it. However, since the supply of electricity has not increased to match the demand, there has been frequent power outages and load shedding affecting business performances. To solve this problem and advance policies to secure electricity in Ghana, it is imperative that those factors that cause consumption to increase be analysed by considering the three classes of consumers; residential, industrial and non-residential. The main argument, however, is that, export of electricity to other neighbouring countries should be included in the electricity consumption model and considered as one of the significant factors which can decrease or increase consumption. The author made use of multivariate time series data from 1980-2010 and econometric models such as Ordinary Least Squares (OLS) and Vector Error Correction Model. Findings show that GDP growth, urban population growth, electricity exports and industry value added to GDP were cointegrated. The results also showed that there is unidirectional causality from electricity export and GDP growth and Industry value added to GDP to electricity consumption in the long run. However, in the short run, there was found to be a directional causality among all the variables and electricity consumption. The results have useful implication for energy policy makers especially with regards to electricity consumption, demand, and supply.Keywords: electricity consumption, energy policy, GDP growth, vector error correction model
Procedia PDF Downloads 43728921 An Automated Stock Investment System Using Machine Learning Techniques: An Application in Australia
Authors: Carol Anne Hargreaves
Abstract:
A key issue in stock investment is how to select representative features for stock selection. The objective of this paper is to firstly determine whether an automated stock investment system, using machine learning techniques, may be used to identify a portfolio of growth stocks that are highly likely to provide returns better than the stock market index. The second objective is to identify the technical features that best characterize whether a stock’s price is likely to go up and to identify the most important factors and their contribution to predicting the likelihood of the stock price going up. Unsupervised machine learning techniques, such as cluster analysis, were applied to the stock data to identify a cluster of stocks that was likely to go up in price – portfolio 1. Next, the principal component analysis technique was used to select stocks that were rated high on component one and component two – portfolio 2. Thirdly, a supervised machine learning technique, the logistic regression method, was used to select stocks with a high probability of their price going up – portfolio 3. The predictive models were validated with metrics such as, sensitivity (recall), specificity and overall accuracy for all models. All accuracy measures were above 70%. All portfolios outperformed the market by more than eight times. The top three stocks were selected for each of the three stock portfolios and traded in the market for one month. After one month the return for each stock portfolio was computed and compared with the stock market index returns. The returns for all three stock portfolios was 23.87% for the principal component analysis stock portfolio, 11.65% for the logistic regression portfolio and 8.88% for the K-means cluster portfolio while the stock market performance was 0.38%. This study confirms that an automated stock investment system using machine learning techniques can identify top performing stock portfolios that outperform the stock market.Keywords: machine learning, stock market trading, logistic regression, cluster analysis, factor analysis, decision trees, neural networks, automated stock investment system
Procedia PDF Downloads 15728920 A Novel Approach towards Test Case Prioritization Technique
Authors: Kamna Solanki, Yudhvir Singh, Sandeep Dalal
Abstract:
Software testing is a time and cost intensive process. A scrutiny of the code and rigorous testing is required to identify and rectify the putative bugs. The process of bug identification and its consequent correction is continuous in nature and often some of the bugs are removed after the software has been launched in the market. This process of code validation of the altered software during the maintenance phase is termed as Regression testing. Regression testing ubiquitously considers resource constraints; therefore, the deduction of an appropriate set of test cases, from the ensemble of the entire gamut of test cases, is a critical issue for regression test planning. This paper presents a novel method for designing a suitable prioritization process to optimize fault detection rate and performance of regression test on predefined constraints. The proposed method for test case prioritization m-ACO alters the food source selection criteria of natural ants and is basically a modified version of Ant Colony Optimization (ACO). The proposed m-ACO approach has been coded in 'Perl' language and results are validated using three examples by computation of Average Percentage of Faults Detected (APFD) metric.Keywords: regression testing, software testing, test case prioritization, test suite optimization
Procedia PDF Downloads 33828919 Developing Countries and the Entrepreneurial Intention of Postgraduates: A Study of Nigerian Postgraduates in UUM
Authors: Mahmoud Ahmad Mahmoud
Abstract:
The surge in unemployment among nations and the understanding of the important role played by entrepreneurship in job creation by researchers and policy makers have steered to the postulation that entrepreneurship activities can be spurred through the development of entrepreneurial intentions. Notwithstanding, entrepreneurial intention studies are very scarce in the developing world especially in the African continent. Even among the developed countries, studies of entrepreneurial intention were mostly focused on the undergraduate candidates. This paper therefore, aimed at filling the gap by employing the descriptive quantitative survey method to examine the entrepreneurial intention of 158 Nigerian postgraduate candidates of Universiti Utara Malaysia (UUM), comprising 46 Masters and 112 PhD candidates who are studying in the College of Business (COB), College of Arts and Sciences (CAS) and College of Legal, Government and International Studies (COLGIS), the theory of planned behaviour (TPB) model was used due its reputable validity, with attitudes, subjective norms and perceived behavioural control as the independent variables. Preliminary analysis and data screening were conducted which qualifies the data to the multivariate analysis assumptions. The reliability test was performed using the Cronbach Alpha method which shows all variables as reliable with a value of >0.70. However, the data is free from the multicollinearity issue with all factors in the Pearson correlation having <0.9 value and the VIF having <10. Regression analysis has shown the sufficiency and predictive capability of the TPB model to entrepreneurship intention with attitude, subjective norms and perceived behavioural control being positively and significantly related to the entrepreneurial intention of Nigerian postgraduates. Considering the Beta values, perceived behavioural control emerged as the strongest factor that influences the postgraduates entrepreneurial intention. Developing countries are therefore, recommended to make efforts in redesigning their entrepreneurship development policies to fit candidates of the highest level of academia. Further studies should replicate in a larger sample that comprises more than one university and more than one developing country.Keywords: attitude, entrepreneurial intention, Nigeria, perceived behavioral control, postgraduates, subjective norms
Procedia PDF Downloads 43228918 Blood Pressure Level, Targeted Blood Pressure Control Rate, and Factors Related to Blood Pressure Control in Post-Acute Ischemic Stroke Patients
Authors: Nannapus Saramad, Rewwadee Petsirasan, Jom Suwanno
Abstract:
Background: This retrospective study design was to describe average blood pressure, blood pressure level, target blood pressure control rate post-stroke BP control in the year following discharge from Sichon hospital, Sichon District, Nakhon Si Thammarat province. The secondary data analysis was employed from the patient’s health records with patient or caregiver interview. A total of 232 eligible post-acute ischemic strokes in the year following discharge (2017-2018) were recruited. Methods: Data analyses were applied to identify the relationship values of single variables were determined through univariate analyses: The Chi-square test, Fisher exact test, the variables found to have a p-value < 0.2 were analyzed by the binary logistic regression Results: Most of the patients in this study were men 61.6%, an average age of 65.4 ± 14.8 years. Systolic blood pressure levels were in the grade 1-2 hypertension and diastolic pressure at optimal and normal at all times during the initial treatment through the present. The results revealed 25% among the groups under the age of 60 achieved BP control; 36.3% for older than 60 years group; and 27.9% for diabetic group. The multivariate analysis revealed the final relationship of four significant variables: 1) receiving calcium-channel blocker (p =.027); 2) medication adherence of antihypertensive (p = .024) 3) medication adherence of antiplatelet ( p = .020); and 4) medication behavior ( p = . 010) . Conclusion: The medical nurse and health care provider should promote their adherence to behavior to improve their blood pressure control.Keywords: acute ischemic stroke, target blood pressure control, medication adherence, recurrence stroke
Procedia PDF Downloads 12228917 Correlates of Comprehensive HIV/AIDS Knowledge and Acceptance Attitude Towards People Living with HIV/AIDS: A Cross-Sectional Study among Unmarried Young Women in Uganda
Authors: Tesfaldet Mekonnen Estifanos, Chen Hui, Afewerki Weldezgi
Abstract:
Background: Youth in general and young females in particular, remain at the center of the HIV/AIDS epidemic. Sexual risk-taking among young unmarried women is relatively high and are the most vulnerable and highly exposed to HIV/AIDS. Improvements in the status of HIV/AIDS knowledge and acceptance attitude towards people living with HIV (PLWHIV) plays a great role in averting the incidence of HIV/AIDS. Thus, the aim of the study was to explore the level and correlates of HIV/AIDS knowledge and accepting attitude toward PLWHIV. Methods: A cross-sectional study was conducted using data from the Uganda Demographic Health Survey 2016 (UDHS-2016). National level representative household surveys using a multistage cluster probability sampling method, face to face interviews with standard questionnaires were performed. Unmarried women aged 15-24 years with a sample size of 2019 were selected from the total sample of 8674 women aged 15-49 years and were analyzed using SPSS version 23. Independent variables such as age, religion, educational level, residence, and wealth index were included. Two binary outcome variables (comprehensive HIV/AIDS knowledge and acceptance attitude toward PLWHIV) were utilized. We used the chi-square test as well as multivariate regression analysis to explore correlations of explanatory variables with the outcome variables. The results were reported by odds ratios (OR) with 95% confidence interval (95% CI), taking a p-value less than 0.05 as significant. Results: Almost all (99.3%) of the unmarried women aged 15-24 years were aware of HIV/AIDS, but only 51.2% had adequate comprehensive knowledge on HIV/AIDS. Only 69.4% knew both methods: using a condom every time had sex, and having only one faithful uninfected partner can prevent HIV/AIDS transmission. About 66.6% of the unmarried women reject at least two common local misconceptions about HIV/AIDS. Moreover, an alarmingly few (20.3%) of the respondents had a positive acceptance attitude to PLWHIV. On multivariate analysis, age (20-24 years), living in urban, being educated and wealthier, were predictors of having adequate comprehensive HIV/AIDS knowledge. On the other hand, research participants with adequate comprehensive knowledge about HIV/AIDS were highly likely (OR, 1.94 95% CI, 1.52-2.46) to have a positive acceptance attitude to PLWHIV than those with inadequate knowledge. Respondents with no education, Muslim, and Pentecostal religion were emerged less likely to have a positive acceptance attitude to PLWHIV. Conclusion: This study found out the highly accepted level of awareness, but the knowledge and positive acceptance attitude are not encouraging. Thus, expanding access to comprehensive sexuality and strengthening educational campaigns on HIV/AIDS in communities, health facilities, and schools is needed with a greater focus on disadvantaged women having low educational level, poor socioeconomic status, and those residing in rural areas. Sexual risk behaviors among the most affected people - young women have also a role in the spread of HIV/AIDS. Hence, further research assessing the significant contributing factors for sexual risk-taking might have a positive impact on the fight against HIV/AIDS.Keywords: acceptance attitude, HIV/AIDS, knowledge, unmarried women
Procedia PDF Downloads 15128916 The Impact of Global Financial Crises and Corporate Financial Crisis (Bankruptcy Risk) on Corporate Tax Evasion: Evidence from Emerging Markets
Authors: Seyed Sajjad Habibi
Abstract:
The aim of this study is to investigate the impact of global financial crises and corporate financial crisis on tax evasion of companies listed on the Tehran Stock Exchange. For this purpose, panel data in the periods of financial crisis period (2007 to 2012) and without a financial crisis (2004, 2005, 2006, 2013, 2014, and 2015) was analyzed using multivariate linear regression. The results indicate a significant relationship between the corporate financial crisis (bankruptcy risk) and tax evasion in the global financial crisis period. The results also showed a significant relationship between the corporate bankruptcy risk and tax evasion in the period with no global financial crisis. A significant difference was found between the bankruptcy risk and tax evasion in the period of the global financial crisis and that with no financial crisis so that tax evasion increased in the financial crisis period.Keywords: global financial crisis, corporate financial crisis, bankruptcy risk, tax evasion risk, emerging markets
Procedia PDF Downloads 28028915 Prediction of the Thermodynamic Properties of Hydrocarbons Using Gaussian Process Regression
Authors: N. Alhazmi
Abstract:
Knowing the thermodynamics properties of hydrocarbons is vital when it comes to analyzing the related chemical reaction outcomes and understanding the reaction process, especially in terms of petrochemical industrial applications, combustions, and catalytic reactions. However, measuring the thermodynamics properties experimentally is time-consuming and costly. In this paper, Gaussian process regression (GPR) has been used to directly predict the main thermodynamic properties - standard enthalpy of formation, standard entropy, and heat capacity -for more than 360 cyclic and non-cyclic alkanes, alkenes, and alkynes. A simple workflow has been proposed that can be applied to directly predict the main properties of any hydrocarbon by knowing its descriptors and chemical structure and can be generalized to predict the main properties of any material. The model was evaluated by calculating the statistical error R², which was more than 0.9794 for all the predicted properties.Keywords: thermodynamic, Gaussian process regression, hydrocarbons, regression, supervised learning, entropy, enthalpy, heat capacity
Procedia PDF Downloads 22228914 Application of Multilinear Regression Analysis for Prediction of Synthetic Shear Wave Velocity Logs in Upper Assam Basin
Authors: Triveni Gogoi, Rima Chatterjee
Abstract:
Shear wave velocity (Vs) estimation is an important approach in the seismic exploration and characterization of a hydrocarbon reservoir. There are varying methods for prediction of S-wave velocity, if recorded S-wave log is not available. But all the available methods for Vs prediction are empirical mathematical models. Shear wave velocity can be estimated using P-wave velocity by applying Castagna’s equation, which is the most common approach. The constants used in Castagna’s equation vary for different lithologies and geological set-ups. In this study, multiple regression analysis has been used for estimation of S-wave velocity. The EMERGE module from Hampson-Russel software has been used here for generation of S-wave log. Both single attribute and multi attributes analysis have been carried out for generation of synthetic S-wave log in Upper Assam basin. Upper Assam basin situated in North Eastern India is one of the most important petroleum provinces of India. The present study was carried out using four wells of the study area. Out of these wells, S-wave velocity was available for three wells. The main objective of the present study is a prediction of shear wave velocities for wells where S-wave velocity information is not available. The three wells having S-wave velocity were first used to test the reliability of the method and the generated S-wave log was compared with actual S-wave log. Single attribute analysis has been carried out for these three wells within the depth range 1700-2100m, which corresponds to Barail group of Oligocene age. The Barail Group is the main target zone in this study, which is the primary producing reservoir of the basin. A system generated list of attributes with varying degrees of correlation appeared and the attribute with the highest correlation was concerned for the single attribute analysis. Crossplot between the attributes shows the variation of points from line of best fit. The final result of the analysis was compared with the available S-wave log, which shows a good visual fit with a correlation of 72%. Next multi-attribute analysis has been carried out for the same data using all the wells within the same analysis window. A high correlation of 85% has been observed between the output log from the analysis and the recorded S-wave. The almost perfect fit between the synthetic S-wave and the recorded S-wave log validates the reliability of the method. For further authentication, the generated S-wave data from the wells have been tied to the seismic and correlated them. Synthetic share wave log has been generated for the well M2 where S-wave is not available and it shows a good correlation with the seismic. Neutron porosity, density, AI and P-wave velocity are proved to be the most significant variables in this statistical method for S-wave generation. Multilinear regression method thus can be considered as a reliable technique for generation of shear wave velocity log in this study.Keywords: Castagna's equation, multi linear regression, multi attribute analysis, shear wave logs
Procedia PDF Downloads 22928913 Microstructural Characterization and Mechanical Properties of Al-2Mn-5Fe Ternary Eutectic Alloy
Authors: Emin Çadirli, Izzettin Yilmazer, Uğur Büyük, Hasan Kaya
Abstract:
Al-2Mn-5Fe eutectic alloy (wt.%) was prepared in a graphite crucible under vacuum atmosphere. The samples were directionally solidified upward at a constant temperature gradient in four different of growth rates by using a Bridgman method. The values of eutectic spacing were measured from longitudinal and transverse sections of the samples. The dependence of eutectic spacing on the growth rate was determined by using linear regression analysis. The microhardness and tensile strength of the studied alloy also were measured from directionally solidified samples. The dependency of the microhardness and tensile strength for directionally solidified Al-2Mn-5Fe eutectic alloy on the growth rate were investigated and the relationships between them were experimentally obtained by using regression analysis. The results obtained in present work were compared with the previous similar experimental results obtained for binary and ternary alloys.Keywords: eutectic alloy, microhardness, microstructure, tensile strength
Procedia PDF Downloads 47328912 The Effect of Non-Surgical Periodontal Therapy on Metabolic Control in Children
Authors: Areej Al-Khabbaz, Swapna Goerge, Majedah Abdul-Rasoul
Abstract:
Introduction: The most prevalent periodontal disease among children is gingivitis, and it usually becomes more severe in adolescence. A number of intervention studies suggested that resolution of periodontal inflammation can improve metabolic control in patients diagnosed with diabetes mellitus. Aim: to assess the effect of non-surgical periodontal therapy on glycemic control of children diagnosed with diabetes mellitus. Method: Twenty-eight children diagnosed with diabetes mellitus were recruited with established diagnosis diabetes for at least 1 year. Informed consent and child assent form were obtained from children and parents prior to enrolment. The dental examination for the participants was performed on the same week directly following their annual medical assessment. All patients had their glycosylated hemoglobin (HbA1c%) test one week prior to their annual medical and dental visit and 3 months following non-surgical periodontal therapy. All patients received a comprehensive periodontal examination The periodontal assessment included clinical attachment loss, bleeding on probing, plaque score, plaque index and gingival index. All patients were referred for non-surgical periodontal therapy, which included oral hygiene instruction and motivation followed by supra-gingival and subg-ingival scaling using ultrasonic and hand instruments. Statistical Analysis: Data were entered and analyzed using the Statistical Package for Social Science software (SPSS, Chicago, USA), version 18. Statistical analysis of clinical findings was performed to detect differences between the two groups in term of periodontal findings and HbA1c%. Binary logistic regression analysis was performed in order to examine which factors were significant in multivariate analysis after adjusting for confounding between effects. The regression model used the dependent variable ‘Improved glycemic control’, and the independent variables entered in the model were plaque index, gingival index, bleeding %, plaque Statistical significance was set at p < 0.05. Result: A total of 28 children. The mean age of the participants was 13.3±1.92 years. The study participants were divided into two groups; Compliant group (received dental scaling) and non-complaints group (received oral hygiene instructions only). No statistical difference was found between compliant and non-compliant group in age, gender distribution, oral hygiene practice and the level of diabetes control. There was a significant difference between compliant and non-compliant group in term of improvement of HBa1c before and after periodontal therapy. Mean gingival index was the only significant variable associated with improved glycemic control level. In conclusion, this study has demonstrated that non-surgical mechanical periodontal therapy can improve HbA1c% control. The result of this study confirmed that children with diabetes mellitus who are compliant to dental care and have routine professional scaling may have better metabolic control compared to diabetic children who are erratic with dental care.Keywords: children, diabetes, metabolic control, periodontal therapy
Procedia PDF Downloads 16128911 A Hybrid Fuzzy Clustering Approach for Fertile and Unfertile Analysis
Authors: Shima Soltanzadeh, Mohammad Hosain Fazel Zarandi, Mojtaba Barzegar Astanjin
Abstract:
Diagnosis of male infertility by the laboratory tests is expensive and, sometimes it is intolerable for patients. Filling out the questionnaire and then using classification method can be the first step in decision-making process, so only in the cases with a high probability of infertility we can use the laboratory tests. In this paper, we evaluated the performance of four classification methods including naive Bayesian, neural network, logistic regression and fuzzy c-means clustering as a classification, in the diagnosis of male infertility due to environmental factors. Since the data are unbalanced, the ROC curves are most suitable method for the comparison. In this paper, we also have selected the more important features using a filtering method and examined the impact of this feature reduction on the performance of each methods; generally, most of the methods had better performance after applying the filter. We have showed that using fuzzy c-means clustering as a classification has a good performance according to the ROC curves and its performance is comparable to other classification methods like logistic regression.Keywords: classification, fuzzy c-means, logistic regression, Naive Bayesian, neural network, ROC curve
Procedia PDF Downloads 33628910 Competitors’ Influence Analysis of a Retailer by Using Customer Value and Huff’s Gravity Model
Authors: Yepeng Cheng, Yasuhiko Morimoto
Abstract:
Customer relationship analysis is vital for retail stores, especially for supermarkets. The point of sale (POS) systems make it possible to record the daily purchasing behaviors of customers as an identification point of sale (ID-POS) database, which can be used to analyze customer behaviors of a supermarket. The customer value is an indicator based on ID-POS database for detecting the customer loyalty of a store. In general, there are many supermarkets in a city, and other nearby competitor supermarkets significantly affect the customer value of customers of a supermarket. However, it is impossible to get detailed ID-POS databases of competitor supermarkets. This study firstly focused on the customer value and distance between a customer's home and supermarkets in a city, and then constructed the models based on logistic regression analysis to analyze correlations between distance and purchasing behaviors only from a POS database of a supermarket chain. During the modeling process, there are three primary problems existed, including the incomparable problem of customer values, the multicollinearity problem among customer value and distance data, and the number of valid partial regression coefficients. The improved customer value, Huff’s gravity model, and inverse attractiveness frequency are considered to solve these problems. This paper presents three types of models based on these three methods for loyal customer classification and competitors’ influence analysis. In numerical experiments, all types of models are useful for loyal customer classification. The type of model, including all three methods, is the most superior one for evaluating the influence of the other nearby supermarkets on customers' purchasing of a supermarket chain from the viewpoint of valid partial regression coefficients and accuracy.Keywords: customer value, Huff's Gravity Model, POS, Retailer
Procedia PDF Downloads 12328909 GIS-Based Spatial Distribution and Evaluation of Selected Heavy Metals Contamination in Topsoil around Ecton Mining Area, Derbyshire, UK
Authors: Zahid O. Alibrahim, Craig D. Williams, Clive L. Roberts
Abstract:
The study area (Ecton mining area) is located in the southern part of the Peak District in Derbyshire, England. It is bounded by the River Manifold from the west. This area has been mined for a long period. As a result, huge amounts of potentially toxic metals were released into the surrounding area and are most likely to be a significant source of heavy metal contamination to the local soil, water and vegetation. In order to appraise the potential heavy metal pollution in this area, 37 topsoil samples (5-20 cm depth) were collected and analysed for their total content of Cu, Pb, Zn, Mn, Cr, Ni and V using ICP (Inductively Coupled Plasma) optical emission spectroscopy. Multivariate Geospatial analyses using the GIS technique were utilised to draw geochemical maps of the metals of interest over the study area. A few hotspot points, areas of elevated concentrations of metals, were specified, which are presumed to be the results of anthropogenic activities. In addition, the soil’s environmental quality was evaluated by calculating the Mullers’ Geoaccumulation index (I geo), which suggests that the degree of contamination of the investigated heavy metals has the following trend: Pb > Zn > Cu > Mn > Ni = Cr = V. Furthermore, the potential ecological risk, using the enrichment factor (EF), was also specified. On the basis of the calculated amount or the EF, the levels of pollution for the studied metals in the study area have the following order: Pb>Zn>Cu>Cr>V>Ni>Mn.Keywords: enrichment factor, geoaccumulation index, GIS, heavy metals, multivariate analysis
Procedia PDF Downloads 35828908 Solving Single Machine Total Weighted Tardiness Problem Using Gaussian Process Regression
Authors: Wanatchapong Kongkaew
Abstract:
This paper proposes an application of probabilistic technique, namely Gaussian process regression, for estimating an optimal sequence of the single machine with total weighted tardiness (SMTWT) scheduling problem. In this work, the Gaussian process regression (GPR) model is utilized to predict an optimal sequence of the SMTWT problem, and its solution is improved by using an iterated local search based on simulated annealing scheme, called GPRISA algorithm. The results show that the proposed GPRISA method achieves a very good performance and a reasonable trade-off between solution quality and time consumption. Moreover, in the comparison of deviation from the best-known solution, the proposed mechanism noticeably outperforms the recently existing approaches.Keywords: Gaussian process regression, iterated local search, simulated annealing, single machine total weighted tardiness
Procedia PDF Downloads 30928907 Acne Vulgaris Association with Smoking and Body Mass Index in Jordanian Young Adults
Authors: Almutazballlah Bassam Qablan, Jihan M. Muhaidat, bana Abu Rajab
Abstract:
Background: Acne vulgaris is considered one of the most common skin conditions encountered by dermatologists. It is a chronic inflammation affecting the pilosebaceous unit. Although acne vulgaris is not fatal, it leads to permanent scarring and disfigurement, and even without scarring, it has a huge effect on patients, causing negative health outcomes. Acne vulgaris patients experience psychological, and emotional ramifications as those with chronic health problems; they feel depressed, angry, anxious, and confused. Although acne is a popular disease, many thoughts and myths are still discussed about its origins and triggering factors. These myths can make you feel guilt as if you were somehow responsible for your acne. In this case control study, we want to define the relationship between two modifiable risk factors ;BMI and smoking, with acne vulgaris. Methods: A case-control study was conducted at King Abdullah University Hospital in Ramtha, Jordan in 2019/2020. A total number of 325 participants between 14 and 33 years of age were interviewed by the authors; including 163 acne vulgaris cases and 162 controls without acne vulgaris. Anthropometric measures and smoking for Acne patients and control participants were the independent variables used to assess acne. Univariate and multivariate analysis were used to compare the characteristics of people who reported acne with those with no acne. The collected data analyzed by using the Statistical Package for Social Sciences (SPSS). Results: Cigarette smoking was highly associated with controls; odds ratio 0.4 (95% CI: 0.2–0.9) , P-value = 0.018. BMI and waterpipe smoking were statistically insignificant with acne in the multivariate analysis. Conclusion: We found that cigarette smoking was protective against Acne. There was a statistically insignificant relation between BMI, waterpipe smoking and the development of Acne Vulgaris.Keywords: acne, adolescents, BMI, smoking, case-control, risk factors
Procedia PDF Downloads 9428906 The Profit Trend of Cosmetics Products Using Bootstrap Edgeworth Approximation
Authors: Edlira Donefski, Lorenc Ekonomi, Tina Donefski
Abstract:
Edgeworth approximation is one of the most important statistical methods that has a considered contribution in the reduction of the sum of standard deviation of the independent variables’ coefficients in a Quantile Regression Model. This model estimates the conditional median or other quantiles. In this paper, we have applied approximating statistical methods in an economical problem. We have created and generated a quantile regression model to see how the profit gained is connected with the realized sales of the cosmetic products in a real data, taken from a local business. The Linear Regression of the generated profit and the realized sales was not free of autocorrelation and heteroscedasticity, so this is the reason that we have used this model instead of Linear Regression. Our aim is to analyze in more details the relation between the variables taken into study: the profit and the finalized sales and how to minimize the standard errors of the independent variable involved in this study, the level of realized sales. The statistical methods that we have applied in our work are Edgeworth Approximation for Independent and Identical distributed (IID) cases, Bootstrap version of the Model and the Edgeworth approximation for Bootstrap Quantile Regression Model. The graphics and the results that we have presented here identify the best approximating model of our study.Keywords: bootstrap, edgeworth approximation, IID, quantile
Procedia PDF Downloads 15928905 Spatial Time Series Models for Rice and Cassava Yields Based on Bayesian Linear Mixed Models
Authors: Panudet Saengseedam, Nanthachai Kantanantha
Abstract:
This paper proposes a linear mixed model (LMM) with spatial effects to forecast rice and cassava yields in Thailand at the same time. A multivariate conditional autoregressive (MCAR) model is assumed to present the spatial effects. A Bayesian method is used for parameter estimation via Gibbs sampling Markov Chain Monte Carlo (MCMC). The model is applied to the rice and cassava yields monthly data which have been extracted from the Office of Agricultural Economics, Ministry of Agriculture and Cooperatives of Thailand. The results show that the proposed model has better performance in most provinces in both fitting part and validation part compared to the simple exponential smoothing and conditional auto regressive models (CAR) from our previous study.Keywords: Bayesian method, linear mixed model, multivariate conditional autoregressive model, spatial time series
Procedia PDF Downloads 39528904 Regional Flood Frequency Analysis in Narmada Basin: A Case Study
Authors: Ankit Shah, R. K. Shrivastava
Abstract:
Flood and drought are two main features of hydrology which affect the human life. Floods are natural disasters which cause millions of rupees’ worth of damage each year in India and the whole world. Flood causes destruction in form of life and property. An accurate estimate of the flood damage potential is a key element to an effective, nationwide flood damage abatement program. Also, the increase in demand of water due to increase in population, industrial and agricultural growth, has let us know that though being a renewable resource it cannot be taken for granted. We have to optimize the use of water according to circumstances and conditions and need to harness it which can be done by construction of hydraulic structures. For their safe and proper functioning of hydraulic structures, we need to predict the flood magnitude and its impact. Hydraulic structures play a key role in harnessing and optimization of flood water which in turn results in safe and maximum use of water available. Mainly hydraulic structures are constructed on ungauged sites. There are two methods by which we can estimate flood viz. generation of Unit Hydrographs and Flood Frequency Analysis. In this study, Regional Flood Frequency Analysis has been employed. There are many methods for estimating the ‘Regional Flood Frequency Analysis’ viz. Index Flood Method. National Environmental and Research Council (NERC Methods), Multiple Regression Method, etc. However, none of the methods can be considered universal for every situation and location. The Narmada basin is located in Central India. It is drained by most of the tributaries, most of which are ungauged. Therefore it is very difficult to estimate flood on these tributaries and in the main river. As mentioned above Artificial Neural Network (ANN)s and Multiple Regression Method is used for determination of Regional flood Frequency. The annual peak flood data of 20 sites gauging sites of Narmada Basin is used in the present study to determine the Regional Flood relationships. Homogeneity of the considered sites is determined by using the Index Flood Method. Flood relationships obtained by both the methods are compared with each other, and it is found that ANN is more reliable than Multiple Regression Method for the present study area.Keywords: artificial neural network, index flood method, multi layer perceptrons, multiple regression, Narmada basin, regional flood frequency
Procedia PDF Downloads 41828903 Analytical Modelling of Surface Roughness during Compacted Graphite Iron Milling Using Ceramic Inserts
Authors: Ş. Karabulut, A. Güllü, A. Güldaş, R. Gürbüz
Abstract:
This study investigates the effects of the lead angle and chip thickness variation on surface roughness during the machining of compacted graphite iron using ceramic cutting tools under dry cutting conditions. Analytical models were developed for predicting the surface roughness values of the specimens after the face milling process. Experimental data was collected and imported to the artificial neural network model. A multilayer perceptron model was used with the back propagation algorithm employing the input parameters of lead angle, cutting speed and feed rate in connection with chip thickness. Furthermore, analysis of variance was employed to determine the effects of the cutting parameters on surface roughness. Artificial neural network and regression analysis were used to predict surface roughness. The values thus predicted were compared with the collected experimental data, and the corresponding percentage error was computed. Analysis results revealed that the lead angle is the dominant factor affecting surface roughness. Experimental results indicated an improvement in the surface roughness value with decreasing lead angle value from 88° to 45°.Keywords: CGI, milling, surface roughness, ANN, regression, modeling, analysis
Procedia PDF Downloads 44828902 Agriculture Yield Prediction Using Predictive Analytic Techniques
Authors: Nagini Sabbineni, Rajini T. V. Kanth, B. V. Kiranmayee
Abstract:
India’s economy primarily depends on agriculture yield growth and their allied agro industry products. The agriculture yield prediction is the toughest task for agricultural departments across the globe. The agriculture yield depends on various factors. Particularly countries like India, majority of agriculture growth depends on rain water, which is highly unpredictable. Agriculture growth depends on different parameters, namely Water, Nitrogen, Weather, Soil characteristics, Crop rotation, Soil moisture, Surface temperature and Rain water etc. In our paper, lot of Explorative Data Analysis is done and various predictive models were designed. Further various regression models like Linear, Multiple Linear, Non-linear models are tested for the effective prediction or the forecast of the agriculture yield for various crops in Andhra Pradesh and Telangana states.Keywords: agriculture yield growth, agriculture yield prediction, explorative data analysis, predictive models, regression models
Procedia PDF Downloads 31328901 Analyzing the Influence of Principals’ Cultural Intelligence on Teachers’ Perceived Diversity Climate
Authors: Meghry Nazarian, Ibrahim Duyar
Abstract:
Effective management of a diverse workforce in the United Arab Emirates (UAE) presents peculiar importance as two-thirds of residents are expatriates who have diverse ethnic and cultural backgrounds. Like any other organization in the country, UAE schools have become upmost diverse settings in the world. The purpose of this study was to examine whether principals’ cultural intelligence has direct and indirect (moderating) influences on teachers’ perceived diversity climate. A quantitative causal-comparative research design was employed to analyze the data. Participants included random samples of principals and teachers working in the private and charter schools in the Emirate of Abu Dhabi. The data-gathering online questionnaires included previously developed and validated scales as the measures of study variables. More specifically, the multidimensional short-form measure of Cultural Intelligence (CQ) and the diversity climate scale were used to measure the study variables. Multivariate statistics, including the analysis of multivariate analysis of variance (MANCOVA) and structural equation modeling (SEM), were employed to examine the relationships between the study variables. The preliminary analyses of data showed that principals and teachers have differing views of diversity management and climate in schools. Findings also showed that principals’ cultural intelligence has both direct and moderating influences on teachers’ perceived diversity climate. The study findings are expected to inform policymakers and practicing educational leaders in addressing diversity management in a country where the majority of the residents are the minority who have diverse ethnic and cultural backgrounds.Keywords: diversity management, united arab emirates, school principals’ cultural intelligence (CQ), teachers’ perceived diversity climate
Procedia PDF Downloads 11228900 A Multivariate Analysis of Patent Price Variations in the Emerging United States Patent Auction Market: Role of Patent, Seller, and Bundling Related Characteristics
Authors: Pratheeba Subramanian, Anjula Gurtoo, Mary Mathew
Abstract:
Transaction of patents in emerging patent markets is gaining momentum. Pricing patents for a transaction say patent sale remains a challenge. Patents vary in their pricing with some patents fetching higher prices than others. Sale of patents in portfolios further complicates pricing with multiple patents playing a role in pricing a bundle. In this paper, a set of 138 US patents sold individually as single invention lots and 462 US patents sold in bundles of 120 portfolios are investigated to understand the dynamics of selling prices of singletons and portfolios and their determinants. Firstly, price variations when patents are sold individually as singletons and portfolios are studied. Multivariate statistical techniques are used for analysis both at the lot level as well as at the individual patent level. The results show portfolios fetching higher prices than singletons at the lot level. However, at the individual patent level singletons show higher prices than per patent price of individual patent members within the portfolio. Secondly, to understand the price determinants, the effect of patent, seller, and bundling related characteristics on selling prices is studied separately for singletons and portfolios. The results show differences in the set of characteristics determining prices of singletons and portfolios. Selling prices of singletons are found to be dependent on the patent related characteristics, unlike portfolios whose prices are found to be dependent on all three aspects – patent, seller, and bundling. The specific patent, seller and bundling characteristics influencing selling price are discussed along with the implications.Keywords: auction, patents, portfolio bundling, seller type, selling price, singleton
Procedia PDF Downloads 32828899 The Role of Self-Confidence, Adversity Quotient, and Self-Efficacy Critical Thinking: Path Model
Authors: Bayu Dwi Cahyo, Ekohariadi, Theodorus Wiyanto Wibowo, I. G. P. Asto Budithahjanto, Eppy Yundra
Abstract:
The objective of this study is to examine the effects of self-confidence, adversity quotient, and self-efficacy variables on critical thinking. This research's participants are 137 cadets of Aviation Polytechnics of Surabaya with the sampling technique that was purposive sampling. In this study, the data collection method used a questionnaire with Linkert-scale and distributed or given to respondents by the specified number of samples. The SPSS AMOS v23 was used to test a number of a priori multivariate growth curve models and examining relationships between the variables via path analysis. The result of path analysis was (χ² = 88.463, df= 71, χ² /df= 1.246, GFI= .914, CFI= .988, P= .079, AGFI= .873, TLI= .985, RMSEA= .043). According to the analysis, there is a positive and significant relationship between self-confidence, adversity quotient, and self-efficacy variables on critical thinking.Keywords: self-confidence, adversity quotient, self-efficacy variables, critical thinking
Procedia PDF Downloads 14428898 Antecedents of Sport Commitment: A Comparison Based on Demographic Factors
Authors: Navodita Mishra, T. J. Kamalanabhan
Abstract:
Purpose: The primary purpose of this study was to identify the antecedents of sports commitment among cricket players and to understand demographic variables that may impact these factors. Commitment towards one’s sports plays a crucial role in determining discipline and efforts of the player. Moreover, demographic variables would seem to play an important role in determining which factors or predictors have the greatest impact on commitment level. Design /methodology/approach: This study hypothesized the effect of demographic factors on sports commitment among cricket players. It attempts to examine the extent to which demographic factors can differentially motivate players to exhibit commitment towards their respective sport. Questionnaire survey method was adopted using purposive sampling technique. Using Multiple Regression, ANOVA, and t-test, the hypotheses were tested based on a sample of 350 players from Cricket Academy. Findings: Our main results from the multivariate analysis indicated that enjoyment and leadership of coach and peer affect the level of commitment to a greater extent whereas personal investment is a significant predictor of commitment among rural background players Moreover, level of sport commitment among players is positively related to household income, the rural background players participate in sports to a greater extent than the urban players, there is no evidence of regional differentials in commitment but age differences (i.e. U-19 vs. U-25) play an important role in the decision to continue the participation in sports.Keywords: Individual Sports Commitment, demographic indicators, cricket, player motivation
Procedia PDF Downloads 481