**Commenced**in January 2007

**Frequency:**Monthly

**Edition:**International

**Paper Count:**718

# Search results for: regression.

##### 718 Relationship between Sums of Squares in Linear Regression and Semi-parametric Regression

**Authors:**
Dursun Aydın,
Bilgin Senel

**Abstract:**

**Keywords:**
Semi-parametric regression,
Penalized LeastSquares,
Residuals,
Deviance,
Smoothing Spline.

##### 717 A Comparison of the Sum of Squares in Linear and Partial Linear Regression Models

**Authors:**
Dursun Aydın

**Abstract:**

**Keywords:**
Partial Linear Regression Model,
Linear RegressionModel,
Residuals,
Deviance,
Smoothing Spline.

##### 716 A Comparison of the Nonparametric Regression Models using Smoothing Spline and Kernel Regression

**Authors:**
Dursun Aydin

**Abstract:**

**Keywords:**
Kernel regression,
Nonparametric models,
Prediction,
Smoothing spline.

##### 715 Orthogonal Regression for Nonparametric Estimation of Errors-in-Variables Models

**Authors:**
Anastasiia Yu. Timofeeva

**Abstract:**

Two new algorithms for nonparametric estimation of errors-in-variables models are proposed. The first algorithm is based on penalized regression spline. The spline is represented as a piecewise-linear function and for each linear portion orthogonal regression is estimated. This algorithm is iterative. The second algorithm involves locally weighted regression estimation. When the independent variable is measured with error such estimation is a complex nonlinear optimization problem. The simulation results have shown the advantage of the second algorithm under the assumption that true smoothing parameters values are known. Nevertheless the use of some indexes of fit to smoothing parameters selection gives the similar results and has an oversmoothing effect.

**Keywords:**
Grade point average,
orthogonal regression,
penalized regression spline,
locally weighted regression.

##### 714 On the outlier Detection in Nonlinear Regression

**Authors:**
Hossein Riazoshams,
Midi Habshah,
Jr.,
Mohamad Bakri Adam

**Abstract:**

**Keywords:**
Nonlinear Regression,
outliers,
Gradient,
LeastSquare,
M-estimate,
MM-estimate.

##### 713 Robust Regression and its Application in Financial Data Analysis

**Authors:**
Mansoor Momeni,
Mahmoud Dehghan Nayeri,
Ali Faal Ghayoumi,
Hoda Ghorbani

**Abstract:**

This research is aimed to describe the application of robust regression and its advantages over the least square regression method in analyzing financial data. To do this, relationship between earning per share, book value of equity per share and share price as price model and earning per share, annual change of earning per share and return of stock as return model is discussed using both robust and least square regressions, and finally the outcomes are compared. Comparing the results from the robust regression and the least square regression shows that the former can provide the possibility of a better and more realistic analysis owing to eliminating or reducing the contribution of outliers and influential data. Therefore, robust regression is recommended for getting more precise results in financial data analysis.

**Keywords:**
Financial data analysis,
Influential data,
Outliers,
Robust regression.

##### 712 Regression Test Selection Technique for Multi-Programming Language

**Authors:**
Walid S. Abd El-hamid,
Sherif S. El-Etriby,
Mohiy M. Hadhoud

**Abstract:**

**Keywords:**
Regression testing,
testing,
test selection,
softwareevolution,
software maintenance.

##### 711 Model-Based Software Regression Test Suite Reduction

**Authors:**
Shiwei Deng,
Yang Bao

**Abstract:**

**Keywords:**
Dependence analysis,
EFSM model,
greedy
algorithm,
regression test.

##### 710 Stock Market Prediction by Regression Model with Social Moods

**Authors:**
Masahiro Ohmura,
Koh Kakusho,
Takeshi Okadome

**Abstract:**

This paper presents a regression model with autocorrelated errors in which the inputs are social moods obtained by analyzing the adjectives in Twitter posts using a document topic model, where document topics are extracted using LDA. The regression model predicts Dow Jones Industrial Average (DJIA) more precisely than autoregressive moving-average models.

**Keywords:**
Regression model,
social mood,
stock market
prediction,
Twitter.

##### 709 A Fuzzy Linear Regression Model Based on Dissemblance Index

**Authors:**
Shih-Pin Chen,
Shih-Syuan You

**Abstract:**

**Keywords:**
Dissemblance index,
fuzzy linear regression,
graded
mean integration,
mathematical programming.

##### 708 Segmentation of Piecewise Polynomial Regression Model by Using Reversible Jump MCMC Algorithm

**Authors:**
Suparman

**Abstract:**

Piecewise polynomial regression model is very flexible model for modeling the data. If the piecewise polynomial regression model is matched against the data, its parameters are not generally known. This paper studies the parameter estimation problem of piecewise polynomial regression model. The method which is used to estimate the parameters of the piecewise polynomial regression model is Bayesian method. Unfortunately, the Bayes estimator cannot be found analytically. Reversible jump MCMC algorithm is proposed to solve this problem. Reversible jump MCMC algorithm generates the Markov chain that converges to the limit distribution of the posterior distribution of piecewise polynomial regression model parameter. The resulting Markov chain is used to calculate the Bayes estimator for the parameters of piecewise polynomial regression model.

**Keywords:**
Piecewise,
Bayesian,
reversible jump MCMC,
segmentation.

##### 707 Fuzzy Logic Approach to Robust Regression Models of Uncertain Medical Categories

**Authors:**
Arkady Bolotin

**Abstract:**

Dichotomization of the outcome by a single cut-off point is an important part of various medical studies. Usually the relationship between the resulted dichotomized dependent variable and explanatory variables is analyzed with linear regression, probit regression or logistic regression. However, in many real-life situations, a certain cut-off point dividing the outcome into two groups is unknown and can be specified only approximately, i.e. surrounded by some (small) uncertainty. It means that in order to have any practical meaning the regression model must be robust to this uncertainty. In this paper, we show that neither the beta in the linear regression model, nor its significance level is robust to the small variations in the dichotomization cut-off point. As an alternative robust approach to the problem of uncertain medical categories, we propose to use the linear regression model with the fuzzy membership function as a dependent variable. This fuzzy membership function denotes to what degree the value of the underlying (continuous) outcome falls below or above the dichotomization cut-off point. In the paper, we demonstrate that the linear regression model of the fuzzy dependent variable can be insensitive against the uncertainty in the cut-off point location. In the paper we present the modeling results from the real study of low hemoglobin levels in infants. We systematically test the robustness of the binomial regression model and the linear regression model with the fuzzy dependent variable by changing the boundary for the category Anemia and show that the behavior of the latter model persists over a quite wide interval.

**Keywords:**
Categorization,
Uncertain medical categories,
Binomial regression model,
Fuzzy dependent variable,
Robustness.

##### 706 The Relative Efficiency of Parameter Estimation in Linear Weighted Regression

**Authors:**
Baoguang Tian,
Nan Chen

**Abstract:**

A new relative efficiency in linear model in reference is instructed into the linear weighted regression, and its upper and lower bound are proposed. In the linear weighted regression model, for the best linear unbiased estimation of mean matrix respect to the least-squares estimation, two new relative efficiencies are given, and their upper and lower bounds are also studied.

**Keywords:**
Linear weighted regression,
Relative efficiency,
Mean matrix,
Trace.

##### 705 Internet Purchases in European Union Countries: Multiple Linear Regression Approach

**Authors:**
Ksenija Dumičić,
Anita Čeh Časni,
Irena Palić

**Abstract:**

This paper examines economic and Information and Communication Technology (ICT) development influence on recently increasing Internet purchases by individuals for European Union member states. After a growing trend for Internet purchases in EU27 was noticed, all possible regression analysis was applied using nine independent variables in 2011. Finally, two linear regression models were studied in detail. Conducted simple linear regression analysis confirmed the research hypothesis that the Internet purchases in analyzed EU countries is positively correlated with statistically significant variable Gross Domestic Product *per capita *(GDPpc). Also, analyzed multiple linear regression model with four regressors, showing ICT development level, indicates that ICT development is crucial for explaining the Internet purchases by individuals, confirming the research hypothesis.

**Keywords:**
European Union,
Internet purchases,
multiple linear regression model,
outlier

##### 704 Extended Least Squares LS–SVM

**Authors:**
József Valyon,
Gábor Horváth

**Abstract:**

**Keywords:**
Function estimation,
Least–Squares Support VectorMachines,
Regression,
System Modeling

##### 703 Optimization of Slider Crank Mechanism Using Design of Experiments and Multi-Linear Regression

**Authors:**
Galal Elkobrosy,
Amr M. Abdelrazek,
Bassuny M. Elsouhily,
Mohamed E. Khidr

**Abstract:**

Crank shaft length, connecting rod length, crank angle, engine rpm, cylinder bore, mass of piston and compression ratio are the inputs that can control the performance of the slider crank mechanism and then its efficiency. Several combinations of these seven inputs are used and compared. The throughput engine torque predicted by the simulation is analyzed through two different regression models, with and without interaction terms, developed according to multi-linear regression using LU decomposition to solve system of algebraic equations. These models are validated. A regression model in seven inputs including their interaction terms lowered the polynomial degree from 3^{rd} degree to 1^{st }degree and suggested valid predictions and stable explanations.

**Keywords:**
Design of experiments,
regression analysis,
SI Engine,
statistical modeling.

##### 702 Churn Prediction: Does Technology Matter?

**Authors:**
John Hadden,
Ashutosh Tiwari,
Rajkumar Roy,
Dymitr Ruta

**Abstract:**

**Keywords:**
Churn,
Decision Trees,
Neural Networks,
Regression.

##### 701 Categorical Data Modeling: Logistic Regression Software

**Authors:**
Abdellatif Tchantchane

**Abstract:**

A Matlab based software for logistic regression is developed to enhance the process of teaching quantitative topics and assist researchers with analyzing wide area of applications where categorical data is involved. The software offers an option of performing stepwise logistic regression to select the most significant predictors. The software includes a feature to detect influential observations in data, and investigates the effect of dropping or misclassifying an observation on a predictor variable. The input data may consist either as a set of individual responses (yes/no) with the predictor variables or as grouped records summarizing various categories for each unique set of predictor variables' values. Graphical displays are used to output various statistical results and to assess the goodness of fit of the logistic regression model. The software recognizes possible convergence constraints when present in data, and the user is notified accordingly.

**Keywords:**
Logistic regression,
Matlab,
Categorical data,
Influential observation.

##### 700 Research on the Problems of Housing Prices in Qingdao from a Macro Perspective

**Authors:**
Liu Zhiyuan,
Sun Zongdi,
Liu Zhiyuan,
Sun Zongdi

**Abstract:**

Qingdao is a seaside city. Taking into account the characteristics of Qingdao, this article established a multiple linear regression model to analyze the impact of macroeconomic factors on housing prices. We used stepwise regression method to make multiple linear regression analysis, and made statistical analysis of F test values and T test values. According to the analysis results, the model is continuously optimized. Finally, this article obtained the multiple linear regression equation and the influencing factors, and the reliability of the model was verified by F test and T test.

**Keywords:**
Housing prices,
multiple linear regression model,
macroeconomic factors,
Qingdao City.

##### 699 Estimating Regression Parameters in Linear Regression Model with a Censored Response Variable

**Authors:**
Jesus Orbe,
Vicente Nunez-Anton

**Abstract:**

In this work we study the effect of several covariates X on a censored response variable T with unknown probability distribution. In this context, most of the studies in the literature can be located in two possible general classes of regression models: models that study the effect the covariates have on the hazard function; and models that study the effect the covariates have on the censored response variable. Proposals in this paper are in the second class of models and, more specifically, on least squares based model approach. Thus, using the bootstrap estimate of the bias, we try to improve the estimation of the regression parameters by reducing their bias, for small sample sizes. Simulation results presented in the paper show that, for reasonable sample sizes and censoring levels, the bias is always smaller for the new proposals.

**Keywords:**
Censored response variable,
regression,
bias.

##### 698 Adjusted Ratio and Regression Type Estimators for Estimation of Population Mean when some Observations are missing

**Authors:**
Nuanpan Nangsue

**Abstract:**

Ratio and regression type estimators have been used by previous authors to estimate a population mean for the principal variable from samples in which both auxiliary x and principal y variable data are available. However, missing data are a common problem in statistical analyses with real data. Ratio and regression type estimators have also been used for imputing values of missing y data. In this paper, six new ratio and regression type estimators are proposed for imputing values for any missing y data and estimating a population mean for y from samples with missing x and/or y data. A simulation study has been conducted to compare the six ratio and regression type estimators with a previous estimator of Rueda. Two population sizes N = 1,000 and 5,000 have been considered with sample sizes of 10% and 30% and with correlation coefficients between population variables X and Y of 0.5 and 0.8. In the simulations, 10 and 40 percent of sample y values and 10 and 40 percent of sample x values were randomly designated as missing. The new ratio and regression type estimators give similar mean absolute percentage errors that are smaller than the Rueda estimator for all cases. The new estimators give a large reduction in errors for the case of 40% missing y values and sampling fraction of 30%.

**Keywords:**
Auxiliary variable,
missing data,
ratio and regression
type estimators.

##### 697 Quality of Service Evaluation using a Combination of Fuzzy C-Means and Regression Model

**Authors:**
Aboagela Dogman,
Reza Saatchi,
Samir Al-Khayatt

**Abstract:**

**Keywords:**
Fuzzy C-means; regression model,
network quality
of service

##### 696 Defect Cause Modeling with Decision Tree and Regression Analysis

**Authors:**
B. Bakır,
İ. Batmaz,
F. A. Güntürkün,
İ. A. İpekçi,
G. Köksal,
N. E. Özdemirel

**Abstract:**

**Keywords:**
Casting industry,
decision tree algorithm C5.0,
logistic regression,
quality improvement.

##### 695 Performance Analysis of Adaptive LMS Filter through Regression Analysis using SystemC

**Authors:**
Hyeong-Geon Lee,
Jae-Young Park,
Suk-ki Lee,
Jong-Tae Kim

**Abstract:**

The LMS adaptive filter has several parameters which can affect their performance. From among these parameters, most papers handle the step size parameter for controlling the performance. In this paper, we approach three parameters: step-size, filter tap-size and filter form. The regression analysis is used for defining the relation between parameters and performance of LMS adaptive filter with using the system level simulation results. The results present that all parameters have performance trends in each own particular form, which can be estimated from equations drawn by regression analysis.

**Keywords:**
System level model,
adaptive LMS FIR filter,
regression analysis,
systemC.

##### 694 Density Estimation using Generalized Linear Model and a Linear Combination of Gaussians

**Authors:**
Aly Farag,
Ayman El-Baz,
Refaat Mohamed

**Abstract:**

In this paper we present a novel approach for density estimation. The proposed approach is based on using the logistic regression model to get initial density estimation for the given empirical density. The empirical data does not exactly follow the logistic regression model, so, there will be a deviation between the empirical density and the density estimated using logistic regression model. This deviation may be positive and/or negative. In this paper we use a linear combination of Gaussian (LCG) with positive and negative components as a model for this deviation. Also, we will use the expectation maximization (EM) algorithm to estimate the parameters of LCG. Experiments on real images demonstrate the accuracy of our approach.

**Keywords:**
Logistic regression model,
Expectationmaximization,
Segmentation.

##### 693 Multiple Regression based Graphical Modeling for Images

**Authors:**
Pavan S.,
Sridhar G.,
Sridhar V.

**Abstract:**

Super resolution is one of the commonly referred inference problems in computer vision. In the case of images, this problem is generally addressed using a graphical model framework wherein each node represents a portion of the image and the edges between the nodes represent the statistical dependencies. However, the large dimensionality of images along with the large number of possible states for a node makes the inference problem computationally intractable. In this paper, we propose a representation wherein each node can be represented as acombination of multiple regression functions. The proposed approach achieves a tradeoff between the computational complexity and inference accuracy by varying the number of regression functions for a node.

**Keywords:**
Belief propagation,
Graphical model,
Regression,
Super resolution.

##### 692 Empirical Statistical Modeling of Rainfall Prediction over Myanmar

**Authors:**
Wint Thida Zaw,
Thinn Thu Naing

**Abstract:**

**Keywords:**
Polynomial Regression,
Rainfall Forecasting,
Statistical forecasting.

##### 691 Estimation of Time -Varying Linear Regression with Unknown Time -Volatility via Continuous Generalization of the Akaike Information Criterion

**Authors:**
Elena Ezhova,
Vadim Mottl,
Olga Krasotkina

**Abstract:**

**Keywords:**
Time varying regression,
time-volatility of regression
coefficients,
Akaike Information Criterion (AIC),
Kullback information
maximization principle.

##### 690 Comparison of Neural Network and Logistic Regression Methods to Predict Xerostomia after Radiotherapy

**Authors:**
Hui-Min Ting,
Tsair-Fwu Lee,
Ming-Yuan Cho,
Pei-Ju Chao,
Chun-Ming Chang,
Long-Chang Chen,
Fu-Min Fang

**Abstract:**

To evaluate the ability to predict xerostomia after radiotherapy, we constructed and compared neural network and logistic regression models. In this study, 61 patients who completed a questionnaire about their quality of life (QoL) before and after a full course of radiation therapy were included. Based on this questionnaire, some statistical data about the condition of the patients’ salivary glands were obtained, and these subjects were included as the inputs of the neural network and logistic regression models in order to predict the probability of xerostomia. Seven variables were then selected from the statistical data according to Cramer’s V and point-biserial correlation values and were trained by each model to obtain the respective outputs which were 0.88 and 0.89 for AUC, 9.20 and 7.65 for SSE, and 13.7% and 19.0% for MAPE, respectively. These parameters demonstrate that both neural network and logistic regression methods are effective for predicting conditions of parotid glands.

**Keywords:**
NPC,
ANN,
logistic regression,
xerostomia.

##### 689 Bioprocess Optimization Based On Relevance Vector Regression Models and Evolutionary Programming Technique

**Authors:**
R. Simutis,
V. Galvanauskas,
D. Levisauskas,
J. Repsyte

**Abstract:**

This paper proposes a bioprocess optimization procedure based on Relevance Vector Regression models and evolutionary programming technique. Relevance Vector Regression scheme allows developing a compact and stable data-based process model avoiding time-consuming modeling expenses. The model building and process optimization procedure could be done in a half-automated way and repeated after every new cultivation run. The proposed technique was tested in a simulated mammalian cell cultivation process. The obtained results are promising and could be attractive for optimization of industrial bioprocesses.

**Keywords:**
Bioprocess optimization,
Evolutionary
programming,
Relevance Vector Regression.