Categorical Data Modeling: Logistic Regression Software
Authors: Abdellatif Tchantchane
Abstract:
A Matlab based software for logistic regression is developed to enhance the process of teaching quantitative topics and assist researchers with analyzing wide area of applications where categorical data is involved. The software offers an option of performing stepwise logistic regression to select the most significant predictors. The software includes a feature to detect influential observations in data, and investigates the effect of dropping or misclassifying an observation on a predictor variable. The input data may consist either as a set of individual responses (yes/no) with the predictor variables or as grouped records summarizing various categories for each unique set of predictor variables' values. Graphical displays are used to output various statistical results and to assess the goodness of fit of the logistic regression model. The software recognizes possible convergence constraints when present in data, and the user is notified accordingly.
Keywords: Logistic regression, Matlab, Categorical data, Influential observation.
Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1077399
Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1887References:
[1] Mark A. Huselid and Nancy E. Day , "Organizational commitment, job involvement, and turnover: A substantive and methodological analysis", Journal of Applied Psychology 1991, vol. 76, N0 3 380-391).
[2] Elizabeth N. King and Thomas P. Ryan "A preliminary investigation of maximum likelihood logistic regression versus exact logistic regression", The American Statistician, August 2002, vol. 56 No 3.
[3] Yasunari Takagi, Osamu Mizuno and Tohru Kikuno, Empirical Software Engineering, 10, 495-515, 2005.
[4] (Santner T.J. and Duffy, E.D. (1986), "A note on A. Albert and J.A. Anderson's conditions for the existence of maximum likelihood estimates in logistic regression models." Biometrika, 73, pp. 755-758
[5] D. R. Cox and Nanny Wermuth, "A comment on the coefficient of determination for binary responses." The American Statistician, February 1992, vol. 46 N0 1.
[6] Marija J. Norusis, "SPSS 16.0 Advanced Statistical Procedures Companion", Prentice Hall Inc (2008). ISBN- 13:978-0-13-606140-3.
[7] Michael P.Fay, "Measuring a binary response's range of influential in logistic regression", The American Statistician, February 2002, vol. 56. N0 1.
[8] Iain Pardoe and R. Dennis Cook "A graphical method for assessing the fit of a logistic regression model", The American Statistician, November 2002, vol. 56, No 4.
[9] Nicholas J. Horton and Stuart R. Lipsitz , "Review of software to fit generalized estimation equation regression models", The American Statistician, May 1999, vol. 53.
[10] Jeffrey S. Simonoff, "Logistic regression, categorical predictors, and goodness-of-fit: It depends on who you ask", The American Statistician, February 1998, vol. 52 N0 1.