Identifying Factors Contributing to the Spread of Lyme Disease: A Regression Analysis of Virginia’s Data
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 33093
Identifying Factors Contributing to the Spread of Lyme Disease: A Regression Analysis of Virginia’s Data

Authors: Fatemeh Valizadeh Gamchi, Edward L. Boone

Abstract:

This research focuses on Lyme disease, a widespread infectious condition in the United States caused by the bacterium Borrelia burgdorferi sensu stricto. It is critical to identify environmental and economic elements that are contributing to the spread of the disease. This study examined data from Virginia to identify a subset of explanatory variables significant for Lyme disease case numbers. To identify relevant variables and avoid overfitting, linear poisson, and regularization regression methods such as ridge, lasso, and elastic net penalty were employed. Cross-validation was performed to acquire tuning parameters. The methods proposed can automatically identify relevant disease count covariates. The efficacy of the techniques was assessed using four criteria on three simulated datasets. Finally, using the Virginia Department of Health’s Lyme disease dataset, the study successfully identified key factors, and the results were consistent with previous studies.

Keywords: Lyme disease, Poisson generalized linear model, Ridge regression, Lasso Regression, elastic net regression.

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 122

References:


[1] R. Murphree Bacon, K. J. Kugeler, and P. S. Mead, “Surveillance for lyme disease–united states, 1992-2006,” 2008.
[2] R. J. Brinkerhoff, W. F. Gilliam, and D. Gaines, “Lyme disease, virginia, usa, 2000–2011,” Emerging infectious diseases, vol. 20, no. 10, p. 1661, 2014.
[3] E. Maes, P. Lecomte, and N. Ray, “A cost-of-illness study of lyme disease in the united states,” Clinical therapeutics, vol. 20, no. 5, pp. 993–1008, 1998.
[4] B. F. Allan, F. Keesing, and R. S. Ostfeld, “Effect of forest fragmentation on lyme disease risk,” Conservation Biology, vol. 17, no. 1, pp. 267–272, 2003.
[5] F. Valizadeh Gamchi, O¨ . Gu¨ru¨nlu¨ Alma, and R. Arabi Belaghi, “Classical and bayesian inference for burr type-iii distribution based on progressive type-ii hybrid censored data,” Mathematical Sciences, vol. 13, pp. 79–95, 2019.
[6] L. E. Jackson, E. D. Hilborn, and J. C. Thomas, “Towards landscape design guidelines for reducing lyme disease risk,” International journal of epidemiology, vol. 35, no. 2, pp. 315–322, 2006.
[7] P. Consul and F. Famoye, “Generalized poisson regression model,” Communications in Statistics-Theory and Methods, vol. 21, no. 1, pp. 89–109, 1992.
[8] J. A. Nelder and R. W. Wedderburn, “Generalized linear models,” Journal of the Royal Statistical Society: Series A (General), vol. 135, no. 3, pp. 370–384, 1972.
[9] A. F. Zuur, E. N. Ieno, N. Walker, A. A. Saveliev, G. M. Smith, A. F. Zuur, E. N. Ieno, N. J. Walker, A. A. Saveliev, and G. M. Smith, “Zero-truncated and zero-inflated models for count data,” Mixed effects models and extensions in ecology with R, pp. 261–293, 2009.
[10] A. Agresti, “An introduction to categorical data analysis,” 1996.
[11] G. Rodrıguez, “Models for count data with overdispersion,” Addendum to the WWS, vol. 509, 2013.
[12] J. Mwikali, S. Mwalili, and A. Wanjoya, “Penalized poisson regression model using elastic net and least absolute shrinkage and selection operator (lasso) penality,” Int. J. Data Sci. Anal, vol. 5, no. 5, pp. 99–103, 2019.
[13] C. Flexeder, “Generalized lasso regularization for regression models,” Ph.D. dissertation, Institut f¨ur Statistik, 2010.
[14] A. E. Hoerl and R. W. Kennard, “Ridge regression: Biased estimation for nonorthogonal problems,” Technometrics, vol. 42, no. 1, pp. 80–86, 2000.
[15] K. M˚ansson and G. Shukur, “A poisson ridge regression estimator,” Economic Modelling, vol. 28, no. 4, pp. 1475–1481, 2011.
[16] R. H. Myers and R. H. Myers, Classical and modern regression with applications. Duxbury press Belmont, CA, 1990, vol. 2.
[17] R. Tibshirani, “Regression shrinkage and selection via the lasso,” Journal of the Royal Statistical Society: Series B (Methodological), vol. 58, no. 1, pp. 267–288, 1996.
[18] M. Y. Park and T. Hastie, “L1-regularization path algorithm for generalized linear models,” Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol. 69, no. 4, pp. 659–677, 2007.
[19] P. Tseng and S. Yun, “A coordinate gradient descent method for nonsmooth separable minimization,” Mathematical Programming, vol. 117, pp. 387–423, 2009.
[20] Z. Qin, K. Scheinberg, and D. Goldfarb, “Efficient block-coordinate descent algorithms for the group lasso,” Mathematical Programming Computation, vol. 5, no. 2, pp. 143–169, 2013.
[21] R. Arabi Belaghi, F. Valizadeh Gamchi, and H. Bevrani, “Likelihood based inference on progressive type-ii hybrid-censored data for burr type iii distribution,” Reliability Theory and its Applications, p. 194, 2016.
[22] T. T. Wu and K. Lange, “Coordinate descent algorithms for lasso penalized regression,” 2008.
[23] L. Kantorovitch, “The method of successive approximation for functional equations,” Acta Mathematica, vol. 71, no. 1, pp. 63–97, 1939.
[24] S. Hossain and E. Ahmed, “Shrinkage and penalty estimators of a poisson regression model,” Australian & New Zealand Journal of Statistics, vol. 54, no. 3, pp. 359–373, 2012.
[25] H. Zou and T. Hastie, “Regularization and variable selection via the elastic net,” Journal of the royal statistical society: series B (statistical methodology), vol. 67, no. 2, pp. 301–320, 2005.
[26] T. Hastie and J. Qian, “Glmnet vignette,” Retrieved June, vol. 9, no. 2016, pp. 1–30, 2014.
[27] B. Ripley, B. Venables, D. M. Bates, K. Hornik, A. Gebhardt, D. Firth, and M. B. Ripley, “Package ‘mass’,” Cran r, vol. 538, pp. 113–120, 2013.
[28] Y. Xie, L. Xu, J. Li, X. Deng, Y. Hong, K. Kolivras, and D. N. Gaines, “Spatial variable selection and an application to virginia lyme disease emergence,” Journal of the American statistical association, vol. 114, no. 528, pp. 1466–1480, 2019.
[29] Z. W. Almquist, “Us census spatial and demographic data in r: the uscensus2000 suite of packages,” Journal of Statistical Software, vol. 37, pp. 1–31, 2010.
[30] J. A. Fry, G. Xian, S. Jin, J. A. Dewitz, C. G. Homer, L. Yang, C. A. Barnes, N. D. Herold, J. D. Wickham et al., “Completion of the 2006 national land cover database for the conterminous united states.” PE&RS, Photogrammetric Engineering & Remote Sensing, vol. 77, no. 9, pp. 858–864, 2011.
[31] S. E. Seukep, K. N. Kolivras, Y. Hong, J. Li, S. P. Prisley, J. B. Campbell, D. N. Gaines, and R. L. Dymond, “An examination of the demographic and environmental variables correlated with lyme disease emergence in virginia,” Ecohealth, vol. 12, pp. 634–644, 2015.
[32] H. J. Kilpatrick and A. M. LaBonte, Managing urban deer in Connecticut: a guide for residents and communities. Connecticut Department of Environmental Protection, Bureau of Natural . . . , 2007.