The Effect of Outliers on the Economic and Social Survey on Income and Living Conditions
Authors: Encarnación Álvarez, Rosa M. García-Fernández, Francisco J. Blanco-Encomienda, Juan F. Muñoz
Abstract:
The European Union Survey on Income and Living Conditions (EU-SILC) is a popular survey which provides information on income, poverty, social exclusion and living conditions of households and individuals in the European Union. The EU-SILC contains variables which may contain outliers. The presence of outliers can have an impact on the measures and indicators used by the EU-SILC. In this paper, we used data sets from various countries to analyze the presence of outliers. In addition, we obtain some indicators after removing these outliers, and a comparison between both situations can be observed. Finally, some conclusions are obtained.
Keywords: Headcount index, poverty line, risk of poverty, skewness coefficient.
Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1096499
Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 2561References:
[1] T.N. Achia, A. Wangombe and N. Khadioli, "A logistic regression model to identify key determinants of poverty using demographic and health survey data”. European Journal of Social Sciences, 13(1), pp. 38–45, 2010.
[2] E. A´ lvarez, R.M. Garc´ıa-Ferna´ndez, J.F. Mun˜oz and F.J. Blanco-Encomienda, "On estimating the headcount index by using the logistic regression estimator”. International Journal of Mathematical, Computational, Physical and Quantum Engineering, 8(8), pp. 1039–1041, 2014.
[3] A.B. Atkinson, "On the measurement of poverty”. Econometrica, 55(4), pp. 749-764, 1987.
[4] C.R. Blyth and H.A. Still, "Binomial confidence intervals”. Journal of the American Statistical Association, 78, pp. 108–116, 1983.
[5] R.L. Chambers and R. Dunstan, "Estimating distribution functions from survey data”. Biometrika, 73, pp. 597–604. 1986.
[6] J. Chen and R.R. Sitter, "A pseudo empirical likelihood approach to the effective use of auxiliary information in complex surveys”. Statistica Sinica, 9, pp. 385-406, 1999.
[7] F.A. Cowell and M.P. Victoria-Feser, "Welfare ranking in the presence of contaminated data”. Econometrica, 70, pp. 1221–1233, 2002.
[8] E. Crettaz and C. Suter, "The impact of adaptive preferences on subjective indicators: an analysis of poverty indicators”. Social Indicators Research, 114, pp. 139-152, 2013.
[9] J.C. Deville and C.E. S¨arndal, "Calibration estimators in survey sampling”. Journal of the American Statistical Association, 87, pp. 376-382, 1992.
[10] A.H. Dorfman, "Inference on distribution functions and quantiles, in Handbook of Statistics 29B Sample suveys: Inference and Analysis, D. Pffefermann and C.R. Rao, Eds. Amsterdam: North-Holland, 2009, pp. 371-395.
[11] P. Duchesne, "Estimation of a proportion with survey data”. Journal of Statistics Education, 11, pp. 1-24, 2003.
[12] EUROSTAT, "Laeken” indicators-detailed calculation methodology, Directorate E: Social Statistics, Unit E-2: Living Conditions, DOC.E2/IPSE/2003. http://www.cso.ie/en/media/csoie/eusilc/documents/Laeken%20Indicators %20-%20calculation%20algorithm.pdf, 2003.
[13] J.E. Foster, "Absolute versus relative poverty”. The American Economic Review, 88, pp. 335-341, 1998.
[14] J.E. Foster, J. Greer and E. Thorbecke, "A class of decomposable poverty measures”. Econometrica, 52, pp. 761-766, 1984.
[15] J.R. Frick, M.M. Grabka and O. Groh-Samberg, "Dealing with incomplete household panel data in inequality research”. Sociological Methods and Reserch, 41, pp. 89-123, 2012.
[16] F. Giambona and E. Vassallo, "Composite indicator of social inclusion for European countries”. Social Indicators Research, 116, pp. 269-293, 2014.
[17] H. Gravelle and M. Sutton, "Income relative income, and seft-reporter health in Britain 1979-2000”. Center for Health Economics Research Paper, 10, 2006.
[18] J. Haughton and S.R. Khandker, Handbook on poverty and inequality. Washington, DC: The World Bank, 2009.
[19] D. Jolliffe, "Measuring absolute and relative poverty. The sensitivity of estimated household consumption to survey design”. Journal of Economics and Social Measurement, 27, pp. 1-23, 2001.
[20] S.R. Khandker, Introduction to Poverty Analysis. Washington, DC: World Bank Institute, 2005.
[21] R. Lehtonen and A. Veijanen, "On multinomial logistic generalized regression estimators”. Survey Methodology, 24, pp. 51-55, 1998.
[22] M. Medeiros, "The rich and the poor: the construction of an affluence line from the poverty line”. Social Indicators Research, 78, pp. 1-18, 2006.
[23] I. Molina and J.N.K. Rao, "Small area estimation of poverty indicators”. The Canadian Journal of Statistics, 38, pp. 369-385, 2010.
[24] R.G. Newcombe, "Two-sided confidence intervals for the single proportion: comparison of seven methods”. Statistic in Medicine, 17, pp. 857–872, 1998.
[25] J.N.K. Rao, J.G. Kovar and H.J. Mantel, "On estimating distribution function and quantiles from survey data using auxiliary information”. Biometrika, 77, pp. 365-375, 1990.
[26] C.E. S¨arndal, B. Swensson and J. Wretman, Model assisted survey sampling. New York: Springer Verlag, 1992.
[27] P.L.D. Silva and C.J. Skinner, "Estimating distribution function with auxiliary information using poststratification”. Journal of Official Statistics, 11, pp. 277-294, 1995.
[28] A. Tarozzi and A. Deaton, "Using census and survey data to estimate poverty and inequality for small areas”. Review of Economics and Statistics, 91(4), pp. 773–792, 2009.
[29] S.E. Vollset, "Confidence interval for a binomial proportion”. Statistic in Medicine, 12, pp. 809–824, 1993.
[30] S. Weich, G. Lewis and S.P. Jenkins, "Income inequality and self-rated health in Britain”. Journal of Epidemiology and Community Health, 56, pp. 436–441, 2002.
[31] E.B. Wilson, "Probable inference, the law of succession, and statistical inference”. Journal of the American Statisitical Association, 22, pp. 209–212, 1927.
[32] B. Zheng, "Statistical inference for poverty measures with relative poverty lines”. Journal of Econometrics, 101, pp. 337-356, 2001.