Volume 8 - Isuue 3 | November

Review of how the generalized regression estimators contribute to estimating the financial and economic data with missing observations under unequal probability sampling

Nuanpan Lawson

Abstract:

Purpose
Knowing financial and economic information beforehand benefits in planning and developing policies for every country especially for a developing country like Thailand and for other Asian countries. Unfortunately, missing data or non-response plays an essential role in many areas of studies including finance and economics. Eradication of missing data in a proper way before further analysis can gain remarkable outcomes and can be effective for planning policies. This review on the generalized regression estimators for population total can be applied to financial, economic and other data when missing data are present.

Design/methodology/approach
The generalized regression estimators for estimating population total, including the variance estimators under unequal probability sampling without replacement with missing data are explored under the reverse framework. Applications to financial and economic data in Thailand are also reviewed.

Findings
The review of literatures related to the proposed estimator shows the best performance, giving smaller variances in all scenarios.

Originality/value

The generalized regression estimators can assist in estimating financial and economic data that contain missing values with different missing mechanisms and can be used in other applications which help gain more superior estimators.

References:

  1. Berger, Y.G., Tirari, E.H.M. and Till´, Y. (2003), “Towards optimal regression estimation in sample surveys”, Australian and New Zealand Journal of Statistics, Vol. 45 No. 3, pp. 319-329, doi: 10.1111/1467-842x.00286.
  2. Bethlehem, J.G. and Keller, W.J. (1987), “Linear weighting of sample survey data”, Journal of Official Statistics, Vol. 3 No. 2, pp. 141-153.
  3. Brewer, K.R.W. (2002), Combined Survey Sampling Inference: Weighing Basu's Elephants, Arnold, London.
  4. Brewer, K.R.W. and Donadio, M.E. (2003), “The high entropy variance of the Horvitz-Thompson estimator”, Survey Methodology, Vol. 29 No. 2, pp. 189-196.
  5. Deville, J.C. and Särndal, C.E. (1994), “Variance estimation for the regression imputed Horvitz Thompson estimator”, Journal of Official Statistics, Vol. 10 No. 4, pp. 381-394.
  6. Estevao, V.M. and Särndal, C.E. (2003), “A new perspective on calibration estimators”, JSM- Section on Survey Research Methods, pp. 1346-1356.
  7. Fay, R.E. (1991), “A design-based perspective on missing data variance”, Proceedings of the 1991 Annual Research Conference, US Bureau of the Census, pp. 429-440.
  8. Hajek, J. (1964), “Asymptotic theory of rejective sampling with varying probabilities from a finite population”, Annals of Mathematical Statistics, Vol. 35 No. 4, pp. 1491-1523, doi: 10.1214/aoms/1177700375.
  9. Hajek, J. (1981), Sampling from Finite Population, Marcel Dekker, New York.
  10. Hansen, M.H. and Hurwitz, W.N. (1946), “The problem of nonresponse in sample surveys”, Journal of the American Statistical Association, Vol. 41 No. 236, pp. 517-529, doi: 10.1080/01621459.1946.10501894.
  11. Hartley, H.O. and Rao, J.N.K. (1962), “Sampling with unequal probability and without replacement”, The Annals of Mathematical Statistics, Vol. 33 No. 2, pp. 350-374, doi: 10.1214/aoms/1177704564.
  12. Haziza, D. (2010), “Resampling methods for variance estimation in the presence of missing survey data”, Proceedings of the Annual Conference of the Italian Statistical Society.
  13. Haziza, D. and Rao, J.N.K. (2006), “A nonresponse model approach to inference under imputation for missing survey data”, Survey Methodology, Vol. 32 No. 1, pp. 53-64.
  14. Horvitz, D.F. and Thompson, D.J. (1952), “A generalization of sampling without replacement from a finite universe”, Journal of the American Statistical Association, Vol. 47 No. 260, pp. 663-685, doi: 10.1080/01621459.1952.10483446.
  15. Lawson, N. (2017), “Variance estimation in the presence of nonresponse under probability proportional to size sampling”, Proceedings of the 6th Annual International Conference on Computational Mathematics, Computational Geometry and Statistics (CMCGS 2017), Singapore, doi: 10.5176/2251-1911_cmcgs17.32.
  16. Lawson, N. and Ponkaew, C. (2019), “New generalized regression estimator in the presence of nonresponse under unequal probability sampling”, Communications in Statistics -Theory and Methods, Vol. 48 No. 10, pp. 2483-2498, doi: 10.1080/03610926.2018.1465091.
  17. Lawson, N. and Siripanich, P. (2022), “A new generalized regression estimator and variance estimation for unequal probability sampling without replacement for missing data”, Communications in Statistics -Theory and Methods, Vol. 51 No. 18, pp. 6296-6318, doi: 10.1080/03610926.2020.1860224.
  18. Midzuno, H. (1952), “On the sampling system with probability proportional to sum of sizes”, Annals of the Institute of Statistical Mathematics, Vol. 55 No. 3, pp. 99-107.
  19. Montanari, G. (1987), “Post sampling efficient qr-prediction in large sample survey”, International Statistics, Vol. 55 No. 2, pp. 191-202, doi: 10.2307/1403195.
  20. Ponkaew, C. and Lawson, L. (2023), “New generalized regression estimators using a ratio method and its variance estimation for unequal probability sampling without replacement in the presence of nonresponse”, Current Applied Science and Technology, Vol. 23 No. 2, doi: 10.55003/cast.2022.02.23.007.
  21. Rao, J.N.K. (1990), “Variance estimation under imputation for missing data”, Technical report, Statistics Canada, Ottawa, pp. 599-608.
  22. Särndal, C.E. (1992), “Method for estimating the precision of survey estimateswhen imputation has been used”, Survey Methodology, Vol. 18, pp. 241-252.
  23. Särndal, C.E. (2007), “The calibration approach in survey theory and practice”, Survey Methodology, Vol. 33 No. 2, pp. 99-119.
  24. Särndal, C.E. and Lundström, S. (2005), Estimation in Surveys with Nonresponse, John Wiley & Sons, New York.
  25. Särndal, C.E., Swensson, B. and Wretman, J. (1992), Model Assisted Survey Sampling, Springer- Verlag, New York.
  26. Sen, A.R. (1953), “On the estimate of the variance in sampling with varying probabilities”, Journal of the Indian Society of Agricultural Statistics, Vol. 5, pp. 119-127.
  27. Shao, J. and Steel, P. (1999), “Variance estimation for survey data with composite imputation and nonnegligible sampling fractions”, Journal of the American Statistical Association, Vol. 94 No. 445, pp. 254-265, doi: 10.2307/2669700.
  28. Yates, F. and Grundy, P.M. (1953), “Selection without replacement from within strata with probability proportional to size”, Journal of the Royal Statistical Society: Series B, Vol. 15 No. 2, pp. 235-261, doi: 10.1111/j.2517-6161.1953.tb00140.x.

JEL classification: C83