Using Survey-to-Survey Imputation to Fill Poverty Data Gaps at a Low Cost

Survey data on household consumption are often unavailable or incomparable over time in many low- and middle-income countries. Based on a unique randomized survey experiment implemented in Tanzania, this study offers new and rigorous evidence demonstrating that survey-to-survey imputation can fill consumption data gaps and provide low-cost and reliable poverty estimates. Basic imputation models featuring utility expenditures, together with a modest set of predictors on demographics, employment, household assets, and housing, yield accurate predictions. Imputation accuracy is robust to varying the survey questionnaire length, the choice of base surveys for estimating the imputation model, different poverty lines, and alternative (quarterly or monthly) Consumer Price Index deflators. The proposed approach to imputation also performs better than multiple imputation and a range of machine learning techniques. In the case of a target survey with modified (shortened or aggregated) food or non-food consumption modules, imputation models including food or non-food consumption as predictors do well only if the distributions of the predictors are standardized vis-à-vis the base survey. For the best-performing models to reach acceptable levels of accuracy, the minimum required sample size should be 1,000 for both the base and target surveys. The discussion expands on the implications of the findings for the design of future surveys.

Saved in:
Bibliographic Details
Main Authors: Dang, Hai-Anh, Kilic, Talip, Hlasny, Vladimir, Abanokova, Kseniya, Carletto, Calogero
Format: Working Paper biblioteca
Language:English
en_US
Published: Washington, DC: World Bank 2024-03-26
Subjects:CONSUMPTION, POVERTY, SURVEY-TO-SURVEY IMPUTATION, HOUSEHOLD SURVEYS, TANZANIA, NO POVERTY, SDG 1,
Online Access:http://documents.worldbank.org/curated/en/099851203262433827/IDU1dce7aeb018793142f91aae715d889956cee9
https://hdl.handle.net/10986/41291
Tags: Add Tag
No Tags, Be the first to tag this record!
id dig-okr-1098641291
record_format koha
spelling dig-okr-10986412912024-04-24T19:46:50Z Using Survey-to-Survey Imputation to Fill Poverty Data Gaps at a Low Cost Evidence from a Randomized Survey Experiment Dang, Hai-Anh Kilic, Talip Hlasny, Vladimir Abanokova, Kseniya Carletto, Calogero CONSUMPTION POVERTY SURVEY-TO-SURVEY IMPUTATION HOUSEHOLD SURVEYS TANZANIA NO POVERTY SDG 1 Survey data on household consumption are often unavailable or incomparable over time in many low- and middle-income countries. Based on a unique randomized survey experiment implemented in Tanzania, this study offers new and rigorous evidence demonstrating that survey-to-survey imputation can fill consumption data gaps and provide low-cost and reliable poverty estimates. Basic imputation models featuring utility expenditures, together with a modest set of predictors on demographics, employment, household assets, and housing, yield accurate predictions. Imputation accuracy is robust to varying the survey questionnaire length, the choice of base surveys for estimating the imputation model, different poverty lines, and alternative (quarterly or monthly) Consumer Price Index deflators. The proposed approach to imputation also performs better than multiple imputation and a range of machine learning techniques. In the case of a target survey with modified (shortened or aggregated) food or non-food consumption modules, imputation models including food or non-food consumption as predictors do well only if the distributions of the predictors are standardized vis-à-vis the base survey. For the best-performing models to reach acceptable levels of accuracy, the minimum required sample size should be 1,000 for both the base and target surveys. The discussion expands on the implications of the findings for the design of future surveys. 2024-03-27T13:59:09Z 2024-03-27T13:59:09Z 2024-03-26 Working Paper http://documents.worldbank.org/curated/en/099851203262433827/IDU1dce7aeb018793142f91aae715d889956cee9 https://hdl.handle.net/10986/41291 English en_US Policy Research Working Paper; 10738 CC BY 3.0 IGO https://creativecommons.org/licenses/by/3.0/igo/ World Bank application/pdf text/plain Washington, DC: World Bank
institution Banco Mundial
collection DSpace
country Estados Unidos
countrycode US
component Bibliográfico
access En linea
databasecode dig-okr
tag biblioteca
region America del Norte
libraryname Biblioteca del Banco Mundial
language English
en_US
topic CONSUMPTION
POVERTY
SURVEY-TO-SURVEY IMPUTATION
HOUSEHOLD SURVEYS
TANZANIA
NO POVERTY
SDG 1
CONSUMPTION
POVERTY
SURVEY-TO-SURVEY IMPUTATION
HOUSEHOLD SURVEYS
TANZANIA
NO POVERTY
SDG 1
spellingShingle CONSUMPTION
POVERTY
SURVEY-TO-SURVEY IMPUTATION
HOUSEHOLD SURVEYS
TANZANIA
NO POVERTY
SDG 1
CONSUMPTION
POVERTY
SURVEY-TO-SURVEY IMPUTATION
HOUSEHOLD SURVEYS
TANZANIA
NO POVERTY
SDG 1
Dang, Hai-Anh
Kilic, Talip
Hlasny, Vladimir
Abanokova, Kseniya
Carletto, Calogero
Using Survey-to-Survey Imputation to Fill Poverty Data Gaps at a Low Cost
description Survey data on household consumption are often unavailable or incomparable over time in many low- and middle-income countries. Based on a unique randomized survey experiment implemented in Tanzania, this study offers new and rigorous evidence demonstrating that survey-to-survey imputation can fill consumption data gaps and provide low-cost and reliable poverty estimates. Basic imputation models featuring utility expenditures, together with a modest set of predictors on demographics, employment, household assets, and housing, yield accurate predictions. Imputation accuracy is robust to varying the survey questionnaire length, the choice of base surveys for estimating the imputation model, different poverty lines, and alternative (quarterly or monthly) Consumer Price Index deflators. The proposed approach to imputation also performs better than multiple imputation and a range of machine learning techniques. In the case of a target survey with modified (shortened or aggregated) food or non-food consumption modules, imputation models including food or non-food consumption as predictors do well only if the distributions of the predictors are standardized vis-à-vis the base survey. For the best-performing models to reach acceptable levels of accuracy, the minimum required sample size should be 1,000 for both the base and target surveys. The discussion expands on the implications of the findings for the design of future surveys.
format Working Paper
topic_facet CONSUMPTION
POVERTY
SURVEY-TO-SURVEY IMPUTATION
HOUSEHOLD SURVEYS
TANZANIA
NO POVERTY
SDG 1
author Dang, Hai-Anh
Kilic, Talip
Hlasny, Vladimir
Abanokova, Kseniya
Carletto, Calogero
author_facet Dang, Hai-Anh
Kilic, Talip
Hlasny, Vladimir
Abanokova, Kseniya
Carletto, Calogero
author_sort Dang, Hai-Anh
title Using Survey-to-Survey Imputation to Fill Poverty Data Gaps at a Low Cost
title_short Using Survey-to-Survey Imputation to Fill Poverty Data Gaps at a Low Cost
title_full Using Survey-to-Survey Imputation to Fill Poverty Data Gaps at a Low Cost
title_fullStr Using Survey-to-Survey Imputation to Fill Poverty Data Gaps at a Low Cost
title_full_unstemmed Using Survey-to-Survey Imputation to Fill Poverty Data Gaps at a Low Cost
title_sort using survey-to-survey imputation to fill poverty data gaps at a low cost
publisher Washington, DC: World Bank
publishDate 2024-03-26
url http://documents.worldbank.org/curated/en/099851203262433827/IDU1dce7aeb018793142f91aae715d889956cee9
https://hdl.handle.net/10986/41291
work_keys_str_mv AT danghaianh usingsurveytosurveyimputationtofillpovertydatagapsatalowcost
AT kilictalip usingsurveytosurveyimputationtofillpovertydatagapsatalowcost
AT hlasnyvladimir usingsurveytosurveyimputationtofillpovertydatagapsatalowcost
AT abanokovakseniya usingsurveytosurveyimputationtofillpovertydatagapsatalowcost
AT carlettocalogero usingsurveytosurveyimputationtofillpovertydatagapsatalowcost
AT danghaianh evidencefromarandomizedsurveyexperiment
AT kilictalip evidencefromarandomizedsurveyexperiment
AT hlasnyvladimir evidencefromarandomizedsurveyexperiment
AT abanokovakseniya evidencefromarandomizedsurveyexperiment
AT carlettocalogero evidencefromarandomizedsurveyexperiment
_version_ 1798164700928671744