Back to search

High-Dimensional Imputation for the Social Sciences: A Comparison of State-of-The-Art Methods

DSEID
DSEID-001-2658125
DOI
10.1177/00491241231200194
Journal
Sociological Methods & Research
Publisher
SAGE Publications
Published
2025-5
Status
failed

Abstract

Including a large number of predictors in the imputation model underlying a multiple imputation (MI) procedure is one of the most challenging tasks imputers face. A variety of high-dimensional MI techniques can help, but there has been limited research on their relative performance. In this study, we investigated a wide range of extant high-dimensional MI techniques that can handle a large number of predictors in the imputation models and general missing data patterns. We assessed the relative performance of seven high-dimensional MI methods with a Monte Carlo simulation study and a resampling study based on real survey data. The performance of the methods was defined by the degree to which they facilitate unbiased and confidence-valid estimates of the parameters of complete data analysis models. We found that using lasso penalty or forward selection to select the predictors used in the MI model and using principal component analysis to reduce the dimensionality of auxiliary data produce the best results.

Ingestion failed: Traceback (most recent call last): File "/srv/app/app/worker.py", line 85, in run_once process_job(db, job) File "/srv/app/app/worker.py", line 39, in process_job pdf_path, info = fetch_pdf_temp(candidate["url"]) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/srv/app/app/downloader.py", line 129, in fetch_pdf_temp raise ValueError(f"PDF source returned HTTP {response.status_code}.") ValueError: PDF source returned HTTP 403.

PDF

No local PDF is available.

GROBID Extracted text; discontinued.

This text is generated from TEI extraction for accessibility, search, and TTS. Formulas, tables, figures, page layout, and references may not perfectly match the original PDF.

No accessible text representation is available. The text extraction service has been discontinued for the time being. If you require this service, for accessibility or any other reason, please submit an issue/request on this page.

Metadata

Title
High-Dimensional Imputation for the Social Sciences: A Comparison of State-of-The-Art Methods
Delta ID
DSEID-001-2658125
Authors
Edoardo Costantini, Kyle M. Lang, Tim Reeskens, Klaas Sijtsma
Abstract source
crossref
Source URL
https://journals.sagepub.com/doi/pdf/10.1177/00491241231200194
Access
open
Licence
cc-by
PDF SHA-256
TEI SHA-256
GROBID

Issues

No public issues have been filed for this DOI.

Submit an issue

Record history

WhenEventFieldOldNew
2026-06-18 19:37:53.011249+00:00identifier_assignedDSEIDDSEID-001-2658125