Download full text
(external source)
Citation Suggestion
Please use the following Persistent Identifier (PID) to cite this document:
https://doi.org/10.1177/0282423X241312775
Exports for your reference manager
Averaging Non-Probability Online Surveys to Avoid Maximal Estimation Error
[journal article]
Abstract Data from online non-probability samples are often analyzed as if they were based on a simple random sample drawn from the general population. As the exact sampling frame for these non-probability samples are usually unknown, there is no general method to construct unbiased estimators. This raises t... view more
Data from online non-probability samples are often analyzed as if they were based on a simple random sample drawn from the general population. As the exact sampling frame for these non-probability samples are usually unknown, there is no general method to construct unbiased estimators. This raises the question of whether estimates based on online non-probability samples are consistent across sample vendors and concerning estimates based on probability samples. To address this question, we analyze data collected from eight different online non-probability sample vendors and one online probability-based sample. We find that estimates from the different non-probability samples can be very inconsistent. We suggest averaging estimates across multiple vendor samples to avoid the risk of a maximum estimation error. We evaluate several averaging approaches, including a LASSO regression procedure which identifies a subset of vendors that, when averaged, produce estimates that are more consistent with the reference probability-based estimates, compared to any single vendor. Our results show that estimates based on different vendors' samples display different selection biases, but there is also some commonality among some vendor-specific estimates, thus there could be strong gains in estimation precision by averaging across a selection of multiple non-probability sample vendors.... view less
Keywords
survey research; online survey; sample; probability; estimation; sampling error
Classification
Methods and Techniques of Data Collection and Data Analysis, Statistical Methods, Computer Methods
Free Keywords
LASSO; model-averaging; non-probability samples; regression model; sample selection bias
Document language
English
Publication Year
2025
Page/Pages
p. 700-724
Journal
Journal of Official Statistics, 41 (2025) 2
Issue topic
Special Issue on Integrating data from multiple sources for production of official statistics
ISSN
2001-7367
Status
Published Version; peer reviewed