Download full text
(external source)
Citation Suggestion
Please use the following Persistent Identifier (PID) to cite this document:
https://doi.org/10.1111/rssa.12846
Exports for your reference manager
Understanding political news media consumption with digital trace data and natural language processing
[journal article]
Abstract Augmenting survey data with digital traces is a promising direction for combining the advantages of active and passive data collection. However, extracting interpretable measurements from digital traces for social science research is challenging. In this study, we demonstrate how to obtain measureme... view more
Augmenting survey data with digital traces is a promising direction for combining the advantages of active and passive data collection. However, extracting interpretable measurements from digital traces for social science research is challenging. In this study, we demonstrate how to obtain measurements of news media consumption from survey respondents' web browsing data using Bidirectional Encoder Representations from Transformers, a powerful natural language processing algorithm that estimates contextual word embeddings from text data. Our approach is particularly relevant for political scientists and communication researchers studying exposure to online news content but can easily be adapted to projects in other disciplines working with similar data sets.... view less
Keywords
data capture; media consumption; news; online media; political interest; voting behavior
Classification
Methods and Techniques of Data Collection and Data Analysis, Statistical Methods, Computer Methods
Free Keywords
digital trace data; news consumption; NLP; political preferences; web tracking
Document language
English
Publication Year
2022
Page/Pages
S246-S269
Journal
Journal of the Royal Statistical Society, Series A (Statistics in Society), 185 (2022) Suppl. 2
ISSN
1467-985X
Status
Published Version; peer reviewed