Download full text
(1.863Mb)
Citation Suggestion
Please use the following Persistent Identifier (PID) to cite this document:
https://doi.org/10.12759/hsr.45.2020.3.288-313
Exports for your reference manager
Potential and Limits of Automated Classification of Big Data: A Case Study
Potentiale und Grenzen der automatischen Klassifikation von Big Data: Eine Fallstudie
[journal article]
Abstract This case study highlights the potentials and limits of big-data analyses of media sources compared to conventional, quantitative content analysis. In an FFG-funded multidisciplinary project in Austria (based on the KIRAS security research program), the software tool WebLyzard was used for an automa... view more
This case study highlights the potentials and limits of big-data analyses of media sources compared to conventional, quantitative content analysis. In an FFG-funded multidisciplinary project in Austria (based on the KIRAS security research program), the software tool WebLyzard was used for an automated analysis of online news and social media sources (comments on articles, Facebook postings, and Twitter statements) in order to analyze the media representation of pressing societal issues and citizens’ perceptions of security. Frequency and sentiment analyses were carried out by two independent observers in parallel to the automated WebLyzard results. Specific articles on selected key topics like technology or Muslims in two major online newspapers in Austria (Der Standard and Kronen Zeitung) were counted, as were user comments, and both were evaluated according to different sentiment categories. The results indicate various weaknesses of the software leading to misinterpretations, and the automated analyses yield substantially different results compared to the sentiment analysis carried out by the two raters, especially for cynical or irrelevant statements. From a social-sciences methodological perspective, the results clearly show that methodology in our discipline should promote theory-based research, should counteract the attraction of superficial analyses of complex social issues, and should emphasize not only the potentials but also the dangers and risks associated with big data.... view less
Keywords
automation; sense of security; Austria; attitude; population; social media; comparison of methods; case study; software; online media; content analysis; domestic security
Classification
Methods and Techniques of Data Collection and Data Analysis, Statistical Methods, Computer Methods
Interactive, electronic Media
Free Keywords
Security perceptions; social media; big data; evaluation study; automated analysis
Document language
English
Publication Year
2020
Page/Pages
p. 288-313
Journal
Historical Social Research, 45 (2020) 3
ISSN
0172-6404
Status
Published Version; peer reviewed