Show simple item record

[journal article]

dc.contributor.authorHe, Zhoushanyuede
dc.contributor.authorSchonlau, Matthiasde
dc.date.accessioned2021-02-16T12:43:58Z
dc.date.available2021-02-16T12:43:58Z
dc.date.issued2021de
dc.identifier.issn2190-4936de
dc.identifier.urihttps://www.ssoar.info/ssoar/handle/document/71619
dc.description.abstractText answers to open-ended questions are often manually coded into one of several predefined categories or classes. More recently, researchers have begun to employ statistical models to automatically classify such text responses. It is unclear whether such automated coders and human coders find the same type of observations difficult to code or whether humans and models might be able to compensate for each other’s weaknesses. We analyze correlations between estimated error probabilities of human and automated coders and find: 1) Statistical models have higher error rates than human coders 2) Automated coders (models) and human coders tend to make similar coding mistakes. Specifically, the correlation between the estimated coding error of a statistical model and that of a human is comparable to that of two humans. 3) Two very different statistical models give highly correlated estimated coding errors. Therefore, a) the choice of statistical model does not matter, and b) having a second automated coder would be redundant.de
dc.languageende
dc.subject.ddcSozialwissenschaften, Soziologiede
dc.subject.ddcSocial sciences, sociology, anthropologyen
dc.subject.otheropen-ended question; manual coding; automatic coding; text classification; text answerde
dc.titleCoding Text Answers to Open-ended Questions: Human Coders and Statistical Learning Algorithms Make Similar Mistakesde
dc.description.reviewbegutachtet (peer reviewed)de
dc.description.reviewpeer revieweden
dc.source.journalMethods, data, analyses : a journal for quantitative methods and survey methodology (mda)
dc.source.volume15de
dc.publisher.countryDEU
dc.source.issue1de
dc.subject.classozErhebungstechniken und Analysetechniken der Sozialwissenschaftende
dc.subject.classozMethods and Techniques of Data Collection and Data Analysis, Statistical Methods, Computer Methodsen
dc.subject.thesozCodierungde
dc.subject.thesozstatistische Methodede
dc.subject.thesozsurvey researchen
dc.subject.thesozerroren
dc.subject.thesozstatistical methoden
dc.subject.thesozFehlerde
dc.subject.thesozAnalysede
dc.subject.thesozUmfrageforschungde
dc.subject.thesozanalysisen
dc.subject.thesozdata preparationen
dc.subject.thesozDatenaufbereitungde
dc.subject.thesozcodingen
dc.identifier.urnurn:nbn:de:0168-ssoar-71619-9
dc.rights.licenceCreative Commons - Attribution 4.0en
dc.rights.licenceCreative Commons - Namensnennung 4.0de
internal.statusformal und inhaltlich fertig erschlossende
internal.identifier.thesoz10040714
internal.identifier.thesoz10034712
internal.identifier.thesoz10043384
internal.identifier.thesoz10040334
internal.identifier.thesoz10040524
internal.identifier.thesoz10052184
dc.type.stockarticlede
dc.type.documentjournal articleen
dc.type.documentZeitschriftenartikelde
dc.source.pageinfo103-120de
internal.identifier.classoz10105
internal.identifier.journal614
internal.identifier.document32
internal.identifier.ddc300
dc.source.issuetopicThe Use of Open-ended Questions in Surveysen
dc.identifier.doihttps://doi.org/10.12758/mda.2020.10de
dc.description.pubstatusPublished Versionen
dc.description.pubstatusVeröffentlichungsversionde
internal.identifier.licence16
internal.identifier.pubstatus1
internal.identifier.review1
internal.pdf.wellformedtrue
internal.pdf.ocrnull Page_18
internal.pdf.encryptedfalse


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record