Online data collection to address language sampling bias: lessons from the COVID-19 pandemic

Garcia, R; Roeser, J; Kidd, E

NTU > IRep

IRep

Online data collection to address language sampling bias: lessons from the COVID-19 pandemic

Tools

Garcia, R ORCID: https://orcid.org/0000-0003-1363-542X, Roeser, J ORCID: https://orcid.org/0000-0002-4463-0923 and Kidd, E, 2022. Online data collection to address language sampling bias: lessons from the COVID-19 pandemic. Linguistics Vanguard.

Preview

Text
1601644_Roeser.pdf - Post-print
Download (336kB) | Preview

Official URL: https://doi.org/10.1515/lingvan-2021-0040

Abstract

The COVID-19 pandemic has massively limited how linguists can collect data, and out of necessity, researchers across several disciplines have moved data collection online. Here we argue that this rising popularity of remote web-based experiments also provides an opportunity for widening the context of linguistic research by facilitating data collection from understudied populations. We discuss collecting production data from adult native speakers of Tagalog using an unsupervised web-based experiment. Compared to equivalent lab experiments, data collection went quicker, and the sample was more diverse, without compromising data quality. However, there were also technical and human issues that come with this method. We discuss these challenges and provide suggestions on how to overcome them .

Item Type:	Journal article
Publication Title:	Linguistics Vanguard
Creators:	Garcia, R., Roeser, J. and Kidd, E.
Publisher:	De Gruyter Open
Date:	13 October 2022
Identifiers:	Number Type 10.1515/lingvan-2021-0040 DOI 1601644 Other
Divisions:	Schools > School of Social Sciences
Record created by:	Linda Sullivan
Date Added:	26 Sep 2022 12:44
Last Modified:	13 Oct 2023 03:00
URI:	https://irep.ntu.ac.uk/id/eprint/47120