Online data collection to address language sampling bias: lessons from the COVID-19 pandemic

Garcia, R. ORCID: 0000-0003-1363-542X, Roeser, J. ORCID: 0000-0002-4463-0923 and Kidd, E., 2022. Online data collection to address language sampling bias: lessons from the COVID-19 pandemic. Linguistics Vanguard.

[img] Text
1601644_Roeser.pdf - Post-print
Full-text access embargoed until 13 October 2023.

Download (336kB)


The COVID-19 pandemic has massively limited how linguists can collect data, and out of necessity, researchers across several disciplines have moved data collection online. Here we argue that this rising popularity of remote web-based experiments also provides an opportunity for widening the context of linguistic research by facilitating data collection from understudied populations. We discuss collecting production data from adult native speakers of Tagalog using an unsupervised web-based experiment. Compared to equivalent lab experiments, data collection went quicker, and the sample was more diverse, without compromising data quality. However, there were also technical and human issues that come with this method. We discuss these challenges and provide suggestions on how to overcome them .

Item Type: Journal article
Publication Title: Linguistics Vanguard
Creators: Garcia, R., Roeser, J. and Kidd, E.
Publisher: De Gruyter Open
Date: 13 October 2022
Divisions: Schools > School of Social Sciences
Record created by: Linda Sullivan
Date Added: 26 Sep 2022 12:44
Last Modified: 02 Nov 2022 14:44

Actions (login required)

Edit View Edit View


Views per month over past year


Downloads per month over past year