Online data collection to address language sampling bias: lessons from the COVID-19 pandemic

Garcia, R ORCID logoORCID: https://orcid.org/0000-0003-1363-542X, Roeser, J ORCID logoORCID: https://orcid.org/0000-0002-4463-0923 and Kidd, E, 2022. Online data collection to address language sampling bias: lessons from the COVID-19 pandemic. Linguistics Vanguard.

[thumbnail of 1601644_Roeser.pdf]
Preview
Text
1601644_Roeser.pdf - Post-print

Download (336kB) | Preview

Abstract

The COVID-19 pandemic has massively limited how linguists can collect data, and out of necessity, researchers across several disciplines have moved data collection online. Here we argue that this rising popularity of remote web-based experiments also provides an opportunity for widening the context of linguistic research by facilitating data collection from understudied populations. We discuss collecting production data from adult native speakers of Tagalog using an unsupervised web-based experiment. Compared to equivalent lab experiments, data collection went quicker, and the sample was more diverse, without compromising data quality. However, there were also technical and human issues that come with this method. We discuss these challenges and provide suggestions on how to overcome them .

Item Type: Journal article
Publication Title: Linguistics Vanguard
Creators: Garcia, R., Roeser, J. and Kidd, E.
Publisher: De Gruyter Open
Date: 13 October 2022
Identifiers:
Number
Type
10.1515/lingvan-2021-0040
DOI
1601644
Other
Divisions: Schools > School of Social Sciences
Record created by: Linda Sullivan
Date Added: 26 Sep 2022 12:44
Last Modified: 13 Oct 2023 03:00
URI: https://irep.ntu.ac.uk/id/eprint/47120

Actions (login required)

Edit View Edit View

Statistics

Views

Views per month over past year

Downloads

Downloads per month over past year