Discriminating the origin of fish from closely related water bodies by combining NMR spectroscopy with statistical analysis and machine learning

Kuhn, S., Reitel, K., Homapour, E. ORCID: 0000-0001-9756-2744, Kork, K., Vaino, V., Arula, T., Bernotas, P. and Reile, I., 2024. Discriminating the origin of fish from closely related water bodies by combining NMR spectroscopy with statistical analysis and machine learning. Ecological Informatics, 83: 102753. ISSN 1574-9541

Full text not available from this repository.

Abstract

Pikeperch, perch and bream are among the most traded and valued fish species in North-Eastern Europe. Therefore, it is necessary to be able to distinguish fish from different lakes and coastal sea regions to ensure a good traceability of products in the fish market and to protect both consumers and fish stocks. Untargeted metabolomics using nuclear magnetic resonance (NMR) spectroscopy is a suitable tool for this purpose. It is an established method for determining various properties of biological and living systems, such as health, origin, type, etc. Statistical methods including principal component analysis (PCA) and linear discriminant analysis (LDA) are typically applied to NMR data to correlate spectra with a particular research question.

Herein we examine fish from three closely related water bodies and demonstrate that reliable determination of the water body that a particular fish originates from by traditional statistical analysis (PCA and LDA) of fish NMR spectra is not possible. In contrast, determining the fish species is possible. We proceed to show that machine learning methods perform better and that a combination of statistical analysis (LDA) and random forest (RF), a supervised machine learning technique, allows reliable determination of the originating water body, while being also tolerant to seasonal variations. This is an improvement over prior work, which has dealt with more clearly distinguished origins of fish. Exceptional accuracy was achieved in correctly assigning fish to their origin even in a scenario where two of the water bodies are connected by a river through which the fish are known to migrate. Since determining the origin of fish is important in environmental protection, we recommend following up this approach and using it as the basis of a robust tool for environmental protection and other monitoring purposes.

Item Type: Journal article
Publication Title: Ecological Informatics
Creators: Kuhn, S., Reitel, K., Homapour, E., Kork, K., Vaino, V., Arula, T., Bernotas, P. and Reile, I.
Publisher: Elsevier
Date: November 2024
Volume: 83
ISSN: 1574-9541
Identifiers:
NumberType
10.1016/j.ecoinf.2024.102753DOI
2259420Other
Rights: This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/bync-nd/4.0/).
Divisions: Schools > Nottingham Business School
Record created by: Jonathan Gallacher
Date Added: 23 Oct 2024 08:57
Last Modified: 23 Oct 2024 08:57
URI: https://irep.ntu.ac.uk/id/eprint/52457

Actions (login required)

Edit View Edit View

Views

Views per month over past year

Downloads

Downloads per month over past year