A novel deep mining model for effective knowledge discovery from omics data

Alzubaidi, A ORCID logoORCID: https://orcid.org/0000-0002-5977-564X, Tepper, J ORCID logoORCID: https://orcid.org/0000-0001-7339-0132 and Lotfi, A ORCID logoORCID: https://orcid.org/0000-0002-5139-6565, 2020. A novel deep mining model for effective knowledge discovery from omics data. Artificial Intelligence in Medicine, 104: 101821. ISSN 0933-3657

[thumbnail of 1297625_Lotfi.pdf]
Preview
Text
1297625_Lotfi.pdf - Post-print

Download (666kB) | Preview

Abstract

Knowledge discovery from omics data has become a common goal of current approaches to personalised cancer medicine and understanding cancer genotype and phenotype. However, high-throughput biomedical datasets are characterised by high dimensionality and relatively small sample sizes with small signal-to-noise ratios. Extracting and interpreting relevant knowledge from such complex datasets therefore remains a significant challenge for the fields of machine learning and data mining. In this paper, we exploit recent advances in deep learning to mitigate against these limitations on the basis of automatically capturing enough of the meaningful abstractions latent with the available biological samples. Our deep feature learning model is proposed based on a set of non-linear sparse Auto-Encoders that are deliberately constructed in an under-complete manner to detect a small proportion of molecules that can recover a large proportion of variations underlying the data. However, since multiple projections are applied to the input signals, it is hard to interpret which phenotypes were responsible for deriving such predictions. Therefore, we also introduce a novel weight interpretation technique that helps to deconstruct the internal state of such deep learning models to reveal key determinants underlying its latent representations. The outcomes of our experiment provide strong evidence that the proposed deep mining model is able to discover robust biomarkers that are positively and negatively associated with cancers of interest. Since our deep mining model is problem-independent and data-driven, it provides further potential for this research to extend beyond its cognate disciplines.

Item Type: Journal article
Publication Title: Artificial Intelligence in Medicine
Creators: Alzubaidi, A., Tepper, J. and Lotfi, A.
Publisher: Elsevier
Date: April 2020
Volume: 104
ISSN: 0933-3657
Identifiers:
Number
Type
10.1016/j.artmed.2020.101821
DOI
S0933365719309935
Publisher Item Identifier
1297625
Other
Divisions: Schools > School of Science and Technology
Record created by: Jonathan Gallacher
Date Added: 26 Feb 2020 11:15
Last Modified: 27 Aug 2021 09:55
URI: https://irep.ntu.ac.uk/id/eprint/39292

Actions (login required)

Edit View Edit View

Statistics

Views

Views per month over past year

Downloads

Downloads per month over past year