Deep-FS: a feature selection algorithm for deep Boltzmann machines

Taherkhani, A ORCID logoORCID: https://orcid.org/0000-0002-3627-6362, Cosma, G ORCID logoORCID: https://orcid.org/0000-0002-4663-6907 and McGinnity, TM ORCID logoORCID: https://orcid.org/0000-0002-9897-4748, 2018. Deep-FS: a feature selection algorithm for deep Boltzmann machines. Neurocomputing, 322, pp. 22-37. ISSN 0925-2312

[thumbnail of 11719_1136a_Cosma.pdf]
Preview
Text
11719_1136a_Cosma.pdf - Published version

Download (1MB) | Preview

Abstract

A Deep Boltzmann Machine is a model of a Deep Neural Network formed from multiple layers of neurons with nonlinear activation functions. The structure of a Deep Boltzmann Machine enables it to learn very complex relationships between features and facilitates advanced performance in learning of high-level representation of features, compared to conventional Artificial Neural Networks. Feature selection at the input level of Deep Neural Networks has not been well studied, despite its importance in reducing the input features processed by the deep learning model, which facilitates understanding of the data. This paper proposes a novel algorithm, Deep Feature Selection (Deep-FS), which is capable of removing irrelevant features from large datasets in order to reduce the number of inputs which are modelled during the learning process. The proposed Deep-FS algorithm utilizes a Deep Boltzmann Machine, and uses knowledge which is acquired during training to remove features at the beginning of the learning process. Reducing inputs is important because it prevents the network from learning the associations between the irrelevant features which negatively impact on the acquired knowledge of the network about the overall distribution of the data. The Deep-FS method embeds feature selection in a Restricted Boltzmann Machine which is used for training a Deep Boltzmann Machine. The generative property of the Restricted Boltzmann Machine is used to reconstruct eliminated features and calculate reconstructed errors, in order to evaluate the impact of eliminating features. The performance of the proposed approach was evaluated with experiments conducted using the MNIST, MIR-Flickr, GISETTE, MADELON and PANCAN datasets. The results revealed that the proposed Deep-FS method enables improved feature selection without loss of accuracy on the MIR-Flickr dataset, where Deep-FS reduced the number of input features by removing 775 features without reduction in performance. With regards to the MNIST dataset, Deep-FS reduced the number of input features by more than 45%; it reduced the network error from 0.97% to 0.90%, and also reduced processing and classification time by more than 5.5%. Additionally, when compared to classical feature selection methods, Deep-FS returned higher accuracy. The experimental results on GISETTE, MADELON and PANCAN showed that Deep-FS reduced 81%, 57% and 77% of the number of input features, respectively. Moreover, the proposed feature selection method reduced the classifier training time by 82%, 70% and 85% on GISETTE, MADELON and PANCAN datasets, respectively. Experiments with various datasets, comprising a large number of features and samples, revealed that the proposed Deep-FS algorithm overcomes the main limitations of classical feature selection algorithms. More specifically, most classical methods require, as a prerequisite, a pre-specified number of features to retain, however in Deep-FS this number is identified automatically. Deep-FS performs the feature selection task faster than classical feature selection algorithms which makes it suitable for deep learning tasks. In addition, Deep-FS is suitable for finding features in large and big datasets which are normally stored in data batches for faster and more efficient processing.

Item Type: Journal article
Publication Title: Neurocomputing
Creators: Taherkhani, A., Cosma, G. and McGinnity, T.M.
Publisher: Elsevier
Date: 17 December 2018
Volume: 322
ISSN: 0925-2312
Identifiers:
Number
Type
10.1016/j.neucom.2018.09.040
DOI
Divisions: Schools > School of Science and Technology
Record created by: Jonathan Gallacher
Date Added: 08 Aug 2018 08:59
Last Modified: 30 Oct 2018 12:05
URI: https://irep.ntu.ac.uk/id/eprint/34259

Actions (login required)

Edit View Edit View

Statistics

Views

Views per month over past year

Downloads

Downloads per month over past year