Enhancing multi-user activity recognition in an indoor environment with augmented Wi-Fi channel state information and transformer architectures

Kobir, MI, Machado, P ORCID logoORCID: https://orcid.org/0000-0003-1760-3871, Lotfi, A ORCID logoORCID: https://orcid.org/0000-0002-5139-6565, Haider, D ORCID logoORCID: https://orcid.org/0000-0002-9302-871X and Ihianle, IK ORCID logoORCID: https://orcid.org/0000-0001-7445-8573, 2025. Enhancing multi-user activity recognition in an indoor environment with augmented Wi-Fi channel state information and transformer architectures. Sensors, 25 (13): 3955. ISSN 1424-8220

[thumbnail of 2460540_Ihianle.pdf]
Preview
Text
2460540_Ihianle.pdf - Published version

Download (2MB) | Preview

Abstract

Human Activity Recognition (HAR) is crucial for understanding human behaviour through sensor data, with applications in healthcare, smart environments, and surveillance. While traditional HAR often relies on ambient sensors, wearable devices or vision-based systems, these approaches can face limitations in dynamic settings and raise privacy concerns. Device-free HAR systems, utilising Wi-Fi Channel State Information (CSI) to human movements, have emerged as a promising privacy-preserving alternative for next-generation health activity monitoring and smart environments, particularly for multi-user scenarios. However, current research faces challenges such as the need for substantial annotated training data, class imbalance, and poor generalisability in complex, multi-user environments where labelled data is often scarce. This paper addresses these gaps by proposing a hybrid deep learning approach which integrates signal preprocessing, targeted data augmentation, and a customised integration of CNN and Transformer models, designed to address the challenges of multi-user recognition and data scarcity. A random transformation technique to augment real CSI data, followed by hybrid feature extraction involving statistical, spectral, and entropy-based measures to derive suitable representations from temporal sensory input, is employed. Experimental results show that the proposed model outperforms several baselines in single-user and multi-user contexts. Our findings demonstrate that combining real and augmented data significantly improves model generalisation in scenarios with limited labelled data.

Item Type: Journal article
Publication Title: Sensors
Creators: Kobir, M.I., Machado, P., Lotfi, A., Haider, D. and Ihianle, I.K.
Publisher: MDPI AG
Date: 25 June 2025
Volume: 25
Number: 13
ISSN: 1424-8220
Identifiers:
Number
Type
10.3390/s25133955
DOI
2460540
Other
Rights: © 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Divisions: Schools > School of Science and Technology
Record created by: Jonathan Gallacher
Date Added: 30 Jun 2025 16:09
Last Modified: 30 Jun 2025 16:09
URI: https://irep.ntu.ac.uk/id/eprint/53858

Actions (login required)

Edit View Edit View

Statistics

Views

Views per month over past year

Downloads

Downloads per month over past year