Hand Gesture Classification Using Time-Frequency Images and Transfer Learning Based on Cnn

Loading...
Publication Logo

Date

2022

Journal Title

Journal ISSN

Volume Title

Publisher

Elsevier Sci Ltd

Open Access Color

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Top 1%
Influence
Top 10%
Popularity
Top 1%

Research Projects

Journal Issue

Abstract

Hand gesture-based systems are one of the most effective technological advances and continue to develop with improvements in the field of human-computer interaction. Surface electromyography (sEMG) is utilized as a popular input data for gesture classification with elevated accuracy and advanced control capability. This paper presents a comparative hand gesture classification approach using time-frequency (TF) images of the spontaneous sEMG signals and the transfer learning method. 4-channel sEMG signals are collected from 30 subjects performing 7 specific hand gestures. After the required pre-processing, segmentation, and windowing steps, three TF analysis methods, namely Short-Time Fourier Transform (STFT), Continuous Wavelet Transform (CWT), and Hilbert-Huang Transform (HHT), are applied to EMG signals to obtain TF images. Spectrograms from STFT, scalograms from CWT, and Hilbert-Huang spectra (HHS) from HHT obtained from multi-channel sEMG data are separately fused. TF images are then utilized to extract distinct features using seven state-of-the-art, pre-trained Convolutional Neural Network (CNN) architectures and classify seven hand gestures. Two different robust crossvalidation strategies are conducted to evaluate the proposed method; stratified k-fold cross-validation (SKCV) and leave-one-subject-out cross-validation (LOOCV). We also investigate the effect of window size and the combination of Intrinsic Mode Functions (IMFs) on classification performance. The results demonstrated that the HHT utilizing IMFs obtained by Empirical Mode Decomposition (EMD) provided improved TF resolution and better results than STFT and CWT in the classification of sEMG signals. Finally, the best average accuracies (93.75% for SKCV) and (94.41% for LOOCV) are obtained by the HHT method with the pre-trained ResNet-50 model.

Description

Keywords

Hand Gesture Classification, Convolutional Neural Networks (CNN), Electromyogram (EMG), STFT, CWT, Hilbert-Huang Transform (HHT), Hilbert Spectrum, Emg, Recognition

Fields of Science

0206 medical engineering, 0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology

Citation

WoS Q

Q2

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
59

Source

Bıomedıcal Sıgnal Processıng And Control

Volume

77

Issue

Start Page

End Page

PlumX Metrics
Citations

CrossRef : 80

Scopus : 85

Captures

Mendeley Readers : 66

SCOPUS™ Citations

85

checked on Mar 15, 2026

Web of Science™ Citations

69

checked on Mar 15, 2026

Page Views

5

checked on Mar 15, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
8.4733

Sustainable Development Goals

SDG data is not available