Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.14365/1089
Title: | Hand gesture classification using time-frequency images and transfer learning based on CNN | Authors: | Ozdemir, Mehmet Akif Kisa, Deniz Hande Guren, Onan Akan, Aydin |
Keywords: | Hand Gesture Classification Convolutional Neural Networks (CNN) Electromyogram (EMG) STFT CWT Hilbert-Huang Transform (HHT) Hilbert Spectrum Emg Recognition |
Publisher: | Elsevier Sci Ltd | Abstract: | Hand gesture-based systems are one of the most effective technological advances and continue to develop with improvements in the field of human-computer interaction. Surface electromyography (sEMG) is utilized as a popular input data for gesture classification with elevated accuracy and advanced control capability. This paper presents a comparative hand gesture classification approach using time-frequency (TF) images of the spontaneous sEMG signals and the transfer learning method. 4-channel sEMG signals are collected from 30 subjects performing 7 specific hand gestures. After the required pre-processing, segmentation, and windowing steps, three TF analysis methods, namely Short-Time Fourier Transform (STFT), Continuous Wavelet Transform (CWT), and Hilbert-Huang Transform (HHT), are applied to EMG signals to obtain TF images. Spectrograms from STFT, scalograms from CWT, and Hilbert-Huang spectra (HHS) from HHT obtained from multi-channel sEMG data are separately fused. TF images are then utilized to extract distinct features using seven state-of-the-art, pre-trained Convolutional Neural Network (CNN) architectures and classify seven hand gestures. Two different robust crossvalidation strategies are conducted to evaluate the proposed method; stratified k-fold cross-validation (SKCV) and leave-one-subject-out cross-validation (LOOCV). We also investigate the effect of window size and the combination of Intrinsic Mode Functions (IMFs) on classification performance. The results demonstrated that the HHT utilizing IMFs obtained by Empirical Mode Decomposition (EMD) provided improved TF resolution and better results than STFT and CWT in the classification of sEMG signals. Finally, the best average accuracies (93.75% for SKCV) and (94.41% for LOOCV) are obtained by the HHT method with the pre-trained ResNet-50 model. | URI: | https://doi.org/10.1016/j.bspc.2022.103787 https://hdl.handle.net/20.500.14365/1089 |
ISSN: | 1746-8094 1746-8108 |
Appears in Collections: | Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection |
Files in This Item:
File | Size | Format | |
---|---|---|---|
99.pdf Restricted Access | 6.04 MB | Adobe PDF | View/Open Request a copy |
CORE Recommender
SCOPUSTM
Citations
51
checked on Nov 20, 2024
WEB OF SCIENCETM
Citations
39
checked on Nov 20, 2024
Page view(s)
136
checked on Nov 18, 2024
Download(s)
6
checked on Nov 18, 2024
Google ScholarTM
Check
Altmetric
Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.