Speech Emotion Recognition: Emotional Models, Databases, Features, Preprocessing Methods, Supporting Modalities, and Classifiers
Loading...
Files
Date
2020
Authors
Oguz, Kaya
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Open Access Color
Green Open Access
No
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
Speech is the most natural way of expressing ourselves as humans. It is only natural then to extend this communication medium to computer applications. We define speech emotion recognition (SER) systems as a collection of methodologies that process and classify speech signals to detect the embedded emotions. SER is not a new field, it has been around for over two decades, and has regained attention thanks to the recent advancements. These novel studies make use of the advances in all fields of computing and technology, making it necessary to have an update on the current methodologies and techniques that make SER possible. We have identified and discussed distinct areas of SER, provided a detailed survey of current literature of each, and also listed the current challenges.
Description
Keywords
Speech emotion recognition, Survey, Speech features, Classification, Speech databases, Voice Quality, Communicating Emotion, Spectral Features, Neural-Networks, Classification, Valence, Expression, Arousal, Adversarial, Audio
Fields of Science
0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology
Citation
WoS Q
Q1
Scopus Q
Q2

OpenCitations Citation Count
481
Source
Speech Communıcatıon
Volume
116
Issue
Start Page
56
End Page
76
PlumX Metrics
Citations
CrossRef : 534
Scopus : 623
Captures
Mendeley Readers : 658
SCOPUS™ Citations
623
checked on Feb 14, 2026
Web of Science™ Citations
406
checked on Feb 14, 2026
Page Views
3
checked on Feb 14, 2026
Google Scholar™


