Enhancing Text Embeddings for Emotion Detection: A Study on Dimensionality Reduction and Lexicon Filtering

Loading...
Publication Logo

Date

2026

Authors

Uymaz, H.
Kumova Metin, S.

Journal Title

Journal ISSN

Volume Title

Publisher

Springer Science and Business Media Deutschland GmbH

Open Access Color

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

Research Projects

Journal Issue

Abstract

Emotion detection in textual data is a crucial task in Natural language processing (NLP), yet standard word embeddings often fail to capture emotional nuances. This study explores an emotion-enrichment approach that refines text representations by integrating emotional information into word embeddings. In the study, two key challenges are primarily addressed: limitations of emotion lexicons, which may include ambiguous or misclassified words, and high-dimensional vector representations, which may increase computational complexity. To improve lexicon quality, which is an important data source in emotion enrichment studies, a filtering mechanism is introduced aiming to remove the words with inconsistent emotional associations, enhancing lexicon precision. Additionally, a sliding window-based dimensionality reduction method is applied to BERT embeddings to identify emotion-rich vector segments, reducing computational cost while preserving emotional information. Experiments are conducted in both English and Turkish to evaluate the impact of lexicon filtering and dimensionality reduction on emotion detection. Results show that filtering improves the accuracy of emotion-enriched representations, while sub-vector selection gives the possibility of finding more representative parts about emotional content. By focusing on emotion-relevant vector dimensions, the proposed method achieves superior performance compared to full-dimensional embeddings. This research contributes to multilingual emotion representation by refining lexicon-based enrichment strategies and optimizing embedding spaces for emotion detection. The findings highlight the importance of structured lexicon filtering and targeted dimensionality reduction in improving sentiment and emotion analysis models. © 2025 Elsevier B.V., All rights reserved.

Description

Keywords

Bert, Dimensionality Reduction, Emotion Detection, Emotion Enrichment, Emotion Lexicon, Natural Language Processing, NLP

Fields of Science

Citation

WoS Q

N/A

Scopus Q

Q4
OpenCitations Logo
OpenCitations Citation Count
N/A

Source

Communications in Computer and Information Science

Volume

2703 CCIS

Issue

Start Page

37

End Page

56
PlumX Metrics
Citations

Scopus : 0

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.0

Sustainable Development Goals