Exploring the Effectiveness of LLM-Generated Context on Emotion Lexicon Word Vectorization: A Comparative Study on Turkish and English
Loading...

Date
2025
Authors
Kumova Metin, Senem
Aka Uymaz, Hande
Journal Title
Journal ISSN
Volume Title
Publisher
IEEE Computer Soc
Open Access Color
OpenAIRE Downloads
OpenAIRE Views
Abstract
This study explores the impact of large language models (LLMs) on emotion lexicon word vectorization on Turkish and English. Emotion analysis involves extracting affective information from various data sources, with text being a primary medium. While traditional vectorization methods lack semantic meaning, contextual vectors, such as bidirectional encoder representations from transformers (BERT), aim to capture the context of words, leading to improved performance in natural language processing tasks. We investigate the efficacy of context sentences from human-annotated datasets and sentences generated by Gemini-Pro LLM in creating word vectors. Additionally, we introduce a manually annotated Turkish emotion and sentiment lexicon (TES-Lex). Performance evaluation is conducted for both Turkish and English using BERT vectors with two approaches: cosine similarity and machine learning. Our findings indicate that LLM-generated context sentences significantly enhance the quality of word vectors, especially in Turkish, underscoring the potential of LLMs in augmenting emotion lexicon resources in low-resourced languages.
Description
ORCID
Keywords
Performance Evaluation, Soft Sensors, Semantics, Lexicon, Bidirectional Control, Transformers, Encoding, Robustness, Natural Language Processing, Large Language Models, Pareto Optimization
Fields of Science
Citation
WoS Q
Q2
Scopus Q
Q2

OpenCitations Citation Count
N/A
Source
IT Professional
Volume
27
Issue
5
Start Page
52
End Page
58
PlumX Metrics
Citations
Scopus : 1
Captures
Mendeley Readers : 1
Google Scholar™


