Exploring the Effectiveness of LLM-Generated Context on Emotion Lexicon Word Vectorization: A Comparative Study on Turkish and English

Kumova Metin, Senem; Aka Uymaz, Hande

Exploring the Effectiveness of LLM-Generated Context on Emotion Lexicon Word Vectorization: A Comparative Study on Turkish and English

Files

6597.pdf (548.73 KB)

Date

2025-09

Authors

Kumova Metin, Senem

Aka Uymaz, Hande

Publisher

IEEE Computer Soc

Impulse

Average

Influence

Average

Popularity

Average

Abstract

This study explores the impact of large language models (LLMs) on emotion lexicon word vectorization on Turkish and English. Emotion analysis involves extracting affective information from various data sources, with text being a primary medium. While traditional vectorization methods lack semantic meaning, contextual vectors, such as bidirectional encoder representations from transformers (BERT), aim to capture the context of words, leading to improved performance in natural language processing tasks. We investigate the efficacy of context sentences from human-annotated datasets and sentences generated by Gemini-Pro LLM in creating word vectors. Additionally, we introduce a manually annotated Turkish emotion and sentiment lexicon (TES-Lex). Performance evaluation is conducted for both Turkish and English using BERT vectors with two approaches: cosine similarity and machine learning. Our findings indicate that LLM-generated context sentences significantly enhance the quality of word vectors, especially in Turkish, underscoring the potential of LLMs in augmenting emotion lexicon resources in low-resourced languages.

ORCID

0000-0002-9606-3625

Aka Uymaz, Hande

Keywords

Performance Evaluation, Soft Sensors, Semantics, Lexicon, Bidirectional Control, Transformers, Encoding, Robustness, Natural Language Processing, Large Language Models, Pareto Optimization

WoS Q

Q2

Scopus Q

Q2

OpenCitations Citation Count

N/A

Source

IT Professional

Volume

27

Issue

5

Start Page

52

End Page

58

URI

https://doi.org/10.1109/MITP.2025.3572550
https://hdl.handle.net/20.500.14365/6597

Collections

WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection
Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection

PlumX Metrics

Citations

Scopus : 1

Captures

Mendeley Readers : 1

Full item page

Google Scholar™

Check

Exploring the Effectiveness of LLM-Generated Context on Emotion Lexicon Word Vectorization: A Comparative Study on Turkish and English

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

BIP! Indicators

Research Projects

Journal Issue

Abstract

Description

ORCID

Keywords

Fields of Science

Citation

WoS Q

Scopus Q

OpenCitations Citation Count

Source

Volume

Issue

Start Page

End Page

URI

Collections

PlumX Metrics

Citations

Captures

Google Scholar™

OpenAlex FWCI

0.0

Sustainable Development Goals