Exploring the Effectiveness of LLM-Generated Context on Emotion Lexicon Word Vectorization: A Comparative Study on Turkish and English
| dc.contributor.author | Kumova Metin, Senem | |
| dc.contributor.author | Aka Uymaz, Hande | |
| dc.date.accessioned | 2025-11-25T15:25:14Z | |
| dc.date.available | 2025-11-25T15:25:14Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | This study explores the impact of large language models (LLMs) on emotion lexicon word vectorization on Turkish and English. Emotion analysis involves extracting affective information from various data sources, with text being a primary medium. While traditional vectorization methods lack semantic meaning, contextual vectors, such as bidirectional encoder representations from transformers (BERT), aim to capture the context of words, leading to improved performance in natural language processing tasks. We investigate the efficacy of context sentences from human-annotated datasets and sentences generated by Gemini-Pro LLM in creating word vectors. Additionally, we introduce a manually annotated Turkish emotion and sentiment lexicon (TES-Lex). Performance evaluation is conducted for both Turkish and English using BERT vectors with two approaches: cosine similarity and machine learning. Our findings indicate that LLM-generated context sentences significantly enhance the quality of word vectors, especially in Turkish, underscoring the potential of LLMs in augmenting emotion lexicon resources in low-resourced languages. | en_US |
| dc.description.sponsorship | Izmir University of Economics Coordinatorship of Scientific Research Projects [BAP2022-6] | en_US |
| dc.description.sponsorship | This work is carried out under the grant of Izmir University of Economics Coordinatorship of Scientific Research Projects, Project BAP2022-6, Building a Turkish Dataset for Emotion Enriched Vector Space Models. | en_US |
| dc.identifier.doi | 10.1109/MITP.2025.3572550 | |
| dc.identifier.issn | 1520-9202 | |
| dc.identifier.issn | 1941-045X | |
| dc.identifier.scopus | 2-s2.0-105020371802 | |
| dc.identifier.uri | https://doi.org/10.1109/MITP.2025.3572550 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.14365/6597 | |
| dc.language.iso | en | en_US |
| dc.publisher | IEEE Computer Soc | en_US |
| dc.relation.ispartof | IT Professional | en_US |
| dc.rights | info:eu-repo/semantics/closedAccess | en_US |
| dc.subject | Performance Evaluation | en_US |
| dc.subject | Soft Sensors | en_US |
| dc.subject | Semantics | en_US |
| dc.subject | Lexicon | en_US |
| dc.subject | Bidirectional Control | en_US |
| dc.subject | Transformers | en_US |
| dc.subject | Encoding | en_US |
| dc.subject | Robustness | en_US |
| dc.subject | Natural Language Processing | en_US |
| dc.subject | Large Language Models | en_US |
| dc.subject | Pareto Optimization | en_US |
| dc.title | Exploring the Effectiveness of LLM-Generated Context on Emotion Lexicon Word Vectorization: A Comparative Study on Turkish and English | en_US |
| dc.type | Article | en_US |
| dspace.entity.type | Publication | |
| gdc.author.id | 0000-0002-9606-3625 | |
| gdc.author.scopusid | 24471923700 | |
| gdc.author.scopusid | 57195217693 | |
| gdc.author.wosid | Aka Uymaz, Hande/Jzt-3644-2024 | |
| gdc.bip.impulseclass | C5 | |
| gdc.bip.influenceclass | C5 | |
| gdc.bip.popularityclass | C5 | |
| gdc.coar.access | metadata only access | |
| gdc.coar.type | text::journal::journal article | |
| gdc.collaboration.industrial | false | |
| gdc.description.department | İzmir Ekonomi Üniversitesi | en_US |
| gdc.description.departmenttemp | [Kumova Metin, Senem; Aka Uymaz, Hande] Izmir Univ Econ, Dept Software Engn, TR-35330 Izmir, Turkiye | en_US |
| gdc.description.endpage | 58 | en_US |
| gdc.description.issue | 5 | en_US |
| gdc.description.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
| gdc.description.scopusquality | Q2 | |
| gdc.description.startpage | 52 | en_US |
| gdc.description.volume | 27 | en_US |
| gdc.description.woscitationindex | Science Citation Index Expanded | |
| gdc.description.wosquality | Q2 | |
| gdc.identifier.openalex | W4415593943 | |
| gdc.identifier.wos | WOS:001606200700002 | |
| gdc.index.type | WoS | |
| gdc.index.type | Scopus | |
| gdc.oaire.impulse | 0.0 | |
| gdc.oaire.influence | 2.5349236E-9 | |
| gdc.oaire.popularity | 2.8669784E-9 | |
| gdc.openalex.collaboration | National | |
| gdc.openalex.fwci | 0.0 | |
| gdc.openalex.normalizedpercentile | 0.17 | |
| gdc.openalex.toppercent | TOP 10% | |
| gdc.opencitations.count | 0 | |
| gdc.plumx.mendeley | 1 | |
| gdc.plumx.newscount | 1 | |
| gdc.plumx.scopuscites | 1 | |
| gdc.scopus.citedcount | 1 | |
| gdc.virtual.author | Aka Uymaz, Hande | |
| gdc.virtual.author | Kumova Metin, Senem | |
| gdc.wos.citedcount | 0 | |
| relation.isAuthorOfPublication | e6d36a8f-f3ca-479f-9d2e-8c8e6afe1b2a | |
| relation.isAuthorOfPublication | 81d6fcea-c590-42aa-8443-7459c9eab7fa | |
| relation.isAuthorOfPublication.latestForDiscovery | e6d36a8f-f3ca-479f-9d2e-8c8e6afe1b2a | |
| relation.isOrgUnitOfPublication | 805c60d5-b806-4645-8214-dd40524c388f | |
| relation.isOrgUnitOfPublication | 26a7372c-1a5e-42d9-90b6-a3f7d14cad44 | |
| relation.isOrgUnitOfPublication | e9e77e3e-bc94-40a7-9b24-b807b2cd0319 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 805c60d5-b806-4645-8214-dd40524c388f |
