Optimizing High-Dimensional Text Embeddings in Emotion Identification: a Sliding Window Approach

dc.contributor.author Uymaz, H.A.
dc.contributor.author Metin, S.K.
dc.date.accessioned 2025-01-25T17:06:40Z
dc.date.available 2025-01-25T17:06:40Z
dc.date.issued 2024
dc.description Institute for Systems and Technologies of Information, Control and Communication (INSTICC) en_US
dc.description.abstract Natural language processing (NLP) is an interdisciplinary field that enables machines to understand and generate human language. One of the crucial steps in several NLP tasks, such as emotion and sentiment analysis, text similarity, summarization, and classification, is transforming textual data sources into numerical form, a process called vectorization. This process can be grouped into traditional, semantic, and contextual vectorization methods. Despite their advantages, these high-dimensional vectors pose memory and computational challenges. To address these issues, we employed a sliding window technique to partition high-dimensional vectors, aiming not only to enhance computational efficiency but also to detect emotional information within specific vector dimensions. Our experiments utilized emotion lexicon words and emotionally labeled sentences in both English and Turkish. By systematically analyzing the vectors, we identified consistent patterns with emotional clues. Our findings suggest that focusing on specific sub-vectors rather than entire high-dimensional BERT vectors can capture emotional information effectively, without performance loss. With this approach, we examined an increase in pairwise cosine similarity scores within emotion categories when using only sub-vectors. The results highlight the potential of the use of sub-vector techniques, offering insights into the nuanced integration of emotions in language and the applicability of these methods across different languages. © 2024 by SCITEPRESS – Science and Technology Publications, Lda. en_US
dc.identifier.doi 10.5220/0012899300003838
dc.identifier.isbn 978-989758716-0
dc.identifier.issn 2184-3228
dc.identifier.scopus 2-s2.0-85215266075
dc.identifier.uri https://doi.org/10.5220/0012899300003838
dc.identifier.uri https://hdl.handle.net/20.500.14365/5851
dc.language.iso en en_US
dc.publisher Science and Technology Publications, Lda en_US
dc.relation.ispartof International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K - Proceedings -- 16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2024 -- 17 November 2024 through 19 November 2024 -- Porto -- 205736 en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Emotion en_US
dc.subject Large Language Models en_US
dc.subject Natural Language Processing en_US
dc.subject Vector Space Models en_US
dc.title Optimizing High-Dimensional Text Embeddings in Emotion Identification: a Sliding Window Approach en_US
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.scopusid 57195217693
gdc.author.scopusid 24471923700
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access metadata only access
gdc.coar.type text::conference output
gdc.collaboration.industrial false
gdc.description.department İzmir Ekonomi Üniversitesi en_US
gdc.description.departmenttemp Uymaz H.A., Izmir University of Economics, Department of Software Engineering, Izmir, Turkey; Metin S.K., Izmir University of Economics, Department of Software Engineering, Izmir, Turkey en_US
gdc.description.endpage 266 en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q4
gdc.description.startpage 258 en_US
gdc.description.volume 1 en_US
gdc.description.wosquality N/A
gdc.identifier.openalex W4404579388
gdc.index.type Scopus
gdc.oaire.diamondjournal false
gdc.oaire.impulse 1.0
gdc.oaire.influence 2.5853488E-9
gdc.oaire.isgreen false
gdc.oaire.popularity 3.1488547E-9
gdc.oaire.publicfunded false
gdc.openalex.collaboration National
gdc.openalex.fwci 0.3626
gdc.openalex.normalizedpercentile 0.69
gdc.opencitations.count 0
gdc.plumx.mendeley 1
gdc.plumx.scopuscites 1
gdc.scopus.citedcount 1
gdc.virtual.author Aka Uymaz, Hande
gdc.virtual.author Kumova Metin, Senem
relation.isAuthorOfPublication e6d36a8f-f3ca-479f-9d2e-8c8e6afe1b2a
relation.isAuthorOfPublication 81d6fcea-c590-42aa-8443-7459c9eab7fa
relation.isAuthorOfPublication.latestForDiscovery e6d36a8f-f3ca-479f-9d2e-8c8e6afe1b2a
relation.isOrgUnitOfPublication 805c60d5-b806-4645-8214-dd40524c388f
relation.isOrgUnitOfPublication 26a7372c-1a5e-42d9-90b6-a3f7d14cad44
relation.isOrgUnitOfPublication e9e77e3e-bc94-40a7-9b24-b807b2cd0319
relation.isOrgUnitOfPublication.latestForDiscovery 805c60d5-b806-4645-8214-dd40524c388f

Files