Optimizing High-Dimensional Text Embeddings in Emotion Identification: a Sliding Window Approach
| dc.contributor.author | Uymaz, H.A. | |
| dc.contributor.author | Metin, S.K. | |
| dc.date.accessioned | 2025-01-25T17:06:40Z | |
| dc.date.available | 2025-01-25T17:06:40Z | |
| dc.date.issued | 2024 | |
| dc.description | Institute for Systems and Technologies of Information, Control and Communication (INSTICC) | en_US |
| dc.description.abstract | Natural language processing (NLP) is an interdisciplinary field that enables machines to understand and generate human language. One of the crucial steps in several NLP tasks, such as emotion and sentiment analysis, text similarity, summarization, and classification, is transforming textual data sources into numerical form, a process called vectorization. This process can be grouped into traditional, semantic, and contextual vectorization methods. Despite their advantages, these high-dimensional vectors pose memory and computational challenges. To address these issues, we employed a sliding window technique to partition high-dimensional vectors, aiming not only to enhance computational efficiency but also to detect emotional information within specific vector dimensions. Our experiments utilized emotion lexicon words and emotionally labeled sentences in both English and Turkish. By systematically analyzing the vectors, we identified consistent patterns with emotional clues. Our findings suggest that focusing on specific sub-vectors rather than entire high-dimensional BERT vectors can capture emotional information effectively, without performance loss. With this approach, we examined an increase in pairwise cosine similarity scores within emotion categories when using only sub-vectors. The results highlight the potential of the use of sub-vector techniques, offering insights into the nuanced integration of emotions in language and the applicability of these methods across different languages. © 2024 by SCITEPRESS – Science and Technology Publications, Lda. | en_US |
| dc.identifier.doi | 10.5220/0012899300003838 | |
| dc.identifier.isbn | 978-989758716-0 | |
| dc.identifier.issn | 2184-3228 | |
| dc.identifier.scopus | 2-s2.0-85215266075 | |
| dc.identifier.uri | https://doi.org/10.5220/0012899300003838 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.14365/5851 | |
| dc.language.iso | en | en_US |
| dc.publisher | Science and Technology Publications, Lda | en_US |
| dc.relation.ispartof | International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K - Proceedings -- 16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2024 -- 17 November 2024 through 19 November 2024 -- Porto -- 205736 | en_US |
| dc.rights | info:eu-repo/semantics/closedAccess | en_US |
| dc.subject | Emotion | en_US |
| dc.subject | Large Language Models | en_US |
| dc.subject | Natural Language Processing | en_US |
| dc.subject | Vector Space Models | en_US |
| dc.title | Optimizing High-Dimensional Text Embeddings in Emotion Identification: a Sliding Window Approach | en_US |
| dc.type | Conference Object | en_US |
| dspace.entity.type | Publication | |
| gdc.author.scopusid | 57195217693 | |
| gdc.author.scopusid | 24471923700 | |
| gdc.bip.impulseclass | C5 | |
| gdc.bip.influenceclass | C5 | |
| gdc.bip.popularityclass | C5 | |
| gdc.coar.access | metadata only access | |
| gdc.coar.type | text::conference output | |
| gdc.collaboration.industrial | false | |
| gdc.description.department | İzmir Ekonomi Üniversitesi | en_US |
| gdc.description.departmenttemp | Uymaz H.A., Izmir University of Economics, Department of Software Engineering, Izmir, Turkey; Metin S.K., Izmir University of Economics, Department of Software Engineering, Izmir, Turkey | en_US |
| gdc.description.endpage | 266 | en_US |
| gdc.description.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
| gdc.description.scopusquality | Q4 | |
| gdc.description.startpage | 258 | en_US |
| gdc.description.volume | 1 | en_US |
| gdc.description.wosquality | N/A | |
| gdc.identifier.openalex | W4404579388 | |
| gdc.index.type | Scopus | |
| gdc.oaire.diamondjournal | false | |
| gdc.oaire.impulse | 1.0 | |
| gdc.oaire.influence | 2.5853488E-9 | |
| gdc.oaire.isgreen | false | |
| gdc.oaire.popularity | 3.1488547E-9 | |
| gdc.oaire.publicfunded | false | |
| gdc.openalex.collaboration | National | |
| gdc.openalex.fwci | 0.3626 | |
| gdc.openalex.normalizedpercentile | 0.69 | |
| gdc.opencitations.count | 0 | |
| gdc.plumx.mendeley | 1 | |
| gdc.plumx.scopuscites | 1 | |
| gdc.scopus.citedcount | 1 | |
| gdc.virtual.author | Aka Uymaz, Hande | |
| gdc.virtual.author | Kumova Metin, Senem | |
| relation.isAuthorOfPublication | e6d36a8f-f3ca-479f-9d2e-8c8e6afe1b2a | |
| relation.isAuthorOfPublication | 81d6fcea-c590-42aa-8443-7459c9eab7fa | |
| relation.isAuthorOfPublication.latestForDiscovery | e6d36a8f-f3ca-479f-9d2e-8c8e6afe1b2a | |
| relation.isOrgUnitOfPublication | 805c60d5-b806-4645-8214-dd40524c388f | |
| relation.isOrgUnitOfPublication | 26a7372c-1a5e-42d9-90b6-a3f7d14cad44 | |
| relation.isOrgUnitOfPublication | e9e77e3e-bc94-40a7-9b24-b807b2cd0319 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 805c60d5-b806-4645-8214-dd40524c388f |
