Hybrid Emotion Detection With Word Embeddings in a Low Resourced Language: Turkish

Metin, S.K.; Giraz, H.E.

Hybrid Emotion Detection With Word Embeddings in a Low Resourced Language: Turkish

Files

5475.pdf (171.77 KB)

Date

2024

Authors

Metin, S.K.

Giraz, H.E.

Publisher

Science and Information Organization

Open Access Color

GOLD

Green Open Access

No

Publicly Funded

No

Impulse

Average

Influence

Average

Popularity

Average

Abstract

Through natural language processing, subjective information can be obtained from written sources such as suggestions, reviews, and social media publications. Understanding and knowing the user experience or in other words the feelings/emotions of user on any type of product or situation directly affects the decisions to be taken on the regarding product or service. In this study, we focus on a hybrid approach of textbased emotion detection. We combined keyword and lexiconbased approaches by the use of word embeddings. In emotion detection, simply lexicon words/keywords and text units are compared in several different ways and the comparison results are used in emotion identification experiments. As this identification procedure is examined, it is explicit that the performance depends mainly on two actors: the lexicon/keyword list and the representation of text unit. We propose to employ word vectors/embeddings on both actors. Firstly, we propose a hybrid approach that uses word vector similarities in order to determine lexicon words, on contrary to traditional approaches that employs all arbitrary words in given text. By our approach, the overall effort in emotion identification is to be reduced by decreasing the number of arbitrary words that do not carry the emotive content. Moreover, the hybrid approach will decrease the need for crowdsourcing in lexicon word labelling. Secondly, we propose to build the representations of text units by measuring their word vector similarities to given lexicon. We built up two lexicons by our approach and presented three different comparison metrics based on embedding similarities. Emotion identification experiments are performed employing both unsupervised and supervised methods on Turkish text. The experimental results showed that employing the hybrid approach that involves word embeddings is promising on Turkish texts and also due to its flexible and languageindependent structure it can be improved and used in studies on different languages. © (2024), (Science and Information Organization). All Rights Reserved.

Keywords

Emotion detection, Turkish, vector similarity, word embedding, Emotion Recognition, Vectors, Embeddings, Emotion detection, Emotion identifications, Hybrid approach, Lexicon words, Turkish texts, Turkishs, Vector similarity, Word embedding, Word vectors, Embeddings

WoS Q

Q3

Scopus Q

Q3

OpenCitations Citation Count

N/A

Source

International Journal of Advanced Computer Science and Applications

Volume

15

Issue

6

Start Page

1449

End Page

1457

URI

https://doi.org/10.14569/IJACSA.2024.01506145
https://hdl.handle.net/20.500.14365/5475

Collections

Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection

PlumX Metrics

Citations

Scopus : 2

Captures

Mendeley Readers : 1

Full item page

Google Scholar™

Check

Hybrid Emotion Detection With Word Embeddings in a Low Resourced Language: Turkish

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Open Access Color

Green Open Access

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

BIP! Indicators

Research Projects

Journal Issue

Abstract

Description

Keywords

Fields of Science

Citation

WoS Q

Scopus Q

OpenCitations Citation Count

Source

Volume

Issue

Start Page

End Page

URI

Collections

PlumX Metrics

Citations

Captures

Google Scholar™

OpenAlex FWCI

0.0

Sustainable Development Goals