The Impact of Sentence Embeddings in Turkish Paraphrase Detection

Loading...
Publication Logo

Date

2019

Journal Title

Journal ISSN

Volume Title

Publisher

Institute of Electrical and Electronics Engineers Inc.

Open Access Color

Green Open Access

Yes

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

Research Projects

Journal Issue

Abstract

In recent studies, it is shown that word embeddings achieve in several natural language processing (NLP) tasks. Though paraphrase identification in Turkish is well-studied by traditional statistical NLP methods, to the best of our knowledge there exists no study where word and/or sentence embeddings are employed. In this paper, three methods, which are well-known as 'using average vector for word embeddings' (AWE), 'concatenated vectors for word embeddings' (CWE) and 'word mover's distance word embeddings' (WMDWE) to build sentence embeddings from word embeddings are examined and their effect in performance of paraphrase identification is measured. The results are presented comparatively for English (MSRP) and Turkish (PARDER and TuPC) paraphrase corpora. The study doesn't cover the optimization of parameters used in training of word embeddings and also the features specific to Turkish langauge are not considered. Despite this naive approach, the test results obtained from PARDER corpus are inspiring that a more detailed study that involves such improvements may result with more convincing performance values. © 2019 IEEE.

Description

27th Signal Processing and Communications Applications Conference, SIU 2019 -- 24 April 2019 through 26 April 2019 -- 151073

Keywords

Paraphrasing, Praphrase corpus, Sentence embedding, Word embedding, Linguistics, Natural language processing systems, Signal processing, NAtural language processing, Optimization of parameters, Paraphrase corpus, Paraphrase identifications, Paraphrasing, Praphrase corpus, Sentence embedding, Word embedding, Embeddings

Fields of Science

Citation

WoS Q

N/A

Scopus Q

N/A
OpenCitations Logo
OpenCitations Citation Count
N/A

Source

27th Signal Processing and Communications Applications Conference, SIU 2019

Volume

Issue

Start Page

1

End Page

4
PlumX Metrics
Citations

Scopus : 0

Captures

Mendeley Readers : 3

Page Views

2

checked on Mar 16, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.0

Sustainable Development Goals

SDG data is not available