Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14365/3614
Title: The impact of sentence embeddings in Turkish paraphrase detection
Other Titles: Türkçe Eşanlatim Tespitinde Cümle Temsillerinin Etkisi
Authors: Karaoglan B.
Yorgancioglu H.E.
Kisla T.
Kumova Metin S.
Keywords: Paraphrasing
Praphrase corpus
Sentence embedding
Word embedding
Linguistics
Natural language processing systems
Signal processing
NAtural language processing
Optimization of parameters
Paraphrase corpus
Paraphrase identifications
Paraphrasing
Praphrase corpus
Sentence embedding
Word embedding
Embeddings
Publisher: Institute of Electrical and Electronics Engineers Inc.
Abstract: In recent studies, it is shown that word embeddings achieve in several natural language processing (NLP) tasks. Though paraphrase identification in Turkish is well-studied by traditional statistical NLP methods, to the best of our knowledge there exists no study where word and/or sentence embeddings are employed. In this paper, three methods, which are well-known as 'using average vector for word embeddings' (AWE), 'concatenated vectors for word embeddings' (CWE) and 'word mover's distance word embeddings' (WMDWE) to build sentence embeddings from word embeddings are examined and their effect in performance of paraphrase identification is measured. The results are presented comparatively for English (MSRP) and Turkish (PARDER and TuPC) paraphrase corpora. The study doesn't cover the optimization of parameters used in training of word embeddings and also the features specific to Turkish langauge are not considered. Despite this naive approach, the test results obtained from PARDER corpus are inspiring that a more detailed study that involves such improvements may result with more convincing performance values. © 2019 IEEE.
Description: 27th Signal Processing and Communications Applications Conference, SIU 2019 -- 24 April 2019 through 26 April 2019 -- 151073
URI: https://doi.org/10.1109/SIU.2019.8806506
https://hdl.handle.net/20.500.14365/3614
ISBN: 9.78173E+12
Appears in Collections:Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection
WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection

Files in This Item:
File SizeFormat 
2704.pdf
  Restricted Access
3.09 MBAdobe PDFView/Open    Request a copy
Show full item record



CORE Recommender

Page view(s)

72
checked on Sep 30, 2024

Download(s)

6
checked on Sep 30, 2024

Google ScholarTM

Check




Altmetric


Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.