Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.14365/2600
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Kumova Metin, Senem | - |
dc.date.accessioned | 2023-06-16T14:41:20Z | - |
dc.date.available | 2023-06-16T14:41:20Z | - |
dc.date.issued | 2018 | - |
dc.identifier.issn | 1300-0632 | - |
dc.identifier.issn | 1303-6203 | - |
dc.identifier.uri | https://doi.org/10.3906/elk-1709-185 | - |
dc.identifier.uri | https://hdl.handle.net/20.500.14365/2600 | - |
dc.description.abstract | In multiword expressions (MWEs), multiple words unite to build a new unit in language. When MWE identification is accepted as a binary classification task, one of the most important factors in performance is to train the classifier with enough number of labelled samples. Since manual labelling is a time-consuming task, the performances of MWE recognition studies are limited with the size of the training sets. In this study, we propose the comparison-based and common-decision co-training approaches in order to enlarge the MWE dataset. In the experiments, the performances of the proposed approaches were compared to those of the standard co-training [1] and manual labelling where statistical and linguistic features are employed as two different views of the MWE dataset [2]. A number of tests with different settings were performed on a Turkish MWE dataset. Ten different classifiers were utilized in the experiments and the best performing classifier pair was observed to be the SMO-SMO pair. The experimental results showed that the common-decision co-training approach is an alternative to hand-labeling of large MWE datasets and both newly proposed approaches outperform the standard co-training [2] when the training set is to be enlarged in MWE classification. | en_US |
dc.description.sponsorship | Scientific and Technological Research Council of Turkey [115E469] | en_US |
dc.description.sponsorship | This work was carried out under the grant of The Scientific and Technological Research Council of Turkey (Project No. 115E469, Identification of Multiword Expressions in Turkish Texts). Further information/statistics on the MWE dataset is available on the project web page (http://app.ieu-nlpteam.com:8000). | en_US |
dc.language.iso | en | en_US |
dc.publisher | Scientific Technical Research Council Turkey-Tubitak | en_US |
dc.relation.ispartof | Turkısh Journal of Electrıcal Engıneerıng And Computer Scıences | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Multiword expression | en_US |
dc.subject | classification | en_US |
dc.subject | training set | en_US |
dc.subject | co-training | en_US |
dc.title | Enlarging multiword expression dataset by co-training | en_US |
dc.type | Article | en_US |
dc.identifier.doi | 10.3906/elk-1709-185 | - |
dc.identifier.scopus | 2-s2.0-85054525652 | en_US |
dc.department | İzmir Ekonomi Üniversitesi | en_US |
dc.authorid | Kumova Metin, Senem/0000-0002-9606-3625 | - |
dc.authorscopusid | 24471923700 | - |
dc.identifier.volume | 26 | en_US |
dc.identifier.issue | 5 | en_US |
dc.identifier.startpage | 2583 | en_US |
dc.identifier.endpage | 2594 | en_US |
dc.identifier.wos | WOS:000448109200034 | en_US |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.identifier.trdizinid | 323563 | en_US |
dc.identifier.scopusquality | Q3 | - |
dc.identifier.wosquality | Q4 | - |
item.grantfulltext | embargo_20300101 | - |
item.openairetype | Article | - |
item.openairecristype | http://purl.org/coar/resource_type/c_18cf | - |
item.fulltext | With Fulltext | - |
item.languageiso639-1 | en | - |
item.cerifentitytype | Publications | - |
crisitem.author.dept | 05.04. Software Engineering | - |
Appears in Collections: | Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection TR Dizin İndeksli Yayınlar Koleksiyonu / TR Dizin Indexed Publications Collection WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection |
Files in This Item:
File | Size | Format | |
---|---|---|---|
2600.pdf Until 2030-01-01 | 506.55 kB | Adobe PDF | View/Open Request a copy |
CORE Recommender
Page view(s)
98
checked on Nov 18, 2024
Download(s)
6
checked on Nov 18, 2024
Google ScholarTM
Check
Altmetric
Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.