Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.14365/823
Title: | Standard Co-training in Multiword Expression Detection | Authors: | Metin, Senem Kumova | Keywords: | Multiword expression Classification Co-training |
Publisher: | Springer International Publishing Ag | Abstract: | Multiword expressions (MWEs) are units in language where multiple words unite without an obvious/known reason. Since MWEs occupy a prominent amount of space in both written and spoken language materials, identification of MWEs is accepted to be an important task in natural language processing. In this paper, considering MWE detection as a binary classification task, we propose to use a semi-supervised learning algorithm, standard co-training [1] Co-training is a semi-supervised method that employs two classifiers with two different views to label unlabeled data iteratively in order to enlarge the training sets of limited size. In our experiments, linguistic and statistical features that distinguish MWEs from random word combinations are utilized as two different views. Two different pairs of classifiers are employed with a group of experimental settings. The tests are performed on a Turkish MWE data set of 3946 positive and 4230 negative MWE candidates. The results showed that the classifier where statistical view is considered succeeds in MWE detection when the training set is enlarged by co-training. | Description: | 9th International Conference on Intelligent Human Computer Interaction (IHCI) -- DEC 11-13, 2017 -- Evry, FRANCE | URI: | https://doi.org/10.1007/978-3-319-72038-8_14 https://hdl.handle.net/20.500.14365/823 |
ISBN: | 978-3-319-72038-8 978-3-319-72037-1 |
ISSN: | 0302-9743 1611-3349 |
Appears in Collections: | Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection |
Show full item record
CORE Recommender
SCOPUSTM
Citations
2
checked on Nov 20, 2024
Page view(s)
72
checked on Nov 18, 2024
Download(s)
14
checked on Nov 18, 2024
Google ScholarTM
Check
Altmetric
Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.