Standard Co-Training in Multiword Expression Detection
Loading...
Files
Date
2017
Authors
Metin, Senem Kumova
Journal Title
Journal ISSN
Volume Title
Publisher
Springer International Publishing Ag
Open Access Color
HYBRID
Green Open Access
Yes
OpenAIRE Downloads
2
OpenAIRE Views
2
Publicly Funded
No
Abstract
Multiword expressions (MWEs) are units in language where multiple words unite without an obvious/known reason. Since MWEs occupy a prominent amount of space in both written and spoken language materials, identification of MWEs is accepted to be an important task in natural language processing. In this paper, considering MWE detection as a binary classification task, we propose to use a semi-supervised learning algorithm, standard co-training [1] Co-training is a semi-supervised method that employs two classifiers with two different views to label unlabeled data iteratively in order to enlarge the training sets of limited size. In our experiments, linguistic and statistical features that distinguish MWEs from random word combinations are utilized as two different views. Two different pairs of classifiers are employed with a group of experimental settings. The tests are performed on a Turkish MWE data set of 3946 positive and 4230 negative MWE candidates. The results showed that the classifier where statistical view is considered succeeds in MWE detection when the training set is enlarged by co-training.
Description
9th International Conference on Intelligent Human Computer Interaction (IHCI) -- DEC 11-13, 2017 -- Evry, FRANCE
Keywords
Multiword expression, Classification, Co-training
Fields of Science
Citation
WoS Q
N/A
Scopus Q
Q3

OpenCitations Citation Count
1
Source
Intellıgent Human Computer Interactıon, Ihcı 2017
Volume
10688
Issue
Start Page
178
End Page
188
PlumX Metrics
Citations
Scopus : 2
Captures
Mendeley Readers : 4
Google Scholar™


