Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.14365/1218
Title: | Feature selection in multiword expression recognition | Authors: | Metin, Senem Kumova | Keywords: | Multiword expression Multiword expression recognition Learning algorithms Feature selection Named Entity Recognition |
Publisher: | Pergamon-Elsevier Science Ltd | Abstract: | In multiword expression (MWE) recognition, there exist many studies where different learning methods are employed to decide whether given word combination is a multiword expression. The recognition methods commonly utilize a number of features that are extracted from a data source, frequently from the given text. Though the recognition methods and the features are well studied, we believe that to achieve the best possible performance with a learning method, different subsets of features should also be considered and the best performing subset must be selected. In this paper, we propose a procedure that covers the performance comparison of well-known feature selection methods to obtain the best feature subset in MWE recognition. The evaluation tests are performed on a Turkish MWE data set and the performance is measured by precision, recall and Fl values. The highest Fl value =0.731 is obtained by C4.5 classifier employing either wrapper or filtering method in feature selection. In the regarding setting(s), it is examined that the performance is increased by 1.11% compared to the setting where all features are employed in classification. Based on the experimental results, it may be stated that feature selection improves the performance of MWE recognition by eliminating the noisy/non-effective features. Moreover, it is obvious that proposed feature selection method contributes to the overall MWE recognition system by reducing the measurement and storage requirements due to the lower number of features in classification, providing a faster and more -cost effective learning model. (C) 2017 Elsevier Ltd. All rights reserved. | URI: | https://doi.org/10.1016/j.eswa.2017.09.047 https://hdl.handle.net/20.500.14365/1218 |
ISSN: | 0957-4174 1873-6793 |
Appears in Collections: | Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection |
Files in This Item:
File | Size | Format | |
---|---|---|---|
241.pdf Restricted Access | 1.91 MB | Adobe PDF | View/Open Request a copy |
CORE Recommender
SCOPUSTM
Citations
13
checked on Nov 20, 2024
WEB OF SCIENCETM
Citations
13
checked on Nov 20, 2024
Page view(s)
86
checked on Nov 18, 2024
Download(s)
6
checked on Nov 18, 2024
Google ScholarTM
Check
Altmetric
Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.