Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14365/1218
Full metadata record
DC FieldValueLanguage
dc.contributor.authorMetin, Senem Kumova-
dc.date.accessioned2023-06-16T12:59:26Z-
dc.date.available2023-06-16T12:59:26Z-
dc.date.issued2018-
dc.identifier.issn0957-4174-
dc.identifier.issn1873-6793-
dc.identifier.urihttps://doi.org/10.1016/j.eswa.2017.09.047-
dc.identifier.urihttps://hdl.handle.net/20.500.14365/1218-
dc.description.abstractIn multiword expression (MWE) recognition, there exist many studies where different learning methods are employed to decide whether given word combination is a multiword expression. The recognition methods commonly utilize a number of features that are extracted from a data source, frequently from the given text. Though the recognition methods and the features are well studied, we believe that to achieve the best possible performance with a learning method, different subsets of features should also be considered and the best performing subset must be selected. In this paper, we propose a procedure that covers the performance comparison of well-known feature selection methods to obtain the best feature subset in MWE recognition. The evaluation tests are performed on a Turkish MWE data set and the performance is measured by precision, recall and Fl values. The highest Fl value =0.731 is obtained by C4.5 classifier employing either wrapper or filtering method in feature selection. In the regarding setting(s), it is examined that the performance is increased by 1.11% compared to the setting where all features are employed in classification. Based on the experimental results, it may be stated that feature selection improves the performance of MWE recognition by eliminating the noisy/non-effective features. Moreover, it is obvious that proposed feature selection method contributes to the overall MWE recognition system by reducing the measurement and storage requirements due to the lower number of features in classification, providing a faster and more -cost effective learning model. (C) 2017 Elsevier Ltd. All rights reserved.en_US
dc.description.sponsorshipTUBITAK - The Scientific and Technological Research Council of Turkey [115E469]en_US
dc.description.sponsorshipThis work is carried under the grant of TUBITAK - The Scientific and Technological Research Council of Turkey to Project No: 115E469, Identification of Multi-word Expressions in Turkish Texts.en_US
dc.language.isoenen_US
dc.publisherPergamon-Elsevier Science Ltden_US
dc.relation.ispartofExpert Systems Wıth Applıcatıonsen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectMultiword expressionen_US
dc.subjectMultiword expression recognitionen_US
dc.subjectLearning algorithmsen_US
dc.subjectFeature selectionen_US
dc.subjectNamed Entity Recognitionen_US
dc.titleFeature selection in multiword expression recognitionen_US
dc.typeArticleen_US
dc.identifier.doi10.1016/j.eswa.2017.09.047-
dc.identifier.scopus2-s2.0-85029703177en_US
dc.departmentİzmir Ekonomi Üniversitesien_US
dc.authorscopusid24471923700-
dc.identifier.volume92en_US
dc.identifier.startpage106en_US
dc.identifier.endpage123en_US
dc.identifier.wosWOS:000414107100009en_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.identifier.scopusqualityQ1-
dc.identifier.wosqualityQ1-
item.grantfulltextreserved-
item.openairetypeArticle-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.fulltextWith Fulltext-
item.languageiso639-1en-
item.cerifentitytypePublications-
crisitem.author.dept05.04. Software Engineering-
Appears in Collections:Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection
WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection
Files in This Item:
File SizeFormat 
241.pdf
  Restricted Access
1.91 MBAdobe PDFView/Open    Request a copy
Show simple item record



CORE Recommender

SCOPUSTM   
Citations

13
checked on Nov 20, 2024

WEB OF SCIENCETM
Citations

13
checked on Nov 20, 2024

Page view(s)

86
checked on Nov 18, 2024

Download(s)

6
checked on Nov 18, 2024

Google ScholarTM

Check




Altmetric


Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.