Feature Selection in Multiword Expression Recognition

Loading...
Publication Logo

Date

2018

Authors

Metin, Senem Kumova

Journal Title

Journal ISSN

Volume Title

Publisher

Pergamon-Elsevier Science Ltd

Open Access Color

Green Open Access

Yes

OpenAIRE Downloads

2

OpenAIRE Views

5

Publicly Funded

No
Impulse
Top 10%
Influence
Top 10%
Popularity
Top 10%

Research Projects

Journal Issue

Abstract

In multiword expression (MWE) recognition, there exist many studies where different learning methods are employed to decide whether given word combination is a multiword expression. The recognition methods commonly utilize a number of features that are extracted from a data source, frequently from the given text. Though the recognition methods and the features are well studied, we believe that to achieve the best possible performance with a learning method, different subsets of features should also be considered and the best performing subset must be selected. In this paper, we propose a procedure that covers the performance comparison of well-known feature selection methods to obtain the best feature subset in MWE recognition. The evaluation tests are performed on a Turkish MWE data set and the performance is measured by precision, recall and Fl values. The highest Fl value =0.731 is obtained by C4.5 classifier employing either wrapper or filtering method in feature selection. In the regarding setting(s), it is examined that the performance is increased by 1.11% compared to the setting where all features are employed in classification. Based on the experimental results, it may be stated that feature selection improves the performance of MWE recognition by eliminating the noisy/non-effective features. Moreover, it is obvious that proposed feature selection method contributes to the overall MWE recognition system by reducing the measurement and storage requirements due to the lower number of features in classification, providing a faster and more -cost effective learning model. (C) 2017 Elsevier Ltd. All rights reserved.

Description

Keywords

Multiword expression, Multiword expression recognition, Learning algorithms, Feature selection, Named Entity Recognition

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology

Citation

WoS Q

Q1

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
11

Source

Expert Systems Wıth Applıcatıons

Volume

92

Issue

Start Page

106

End Page

123
PlumX Metrics
Citations

CrossRef : 11

Scopus : 14

Captures

Mendeley Readers : 39

SCOPUS™ Citations

14

checked on Mar 15, 2026

Web of Science™ Citations

14

checked on Mar 15, 2026

Page Views

5

checked on Mar 15, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
1.3652

Sustainable Development Goals

SDG data is not available