Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14365/3148
Full metadata record
DC FieldValueLanguage
dc.contributor.authorMetin, Senem Kumova-
dc.date.accessioned2023-06-16T14:55:18Z-
dc.date.available2023-06-16T14:55:18Z-
dc.date.issued2016-
dc.identifier.issn0267-6192-
dc.identifier.urihttps://hdl.handle.net/20.500.14365/3148-
dc.description.abstractIn all natural languages, due to the strong cohesive ties between the composing words, some recurrent combinations of words generate multiword expressions (MWEs). The extraction of MWEs in a text has an important role in natural language processing applications and information retrieval. In this study, we introduce a method of MWE extraction that ranks the candidates by the weakness of outer ties between the candidate and the neighbouring words in the text. The method presents a measure for the weakness of outer ties based on the degree of unpredictability of surrounding words in order to distinguish MWEs from other recurrent groups of consecutive words. Simply in the method, if the words following and preceding a MWE candidate are unpredictable due to the relatively excessive number of different neighbouring words, the candidate is accepted to have a strong evidence to be a real MWE. The method generates a single normalized score of unpredictability, which enables not only the comparison of MWE candidates of different occurrence frequency but also the comparison of MWE candidates with different number of composing words (such as the comparison of two-word candidates with three-word candidates). Comparisons with different groups of well-known methods; statistical measures of association and term hood, vector space models of composition and supervised learning methods; illustrate the effectiveness of the proposed method on two-word MWE candidates and in the merged set of two- and three-word candidates.en_US
dc.language.isoenen_US
dc.publisherC R L Publishing Ltden_US
dc.relation.ispartofComputer Systems Scıence And Engıneerıngen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectMultiword expressionen_US
dc.subjectpredictabilityen_US
dc.subjectassociation measuresen_US
dc.subjectterm hood measuresen_US
dc.subjectcompositionalityen_US
dc.subjectsupervised learningen_US
dc.subjectAutomatic Extractionen_US
dc.titleNeighbour unpredictability measure in multiword expression extractionen_US
dc.typeArticleen_US
dc.identifier.scopus2-s2.0-84991725989en_US
dc.departmentİzmir Ekonomi Üniversitesien_US
dc.identifier.volume31en_US
dc.identifier.issue3en_US
dc.identifier.startpage209en_US
dc.identifier.endpage221en_US
dc.identifier.wosWOS:000383926100003en_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.identifier.scopusqualityQ2-
item.grantfulltextreserved-
item.openairetypeArticle-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.fulltextWith Fulltext-
item.languageiso639-1en-
item.cerifentitytypePublications-
crisitem.author.dept05.04. Software Engineering-
Appears in Collections:Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection
WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection
Files in This Item:
File SizeFormat 
2277.pdf
  Restricted Access
513.14 kBAdobe PDFView/Open    Request a copy
Show simple item record



CORE Recommender

SCOPUSTM   
Citations

6
checked on Nov 20, 2024

WEB OF SCIENCETM
Citations

5
checked on Nov 20, 2024

Page view(s)

86
checked on Nov 18, 2024

Download(s)

6
checked on Nov 18, 2024

Google ScholarTM

Check





Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.