Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14365/1654
Full metadata record
DC FieldValueLanguage
dc.contributor.authorMetin, Senem Kumova-
dc.contributor.authorKaraoglan, Bahar-
dc.date.accessioned2023-06-16T14:19:02Z-
dc.date.available2023-06-16T14:19:02Z-
dc.date.issued2011-
dc.identifier.issn0929-6174-
dc.identifier.issn1744-5035-
dc.identifier.urihttps://doi.org/10.1080/09296174.2011.556005-
dc.identifier.urihttps://hdl.handle.net/20.500.14365/1654-
dc.description.abstractIn all natural languages, some words collocate with other words to create multi-worded blocks of meaning - the collocations. Since identification of collocations is vital for information retrieval, language learning, psycholinguistics, authorship determination and translation, collocation extraction is an important issue in natural language processing. In this paper we present a method which is designed to improve current statistical methods that generate ranked lists of collocation candidates. Due to meaning integrity, any word in a collocation must suggest or at least imply the subsequent words composing the collocation. As a result, we may state that the words in a random text differ in the tendency to facilitate the prediction of the next word. If a word helps the prediction then it tends to collocate, otherwise it does not. In this paper, an attempt has been made to extract collocations by measuring collocation tendency of words and word combinations. The method used is to filter out free word pairs (the words that do not facilitate the prediction of the next word or those in which meaning integrity has not been completed yet) in the lists of candidate pairs. Collocation tendency method is tested on a base data set extracted by some statistical collocation extraction techniques (frequency of occurrence, point-wise mutual information, the t-test, chi-square techniques) and is evaluated by precision and recall measures. We have found that collocation tendency method brings a remarkable improvement on frequency of occurrence and the t-test techniques.en_US
dc.language.isoenen_US
dc.publisherRoutledge Journals, Taylor & Francis Ltden_US
dc.relation.ispartofJournal of Quantıtatıve Lınguıstıcsen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.titleMeasuring Collocation Tendency of Wordsen_US
dc.typeArticleen_US
dc.identifier.doi10.1080/09296174.2011.556005-
dc.identifier.scopus2-s2.0-79957991723en_US
dc.departmentİzmir Ekonomi Üniversitesien_US
dc.authorscopusid24471923700-
dc.authorscopusid22334152300-
dc.identifier.volume18en_US
dc.identifier.issue2en_US
dc.identifier.startpage174en_US
dc.identifier.endpage187en_US
dc.identifier.wosWOS:000295585600003en_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.identifier.scopusqualityQ1-
dc.identifier.wosqualityQ2-
item.grantfulltextreserved-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.cerifentitytypePublications-
item.openairetypeArticle-
item.fulltextWith Fulltext-
item.languageiso639-1en-
crisitem.author.dept05.04. Software Engineering-
Appears in Collections:Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection
WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection
Files in This Item:
File SizeFormat 
1654.pdf
  Restricted Access
382.32 kBAdobe PDFView/Open    Request a copy
Show simple item record



CORE Recommender

SCOPUSTM   
Citations

6
checked on Sep 25, 2024

WEB OF SCIENCETM
Citations

3
checked on Sep 25, 2024

Page view(s)

76
checked on Sep 30, 2024

Download(s)

8
checked on Sep 30, 2024

Google ScholarTM

Check




Altmetric


Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.