Measuring Collocation Tendency of Words

Metin, Senem Kumova; Karaoglan, Bahar

Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14365/1654

Full metadata record

DC Field	Value	Language
dc.contributor.author	Metin, Senem Kumova	-
dc.contributor.author	Karaoglan, Bahar	-
dc.date.accessioned	2023-06-16T14:19:02Z	-
dc.date.available	2023-06-16T14:19:02Z	-
dc.date.issued	2011	-
dc.identifier.issn	0929-6174	-
dc.identifier.issn	1744-5035	-
dc.identifier.uri	https://doi.org/10.1080/09296174.2011.556005	-
dc.identifier.uri	https://hdl.handle.net/20.500.14365/1654	-
dc.description.abstract	In all natural languages, some words collocate with other words to create multi-worded blocks of meaning - the collocations. Since identification of collocations is vital for information retrieval, language learning, psycholinguistics, authorship determination and translation, collocation extraction is an important issue in natural language processing. In this paper we present a method which is designed to improve current statistical methods that generate ranked lists of collocation candidates. Due to meaning integrity, any word in a collocation must suggest or at least imply the subsequent words composing the collocation. As a result, we may state that the words in a random text differ in the tendency to facilitate the prediction of the next word. If a word helps the prediction then it tends to collocate, otherwise it does not. In this paper, an attempt has been made to extract collocations by measuring collocation tendency of words and word combinations. The method used is to filter out free word pairs (the words that do not facilitate the prediction of the next word or those in which meaning integrity has not been completed yet) in the lists of candidate pairs. Collocation tendency method is tested on a base data set extracted by some statistical collocation extraction techniques (frequency of occurrence, point-wise mutual information, the t-test, chi-square techniques) and is evaluated by precision and recall measures. We have found that collocation tendency method brings a remarkable improvement on frequency of occurrence and the t-test techniques.	en_US
dc.language.iso	en	en_US
dc.publisher	Routledge Journals, Taylor & Francis Ltd	en_US
dc.relation.ispartof	Journal of Quantıtatıve Lınguıstıcs	en_US
dc.rights	info:eu-repo/semantics/closedAccess	en_US
dc.title	Measuring Collocation Tendency of Words	en_US
dc.type	Article	en_US
dc.identifier.doi	10.1080/09296174.2011.556005	-
dc.identifier.scopus	2-s2.0-79957991723	-
dc.department	İzmir Ekonomi Üniversitesi	en_US
dc.authorscopusid	24471923700	-
dc.authorscopusid	22334152300	-
dc.identifier.volume	18	en_US
dc.identifier.issue	2	en_US
dc.identifier.startpage	174	en_US
dc.identifier.endpage	187	en_US
dc.identifier.wos	WOS:000295585600003	-
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı	en_US
dc.identifier.scopusquality	Q1	-
dc.identifier.wosquality	Q2	-
item.cerifentitytype	Publications	-
item.openairetype	Article	-
item.languageiso639-1	en	-
item.fulltext	With Fulltext	-
item.openairecristype	http://purl.org/coar/resource_type/c_18cf	-
item.grantfulltext	reserved	-
crisitem.author.dept	05.04. Software Engineering	-
Appears in Collections:	Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection

Files in This Item:

File	Size	Format
1654.pdf Restricted Access	382.32 kB	Adobe PDF	View/Open

Show simple item record

CORE Recommender

SCOPUS^TM
Citations

6

checked on Oct 29, 2025

WEB OF SCIENCE^TM
Citations

3

checked on Oct 29, 2025

Page view(s)

372

checked on Oct 27, 2025

Download(s)

14

checked on Oct 27, 2025

Google Scholar^TM

Check

Files in This Item:

SCOPUSTM Citations

WEB OF SCIENCETM Citations

Page view(s)

Download(s)

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Google Scholar^TM