Measuring Collocation Tendency of Words

Loading...
Publication Logo

Date

2011

Authors

Metin, Senem Kumova

Journal Title

Journal ISSN

Volume Title

Publisher

Routledge Journals, Taylor & Francis Ltd

Open Access Color

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

Research Projects

Journal Issue

Abstract

In all natural languages, some words collocate with other words to create multi-worded blocks of meaning - the collocations. Since identification of collocations is vital for information retrieval, language learning, psycholinguistics, authorship determination and translation, collocation extraction is an important issue in natural language processing. In this paper we present a method which is designed to improve current statistical methods that generate ranked lists of collocation candidates. Due to meaning integrity, any word in a collocation must suggest or at least imply the subsequent words composing the collocation. As a result, we may state that the words in a random text differ in the tendency to facilitate the prediction of the next word. If a word helps the prediction then it tends to collocate, otherwise it does not. In this paper, an attempt has been made to extract collocations by measuring collocation tendency of words and word combinations. The method used is to filter out free word pairs (the words that do not facilitate the prediction of the next word or those in which meaning integrity has not been completed yet) in the lists of candidate pairs. Collocation tendency method is tested on a base data set extracted by some statistical collocation extraction techniques (frequency of occurrence, point-wise mutual information, the t-test, chi-square techniques) and is evaluated by precision and recall measures. We have found that collocation tendency method brings a remarkable improvement on frequency of occurrence and the t-test techniques.

Description

Keywords

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology

Citation

WoS Q

Q1

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
3

Source

Journal of Quantıtatıve Lınguıstıcs

Volume

18

Issue

2

Start Page

174

End Page

187
PlumX Metrics
Citations

CrossRef : 3

Scopus : 6

Captures

Mendeley Readers : 32

SCOPUS™ Citations

6

checked on Mar 25, 2026

Web of Science™ Citations

3

checked on Mar 25, 2026

Page Views

6

checked on Mar 25, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.4276

Sustainable Development Goals