Identifying Collocations in Turkish Using Statistical Methods

dc.contributor.author Metin, S.K.
dc.contributor.author Karaoğlan B.
dc.date.accessioned 2023-06-16T18:52:05Z
dc.date.available 2023-06-16T18:52:05Z
dc.date.issued 2016
dc.description.abstract Collocation is the combination of words in which words appear together more often than by chance in order to create a block of meaning. Since the extraction of collocations provides many benefits in automatic processing, translation of Turkish texts and in learning Turkish, it is an important issue in Turkish natural language processing. In this study several statistical techniques, including occurrence frequency, pointwise mutual information and hypothesis tests, are applied on Turkey Turkish corpus to automatically identify collocations. We have utilized both stemmed and surface forms of words in order to explore the effect of stemming in collocation extraction. The techniques are evaluated using the F-measure. The chi-square hypothesis test and pointwise mutual information methods have produced better results compared to other methods. In addition, we have observed that when words are stemmed, methods which may be considered as successful in collocation extraction may be more clearly discriminated. © 2016, Ahmet Yesevi University. All rights reserved. en_US
dc.identifier.issn 1301-0549
dc.identifier.scopus 2-s2.0-84982952392
dc.identifier.uri https://hdl.handle.net/20.500.14365/4562
dc.language.iso tr en_US
dc.publisher Ahmet Yesevi University en_US
dc.relation.ispartof Bilig en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Collocation en_US
dc.subject Corpus en_US
dc.subject Natural language processing en_US
dc.subject Turkey Turkish en_US
dc.title Identifying Collocations in Turkish Using Statistical Methods en_US
dc.title.alternative Türkiye Türkçesinde Eşdizimlerin İstatistiksel Yöntemlerle Belirlenmesi en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.scopusid 24471923700
gdc.coar.access metadata only access
gdc.coar.type text::journal::journal article
gdc.description.departmenttemp Metin, S.K., İzmir University of Economics, Faculty of Engineering and Computer Science, Department of Software Engineering, İzmir, Turkey; Karaoğlan, B., Ege University, International Computer Institute, İzmir, Turkey en_US
gdc.description.endpage 286 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q4
gdc.description.startpage 253 en_US
gdc.description.volume 78 en_US
gdc.description.wosquality Q3
gdc.identifier.wos WOS:000390423500010
gdc.index.type WoS
gdc.index.type Scopus
gdc.scopus.citedcount 6
gdc.virtual.author Kumova Metin, Senem
gdc.wos.citedcount 3
relation.isAuthorOfPublication 81d6fcea-c590-42aa-8443-7459c9eab7fa
relation.isAuthorOfPublication.latestForDiscovery 81d6fcea-c590-42aa-8443-7459c9eab7fa
relation.isOrgUnitOfPublication 805c60d5-b806-4645-8214-dd40524c388f
relation.isOrgUnitOfPublication 26a7372c-1a5e-42d9-90b6-a3f7d14cad44
relation.isOrgUnitOfPublication e9e77e3e-bc94-40a7-9b24-b807b2cd0319
relation.isOrgUnitOfPublication.latestForDiscovery 805c60d5-b806-4645-8214-dd40524c388f

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
4562.pdf
Size:
930.97 KB
Format:
Adobe Portable Document Format