Stop Word Detection as a Binary Classification Problem

dc.contributor.author Karaoğlan, Bahar
dc.contributor.author Metin, Senem Kumova
dc.date.accessioned 2023-06-16T17:51:44Z
dc.date.available 2023-06-16T17:51:44Z
dc.date.issued 2017
dc.description.abstract In a wide group of languages, the stop words, which have only grammatical roles and not contributing to information content, may be simply exposed by their relatively higher occurrence frequencies. But, in agglutinative or inflectional languages, a stop word may be observed in several different surface forms due to the inflection producing noise. In this study, some of the well-known binary classification methods are employed to overcome the inflectional noise problem in stop word detection. The experiments are conducted on corpora of an agglutinative language, Turkish, in which the amount of inflection is high and a non-agglutinative language, English, in which the inflection is lower for stop words. The evaluations demonstrated that in Turkish corpus, the classification methods improve stop word detection with respect to frequency-based method. On the other hand, the classification methods applied on English corpora showed no improvement in the performance of stop word detection. en_US
dc.identifier.issn 2146-0205
dc.identifier.issn 1302-3160
dc.identifier.uri https://search.trdizin.gov.tr/yayin/detay/245749
dc.identifier.uri https://hdl.handle.net/20.500.14365/4314
dc.identifier.uri https://search.trdizin.gov.tr/en/yayin/detay/245749
dc.language.iso en en_US
dc.relation.ispartof Anadolu Üniversitesi Bilim ve Teknoloji Dergisi :A-Uygulamalı Bilimler ve Mühendislik en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Bilgisayar Bilimleri, Yazılım Mühendisliği
dc.subject Bilgi, Belge Yönetimi
dc.subject Matematik
dc.subject İstatistik Ve Olasılık
dc.subject Bilgisayar Bilimleri, Teori Ve Metotlar
dc.subject Dil Ve Dil Bilim
dc.title Stop Word Detection as a Binary Classification Problem en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.id 0000-0001-9338-7491
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.description.department İEÜ, Mühendislik Fakültesi, Yazılım Mühendisliği Bölümü en_US
gdc.description.departmenttemp Ege University, International Computer Institute (ICI), 35100 Bornova-İzmir, Turkey Department of Software Engineering, Faculty of Engineering, İzmir University of Economics en_US
gdc.description.endpage 359 en_US
gdc.description.issue 2 en_US
gdc.description.publicationcategory Makale - Ulusal Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.startpage 346 en_US
gdc.description.volume 18 en_US
gdc.description.wosquality N/A
gdc.identifier.trdizinid 245749
gdc.index.type TR-Dizin
gdc.virtual.author Kumova Metin, Senem
relation.isAuthorOfPublication 81d6fcea-c590-42aa-8443-7459c9eab7fa
relation.isAuthorOfPublication.latestForDiscovery 81d6fcea-c590-42aa-8443-7459c9eab7fa
relation.isOrgUnitOfPublication 805c60d5-b806-4645-8214-dd40524c388f
relation.isOrgUnitOfPublication 26a7372c-1a5e-42d9-90b6-a3f7d14cad44
relation.isOrgUnitOfPublication e9e77e3e-bc94-40a7-9b24-b807b2cd0319
relation.isOrgUnitOfPublication.latestForDiscovery 805c60d5-b806-4645-8214-dd40524c388f

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
3365.pdf
Size:
1.12 MB
Format:
Adobe Portable Document Format