A New Local Covariance Matrix Estimation for the Classification of Gene Expression Profiles in High Dimensional Rna-Seq Data

Loading...
Publication Logo

Date

2021

Authors

Kochan, Necla
Tütüncü, Gözde Yazgı

Journal Title

Journal ISSN

Volume Title

Publisher

Pergamon-Elsevier Science Ltd

Open Access Color

HYBRID

Green Open Access

Yes

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Top 10%
Influence
Average
Popularity
Top 10%

Research Projects

Journal Issue

Abstract

Recent developments in the next-generation sequencing based on RNA-sequencing (RNA-Seq) allow researchers to measure the expression levels of thousands of genes for multiple samples simultaneously. In order to analyze these kinds of data sets, many classification models have been proposed in the literature. Most of the existing classifiers assume that genes are independent; however, this is not a realistic approach for real RNA-Seq classification problems. For this reason, some other classification methods, which incorporates the dependence structure between genes into a model, are proposed. Quantile transformed Quadratic Discriminant Analysis (qtQDA) proposed recently is one of those classifiers, which estimates covariance matrix by Maximum Likelihood Estimator. However, MLE may not reflect the real dependence between genes. For this reason, we propose a new approach based on local dependence function to estimate the covariance matrix to be used in the qtQDA classification model. This new approach assumes the dependencies between genes are locally defined rather than complete dependency. The performances of qtQDA classifier based on two different covariance matrix estimates are compared over two real RNA-Seq data sets, in terms of classification error rates. The results show that using local dependence function approach yields a better estimate of covariance matrix and increases the performance of qtQDA classifier.

Description

Keywords

RNA-seq, Gene expression, Local Covariance matrix, Classification, Quadratic Discriminant Analysis, Logistic-Regression

Fields of Science

0301 basic medicine, 0303 health sciences, 03 medical and health sciences

Citation

WoS Q

Q1

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
7

Source

Expert Systems Wıth Applıcatıons

Volume

167

Issue

Start Page

End Page

PlumX Metrics
Citations

CrossRef : 8

Scopus : 10

Captures

Mendeley Readers : 8

SCOPUS™ Citations

10

checked on Mar 06, 2026

Web of Science™ Citations

6

checked on Mar 06, 2026

Page Views

4

checked on Mar 06, 2026

Downloads

23

checked on Mar 06, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.58

Sustainable Development Goals

SDG data is not available