Gene set correlation enrichment analysis for interpreting and annotating gene expression profiles

Lan Yun Chang, Meng Zhan Lee, Yujia Wu, Wen Kai Lee, Chia Liang Ma, Jun Mao Chang, Ciao Wen Chen, Tzu Chun Huang, Chia Hwa Lee, Jih Chin Lee, Yu Yao Tseng, Chun Yu Lin

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Pathway analysis, including nontopology-based (non-TB) and topology-based (TB) methods, is widely used to interpret the biological phenomena underlying differences in expression data between two phenotypes. By considering dependencies and interactions between genes, TB methods usually perform better than non-TB methods in identifying pathways that include closely relevant or directly causative genes for a given phenotype. However, most TB methods may be limited by incomplete pathway data used as the reference network or by difficulties in selecting appropriate reference networks for different research topics. Here, we propose a gene set correlation enrichment analysis method, Gscore, based on an expression dataset-derived coexpression network to examine whether a differentially expressed gene (DEG) list (or each of its DEGs) is associated with a known gene set. Gscore is better able to identify target pathways in 89 human disease expression datasets than eight other state-of-the-art methods and offers insight into how disease-wide and pathway-wide associations reflect clinical outcomes. When applied to RNA-seq data from COVID-19-related cells and patient samples, Gscore provided a means for studying how DEGs are implicated in COVID-19-related pathways. In summary, Gscore offers a powerful analytical approach for annotating individual DEGs, DEG lists, and genome-wide expression profiles based on existing biological knowledge.

Original languageEnglish
Pages (from-to)E17
JournalNucleic Acids Research
Volume52
Issue number3
DOIs
Publication statusPublished - Feb 9 2024

ASJC Scopus subject areas

  • Genetics

Fingerprint

Dive into the research topics of 'Gene set correlation enrichment analysis for interpreting and annotating gene expression profiles'. Together they form a unique fingerprint.

Cite this