Statistical Methods Applied to Gene Expression Data to Explore Cancer Features

Hannah Simpson-Clancy, Ann Smith

Research output: Contribution to journalArticlepeer-review

Abstract

A small number of epithelial ovarian cancer cases are deemed preventable
and its overall survival rates are low. The developments in omics data analysis paves a way for biomarker discovery for epithelial ovarian cancer in order to improve survival rates and prevent its development. This report provides analysis of gene expression data for epithelial ovarian cancer to compare the gene expression of $99$ epithelial ovarian cancer samples and $4$ non-cancerous ovary samples. Serous and endometrioid epithelial ovarian cancer subtypes were most similar based on hierarchical clustering. Serous was the subtype with the most differentially expressed genes when compared with normal ovary samples whereas mucinous had the least by the Wilcoxon Rank-Sum test (Benjamini Hochberg, $pThe number of down-regulated genes exceeded the number of up-regulated genes when comparing each cancer subtype with normal ovary samples. In this case, the clear cell subtype had the greatest number of dysregulated genes when compared to normal ovaries whereas endometrioid had the least. The dysregulated genes were found by fold change analysis ($FC>2$ or $FCwere suggested due to $11,181$ differentially expressed genes identified when comparing expression levels in all sample groups by the Kruskal-Wallis test (Benjamini Hochberg, $p
Original languageEnglish
Pages (from-to)208-227
Number of pages20
JournalSIAM Undergraduate Research Online
Volume17
Early online date27 Jun 2024
DOIs
Publication statusPublished - 27 Jun 2024

Cite this