Comparison of Regression Methods to Identify Differential Expression in RNA-Sequencing Count Data from the Serial Analysis of Gene Expression

dc.contributor.authorArreola, Ivan
dc.contributor.authorHan, David
dc.date.accessioned2020-06-09T15:31:29Z
dc.date.available2020-06-09T15:31:29Z
dc.date.issued2019
dc.description.abstractComparative RNA-sequencing analysis for the Serial Analysis of Gene Expression (SAGE) can help identify changes in gene expression which are characteristic to human diseases. Since the RNA-sequencing experiment measures gene expressions in the form of counts, usually with a large degree of skewness, the analysis methods based on continuous probability distributions are generally inappropriate for modeling this type of data. Currently, the parametric regression techniques for solving this problem are based on the well-known discrete probability distributions such as Poisson and negative binomial. In order to overcome this modeling challenge with higher flexibilities to account for a wide range of dispersion levels, here we introduce an alternative Generalized Linear Model (GLM) based on the Conway-Maxwell-Poisson distribution, also known as COM-Poisson or CMP distribution. The CMP regression model generalizes the standard Poisson and negative binomial regressions, and it is suitable for fitting count data with varying degrees of over- and under-dispersions. Using simulated and real SAGE datasets, the performance of the proposed method is assessed in comparison to the Poisson- and negative binomial-based regression models.en_US
dc.description.departmentManagement Science and Statistics
dc.identifier.issn2470-3958
dc.identifier.urihttps://hdl.handle.net/20.500.12588/72
dc.language.isoen_USen_US
dc.publisherOffice of the Vice President for Researchen_US
dc.relation.ispartofseriesThe UTSA Journal of Undergraduate Research and Scholarly Work;Volume 5
dc.subjectConway-Maxwell-Poisson Regressionen_US
dc.subjectGeneralized Linear Modelsen_US
dc.subjectRNA-Sequencingen_US
dc.subjectSerial Analysis of Gene Expressionen_US
dc.titleComparison of Regression Methods to Identify Differential Expression in RNA-Sequencing Count Data from the Serial Analysis of Gene Expressionen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
JURSW.5.Arreola.1.pdf
Size:
153.05 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: