Authors

Lizhen Peng

Type

Text

Type

Dissertation

Advisor

Wang, Xuefeng | Kuan, Pei Fen | Zhu, Wei | Wu, Song | Bahou, Wadie.

Date

2015-05-01

Keywords

Statistics | Integrative Analysis, Statistical Frameworks

Department

Department of Applied Mathematics and Statistics.

Language

en_US

Source

This work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.

Identifier

http://hdl.handle.net/11401/77466

Publisher

The Graduate School, Stony Brook University: Stony Brook, NY.

Format

application/pdf

Abstract

We studied three major interconnected projects focused on establishing statistical frameworks for integrative analysis of genetic data, based on Cox proportional hazard models, kernel Cox regressions, multiple kernel learning models, regularized regression models, and support vector machines regressions, incorporated with dimensionality reduction and feature selections. (1) Our first project employed several machine learning algorithms for clinical predictions utilizing omics data across tumor types, to explore the potential benefits of including genetic measurements with traditional clinical information, for supporting doctor decisions. To predict survival of patients with tumors, our study focused on two objectives. First we applied multivariate Cox proportional hazard (Cox) models with univariate Cox screen or correlation screen, plus L1 penalized log partial likelihood (LASSO) for feature selection. Second, we also examined the factors that could affect prediction of dichotomized survival data by different machine learning algorithms, especially the MKL algorithms for it's capable of data fusion. Our analysis results indicate that incorporating Omics data with clinical information, can significantly improve predictions. Our also provided a well-established frameworks and resources, for reliable prognostic modeling and therapeutic decision making. (2) Our second project involved comprehensively assessing, by using genome-wide DNA methylation data as markers, the contribution of epigenetic effects on asthma and blood related quantitative traits. To evaluate the clinical utility of epigenetic markers, we constructed and compared various prediction models by including top ranked methylation loci from the genome-wide association scan, together with selected sets of known genetic markers from published genome-wide association studies. We observed a significant increase of correlation coefficient between actual and predicted IgE level when methylation markers were included. We also assessed the performance of cross platform prediction using methylation markers. Taken together, results from our assessment suggest that methylation has great potential in prediction of clinical phenotype. (3) Our third project explored kernel Cox regression models to improve the prediction accuracy of patients with metastatic castrate resistant prostate cancer (mCRPC) that treated by docetaxel. We proposed a future direction of utilizing clinical kernels in kernel Cox regression, to potentially obtain better results than linear and Gaussian kernels, with clinical variables only for prognostic modeling. | 92 pages

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.