Type
Text
Type
Dissertation
Advisor
Gouma, Perena | Nancy R. Mendell | Haipeng Xing | Eli Hatchwell.
Date
2010-12-01
Keywords
Statistics -- Genetics | beta distribution, case-control study, exome sequencing, false positives, rare variant, whole genome sequencing
Department
Department of Applied Mathematics and Statistics
Language
en_US
Source
This work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.
Identifier
http://hdl.handle.net/11401/72711
Publisher
The Graduate School, Stony Brook University: Stony Brook, NY.
Format
application/pdf
Abstract
Whole genome sequencing and whole exome sequencing are developing techniques to explore the associations between rare variants and complex diseases. The number of variants that are expected to appear in a randomly selected group that do not appear in a different group randomly selected from the same population has unknown mean and variance. Expressions for these quantities are derived here. Numerical values are calculated assuming that the frequency of a rare variant has a beta distribution using parameters estimated for four populations. Extensions to the number of variants that appear in r ( r >1) members of a randomly selected group with none in the comparison group are given. These calculations suggest that a genome wide study of rare variants would generate an extremely large number of false positives. Similarly, an exome wide search would also generate a smaller but still overwhelming number of false positives. A search restricted to variants in a specified gene would not generate excessive numbers of false positives. The expectations using the beta model fit a SNP database well when the underlying beta distribution was restricted to variant frequencies greater than 0.001.
Recommended Citation
Xu, Wenjie, "Distribution of Number of Rare Variants Appearing in Cases but Not Controls in Genome-wide Studies" (2010). Stony Brook Theses and Dissertations Collection, 2006-2020 (closed to submissions). 1914.
https://commons.library.stonybrook.edu/stony-brook-theses-and-dissertations-collection/1914