Authors

Loretta Au

Type

Text

Type

Dissertation

Advisor

Green, David F | MacCarthy, Thomas | Rizzo, Robert C | Glynn, Steven.

Date

2015-08-01

Keywords

computational chemistry, large-scale mutagenesis, protein evolvability | Applied mathematics

Department

Department of Applied Mathematics and Statistics.

Language

en_US

Source

This work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.

Identifier

http://hdl.handle.net/11401/77510

Publisher

The Graduate School, Stony Brook University: Stony Brook, NY.

Format

application/pdf

Abstract

Proteins from the same family often share many structural and functional motifs. Variations in primary structure, however, allow a single protein family to modulate a broad range of biological processes. Several methods can be used to identify sequence conservation, but complementary mutagenesis experiments are often needed to understand the multiple roles that a given amino acid may have in maintaining overall fitness. Limitations related to financial and temporal costs generally constrain these experiments to smaller proteins or peptides, or to only partial sampling of sequence spaces, and we have devised a computational protocol for large-scale mutagenesis to circumvent these obstacles using a G-protein heterotrimer as a model system. The dead-end elimination and A* search (DEE/A*) algorithms are typically used to find a small number of sequences that may enhance the current level of fitness or introduce novel functions into a protein. These algorithms were adapted to find all low-energy sequences and their corresponding structures, allowing us to disentangle protein fitness, defined here as a combination of structural stability and binding interactions. We demonstrate the effectiveness of DEE/A* in capturing the biophysical features of amino-acid substitutions, and in quantifying the extent that individual positions are affected by mutagenesis, based on all low-energy single mutants. A modified version of this protocol was also used to explore double and triple mutants in the context of the β subunit in the G-protein heterotrimer, a representative repeat protein from the β -propeller family. Repeat proteins are subject to the same experimental challenges as non-repeating ones, but with the addition of having to properly define corresponding repeats in primary structure; DEE/A* is leveraged here to identify patterns of important interaction using a structure-based approach, and reveal a deeply connected interaction motif. | 153 pages

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.