Binary Patterning, Synthetic Biology, & The Hecht Lab at Princeton University
From Proteopedia
|
Modern proteomes found in nature have been molded by billions of years of evolution to most suitably account for biological and environmental selective factors. While these evolutionary forces have created modern proteins with exquisite specificity and functionality the majority of sequence space was likely never examined. This leaves open the possibility that other protein structures never examined due to evolution are more efficacious than their current modern analogues. Unfortunately, these potentially efficacious versions are likely never to be examined in nature as evolution went on an irrevocable path long ago toward modern proteomes. A modern proteome unbiased by evolutionary forces would be an extremely useful tool to examine sequence space unexplored by evolution. Out of such exploration, a better understanding of the true potential of sequence space could arise and provide insight into the "logic" of evolutionary the forces that created modern proteomes.
It has been proposed that primitive enzymes were promiscuous, possessing a broad range of specificities for substrates. [1] This would have allowed early organisms to perform the plethora of catalytic functions necessary for survival with a smaller repertoire of proteins, albeit at a slower rate than modern proteomes. This hypothesis put forward by Jensen et al. in 1976 is nearly impossible to test as evolution has already biased every protein found in nature. In order to test Jensen’s hypothesis as well as begin to explore the vast expanse of unexamined sequence space overlooked by evolution, researches like Prof. Michael Hecht et al. have developed libraries of de novo proteins, unbiased by evolutionary artifacts. [2]
Binary Patterning
An ideal unbiased library would be created via stochastic combinatorial methods. Since a majority of sequence space does not fold into protein-like tertiary structures often resulting in aggregation, purely stochastic methods do not work. [3]Instead, methods were developed to direct sequences toward those that assume stable tertiary structures. One such method is called "binary patterning," and its usage to develop vast, unbiased protein libraries is a primary focus of the Hecht Lab at Princeton University. Binary patterning is guided by two fundamental themes: 1. Natural proteins structures are predominantly comprised of secondary structure and 2. polar side chains are typically exposed to the surface while hydrophobic residues are buried. [3]This method relies on the premise that the exact residue identity is less important than the overall location of polar and non-polar residues. Sequences that comply with these rules can be designed by constraining the periodicity of polar and non-polar residues to match the typical periodicity of the desired secondary structure [4]. In the case of an alpha-helical design, the designed periodicity must have a non polar amino acid at every 3rd or 4th position to approximate 3.6 residues/turn found in alpha-helices. By following this design, the helix will have a polar and non-polar face (See Diagram). [3] Through this method, a combinatorial library can be created that is directed toward sequence space that contains a higher proportion of folded structures, while still maintaining diversity by not specifying specific residue identity, only polarity. [3]
The Libraries
The First Generation Library
The Hecht Lab developed the first generation library using a degenerate DNA codon system to create a 74-residue template designed to form a 4-helix bundle structure. [3] While searches of the first generation library using NMR identified several proteins that exhibited native like structures, most proteins within the first generation formed fluctuating molten globule structures. [5] [6]
The Second Generation Library
It was predicted that the predominant reason the first generation library did not form well folded proteins, as would be predicted by numerous other studies [7] [8], was that the helices were not long enough. Longer helices give rise to larger interhelical interfaces which add stability due to additional Van der Waals forces. [9]
|
The solved structure of S-824 revealed that the protein is . Since 86% (88 of 102) of the residues in this protein were not specified, the library contains significant diversity. The are in the chain termini and interhelical turns. [3] Over the entire library, a significant percentage (~ 80%) are well ordered, [11] validating the binary method. A second well folded structure was solved, S-836, in 2008, supporting the conclusion that S-824 was a reasonable representative of the library as a whole.
Structure of S-836
S-836 is a left-turning 4-helix bundle. [2] Since the identity of residues was not specified beyond polarity restrictions, a diversity of hydrophobic packing is expected. In S-836, . [2] That being said, likely due to their (ie. Residues Trp23, Phe47, Phe64, and Phe93). The hydrophobic core . It is believed that the . This assertion is supported by the fact that all well-folded proteins characterized had a bulky tryptophan side chain at this position while molten globule library members did not. [2]. While stable, S-836 , including residues 14, 23, 24, 34, 71, 74, 75, 76, 77, 87, and 100. Although internal cavities leading to dynamic conformational mobility is not ideal for evolved, precise proteins, these properties are advantageous for primordial proteins, providing the necessary catalytic versatility for early organisms functioning with a minimal enzyme repertoire. [2] In fact, several of the proteins from the second generation screened exhibited promiscuous enzymatic activity including esterase, lipase and peroxonase activities when bound to heme. [3]
The Third Generation Library
Since the second generation library was small and of limited diversity due to its single starting point (Sequence #86), a third generation library was created. This library , but did nor the . [10] The third generation library was considerably more diverse, containing ~ 10^6 sequences. Heme binding assays determined that nearly 66% of the third generation library binds heme (nearly 100% for those those proteins which express well) with over half of these binding heme at high levels. The catalytic ability of library enzymes was verified both in the presence and absence of heme. Nearly 80% of those proteins which bound heme exhibited peroxidase activity (up to 10^6 fold faster than the uncatalyzed reaction), ~ 60% exhibited hydrolase activity (~10^3 fold faster than the uncatalyzed reaction), and 36% exhibited lipase activity (up to 10^3 fold faster than the uncatalyzed reaction). [10]. Also of note, nearly 30% of those proteins which bound heme exhibited some level of activity for all the functions, highlighting the promiscuity of unevolved libraries. Even in the absence of cofactor, 30% of the third generation library exhibits esterase activity and 20% exhibits lipase catalytic activity, although at considerably lower rates than natural, evolved enzymes [10].
Conclusion
Using an unevolved combinatorial library created with a binary design Hecht et al. created a library of primitive enzymes, untouched by evolution. The enzymes in this library form stable folded structures, but contain several dynamic residues and promiscuous substrate specificity. These results appear to validate Jensen’s hypothesis from 1976 that "primitive enzymes possessed a very broad specificity, permitting them to react with a wide range of related substrates." [1] The eventual development of cells adapted to a particular environment and the differentiation of cells with specific functions in higher organisms required that proteins shed their promiscuous specificities.[10] Although modern proteomes are well adapted to the functions they carry out, the vastness of the unexplored sequence space likely holds structures which would be more ideal. Hecht et al continue their research on binary combinatorial libraries, especially toward finding biologically active proteins which are able to support cellular life in-vivo, a critical step toward synthetic biology.
Additional Structures
Additional Resources
The Hecht Lab Website
References
- ↑ 1.0 1.1 Jensen RA. Enzyme recruitment in evolution of new function. Annu Rev Microbiol. 1976;30:409-25. PMID:791073 doi:http://dx.doi.org/10.1146/annurev.mi.30.100176.002205
- ↑ 2.0 2.1 2.2 2.3 2.4 2.5 Go A, Kim S, Baum J, Hecht MH. Structure and dynamics of de novo proteins from a designed superfamily of 4-helix bundles. Protein Sci. 2008 May;17(5):821-32. PMID:18436954 doi:10.1110/ps.073377908
- ↑ 3.0 3.1 3.2 3.3 3.4 3.5 3.6 3.7 Hecht MH, Das A, Go A, Bradley LH, Wei Y. De novo proteins from designed combinatorial libraries. Protein Sci. 2004 Jul;13(7):1711-23. PMID:15215517 doi:10.1110/ps.04690804
- ↑ Xiong H, Buckwalter BL, Shieh HM, Hecht MH. Periodicity of polar and nonpolar amino acids is the major determinant of secondary structure in self-assembling oligomeric peptides. Proc Natl Acad Sci U S A. 1995 Jul 3;92(14):6349-53. PMID:7603994
- ↑ Roy S, Helmer KJ, Hecht MH. Detecting native-like properties in combinatorial libraries of de novo proteins. Fold Des. 1997;2(2):89-92. PMID:9135980
- ↑ Roy S, Hecht MH. Cooperative thermal denaturation of proteins designed by binary patterning of polar and nonpolar amino acids. Biochemistry. 2000 Apr 25;39(16):4603-7. PMID:10769115
- ↑ Chothia C, Lesk AM. The evolution of protein structures. Cold Spring Harb Symp Quant Biol. 1987;52:399-405. PMID:3454269
- ↑ Bromberg S, Dill KA. Side-chain entropy and packing in proteins. Protein Sci. 1994 Jul;3(7):997-1009. PMID:7920265 doi:http://dx.doi.org/10.1002/pro.5560030702
- ↑ Betz SF, DeGrado WF. Controlling topology and native-like behavior of de novo-designed peptides: design and characterization of antiparallel four-stranded coiled coils. Biochemistry. 1996 May 28;35(21):6955-62. PMID:8639647 doi:10.1021/bi960095a
- ↑ 10.0 10.1 10.2 10.3 10.4 Patel SC, Bradley LH, Jinadasa SP, Hecht MH. Cofactor binding and enzymatic activity in an unevolved superfamily of de novo designed 4-helix bundle proteins. Protein Sci. 2009 Jul;18(7):1388-400. PMID:19544578 doi:10.1002/pro.147
- ↑ Wei Y, Kim S, Fela D, Baum J, Hecht MH. Solution structure of a de novo protein from a designed combinatorial library. Proc Natl Acad Sci U S A. 2003 Nov 11;100(23):13270-3. Epub 2003 Oct 30. PMID:14593201 doi:10.1073/pnas.1835644100