Intrinsically Disordered Protein

From Proteopedia

Jump to: navigation, search

Contents

It has long been taught that proteins must be properly folded in order to perform their functions. This paradigm derives from work by Christian B. Anfinsen and coworkers. In the 1960's, they showed that RNAse, when denatured so that 99% of its enzymatic activity was lost, could regain enzymatic activity within seconds when the denaturing agent was removed under proper conditions[1][2][3]. They concluded that the amino acid sequence is sufficient for a protein to fold into its functional, lowest energy conformation. This work won the 1972 Nobel Prize, and was subsequently confirmed and extended by many researchers.

Beginning around 2000, it was recognized that not all proteins function in a folded state[4][5][6][7][8][9][10]. Some proteins must be unfolded or disordered in order to perform their functions, and others fold only in complex with target structures[11][12][13]. These are termed intrinsically disordered protein (IDP), intrinsically unstructured protein (IDP), or natively unfolded protein.

By some estimates, about 10% of all proteins are fully disordered, and about 40% of eukaryotic proteins have at least one long (>50 amino acids) disordered loop[7]. Such sequences, under physiological conditions in vitro, display physicochemical characteristics resembling those of random coils. They possess little or no ordered structure, having instead an extended conformation with high intra-molecular flexibility, lacking any tightly packed core.

At left is an animation of a heat shock/chaperonin protein fragment. Residues 1-70 are disordered; 71-109 are alpha helical. This animates 20 models from an NMR experiment (2ljl). Charges are colored blue=positive and red=negative. A high charge density prevents folding. For comparison, at right is an animation of 20 NMR models of a protein of similar length that folds into a stable domain (2n5a).

Animations will stop after 25 cycles. Shift+Re-load this page to restart the animations. Internet Explorer and Edge: toggle spinning off to speed up animation. Click on an animation to see it larger.

Many crystallographic structures have missing loops -- that is, ranges of amino acids with no atomic coordinates in the model. These "gaps" in the model are often thought to be artifacts of inadvertant disorder in the crystal. In some cases, these gaps may be alerting us to the presence of intrinsically disordered loops in an otherwise folded protein[14]. Such gaps are the basis for the DISOPRED2 disorder prediction server. FirstGlance in Jmol offers one method for locating and visualizing such gaps.

Despite the existence of compelling evidence for IDPs and intrinsically disordered loops beginning in 1990[15][16][17], many current textbooks of biochemistry and even some monographs on protein structure fail to mention intrinsic disorder and its importance for protein function[18][19]. In 2011, Chouard provided a readable and informative overview of IDPs and how some of them function[20].


Examples of IDPs

Examples cover a wide variety of cellular systems and it has been predicted that eukaryotes have more IDPs than other kingdoms [21]. Of course, there are no PDB codes for fully disordered proteins in isolation. However, there are some crystallographic results for IDP that undergo disorder-order transition when they complex with another folded protein domain, such as 1jsu, 1g3j, and 1oct[7]. Other examples are at Globular_Proteins. See further information about 1jsu and other cases below.

IDPs play roles in processes such as:

  • Cell signaling and cell cycle regulation, e.g. cyclin dependent kinase inhibitor p21Waf1/Cip1/Sdi1[22]
  • Oncogene, e.g. P53 contains large unstructured regions in its native state[23]
  • Assembly of cytoskeletal proteins, e.g. Tau protein[24]
  • Protein folding: some intrinsically disordered regions function as chaperones[25].
  • Membrane fusion and membrane transport, e.g. isolated components of the SNARE complex[26]
  • DNA recognition molecules, e.g. the basic DNA-binding region of the leucine zipper protein, GCN4[15]
  • Transcriptional activation domains, e.g. NF-kb[29], Glucocorticoid receptor, 77-262 fragment [30]. There is "widespread importance of structural disorder in gene regulatory proteins", such as Lacl/GalR and Hox[31].
  • Amyloid formation, e.g. prion protein, N terminal part[32], NACP precursor of the non-Ab component of the amyloid plaque[33]
  • Evasion of immune responses by parasites. Highly flexible disordered proteins are poor antigens [34].

Molecular Shields

It appears that hundreds of IDPs that remain soluble after boiling protect folded proteins against heat-denaturation, aggregation, and loss of activity from dessication or organic solvents[35]. They also appear to suppress neurodegeneration and extend lifespan[35]. They have been termed "heat-resistant obscure" (hero) proteins[35]. Their isoelectric pH's (pI's) form a bimodal distribution, so that most are negatively or positively charged at neutral pH[35]. Examples include six human proteins that were studied in detail: SERF2 (length 59), C9orf16 (length 83), C19ofr53 (length 99), BEX3 (length 111), C11orf58 (length 183), and SERBP1 (length 408)[35]. Estimated isoelectric points[36] are 10.5, 4.2, 11.6, 5.5, 4.7, and 8.6 respectively. In several test cases, scrambling the sequences of these proteins did not diminish their protective effects[35]. Their protective activity appears to depend on their high charge density and length, but not on a specific sequence.

Protein disorder predictors

Principles Used in Prediction

FoldIndex output for three protein sequences (a) Cat-Muscle Pyruvate Kinase (b) The human p53 tumor suppressor protein (c) Chicken gizzard caldesmon; green is folded and red is unfolded
FoldIndex[37] output for three protein sequences (a) Cat-Muscle Pyruvate Kinase (b) The human p53 tumor suppressor protein (c) Chicken gizzard caldesmon; green is folded and red is unfolded
Content of order-promoting and disorder-promoting amino acids in the Drosophila proteome (black) and in the cytoplasmic domain of gliotactin that was shown to be IDP (gray)
Content of order-promoting and disorder-promoting amino acids in the Drosophila proteome (black) and in the cytoplasmic domain of gliotactin that was shown to be IDP (gray) [38]

Led by the assumption that “since amino acid sequence determines 3-D structure, amino acid sequence should also determine lack of 3-D structure” [39] specific sequence features shared by IDPs have been evaluated and algorithms for their identification formulated.

The low hydrophobicity and high net charge of natively unfolded proteins result in a difference in amino acid composition between them and natively folded proteins [40].

Compared to sequences of ordered proteins, disordered protein sequences are substantially depleted in I, L, V, W, F, Y, and C, which were therefore designated as “order promoting” amino acids, and enriched in E, K, R, G, Q, S, P, and A, which have been designated as “disorder promoting”. The under representation of hydrophobic amino acids in a protein diminishes one of the basic thermodynamic forces known to be important for protein folding, namely, the hydrophobic interaction. Because a hydrophobic core does not form, such proteins have large hydrodynamic dimensions.

Prediction Servers

The quality of predictions by various algorithms have been evaluated beginning in CASP5 (2002). The assessment of disorder predictions for CASP8 (2008) has been published[41].

Prediction Meta-Servers

Meta-servers gather the predictions from other servers into a single report.

  • MobiDB. The Structure section in entries at UniProt.Org offers MobiDB. MobiDB also includes manually curated disorder data along with derived and predicted data.
  • D2P2: "pre-computed disorder predictions on a large library of proteins from completely-sequenced genomes. ... statistical comparisons of the various prediction methods ...."


Single Algorithm Prediction Servers

  • DISOPRED2 (Jones Group, University College London, UK). "DISOPRED2 was trained on a set of around 750 non-redundant sequences with high resolution X-ray structures. Disorder was identified with those residues that appear in the sequence records but with coordinates missing from the electron density map. This is an imperfect means for identifying disordered residues as missing co-ordinates can also arise as an artifact of the crystalization process. False assignment of order can also occur as a result of stabilizing interactions by ligands or other macromolecules in the complex. However, this is the simplest means for defining disorder in the absence of further experimental investigation of the protein." (Quoted from the DISOPRED2 website.)
  • flDPnn2 (putative function- and linker based Disorder Prediction using deep neural network)[42]. In 2021, flDPnn, was selected as the best disorder predictor in the first Critical Assessment of Protein Intrinsic Disorder Prediction (CAID) [43].
  • FoldIndex[37] (Sussman Group, Weizmann Institute, Rehovot, Israel). FoldIndex makes predictions based on the observation that IDPs occupy the low hydrophobicity/ high net-charge portion of charge-hydrophobicity phase space. (See Figure above.)
  • IUPred2a (Dosztányi, Csizmók, Tompa and Simon: Budapest, Hungary). "IUPred recognized intrinsically unstructured regions from the amino acid sequence based on the estimated pairwise energy content. The underlying assumption is that globular proteins are composed of amino acids which have the potential to form a large number of favorable interactions, whereas intrinsically disorered proteins (IDPs) adopt no stable structure because their amino acid composition does not allow sufficient favorable interactions to form." (Quoted from the IUPred website.)
  • PONDR (Dunker Group, Indiana University and Molecular Kinetics, Inc., Indianapolis IN USA; Obradovic Group, Temple Univ., Philadelphia PA USA). "PONDR® functions from primary sequence data alone. The predictors are feedforward neural networks that use sequence information from windows of generally 21 amino acids. Attributes, such as the fractional composition of particular amino acids or hydropathy, are calculated over this window, and these values are used as inputs for the predictor. The neural network, which has been trained on a specific set of ordered and disordered sequences, then outputs a value for the central amino acid in the window. The predictions are then smoothed over a sliding window of 9 amino acids. If a residue value exceeds a threshold of 0.5 (the threshold used for training) the residue is considered disordered." (Quoted from the PONDR website.)
  • RONN (Esnouf Group, University of Oxford, UK). "We have developed the regional order neural network (RONN) software as an application of our recently developed ‘bio- basis function neural network’ pattern recognition algorithm to the detection of natively disordered regions in proteins. The results of blind-testing a panel of nine disorder prediction tools (including RONN) against 80 protein sequences derived from the Protein Data Bank shows that, based on the probability excess measure, RONN performed the best."[44]
  • WinDiso[45] (Grishin Lab, Dallas, Texas USA). "WinDiso is a linear, sequence- and alignment-based predictor of disordered/unfolded regions in proteins. It has the capability of adjusting for the increased tendency for disorder at protein termini. The simple weighted window-based algorithm and careful optimization technique make this a good predictor to use when trying to avoid bias toward special cases." (Quoted from the Grishin lab website.)

The above list is incomplete. Addition of other servers is welcome, and summaries of methods, pros and cons for each server would be useful.


Curated Collections

Because their very nature makes them difficult to categorize and study by standard means, several groups have set up curated listings of intrinsically disordered proteins and intrinsically disordered regions.

  • PED: Protein Ensemble Database: " an openly accessible database for the deposition of structural ensembles of intrinsically disordered proteins (IDPs) and of denatured proteins based on nuclear magnetic resonance spectroscopy, small-angle X-ray scattering and other data measured in solution." The database of conformational ensembles describing flexible proteins. This database seems to have stopped taking submissions in 2016.
  • Disprot: " a database of experimental evidences of disorder manually collected from literature."


Biological implications of IDPs

It was proposed that the unfolded nature of the IDPs provides them with advantages in recognition and binding. Although their large hydrodynamic dimensions slow down diffusion, their size provides a large target for initial molecular collisions, and the lack of rigid binding pockets permits multiple approach orientations for a binding partner, which may increase the probability of productive interactions [46][39]. In addition, IDPs allow molecular plasticity by adopting more than one conformation and binding diversity by binding to several proteins and thus many of the known hub proteins are IDPs. IDPs rapid turnover in the cell allow their tight regulation as many times needed in cell signaling and cell cycle.

Evolution of IDPs

In p53, the folded DNA-binding domain is conserved, while the intrinsically disordered regions display a higher rate of mutations[47].

Many IDPs undergo disorder-order transition

Binding of natural ligands such as a variety of small molecules, substrates, cofactors, other proteins, nucleic acids or membranes may induce unstructured proteins to adopt stable structures bound to the partner, or even a secondary structure bound to the partner. In addition to the cases detailed below, other examples include 1g3j, 1oct[7], and the Lac repressor.

Some IDP sequences are able to bind to multiple partners that have <25% sequence identity, and in some cases even different folds[48]. For example, the C-terminal portion of p53 is known to bind to four different protein partners each with different folds[48]; and the N-terminus of histone H3 binds to nine different protein partners with distinct folds[48].

The human p27Kip1 kinase inhibitory domain [49]

The cyclin-dependent kinases (CDKs) have a central role in coordinating the eukaryotic cell division cycle. CDKs are controlled through several different processes involving the binding of activating cyclin subunits. Complexes of cyclins with CDKs play a central role in the control of the eukaryotic cell cycle. These complexes are inhibited by other proteins termed in general cyclin-CDK inhibitors (CKIs). One example of CKIs is p27Kip1. p27Kip1 is an IDP and it binds to phosphorylated cyclin/CDK complex in an extended conformation interacting with both cyclin A and CDK2 (1jsu). On cyclin A, it binds in a groove formed by conserved cyclin box residues. On CDK2, it binds and rearranges the amino-terminal lobe and also inserts into the catalytic cleft, mimicking ATP. [[1]]

The transcriptional activator GCN4 [50]

The structure of GCN4 bound to a DNA fragment contains the perfectly symmetrical binding site (1dgc). A homodimer of parallel alpha-helices form an interhelix coiled-coil region via the leucine zipper, and the two N-terminal basic regions fit into the major groove of half sites on opposite sides of the DNA double helix.

The yeast transcriptional activator GCN4 belongs to a large family of eukaryotic transcription factors including Fos, Jun and CREB. All family members have a DNA recognition motif consists of a coiled-coil dimerization element, the leucine-zipper, and an adjoining basic region, which mediates DNA binding. This basic region is largely unstructured in the absence of DNA, addition of DNA containing a GCN4 binding site induce the transition of this region from unstructured to α-helical[51].

Practical Implications of IDPs

There is evidence that large intrinsically unstructured regions interfere with crystallization[14]. Oldfield et al., 2013[14], concluded:

The limited amount of intrinsic disorder present as missing density regions agrees with the idea that intrinsically disordered regions, particularly long disordered regions, inhibits successful determination of crystal structures, and suggests that avoiding or tailoring disordered proteins may aid in the determination of crystal structures.

Human CDK2 (blue) complex with cyclin-A (green) and P27 (pink) 1jsu: see p27kip1 below. P27 has undergone a disorder to order transition upon encountering these partners.

Drag the structure with the mouse to rotate


References and Notes

  1. For the sake of brevity, this description is oversimplified. RNAse needed to be reduced to break disulfide bonds, as well as using 8 M urea, for denaturation. Oxidation without the denaturant then left an inactive enzyme because the disulfide bonds formed randomly, precluding proper folding except very slowly (many hours). Only when protein disulfide isomerase was added did the re-folding occur at a physiological rate (about a minute). The fact that RNAse could thus be trapped in an inactive conformation under physiological conditions contributed to the insights developed by Anfinsen and his team. Proteins lacking disulfides renatured in seconds. For details, see Anfinsen's Nobel Lecture.
  2. A similar observation was made around the same time by then graduate student Lisa Steiner in the lab of Fred Richards at Yale University. Neither Richards nor advisor Joseph Fruton thought the observation interesting enough to publish. It was an answer to a question not yet asked. This story is recounted by David Eisenberg, see the next citation.
  3. Eisenberg DS. How Hard It Is Seeing What Is in Front of Your Eyes. Cell. 2018 Jun 28;174(1):8-11. doi: 10.1016/j.cell.2018.06.027. PMID:29958112 doi:http://dx.doi.org/10.1016/j.cell.2018.06.027
  4. Wright PE, Dyson HJ. Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. J Mol Biol. 1999 Oct 22;293(2):321-31. PMID:10550212 doi:10.1006/jmbi.1999.3110
  5. Dunker AK, Lawson JD, Brown CJ, Williams RM, Romero P, Oh JS, Oldfield CJ, Campen AM, Ratliff CM, Hipps KW, Ausio J, Nissen MS, Reeves R, Kang C, Kissinger CR, Bailey RW, Griswold MD, Chiu W, Garner EC, Obradovic Z. Intrinsically disordered protein. J Mol Graph Model. 2001;19(1):26-59. PMID:11381529
  6. Uversky VN. What does it mean to be natively unfolded? Eur J Biochem. 2002 Jan;269(1):2-12. PMID:11784292
  7. 7.0 7.1 7.2 7.3 Tompa P. Intrinsically unstructured proteins. Trends Biochem Sci. 2002 Oct;27(10):527-33. PMID:12368089
  8. Summary of the previous paper (Tompa, 2002): The disorder of intrinsically disordered proteins (IDP's) is crucial to their functions. They may adopt defined but extended structures when bound to cognate ligands. Their amino acid compositions are less hydrophobic than those of soluble proteins. They lack hydrophobic cores, and hence do not become insoluble when heated. About 40% of eukaryotic proteins have at least one long (>50 residues) disordered region. Roughly 10% of proteins in various genomes have been predicted to be fully disordered. Presently over 100 IDP's have been identified; none are enzymes. Obviously, IDP's are greatly underrepresented in the Protein Data Bank, although there are a few cases of an IDP bound to a folded (intrinsically structured) protein. Here, Tompa suggests five functional categories for intrinsically unstructured proteins and domains: entropic chains (bristles to ensure spacing, springs, flexible spacers/linkers), effectors (inhibitors and disassemblers), scavengers, assemblers, and display sites. (Summary by Eric Martz.)
  9. Dunker AK, Silman I, Uversky VN, Sussman JL. Function and structure of inherently disordered proteins. Curr Opin Struct Biol. 2008 Dec;18(6):756-64. Epub 2008 Nov 17. PMID:18952168 doi:10.1016/j.sbi.2008.10.002
  10. Tompa P, Csermely P. The role of structural disorder in the function of RNA and protein chaperones. FASEB J. 2004 Aug;18(11):1169-75. PMID:15284216 doi:10.1096/fj.04-1584rev
  11. Gunasekaran K, Tsai CJ, Kumar S, Zanuy D, Nussinov R. Extended disordered proteins: targeting function with less scaffold. Trends Biochem Sci. 2003 Feb;28(2):81-5. PMID:12575995
  12. Summary of the previous paper (Gunasekaran et al., 2003): Argues that proteins involved in extensive protein-protein interactions can function effectively despite having their structure depend upon such interactions, so that as monomers they are natively disordered. Dispensing with the structural framework (scaffold) needed to maintain a stable fold in the monomer increases efficiency by reducing size. This may account for the large percentage (roughly half) of all proteins that are predicted to be natively disordered. (Summary by Eric Martz.)
  13. Dyson HJ, Wright PE. Intrinsically unstructured proteins and their functions. Nat Rev Mol Cell Biol. 2005 Mar;6(3):197-208. PMID:15738986 doi:10.1038/nrm1589
  14. 14.0 14.1 14.2 Oldfield CJ, Xue B, Van YY, Ulrich EL, Markley JL, Dunker AK, Uversky VN. Utilization of protein intrinsic disorder knowledge in structural proteomics. Biochim Biophys Acta. 2013 Feb;1834(2):487-98. doi: 10.1016/j.bbapap.2012.12.003., Epub 2012 Dec 8. PMID:23232152 doi:http://dx.doi.org/10.1016/j.bbapap.2012.12.003
  15. 15.0 15.1 Weiss MA, Ellenberger T, Wobbe CR, Lee JP, Harrison SC, Struhl K. Folding transition in the DNA-binding domain of GCN4 on specific binding to DNA. Nature. 1990 Oct 11;347(6293):575-8. PMID:2145515 doi:http://dx.doi.org/10.1038/347575a0
  16. Pontius BW, Berg P. Renaturation of complementary DNA strands mediated by purified mammalian heterogeneous nuclear ribonucleoprotein A1 protein: implications for a mechanism for rapid molecular assembly. Proc Natl Acad Sci U S A. 1990 Nov;87(21):8403-7. PMID:2236048
  17. For the unstructured domain interpretation of early work by Pontius and Berg, see the 2004 review by Tompa and Csermley, PMID: 15284216
  18. Dunker AK, Oldfield CJ, Meng J, Romero P, Yang JY, Chen JW, Vacic V, Obradovic Z, Uversky VN. The unfoldomics decade: an update on intrinsically disordered proteins. BMC Genomics. 2008 Sep 16;9 Suppl 2:S1. PMID:18831774 doi:10.1186/1471-2164-9-S2-S1
  19. Martz, E. Book review of Introduction to protein science—architecture, function, and genomics: Lesk, Arthur M.. Biochem. Mol. Biol. Educ. 33:144-5 (2006). DOI: 10.1002/bmb.2005.494033022442
  20. Chouard T. Structural biology: Breaking the protein rules. Nature. 2011 Mar 10;471(7337):151-3. PMID:21390105 doi:10.1038/471151a
  21. Dunker AK, Obradovic Z, Romero P, Garner EC, Brown CJ. Intrinsic protein disorder in complete genomes. Genome Inform Ser Workshop Genome Inform. 2000;11:161-71. PMID:11700597
  22. Kriwacki RW, Hengst L, Tennant L, Reed SI, Wright PE. Structural studies of p21Waf1/Cip1/Sdi1 in the free and Cdk2-bound state: conformational disorder mediates binding diversity. Proc Natl Acad Sci U S A. 1996 Oct 15;93(21):11504-9. PMID:8876165
  23. Bell S, Klein C, Muller L, Hansen S, Buchner J. p53 contains large unstructured regions in its native state. J Mol Biol. 2002 Oct 4;322(5):917-27. PMID:12367518
  24. Schweers O, Schonbrunn-Hanebeck E, Marx A, Mandelkow E. Structural studies of tau protein and Alzheimer paired helical filaments show no evidence for beta-structure. J Biol Chem. 1994 Sep 30;269(39):24290-7. PMID:7929085
  25. Tompa P, Kovacs D. Intrinsically disordered chaperones in plants and animals. Biochem Cell Biol. 2010 Apr;88(2):167-74. doi: 10.1139/o09-163. PMID:20453919 doi:http://dx.doi.org/10.1139/o09-163
  26. Fiebig KM, Rice LM, Pollock E, Brunger AT. Folding intermediates of SNARE complex assembly. Nat Struct Biol. 1999 Feb;6(2):117-23. PMID:10048921 doi:10.1038/5803
  27. Markus MA, Hinck AP, Huang S, Draper DE, Torchia DA. High resolution solution structure of ribosomal protein L11-C76, a helical protein with a flexible loop that becomes structured upon binding to RNA. Nat Struct Biol. 1997 Jan;4(1):70-7. PMID:8989327
  28. Ban N, Nissen P, Hansen J, Moore PB, Steitz TA. The complete atomic structure of the large ribosomal subunit at 2.4 A resolution. Science. 2000 Aug 11;289(5481):905-20. PMID:10937989
  29. Schmitz ML, dos Santos Silva MA, Altmann H, Czisch M, Holak TA, Baeuerle PA. Structural and functional analysis of the NF-kappa B p65 C terminus. An acidic and modular transactivation domain with the potential to adopt an alpha-helical conformation. J Biol Chem. 1994 Oct 14;269(41):25613-20. PMID:7929265
  30. Baskakov IV, Kumar R, Srinivasan G, Ji YS, Bolen DW, Thompson EB. Trimethylamine N-oxide-induced cooperative folding of an intrinsically unfolded transcription-activating fragment of human glucocorticoid receptor. J Biol Chem. 1999 Apr 16;274(16):10693-6. PMID:10196139
  31. Bondos SE, Swint-Kruse L, Matthews KS. Flexibility and Disorder in Gene Regulation: LacI/GalR and Hox Proteins. J Biol Chem. 2015 Oct 9;290(41):24669-77. doi: 10.1074/jbc.R115.685032. Epub 2015, Sep 4. PMID:26342073 doi:http://dx.doi.org/10.1074/jbc.R115.685032
  32. Donne DG, Viles JH, Groth D, Mehlhorn I, James TL, Cohen FE, Prusiner SB, Wright PE, Dyson HJ. Structure of the recombinant full-length hamster prion protein PrP(29-231): the N terminus is highly flexible. Proc Natl Acad Sci U S A. 1997 Dec 9;94(25):13452-7. PMID:9391046
  33. Weinreb PH, Zhen W, Poon AW, Conway KA, Lansbury PT Jr. NACP, a protein implicated in Alzheimer's disease and learning, is natively unfolded. Biochemistry. 1996 Oct 29;35(43):13709-15. PMID:8901511 doi:10.1021/bi961799n
  34. Dunker AK. Another disordered chameleon: the Micro-Exon Gene 14 protein from Schistosomiasis. Biophys J. 2013 Jun 4;104(11):2326-8. doi: 10.1016/j.bpj.2013.04.018. PMID:23746503 doi:http://dx.doi.org/10.1016/j.bpj.2013.04.018
  35. 35.0 35.1 35.2 35.3 35.4 35.5 Tsuboyama K, Osaki T, Matsuura-Suzuki E, Kozuka-Hata H, Okada Y, Oyama M, Ikeuchi Y, Iwasaki S, Tomari Y. A widespread family of heat-resistant obscure (Hero) proteins protect against protein instability and aggregation. PLoS Biol. 2020 Mar 12;18(3):e3000632. doi: 10.1371/journal.pbio.3000632., eCollection 2020 Mar. PMID:32163402 doi:http://dx.doi.org/10.1371/journal.pbio.3000632
  36. Isoelectric points were estimated with the Protein Calculator.
  37. 37.0 37.1 Prilusky J, Felder CE, Zeev-Ben-Mordehai T, Rydberg EH, Man O, Beckmann JS, Silman I, Sussman JL. FoldIndex: a simple tool to predict whether a given protein sequence is intrinsically unfolded. Bioinformatics. 2005 Aug 15;21(16):3435-8. Epub 2005 Jun 14. PMID:15955783 doi:http://dx.doi.org/10.1093/bioinformatics/bti537
  38. Zeev-Ben-Mordehai T, Rydberg EH, Solomon A, Toker L, Auld VJ, Silman I, Botti S, Sussman JL. The intracellular domain of the Drosophila cholinesterase-like neural adhesion protein, gliotactin, is natively unfolded. Proteins. 2003 Nov 15;53(3):758-67. PMID:14579366 doi:10.1002/prot.10471
  39. 39.0 39.1 Dunker AK, Obradovic Z. The protein trinity--linking function and disorder. Nat Biotechnol. 2001 Sep;19(9):805-6. PMID:11533628 doi:10.1038/nbt0901-805
  40. Romero P, Obradovic Z, Li X, Garner EC, Brown CJ, Dunker AK. Sequence complexity of disordered protein. Proteins. 2001 Jan 1;42(1):38-48. PMID:11093259
  41. Noivirt-Brik O, Prilusky J, Sussman JL. Assessment of disorder predictions in CASP8. Proteins. 2009 Aug 21. PMID:19774619 doi:10.1002/prot.22586
  42. Hu G, Katuwawala A, Wang K, Wu Z, Ghadermarzi S, Gao J, Kurgan L. flDPnn: Accurate intrinsic disorder prediction with putative propensities of disorder functions. Nat Commun. 2021 Jul 21;12(1):4438. PMID:34290238 doi:10.1038/s41467-021-24773-7
  43. Necci M, Piovesan D, Tosatto SCE. Critical assessment of protein intrinsic disorder prediction. Nat Methods. 2021 May;18(5):472-481. PMID:33875885 doi:10.1038/s41592-021-01117-3
  44. Yang ZR, Thomson R, McNeil P, Esnouf RM. RONN: the bio-basis function neural network technique applied to the detection of natively disordered regions in proteins. Bioinformatics. 2005 Aug 15;21(16):3369-76. Epub 2005 Jun 9. PMID:15947016 doi:http://dx.doi.org/10.1093/bioinformatics/bti534
  45. Holladay NB, Kinch LN, Grishin NV. Optimization of linear disorder predictors yields tight association between crystallographic disorder and hydrophobicity. Protein Sci. 2007 Oct;16(10):2140-52. PMID:17893360 doi:16/10/2140
  46. Denning DP, Uversky V, Patel SS, Fink AL, Rexach M. The Saccharomyces cerevisiae nucleoporin Nup2p is a natively unfolded protein. J Biol Chem. 2002 Sep 6;277(36):33447-55. Epub 2002 Jun 13. PMID:12065587 doi:10.1074/jbc.M203499200
  47. Xue B, Brown CJ, Dunker AK, Uversky VN. Intrinsically disordered regions of p53 family are highly diversified in evolution. Biochim Biophys Acta. 2013 Apr;1834(4):725-38. doi: 10.1016/j.bbapap.2013.01.012., Epub 2013 Jan 22. PMID:23352836 doi:http://dx.doi.org/10.1016/j.bbapap.2013.01.012
  48. 48.0 48.1 48.2 Hsu WL, Oldfield CJ, Xue B, Meng J, Huang F, Romero P, Uversky VN, Dunker AK. Exploring the binding diversity of intrinsically disordered proteins involved in one-to-many binding. Protein Sci. 2013 Mar;22(3):258-73. doi: 10.1002/pro.2207. Epub 2013 Jan 27. PMID:23233352 doi:http://dx.doi.org/10.1002/pro.2207
  49. Russo AA, Jeffrey PD, Patten AK, Massague J, Pavletich NP. Crystal structure of the p27Kip1 cyclin-dependent-kinase inhibitor bound to the cyclin A-Cdk2 complex. Nature. 1996 Jul 25;382(6589):325-31. PMID:8684460 doi:10.1038/382325a0
  50. Konig P, Richmond TJ. The X-ray structure of the GCN4-bZIP bound to ATF/CREB site DNA shows the complex depends on DNA flexibility. J Mol Biol. 1993 Sep 5;233(1):139-54. PMID:8377181 doi:http://dx.doi.org/10.1006/jmbi.1993.1490
  51. Hollenbeck JJ, McClain DL, Oakley MG. The role of helix stabilizing residues in GCN4 basic region folding and DNA binding. Protein Sci. 2002 Nov;11(11):2740-7. PMID:12381856 doi:http://dx.doi.org/10.1110/ps.0211102

See Also


Authorship

The bulk of this article was written by Tzviya Zeev-Ben-Mordehai. Contributions by Eric Martz were minor -- his name is listed first due to a technicality.

Personal tools