SARS-CoV-2 ORF8
Following the discussion by Rubiera[2],
our first example will be SARS-CoV-2 protein ORF8, a protein that contributes to virulence in COVID-19[3]. CASP 14 classified ORF8 as a "free modeling" (FM) target[4], meaning that there were no adequate empirical templates for homology modeling. This was easily confirmed. When the amino acid sequence of ORF8 is submitted to Swiss Model, it reports the best templates for homology modeling. When the two empirical models that were not available during CASP 14 are excluded (7jtl and 7jx6), the best template offered, chain B of 3afc, covers only 36% of the length of ORF8 at 13.2% sequence identity, with a 4-residue untemplated gap in the sequence alignment. This template would not be adequate for constructing a useful model.
X-Ray Structures for ORF8
The quality of predictions for the structure of ORF8 are judged by comparison with X-ray crystallographic empirical models which were not available to the groups making predictions. Shortly after the CASP 14 competition (summer 2020), two X-ray crystal structures were reported for ORF8: 7jtl released August 26, 2020, and 7jx6, released September 23, 2020. The resolutions are 2.0 and 1.6 Å respectively, and both have worse than average Rfree values.
Click the green links below to change the molecular scene. Drag to rotate. Zoom the molecule with your mouse wheel, or Shift-Drag up/down.
|
. The only substantial disagreement is for a large surface loop, sequence range 48-57. See the TABLE below for RMSD values.
AlphaFold2 Prediction for ORF8
The quality of a prediction in CASP is judged, in large part, by the Global Distance Test Total Score, GDT_TS. AlphaFold2's predicted structure[5] has a GDT_TS score of 87. (A score of 0 is meaningless, and a score of 100 means perfect agreement with an X-ray crystal structure.) 87 means . The structure predicted by AlphaFold2 is almost as close to the X-ray crystallographic model 7jx6 as is the independently-determined X-ray structure 7jtl. AlphaFold2 predicted the positions of 92 amino acids. (CASP 14 excluded residues 48-59, a 12-residue surface loop, from the target residues[4].) See TABLE below for RMSD values.
ORF8 Alignments With Chain A of 7jx6
| Model | GDT_TS | Disulfde Bonds | Cα RMSD, Å | Cα Aligned | RMSD Including Sidechains, Å | Atoms Aligned
|
| 7jtl:A | | 3 | 4.02 0.66 | 102/102 (100%) 87/102 (85%) | 4.3 1.58 | 829/829 (100%) 709/829 (86%)
|
| AlphaFold2 | 87 | 3 | 2.58 1.25 | 92/92 (100%) 83/92* (90%) | 3.23 1.91 | 747/748 (100%) 679/748 (91%)
|
| 2nd Best* | 43 | 0 | 5.33 1.71 | 92/92 (100%) 38/92 (41%) | 6.54 5.86 | 747/748 (100%) 324/748 (43%)
|
| 3rd Best§ | 33 | 0 | 13.37 † | 92/92 (100%) † | 14.50 † | 747/748 (100%) †
|
- Alignments by "Magic Fit" of Swiss-PdbViewer 4.1.
- Alignments by "Iterative Magic Fit" of Swiss-PdbViewer 4.1.
- *Second best: Group of Xian Ming Pan.
- §Third best: Group of Perez Lab.
- † Iterative Magic Fit was unable to align.
Second Best Prediction for ORF8
In CASP 14, 70 research groups and 42 automated servers predicted structures for ORF8. The median GDT_TS score for all 112 predictions was 26. AlphaFold2 made the best prediction (GDT_TS 87). The second best prediction was by the group of Xian Ming Pan, with GDT_TS 43 (see TABLE above).
Third Best Prediction for ORF8