Background
Gal4p is a transcriptional activator in Saccharomyces cerevisiae that regulates the expression of genes to coordinate the response to the carbon source galactose.
Gal4p is an 881-amino-acid protein with a Zn cluster-type DNA-binding domain, a linker domain, a dimerization domain, and two acidic activation domains.
The structure of the DNA-binding domain(1d66), the dimerization domain (1hbw), and the activation domain (3bts) has been solved separately. There is a structure of the DNA-binding and dimerization domains as one unit as well (3coq).
Pioneering studies on Gal4p from the Ptashne laboratory shaped our understanding of eukaryotic transcriptional activation, and Gal4p derivatives, along with their cognate promoters, continue to be used as tools by investigators preforming research in yeast.
DNA Recognition by Gal4p
of the protein dimer has 3 alpha helices.
The protein . Specifically, the consensus Gal4p-binding site is a 17-mer of sequence conforming to the motif below, which has the key feature of CGG triplets at the 5' ends, separated by 11 bps, or 5′-CGG-N11-CCG-3′.
5'-CGGNNNNNNNNNNNCCG-3'
|||||||||||||||||
3'-CGGNNNNNNNNNNNGGC-5'
The DNA binding domain . (In the crystal structure 1d66, the zinc ions are actually represented by other members of the same group on the periodic table, the heavier cadmium ions.) The Zn-binding motif lets the structure accomplish more with fewer amino acids, the metals lending an "economy of structure", arranging two small helices to recognize a specific DNA site. .
A C G T
Despite only making base-specific contacts at the ends,
The third, large helix of each dimer subunit interacts with the other . Note the abundance of gray residues in the central stem protruding from a site roughly above the center of the DNA binding site. The dimer interface is more fully explored in a later publication with an expanded structure, see 3coq.
The conformation of the linker connecting each dimerization helix and the metal-binding domains determines the preferred total number of bases between the triplets.