Friday, 24 March 2017

molecular biology - How do DNA-binding proteins recognize the correct DNA base pairs?


My professor posed this question to the class today - "How do DNA binding proteins specifically bind to base pairs?"


He alluded to the different arrangements of hydrogen-bond donor and acceptors in A-T, T-A, C-G, and G-C base pairs. However, they are not always significantly different in some pairs (i.e. the minor groove of the C-G and G-C base pairs have the same acceptor - donor - acceptor pattern).


So how would a binding protein know the difference between C-G and G-C on the minor groove of a DNA base pair? How does a DNA binding protein know it's reached the right base pair, more generally?


Thank you!



Answer




There are different kinds of DNA binding domains; the ones involved in base identification in the major groove can differentiate between different base pairs of the same nucleotides i.e. AT vs TA because they bind to functional groups on a nucleotide and not the base-pair per se. The stereochemistry i.e. which nucleotide is in the major groove, is important. The location of the nucleotides in the major groove also is a factor (Figure 1).


enter image description here



Figure 1: DNA sequence identification by Helix-Turn-Helix transcription factor. This is the example case of the Engrailed protein in Drosophila. Reproduced from Paro and Sauer (1992) [1].



You can see that Adenine is recognised differently than Thymine. In many cases such as that of zinc finger proteins (ZF), sequence identification usually happens for dinucleotide pairs such that each finger (a modular unit) can identify a di/tri-nucleotide (Figure 2,3). An array of fingers in case of ZF and helices in case of Helix-Turn-Helix motifs can recognize a unique stretch of DNA sequence (which is certainly more unique than individual base pairs). Usually the binding of the fingers are co-operative which leads to increased affinity and specificity of the protein-DNA binding [2,3]. Analogous mechanism may exist for other kinds of DBD too (I haven't specifically looked for that). Moreover, many DNA motif binding proteins oligomerize, which is again co-operative. This increases the specificity and the overall binding kinetics becomes sigmoidal and thresholded (see Hill kinetics).


Binding mechanisms are best understood for ZF proteins that led to the development of zinc finger nucleases which was one of the first modern methods of targeted genome editing [4] (though they have now been obsoleted by CRISPR-Cas system).


enter image description here



Figure 2: DNA sequence identification by zinc finger proteins. This example considers the Zif268 protein. Reproduced from Paro and Sauer (1992) [1].




enter image description here



Figure 3: A more detailed depiction of Zif268-DNA interaction. Reproduced from Durai et al. (2005) [4].





References:



  1. Pabo, Carl O., and Robert T. Sauer. "Transcription factors: structural families and principles of DNA recognition." Annual review of biochemistry 61.1 (1992): 1053-1095.

  2. Krishna, S. Sri, Indraneel Majumdar, and Nick V. Grishin. "Structural classification of zinc fingers SURVEY AND SUMMARY." Nucleic acids research 31.2 (2003): 532-550.


  3. Lee, Juyong, Jin-Soo Kim, and Chaok Seok. "Cooperativity and Specificity of Cys2His2 Zinc Finger Protein− DNA Interactions: A Molecular Dynamics Simulation Study." The Journal of Physical Chemistry B 114.22 (2010): 7662-7671.

  4. Durai, Sundar, et al. "Zinc finger nucleases: custom-designed molecular scissors for genome engineering of plant and mammalian cells." Nucleic acids research 33.18 (2005): 5978-5990.


No comments:

Post a Comment

evolution - Are there any multicellular forms of life which exist without consuming other forms of life in some manner?

The title is the question. If additional specificity is needed I will add clarification here. Are there any multicellular forms of life whic...