Journal of Molecular Biology
Regular articleThe solution structure of the domain from MeCP2 that binds to methylated DNA1
Introduction
The major covalent modification of the eukaryotic genome is the addition of a methyl group at the 5-position of cytosine in cytidine-guanosine (CpG) dinucleotide pairs. Because of its ability to repress transcription, methylation of CpG sequences has been implicated in stable modulations of gene expression in a cell-type-specific manner during development (Bartolomei & Tilghman, 1997). Mice lacking maintenance DNA methyltransferase (DNMT) are developmentally retarded and die at mid-gestation (Li et al., 1992).
While certain invertebrate genomes have a pattern of DNA methylation characterised by methylation-rich and methylation-free regions, the somatic genomes of vertebrates are consistently methylated except for regions known as “CpG islands” (Bird, 1986). CpG islands are coincident with the promoters of nearly two-thirds of human genes transcribed by RNA polymerase II (Antequera & Bird, 1994). DNA methylation can repress transcription at a distance Cedar 1988, Kass et al 1993, and repression of methylated transgenes requires prior assembly of chromatin (Buschhausen et al., 1987). Silencing of genes by methylation at CpG islands has been observed in the inactive X-chromosome (Riggs & Pfeifer, 1992), in genomic imprinting Neumann and Barlow 1996, Razin and Cedar 1994, and in transformed cell-lines and tumours (e.g. Bird, 1996).
A family of five mammalian proteins, MeCP2, MBD1, MBD2, MBD3 and MBD4, has been identified (Hendrich & Bird, 1998, and references therein) as being of likely importance in interpreting the signal that methylation of DNA represents. Each member has a stretch of 60–80 residues displaying a high level of similarity (50–70 %) between all five proteins (Figure 1). In MeCP2, this region has been identified as necessary for binding directly to methyl-CpG (mCpG, Nan et al., 1993), and it has been termed the mCpG-binding domain (MBD). MBD2 and MBD4 bind specifically to methylated DNA in vitro (Hendrich & Bird, 1998). Both proteins co-localise with foci of heavily methylated satellite DNA in mouse cells and localisation is disrupted in cells deficient in CpG methylation (Hendrich & Bird, 1998). MBD2 has been reported to possess demethylase activity (Bhattacharya et al., 1999) and MBD4 is probably a novel repair enzyme (Bellacosa et al., 1999). MBD3 apparently lacks specific methylated DNA-binding activity, although it has a high degree of sequence similarity with MBD2 (Figure 1). MBD1 and MeCP2 bind specifically to methylated DNA in any sequence context and inhibit transcription Meehan et al 1989, Lewis et al 1992, Boyes and Bird 1991, Nan et al 1997.
MeCP2 is capable of binding to a single, symmetrically methylated CpG dinucleotide and binds to chromosomes at sites known to contain methylated DNA (Nan et al., 1996). MeCP2 (Figure 1) contains both an MBD and a transcriptional repression domain. It is able to recruit histone deacetylases Nan et al 1998, Jones et al 1998, and to displace histone H1 from preassembled chromatin that contains methylated DNA. Its localisation is disrupted in embryonic stem cells (ES) lacking a functional DNMT gene (Nan et al., 1996). Like DNMT-deficient ES, MeCP2-deficient ES grow normally in culture but are incapable of supporting embryonic development (Tate et al., 1996). On the other hand, MeCP2-deficient somatic cells are viable and MeCP2 may be responsible for genome-wide transcriptional repression or “transcriptional noise reduction” (Bird, 1995), but may also effect suppression of specific genes.
The boundaries of the minimal domain of MeCP2 necessary and sufficient for binding mCpG were defined empirically using mutagenesis and band-shift assays (Nan et al., 1993). While residues 98–161 encompass the consensus MBD residues in MeCP2 (Figure 1), additional residues within MeCP2 at either the COOH or the NH2 terminus of the consensus sequence are required for binding. The minimum length mCpG-binding sequence in MeCP2 consists of amino acid residues 78–162. A recombinant fragment corresponding to this sequence was shown to bind to a single CpG site with an affinity in the order of 1 nM, and as a monomer (Nan et al., 1993). Symmetrically methylated CpG in the context of double-stranded DNA, flanked by non-specific sequences of no less than six base-pairs is a requirement of the target (Nan et al., 1993).
Given that the MBD is pivotal to the biological manifestation of the methylation signal in eukaryotic DNA, its structure-function relationships are of interest. Here, the solution structure of a recombinant fragment of MeCP2 encompassing its MBD, and capable of recognising mCpG is described, and on the basis of chemical shift perturbations, the DNA-binding surface of the domain is identified.
Section snippets
Secondary structure and conformational mobility
A recombinant fragment of MeCP2 encompassing residues 77–164, and having the non-native sequence MHHHHHHAM at its NH2 terminus and GSGC at its COOH terminus, was soluble at 1 mM (at pH 6.0), and appeared to be stable for many days providing the temperature of the sample remained below 20 °C; therefore NMR data were collected at 18 °C. Using a 13C, 15N-labelled sample it was possible to assign backbone atoms in all residues of the native sequence, some residues of the NH2-terminal poly-His tag,
Discussion
On the basis of single crystal X-ray studies (e.g. Mayer-Jung et al., 1997) the uncomplexed DNA is likely to be B-form, with the methyl groups of methyl-C in the major groove, at a distance of approximately 6.5 Å from one another. The MBD from MeCP2 has been reported (Nan et al., 1993) to bind as a monomer to a symmetrically methylated CpG site flanked by non-specific sequences of at least six base-pairs.
It was noticed previously that within the sequence of MeCP2 MBD there are a pair of
Protein expression
Recombinant, His-tagged MBD from MeCP2 (MHHHHHHAM-77–164-SGSGC) was expressed in Escherichia coli BL21(DE3)LysS from plasmid pET6HMBD (Cross et al., 1994). The sequence was based on a previous study (Nan et al., 1993) that defined the minimal protein fragment required to recognise methylated-CpG. The poly-His tag does not interfere with binding. An overnight culture was diluted 100 times and incubated at 37 °C to A600 = 0.3, before induction with isopropyl β-thiogalactoside. Cells were grown in
Acknowledgements
This work was supported by the MRC (Grant number G9504114MB), the Wellcome Trust (Grant number 031581) and the Edinburgh Centre for Protein Technology. A.F. is supported by a Wellcome Trust Advanced Training Fellowship (Grant number 052284/Z/97/Z).
References (50)
Gene number, noise reduction and biological complexity
Trends Genet.
(1995)- et al.
DNA methylation inhibits transcription indirectly via a methyl-CpG binding protein
Cell
(1991) DNA methylation and gene activity
Cell
(1988)- et al.
Improved 3D triple-resonance nmr techniques applied to a 31-kDa protein
J. Magn. Reson.
(1992) - et al.
Correlation of backbone amide and aliphatic side-chain resonances in 13C/15N-enriched proteins by isotropic mixing of 13C magnetization
J. Magn. Reson. ser. B
(1993) - et al.
Protein structure comparison by alignment of distance matrices
J. Mol. Biol.
(1993) - et al.
A gradient-enhanced HCCH TOCSY experiment for recording side-chain 1H and 13C correlations in H2O samples of proteins
J. Magn. Reson. ser. B
(1993) - et al.
Enhanced-sensitivity triple-resonance spectroscopy with minimal H2O saturation
J. Magn. Reson. ser. A
(1994) ANSIG-a program for the assignment of protein 1H 2D-NMR spectra by interactive computer-graphics
J. Magn. Reson.
(1989)- et al.
Purification, sequence and cellular localisation of a novel chromosomal protein that binds to methylated DNA
Cell
(1992)
Effect of cytosine methylation on DNA-DNA recognition at CpG steps
J. Mol. Biol.
Identification of a mammalian protein that binds specifically to DNA containing methylated CpGs
Cell
Gradient-enhanced triple-resonance 3-dimensional NMR experiments with improved sensitivity
J. Magn. Reson. ser. B
MeCP2 is a transcriptional repressor with abundant binding sites in genetic chromatin
Cell
Multiple roles for DNA methylation in genetic imprinting
Curr. Opin. Genet. Dev.
Automated NOESY interpretation with ambiguous distance restraintsThe refined NMR solution structure of the pleckstrin homology domain from beta-spectrin
J. Mol. Biol.
DNA methylation and genetic imprinting
Cell
X-chromosome inactivation and cell memory
Trends Genet.
Homeodomain interactions
Curr. Opin. Struct. Biol.
A novel mode of DNA recognition by a β-sheet revealed by the solution structure of the GCC-box binding domain in complex with DNA
EMBO J.
Number of CpG islands and genes in human and mouse
Proc. Natl Acad. Sci. USA
Genomic imprinting in mammals
Annu. Rev. Genet.
MED1, a novel human methyl-CpG-binding endonuclease, interacts with DNA mismatch repair protein MLH1
Proc. Natl Acad. Sci. USA
A mammalian protein with specific demethylase activity for mCpG DNA
Nature
CpG-rich islands and the function of DNA methylation
Nature
Cited by (173)
Quantitative investigation of the effects of DNA modifications and protein mutations on MeCP2-MBD-DNA interactions
2023, International Journal of Biological MacromoleculesTAT-MeCP2 protein variants rescue disease phenotypes in human and mouse models of Rett syndrome
2022, International Journal of Biological MacromoleculesInfluence of the disordered domain structure of MeCP2 on its structural stability and dsDNA interaction
2021, International Journal of Biological MacromoleculesThe Methyl-CpG–Binding Domain 2 and 3 Proteins and Formation of the Nucleosome Remodeling and Deacetylase Complex
2020, Journal of Molecular Biology
- 1
Edited by P. E. Wright
- 2
Present address: X. Nan, MRC Human Genetics Unit, Western General Hospital, Crewe Rd, Edinburgh EH4 2XU, UK.