AT-hook
Encyclopedia
The AT-hook is a DNA-binding
motif
present in many proteins, including the high mobility group
(HMG) protein
s, DNA-binding proteins
from plants and hBRG1 protein, a central ATPase of the human switching/sucrose non-fermenting (SWI/SNF) remodeling complex
.
This motif consists of a conserved, palindromic, core sequence of proline
-arginine
-glycine
-arginine
-proline
, although some AT-hooks contain only a single proline in the core sequence. AT-hooks also include a variable number of positively charged lysine
and arginine
residues on either side of the core sequence. The AT-hook binds to the minor groove of adenine
-thymine
(AT) rich DNA, hence the AT in the name. The rest of the name derives from a predicted asparagine
/aspartate "hook" in the earliest AT-hooks reported in 1990. In 1997 structural studies using NMR
determined that a DNA-bound AT-hook adopted a crescent or hook shape around the minor groove of a target DNA strand (pictured at right). HMGA proteins contain three AT-hooks, although some proteins contain as many as 30. The optimal binding sequences for AT-hook proteins are repeats of the form (ATAA)n or (TATT)n, although the optimal binding sequences for the core sequence of the AT-hook are AAAT and AATT.
DNA-binding domain
A DNA-binding domain is an independently folded protein domain that contains at least one motif that recognizes double- or single-stranded DNA. A DBD can recognize a specific DNA sequence or have a general affinity to DNA...
motif
Structural motif
In a chain-like biological molecule, such as a protein or nucleic acid, a structural motif is a supersecondary structure, which appears also in a variety of other molecules...
present in many proteins, including the high mobility group
High mobility group
High-Mobility Group or HMG is a group of chromosomal proteins that help withtranscription, replication, recombination, and DNA repair.-Families:The HMG proteins are subdivided into 3 superfamilies each containing a characteristic functional domain:...
(HMG) protein
Protein
Proteins are biochemical compounds consisting of one or more polypeptides typically folded into a globular or fibrous form, facilitating a biological function. A polypeptide is a single linear polymer chain of amino acids bonded together by peptide bonds between the carboxyl and amino groups of...
s, DNA-binding proteins
Protein
Proteins are biochemical compounds consisting of one or more polypeptides typically folded into a globular or fibrous form, facilitating a biological function. A polypeptide is a single linear polymer chain of amino acids bonded together by peptide bonds between the carboxyl and amino groups of...
from plants and hBRG1 protein, a central ATPase of the human switching/sucrose non-fermenting (SWI/SNF) remodeling complex
Protein complex
A multiprotein complex is a group of two or more associated polypeptide chains. If the different polypeptide chains contain different protein domain, the resulting multiprotein complex can have multiple catalytic functions...
.
This motif consists of a conserved, palindromic, core sequence of proline
Proline
Proline is an α-amino acid, one of the twenty DNA-encoded amino acids. Its codons are CCU, CCC, CCA, and CCG. It is not an essential amino acid, which means that the human body can synthesize it. It is unique among the 20 protein-forming amino acids in that the α-amino group is secondary...
-arginine
Arginine
Arginine is an α-amino acid. The L-form is one of the 20 most common natural amino acids. At the level of molecular genetics, in the structure of the messenger ribonucleic acid mRNA, CGU, CGC, CGA, CGG, AGA, and AGG, are the triplets of nucleotide bases or codons that codify for arginine during...
-glycine
Glycine
Glycine is an organic compound with the formula NH2CH2COOH. Having a hydrogen substituent as its 'side chain', glycine is the smallest of the 20 amino acids commonly found in proteins. Its codons are GGU, GGC, GGA, GGG cf. the genetic code.Glycine is a colourless, sweet-tasting crystalline solid...
-arginine
Arginine
Arginine is an α-amino acid. The L-form is one of the 20 most common natural amino acids. At the level of molecular genetics, in the structure of the messenger ribonucleic acid mRNA, CGU, CGC, CGA, CGG, AGA, and AGG, are the triplets of nucleotide bases or codons that codify for arginine during...
-proline
Proline
Proline is an α-amino acid, one of the twenty DNA-encoded amino acids. Its codons are CCU, CCC, CCA, and CCG. It is not an essential amino acid, which means that the human body can synthesize it. It is unique among the 20 protein-forming amino acids in that the α-amino group is secondary...
, although some AT-hooks contain only a single proline in the core sequence. AT-hooks also include a variable number of positively charged lysine
Lysine
Lysine is an α-amino acid with the chemical formula HO2CCH4NH2. It is an essential amino acid, which means that the human body cannot synthesize it. Its codons are AAA and AAG....
and arginine
Arginine
Arginine is an α-amino acid. The L-form is one of the 20 most common natural amino acids. At the level of molecular genetics, in the structure of the messenger ribonucleic acid mRNA, CGU, CGC, CGA, CGG, AGA, and AGG, are the triplets of nucleotide bases or codons that codify for arginine during...
residues on either side of the core sequence. The AT-hook binds to the minor groove of adenine
Adenine
Adenine is a nucleobase with a variety of roles in biochemistry including cellular respiration, in the form of both the energy-rich adenosine triphosphate and the cofactors nicotinamide adenine dinucleotide and flavin adenine dinucleotide , and protein synthesis, as a chemical component of DNA...
-thymine
Thymine
Thymine is one of the four nucleobases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The others are adenine, guanine, and cytosine. Thymine is also known as 5-methyluracil, a pyrimidine nucleobase. As the name suggests, thymine may be derived by methylation of uracil at...
(AT) rich DNA, hence the AT in the name. The rest of the name derives from a predicted asparagine
Asparagine
Asparagine is one of the 20 most common natural amino acids on Earth. It has carboxamide as the side-chain's functional group. It is not an essential amino acid...
/aspartate "hook" in the earliest AT-hooks reported in 1990. In 1997 structural studies using NMR
NMR
NMR may refer to:Applications of Nuclear Magnetic Resonance:* Nuclear magnetic resonance* NMR spectroscopy* Solid-state nuclear magnetic resonance* Protein nuclear magnetic resonance spectroscopy* Proton NMR* Carbon-13 NMR...
determined that a DNA-bound AT-hook adopted a crescent or hook shape around the minor groove of a target DNA strand (pictured at right). HMGA proteins contain three AT-hooks, although some proteins contain as many as 30. The optimal binding sequences for AT-hook proteins are repeats of the form (ATAA)n or (TATT)n, although the optimal binding sequences for the core sequence of the AT-hook are AAAT and AATT.