Central dogma of molecular biology
The central dogma of molecular biology is an explanation of the flow of genetic information within a biological system. It was first stated by Francis Crick in 1956 and re-stated in a Nature paper published in 1970:
- The central dogma of molecular biology deals with the detailed residue-by-residue transfer of sequential information. It states that such information cannot be transferred back from protein to either protein or nucleic acid.
The central dogma has also been described as "DNA makes RNA and RNA makes protein," a positive statement which was originally termed the sequence hypothesis by Crick. However, this simplification does not make it clear that the central dogma as stated by Crick does not preclude the reverse flow of information from RNA to DNA, only ruling out the flow from protein to RNA or DNA. Crick's use of the word dogma was idiosyncratic, and has been controversial.
The dogma is a framework for understanding the transfer of DNA and RNA (both nucleic acids), and protein. There are 3×3 = 9 conceivable direct transfers of information that can occur between these. The dogma classes these into 3 groups of 3: 3 general transfers (believed to occur normally in most cells), 3 special transfers (known to occur, but only under specific conditions in case of some viruses or in a laboratory), and 3 unknown transfers (believed never to occur). The general transfers describe the normal flow of biological information: DNA can be copied to DNA (DNA replication), DNA information can be copied into mRNA (transcription), and proteins can be synthesized using the information in mRNA as a template (translation).
- 1 Biological sequence information
- 2 General transfers of biological sequential information
- 3 Special transfers of biological sequential information
- 4 Transfers of information not explicitly covered in the theory
- 5 Use of the term "dogma"
- 6 See also
- 7 References
- 8 External links
Biological sequence information
The biopolymers that comprise DNA, RNA and (poly)peptides are linear polymers (i.e.: each monomer is connected to at most two other monomers). The sequence of their monomers effectively encodes information. The transfers of information described by the central dogma ideally are faithful, deterministic transfers, wherein one biopolymer's sequence is used as a template for the construction of another biopolymer with a sequence that is entirely dependent on the original biopolymer's sequence.
General transfers of biological sequential information
Table of the 3 classes of information transfer suggested by the dogma General Special Unknown DNA → DNA RNA → DNA protein → DNA DNA → RNA RNA → RNA protein → RNA RNA → protein DNA → protein protein → protein
In the sense that DNA replication must occur if genetic material is to be provided for the progeny of any cell, whether somatic or reproductive, the copying from DNA to DNA arguably is the fundamental step in the central dogma. A complex group of proteins called the replisome performs the replication of the information from the parent strand to the complementary daughter strand.
The replisome comprises:
- a helicase that unwinds the superhelix as well as the double-stranded DNA helix to create a replication fork
- SSB protein that binds open the double-stranded DNA to prevent it from reassociating
- RNA primase that adds a complementary RNA primer to each template strand as a starting point for replication
- DNA polymerase III that reads the existing template chain from its 3' end to its 5' end and adds new complementary nucleotides from the 5' end to the 3' end of the daughter chain
- DNA polymerase I that removes the RNA primers and replaces them with DNA.
- DNA ligase that joins the two Okazaki fragments with phosphodiester bonds to produce a continuous chain.
Transcription is the process by which the information contained in a section of DNA is replicated in the form of a newly assembled piece of messenger RNA (mRNA). Enzymes facilitating the process include RNA polymerase and transcription factors. In eukaryotic cells the primary transcript is (pre-mRNA). Pre-mRNA must be processed for translation to proceed. Processing includes the addition of a 5' cap and a poly-A tail to the pre-mRNA chain, followed by splicing. Alternative splicing occurs when appropriate, increasing the diversity of the proteins that any single mRNA can produce. The product of the entire transcription process that began with the production of the pre-mRNA chain, is a mature mRNA chain.
The mature mRNA finds its way to a ribosome, where it gets translated. In prokaryotic cells, which have no nuclear compartment, the processes of transcription and translation may be linked together without clear separation. In eukaryotic cells, the site of transcription (the cell nucleus) is usually separated from the site of translation (the cytoplasm), so the mRNA must be transported out of the nucleus into the cytoplasm, where it can be bound by ribosomes. The ribosome reads the mRNA triplet codons, usually beginning with an AUG (adenine−uracil−guanine), or initiator methionine codon downstream of the ribosome binding site. Complexes of initiation factors and elongation factors bring aminoacylated transfer RNAs (tRNAs) into the ribosome-mRNA complex, matching the codon in the mRNA to the anti-codon on the tRNA. Each tRNA bears the appropriate amino acid residue to add to the polypeptide chain being synthesised. As the amino acids get linked into the growing peptide chain, the chain begins folding into the correct conformation. Translation ends with a stop codon which may be a UAA, UGA, or UAG triplet.
The mRNA does not contain all the information for specifying the nature of the mature protein. The nascent polypeptide chain released from the ribosome commonly requires additional processing before the final product emerges. For one thing, the correct folding process is complex and vitally important. For most proteins it requires other chaperone proteins to control the form of the product. Some proteins then excise internal segments from their own peptide chains, splicing the free ends that border the gap; in such processes the inside "discarded" sections are called inteins. Other proteins must be split into multiple sections without splicing. Some polypeptide chains need to be cross-linked, and others must be attached to cofactors such as haem (heme) before they become functional.
Special transfers of biological sequential information
Reverse transcription is the transfer of information from RNA to DNA (the reverse of normal transcription). This is known to occur in the case of retroviruses, such as HIV, as well as in eukaryotes, in the case of retrotransposons and telomere synthesis. It is the process by which genetic information from RNA gets transcribed into new DNA.
RNA replication is the copying of one RNA to another. Many viruses replicate this way. The enzymes that copy RNA to new RNA, called RNA-dependent RNA polymerases, are also found in many eukaryotes where they are involved in RNA silencing.
RNA editing, in which an RNA sequence is altered by a complex of proteins and a "guide RNA", could also be seen as an RNA-to-RNA transfer.
Direct translation from DNA to protein
Direct translation from DNA to protein has been demonstrated in a cell-free system (i.e. in a test tube), using extracts from neomycin was found to enhance this effect. However, it was unclear whether this mechanism of translation corresponded specifically to the genetic code.
Transfers of information not explicitly covered in the theory
After protein amino acid sequences have been translated from nucleic acid chains, they can be edited by appropriate enzymes. Although this is a form of protein affecting protein sequence, not explicitly covered by the central dogma, there are not many clear examples where the associated concepts of the two fields have much to do with each other.
An intein is a "parasitic" segment of a protein that is able to excise itself from the chain of amino acids as they emerge from the ribosome and rejoin the remaining portions with a peptide bond in such a manner that the main protein "backbone" does not fall apart. This is a case of a protein changing its own primary sequence from the sequence originally encoded by the DNA of a gene. Additionally, most inteins contain a homing endonuclease or HEG domain which is capable of finding a copy of the parent gene that does not include the intein nucleotide sequence. On contact with the intein-free copy, the HEG domain initiates the DNA double-stranded break repair mechanism. This process causes the intein sequence to be copied from the original source gene to the intein-free gene. This is an example of protein directly editing DNA sequence, as well as increasing the sequence's heritable propagation.
Variation in methylation states of DNA can alter gene expression levels significantly. Methylation variation usually occurs through the action of DNA methylases. When the change is heritable, it is considered epigenetic. When the change in information status is not heritable, it would be a somatic epitype. The effective information content has been changed by means of the actions of a protein or proteins on DNA, but the primary DNA sequence is not altered.
Prions are proteins of particular amino acid sequences in particular conformations. They propagate themselves in host cells by making conformational changes in other molecules of protein with the same amino acid sequence, but with a different conformation that is functionally important to the cell. Once the cell protein has been re-conformed to the prion configuration it no longer functions in the way that the cell requires and in its turn it can infect cells and reconfigure more functional molecules of that sequence into the harmful form. In some types of prion in fungi this change is continuous and direct; the information flow is Protein → Protein.
This does represents a transfer of information, but the prion interactions leave the sequence of the protein unchanged.
Natural genetic engineering
James A. Shapiro argues that a superset of these examples should be classified as natural genetic engineering and are sufficient to falsify the central dogma. While Shapiro has received a respectful hearing for his view, his critics have not been convinced that his reading of the central dogma is in line with what Crick intended. 
Use of the term "dogma"
"I called this idea the central dogma, for two reasons, I suspect. I had already used the obvious word hypothesis in the sequence hypothesis, and in addition I wanted to suggest that this new assumption was more central and more powerful. ... As it turned out, the use of the word dogma caused almost more trouble than it was worth. Many years later Jacques Monod pointed out to me that I did not appear to understand the correct use of the word dogma, which is a belief that cannot be doubted. I did apprehend this in a vague sort of way but since I thought that all religious beliefs were without foundation, I used the word the way I myself thought about it, not as most of the world does, and simply applied it to a grand hypothesis that, however plausible, had little direct experimental support."
"My mind was, that a dogma was an idea for which there was no reasonable evidence. You see?!" And Crick gave a roar of delight. "I just didn't know what dogma meant. And I could just as well have called it the 'Central Hypothesis,' or — you know. Which is what I meant to say. Dogma was just a catch phrase."
It is becoming increasingly clear that in reality, the concept of the central dogma of molecular biology is not entirely accurate insofar as it puts emphasis on proteins as the mediator of biological function. We know that 80% of the human genome is transcribed even though only 1% codes for proteins. While it is possible this may be simple transcriptional noise, it seems to be an unlikely waste of cellular energy resources, and considering the major role played by RNA in regulation of gene expression, it may well have a role. Current research focuses on investigating the function of non-coding RNA, that is, RNA that does not follow the dogma trend and does not code for polypeptides.
Moreover, the precise meaning of "information" in this framework is often overlooked.
- Crick, F.H.C. (1956): On Protein Synthesis. Symp. Soc. Exp. Biol. XII, 139-163. (pdf, early draft of original article)
- Crick, F (August 1970). "Central dogma of molecular biology.". Nature 227 (5258): 561–3.
- Leavitt, Sarah A. (June 2010). "Deciphering the Genetic Code: Marshall Nirenberg". Office of NIH History.
- Ahlquist P (May 2002). "RNA-dependent RNA polymerases, viruses, and RNA silencing". Science 296 (5571): 1270–3.
- B. J. McCarthy and J. J. Holland (September 15, 1965). Protein Synthesis"in vitro"Denatured DNA as a Direct Template for . Proceedings of the National Academy of Sciences of the United States 54 (3): 880–886.
- .T. Uzawa, A. Yamagishi, T. Oshima (2002-04-09). "Polypeptide Synthesis Directed by DNA as a Messenger in Cell-Free Polypeptide Synthesis by Extreme Thermophiles, Thermus thermophilus HB27 and Sulfolobus tokodaii Strain 7". The Journal of Biochemistry 131 (6): 849–853.
- Wilkins, Adam S. (January 2012). "(Review) Evolution: A View from the 21st Century". Genome Biology and Evolution.
- Moran, Laurence A (May–June 2011). "(Review) Evolution: A View from the 21st Century". Reports of the National Center for Science Education 32.3 (9): 1–4.
- Horace Freeland Judson (1996). "Chapter 6: My mind was, that a dogma was an idea for which there was no reasonable evidence. You see?!". The Eighth Day of Creation: Makers of the Revolution in Biology (25th anniversary edition). Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press.
- Animation: The Central Dogma - Nature video
- The Elaboration of the Central Dogma - Scitable: By Nature Education
- Discussion on challenges to the "Central Dogma of Molecular Biology"
- Explanation of the central dogma using a musical analogy
- "Francis Harry Compton Crick (1916-2004)" by A. Andrei at the Embryo Project Encyclopedia
- Central dogma as a case of knowledge shielding