Representation of intron and exons within a simple gene containing a single intron.

An intron is any proteins, ribosomal RNA (rRNA), and transfer RNA (tRNA). When proteins are generated from intron-containing genes, RNA splicing takes place as part of the RNA processing pathway that follows transcription and precedes translation.

The word intron is derived from the term intragenic region, i.e. a region inside a gene. Although introns are sometimes called intervening sequences, the term "intervening sequence" can refer to any of several families of internal nucleic acid sequences that are not present in the final gene product, including inteins, untranslated sequences (UTR), and nucleotides removed by RNA editing, in addition to introns.


  • Introduction 1
  • Classification 2
  • Biological functions and evolution 3
  • Introns as mobile genetic elements 4
  • Name 5
  • Homotypic Synonyms 6
  • Heterotypic Synonyms 7
  • References 8
  • Vernacular names 9
  • See also 10
  • References 11
  • External links 12


Introns were first discovered in protein-coding genes of adenovirus,[4][5] and were subsequently identified in genes encoding transfer RNA and ribosomal RNA genes. Introns are now known to occur within a wide variety of genes throughout organisms and viruses within all of the biological kingdoms.

The fact that genes were split or interrupted by introns was discovered independently in 1977 by Phillip Allen Sharp and Richard J. Roberts, for which they shared the Nobel Prize in Physiology or Medicine in 1993.[6] The term intron was introduced by American biochemist Walter Gilbert:[7]

"The notion of the cistron [...] must be replaced by that of a transcription unit containing regions which will be lost from the mature messenger - which I suggest we call introns (for intragenic regions) - alternating with regions which will be expressed - exons." (Gilbert 1978)

The frequency of introns within different genomes is observed to vary widely across the spectrum of biological organisms. For example, introns are extremely common within the nuclear genome of higher vertebrates (e.g. humans and mice), where protein-coding genes almost always contain multiple introns, while introns are rare within the nuclear genes of some eukaryotic microorganisms,[8] for example baker's/brewer's yeast (Saccharomyces cerevisiae). In contrast, the mitochondrial genomes of vertebrates are entirely devoid of introns, while those of eukaryotic microorganisms may contain many introns. Introns are well known in bacterial and archaeal genes, but occur more rarely than in most eukaryotic genomes.

Simple illustration of an unspliced mRNA precursor, with two introns and three exons (top). After the introns have been removed via splicing, the mature mRNA sequence is ready for translation (bottom).


Splicing of all intron-containing RNA molecules is superficially similar, as described above. However, different types of introns were identified through the examination of intron structure by DNA sequence analysis, together with genetic and biochemical analysis of RNA is splicing reactions.

At least four distinct classes of introns have been identified.[9]

Group III introns are proposed to be a fifth family, but little is known about the biochemical apparatus that mediates their splicing. They appear to be related to group II introns, and possibly to spliceosomal introns.[10]

Nuclear pre-mRNA introns (spliceosomal introns) are characterized by specific intron sequences located at the boundaries between introns and exons.[11] These sequences are recognized by spliceosomal RNA molecules when the splicing reactions are initiated.[12] In addition, they contain a branch point, a particular nucleotide sequence near the 3' end of the intron that becomes covalently linked to the 5' end of the intron during the splicing process, generating a branched (lariat) intron. Apart from these three short conserved elements, nuclear pre-mRNA intron sequences are highly variable. Nuclear pre-mRNA introns are often much longer than their surrounding exons.

Group I and group II introns are found in genes encoding proteins (

  • A search engine for exon/intron sequences defined by NCBI
  • Bruce Alberts, Alexander Johnson, Julian Lewis, Martin Raff, Keith Roberts, and Peter Walter Molecular Biology of the Cell, 2007, ISBN 978-0-8153-4105-5. Fourth edition is available online through the NCBI Bookshelf: link
  • Jeremy M Berg, John L Tymoczko, and Lubert Stryer, Biochemistry 5th edition, 2002, W H Freeman. Available online through the NCBI Bookshelf: link
  • Intron finding tool for plant genomic sequences
  • Exon-intron graphic maker

External links

  1. ^ Alberts, Bruce (2008). Molecular biology of the cell. New York: Garland Science.  
  2. ^ Stryer, Lubert; Berg, Jeremy Mark; Tymoczko, John L. (2007). Biochemistry. San Francisco: W.H. Freeman.  
  3. ^ Kinniburgh, Alan; mertz, j. and Ross, J. (July 1978). "The precursor of mouse β-globin messenger RNA contains two intervening RNA sequences". Cell 14 (3): 681–693.  
  4. ^ Chow LT, Gelinas RE, Broker TR, Roberts RJ (September 1977). "An amazing sequence arrangement at the 5' ends of adenovirus 2 messenger RNA". Cell 12 (1): 1–8. PMID 902310. doi:10.1016/0092-8674(77)90180-5. 
  5. ^ Berget SM, Moore C, Sharp PA (August 1977). "Spliced segments at the 5' terminus of adenovirus 2 late mRNA". Proc. Natl. Acad. Sci. U.S.A. 74 (8): 3171–5.  
  6. ^
  7. ^  
  8. ^ Stajich JE, Dietrich FS, Roy SW (2007). "Comparative genomic analysis of fungal genomes reveals intron-rich ancestors". Genome Biol. 8 (10): R223.  
  9. ^
  10. ^ Copertino DW, Hallick RB (December 1993). "Group II and group III introns of twintrons: potential relationships with nuclear pre-mRNA introns". Trends Biochem. Sci. 18 (12): 467–71.  
  11. ^ Padgett RA, Grabowski PJ, Konarska MM, Seiler S, Sharp PA (1986). "Splicing of messenger RNA precursors". Annu. Rev. Biochem. 55: 1119–50.  
  12. ^ Guthrie C, Patterson B (1988). "Spliceosomal snRNAs". Annu. Rev. Genet. 22: 387–419.  
  13. ^ Cech TR (1990). "Self-splicing of group I introns". Annu. Rev. Biochem. 59: 543–68.  
  14. ^
  15. ^ Greer CL, Peebles CL, Gegenheimer P, Abelson J (February 1983). "Mechanism of action of a yeast RNA ligase in tRNA splicing". Cell 32 (2): 537–46.  
  16. ^ Reinhold-Hurek B, Shub DA (May 1992). "Self-splicing introns in tRNA genes of widely divergent bacteria". Nature 357 (6374): 173–6.  
  17. ^ Rearick D, Prakash A, McSweeny A, Shepard SS, Fedorova L, Fedorov A (March 2011). "Critical association of ncRNA with introns". Nucleic Acids Res. 39 (6): 2357–66.  
  18. ^ Lambowitz AM, Belfort M (1993). "Introns as mobile genetic elements". Annu. Rev. Biochem. 62: 587–622.  
  19. ^ Penny D, Hoeppner MP, Poole AM, Jeffares DC (November 2009). "An overview of the introns-first theory". Journal of Molecular Evolution 69 (5): 527–40.  
  20. ^ Rodríguez-Trelles F, Tarrío R, Ayala FJ (2006). "Origins and evolution of spliceosomal introns". Annu. Rev. Genet 40: 47–76.  
  21. ^ Mourier T, Jeffares DC (May 2003). "Eukaryotic intron loss". Science 300 (5624): 1393–1393.  
  22. ^ Roy SW, Gilbert W (March 2006). "The evolution of spliceosomal introns: patterns, puzzles and progress". Nature Reviews Genetics 7 (3): 211–21.  
  23. ^ de Souza SJ (July 2003). "The emergence of a synthetic theory of intron evolution". Genetica 118 (2–3): 117–21.  
  24. ^ Lynch M (April 2002). "Intron evolution as a population-genetic process". Proceedings of the National Academy of Sciences 99 (9): 6118–23.  
  25. ^ Jeffares DC, Mourier T, Penny D (January 2006). "The biology of intron gain and loss". Trends in Genetics 22 (1): 16–22.  
  26. ^ Jeffares DC, Penkett CJ, Bähler J (August 2008). "Rapidly regulated genes are intron poor". Trends in Genetics 24 (8): 375–8.  
  27. ^ Castillo-Davis CI, Mekhedov SL, Hartl DL, Koonin EV, Kondrashov FA (August 2002). "Selection for short introns in highly expressed genes". Nature Genetics 31 (4): 415–8.  
  28. ^ Rogozin, I. B.; Carmel, L.; Csuros, M.; Koonin, E. V. (2012). "Origin and evolution of spliceosomal introns". Biology Direct 7: 11.  
  29. ^ Derr, L. K.; Strathern, J. N. (1993). "A role for reverse transcripts in gene conversion". Nature 361 (6408): 170–173.  
  30. ^ a b c d Yenerall, P.; Zhou, L. (2012). "Identifying the mechanisms of intron gain: Progress and trends". Biology Direct 7: 29.  
  31. ^
  32. ^ Cech, T. R. (1986). "The generality of self-splicing RNA: Relationship to nuclear mRNA splicing". Cell 44 (2): 207–210.  
  33. ^ ==Taxonavigation== Species: Platycodon grandiflorus


    Platycodon grandiflorus (Jacq.) A.DC. Monogr. Campan. 125 1830.

    Homotypic Synonyms

    • Campanula grandiflora Jacq., Hort. Bot. Vindob. 3: 4 (1777). Basionym/Replaced Synonym
    • Campanula gentianoides Lam., Encycl. 1: 581 (1785), nom. superfl.

    Heterotypic Synonyms

    • Campanula glauca Thunb. in J.A.Murray, Syst. Veg. ed. 14: 211 (1784).
    • Platycodon grandiflorus var. glaucus (Thunb.) Siebold & Zucc., Abh. Math. Phys. Cl. Königl. Bayer. Akad. Wiss. 4(3): 179 (1846).
    • Platycodon autumnalis Decne., Rev. Hort., III, 2: 361 (1848).
    • Platycodon chinensis Lindl. & Paxton, Paxton's Fl. Gard. 2: 121 (185onanus H.Hara, Enum. Sperm. Jap. 2: 102 (1952).
    • Platycodon mariesii f. striatus (Stubenrauch) H.Hara, Enum. Sperm. Jap. 2: 101 (1952).
    • Platycodon grandiflorus f. monanthus (Nakai) Hyun S.Kim, Fl. Coreana 6: 97 (1976).


    Candolle, A.L.P. Pyramus de (1830) Monographie des Campanulees ... avec Vingt Planches: 125. IPNI
    Denisov, N. (2008). Addition to Vascular flora of the Kozlov island (Peter the Great Gulf, Japanese sea). Turczaninowia 11(4): 29-42.
    Lammers, T.G.. & R.Govaerts, R.(2012) World Checklist of Selected Plant Families; Platycodon grandiflorus (Jacq.) A.DC., Facilitated by the Royal Botanic Gardens, Kew. Published on the Internet; Platycodon grandiflorus Accessed 27th July 2012.
    Lammers, T.G. (2007). World checklist and bibliography of Campanulaceae: 1-675. The Board of Trustees of the Royal Botanic Gardens, Kew.
    The Plant List Version 1. (2010) Platycodon grandiflorus, Royal Botanic Gardens, Kew and Missouri Botanical Garden. Published on the Internet; Platycodon Grandiflorus. Accessed 27th July 2012.

    Vernacular names

    • Platycodon sinensis Lem., Jard. Fleur. 3: t. 250 (1853).
    • Platycodon grandiflorus var. mariesii Lynch, Garden (London) 27: 216 (1885).
    • Platycodon mariesii (Lynch) Wittm., Gartenflora 41: 655 (1892).
    • Platycodon mariesii var. albus Wittm., Gartenflora 41: 655 (1892).
    • Platycodon grandiflorus var. autumnalis (Decne.) Voss, Vilm. Blumengärtn. ed. 3, 1: 572 (1894).
    • Platycodon grandiflorus var. albus Stubenrauch in L.H.Bailey, Cycl. Amer. Hort.: 1370 (1901).
    • Platycodon grandiflorus var. japonicus Stubenrauch in L.H.Bailey, Cycl. Amer. Hort.: 1370 (1901).
    • Platycodon grandiflorus var. semiplenus Stubenrauch in L.H.Bailey, Cycl. Amer. Hort.: 1370 (1901).
    • Platycodon grandiflorus var. striatus Stubenrauch in L.H.Bailey, Cycl. Amer. Hort.: 1370 (1901).
    • Platycodon grandiflorus var. duplex Makino, Bot. Mag. (Tokyo) 22: 157 (1908).
    • Platycodon grandiflorus var. pentapetalus Makino, Bot. Mag. (Tokyo) 23: 21 (1909).
    • Platycodon glaucus (Thunb.) Nakai, Bot. Mag. (Tokyo) 38: 301 (1924).
    • Platycodon glaucus f. albus Makino, J. Jap. Bot. 3: 43 (1926).
    • Platycodon glaucus f. bicolor Makino, J. Jap. Bot. 3: 43 (1926).
    • Platycodon glaucus var. pentapetalus (Makino) Makino, J. Jap. Bot. 3: 44 (1926).
    • Platycodon glaucus f. violaceus Makino, J. Jap. Bot. 3: 44 (1926).
    • Platycodon glaucus f. albiflorus Honda, Bot. Mag. (Tokyo) 51: 858 (1937).
    • Platycodon glaucus f. subasepalus Honda, Bot. Mag. (Tokyo) 52: 517 (1938).
    • Platycodon glaucus var. monanthus Nakai, J. Jap. Bot. 15: 186 (1939).
    • Platycodon glaucus var. subasepalus (Honda) Nakai, J. Jap. Bot. 15: 186 (1939).
    • Platycodon glaucus var. planicorollatus Makino, Zissai-Engei 26: 461 (1950).
    • Platycodon glaucus var. rugosus Makino, Zissai-Engei 26: 461 (1950).
    • Platycodon grandiflorus f. albiflorus (Honda) H.Hara, Enum. Sperm. Jap. 2: 101 (1952).
    • Platycodon grandiflorus f. bicolor (Makino) H.Hara, Enum. Sperm. Jap. 2: 102 (1952).
    • Platycodon grandiflorus f. leucanthus H.Hara, Enum. Sperm. Jap. 2: 102 (1952).
    • Platycodon grandiflorus var. planicorollatus (Makino) Nakai, Enum. Sperm. Jap. 2: 102 (1952).
    • Platycodon grandiflorus var. rugosus (Makino) Nakai, Enum. Sperm. Jap. 2: 102 (1952).
    • Platycodon grandiflorus f. subasepalus (Honda) H.Hara, Enum. Sperm. Jap. 2: 101 (1952).
    • Platycodon mariesii f. alb
  34. ^ Hellsten, U.; Aspden, J. L.; Rio, D. C.; Rokhsar, D. S. (2011). "A segmental genomic duplication generates a functional intron". Nature Communications 2: 454–.  






See also

The only hypothesized mechanism of recent intron gain lacking any direct evidence is that of group II intron insertion, which when demonstrated in vivo, abolishes gene expression.[31] Group II introns are therefore likely the presumed ancestors of spliceosomal introns, acting as site-specific retroelements, and are no longer responsible for intron gain.[32][33] Tandem genomic duplication is the only proposed mechanism with supporting in vivo experimental evidence: a short intragenic tandem duplication can insert a novel intron into a protein-coding gene, leaving the corresponding peptide sequence unchanged.[34] This mechanism also has extensive indirect evidence lending support to the idea that tandem genomic duplication is a prevalent mechanism for intron gain. The testing of other proposed mechanisms in vivo, particularly intron gain during DSBR, intron transfer, and intronization, is possible, although these mechanisms must be demonstrated in vivo to solidify them as actual mechanisms of intron gain. Further genomic analyses, especially when executed at the population level, may then quantify the relative contribution of each mechanism, possibly identifying species-specific biases that may shed light on varied rates of intron gain amongst different species.[30]

Intron transfer has been hypothesized to result in intron gain when a paralog or pseudogene gains an intron and then transfers this intron via recombination to an intron-absent location in its sister paralog. Intronization is the process by which mutations create novel introns from formerly exonic sequence. Thus, unlike other proposed mechanisms of intron gain, this mechanism does not require the insertion or generation of DNA to create a novel intron.[30]

In intron transposition, the most commonly purported intron gain mechanism, a spliced intron is thought to reverse splice into either its own mRNA or another mRNA at a previously intron-less position. This intron-containing mRNA is then reverse transcribed and the resulting intron-containing cDNA may then cause intron gain via complete or partial recombination with its original genomic locus. Transposon insertions can also result in intron creation. Such an insertion could intronize the transposon without disrupting the coding sequence when a transposon inserts into the sequence AGGT, resulting in the duplication of this sequence on each side of the transposon. It is not yet understood why these elements are spliced, whether by chance, or by some preferential action by the transposon. In tandem genomic duplication, due to the similarity between consensus donor and acceptor splice sites, which both closely resemble AGGT, the tandem genomic duplication of an exonic segment harboring an AGGT sequence generates two potential splice sites. When recognized by the spliceosome, the sequence between the original and duplicated AGGT will be spliced, resulting in the creation of an intron without alteration of the coding sequence of the gene. Double-stranded break repair via non-homologous end joining was recently identified as a source of intron gain when researchers identified short direct repeats flanking 43% of gained introns in Daphnia.[30] These numbers must be compared to the number of conserved introns flanked by repeats in other organisms, though, for statistical relevance. For group II intron insertion, the retrohoming of a group II intron into a nuclear gene was proposed to cause recent spliceosomal intron gain.

Introns may be lost or gained over evolutionary time, as shown by many comparative studies of orthologous genes. Subsequent analyses have identified thousands of examples of intron loss and gain events, and it has been proposed that the emergence of eukaryotes, or the initial stages of eukaryotic evolution, involved an intron invasion.[28] Two definitive mechanisms of intron loss, Reverse Transcriptase-Mediated Intron Loss (RTMIL) and genomic deletions, have been identified, and are known to occur.[29] The definitive mechanisms of intron gain, however, remain elusive and controversial. At least seven mechanisms of intron gain have been reported thus far: Intron Transposition, Transposon Insertion, Tandem Genomic Duplication, Intron Transfer, Intron Gain during Double-Strand Break Repair (DSBR), Insertion of a Group II Intron, and Intronization. In theory it should be easiest to deduce the origin of recently gained introns due to the lack of host-induced mutations, yet even introns gained recently did not arise from any of the aforementioned mechanisms. These findings thus raise the question of whether or not the proposed mechanisms of intron gain fail to describe the mechanistic origin of many novel introns because they are not accurate mechanisms of intron gain, or if there are other, yet to be discovered, processes generating novel introns.[30]

Introns as mobile genetic elements

Introns contain several short sequences that are important for efficient splicing, such as acceptor and donor sites at either end of the intron as well as a branch point site, which are required for proper splicing by the spliceosome. Some introns are known to enhance the expression of the gene that they are contained in by a process known as intron-mediated enhancement (IME).

Alternative splicing of introns within a gene acts to introduce greater variability of protein sequences translated from a single gene, allowing multiple related proteins to be generated from a single gene and a single precursor mRNA transcript. The control of alternative RNA splicing is performed by a complex network of signaling molecules that respond to a wide range of intracellular and extracellular signals.

Early studies of genomic DNA sequences from a wide range of organisms show that the intron-exon structure of homologous genes in different organisms can vary widely.[20] More recent studies of entire eukaryotic genomes have now shown that the lengths and density (introns/gene) of introns varies considerably between related species. For example, while the human genome contains an average of 8.4 introns/gene (139,418 in the genome), the unicellular fungus Encephalitozoon cuniculi contains only 0.0075 introns/gene (15 introns in the genome).[21] Since eukaryotes arose from a common ancestor (Common descent), there must have been extensive gain or loss of introns during evolutionary time.[22][23] This process is thought to be subject to selection, with a tendency towards intron gain in larger species due to their smaller population sizes, and the converse in smaller (particularly unicellular) species.[24] Biological factors also influence which genes in a genome lose or accumulate introns.[25][26][27]

The biological origins of introns are obscure. After the initial discovery of introns in protein-coding genes of the eukaryotic nucleus, there was significant debate as to whether introns in modern-day organisms were inherited from a common ancient ancestor (termed the introns-early hypothesis), or whether they appeared in genes rather recently in the evolutionary process (termed the introns-late hypothesis). Another theory is that the spliceosome and the intron-exon structure of genes is a relic of the RNA world (the introns-first hypothesis).[19] There is still considerable debate about the extent to which of these hypotheses is most correct. The popular consensus at the moment is that introns arose within the eukaryote lineage as selfish elements .

As a first approximation, it is possible to view introns as unimportant sequences whose only function is to be removed from an unspliced precursor RNA in order to generate the functional mRNA, rRNA or tRNA product. However, it is now well-established that some introns themselves encode specific proteins or can be further processed after splicing to generate noncoding RNA molecules.[17] Alternative splicing is widely used to generate multiple proteins from a single gene. Furthermore, some introns represent mobile genetic elements and may be regarded as examples of selfish DNA.[18]

Biological functions and evolution

Transfer RNA introns that depend upon proteins for removal occur at a specific location within the anticodon loop of unspliced tRNA precursors, and are removed by a tRNA splicing endonuclease. The exons are then linked together by a second protein, the tRNA splicing ligase.[15] Note that self-splicing introns are also sometimes found within tRNA genes.[16]

, that is, the intron-containing RNA molecule can rearrange its own covalent structure so as to precisely remove the intron and link the exons together in the correct order. In some cases, particular intron-binding proteins are involved in splicing, acting in such a way that they assist the intron in folding into the three-dimensional structure that is necessary for self-splicing activity. Group I and group II introns are distinguished by different sets of internal conserved sequences and folded structures, and by the fact that splicing of RNA molecules containing group II introns generates branched introns (like those of spliceosomal RNAs), while group I introns use a non-encoded guanosine nucleotide (typically GTP) to initiate splicing, adding it on to the 5'-end of the excised intron. self-splicing. These complex architectures allow some group I and group II introns to be three-dimensional architecture Following transcription into RNA, group I and group II introns also make extensive internal interactions that allow them to fold into a specific, complex [14][13]