Reading frame

In molecular biology, a reading frame is a way of dividing the sequence of nucleotides in a nucleic acid (DNA or RNA) molecule into a set of consecutive, non-overlapping triplets. Where these triplets equate to amino acids or stop signals during translation, they are called codons.

A single strand of a nucleic acid molecule has a phosphoryl end, called the 5′-end, and a hydroxyl or 3′-end. These define the 5′→3′ direction. There are three reading frames that can be read in this 5′→3′ direction, each beginning from a different nucleotide in a triplet. In a double stranded nucleic acid, an additional three reading frames may be read from the other, complementary strand in the 5′→3′ direction along this strand. As the two strands of a double-stranded nucleic acid molecule are antiparallel, the 5′→3′ direction on the second strand corresponds to the 3′→5′ direction along the first strand.^[1]^[2]

In general, at the most, one reading frame in a given section of a nucleic acid, is biologically relevant (open reading frame). Some viral transcripts can be translated using multiple, overlapping reading frames. There is one known example of overlapping reading frames in mammalian mitochondrial DNA: coding portions of genes for 2 subunits of ATPase overlap.

Genetic code

DNA encodes protein sequence by a series of three-nucleotide codons. Any given sequence of DNA can therefore be read in six different ways: Three reading frames in one direction (starting at different nucleotides) and three in the opposite direction. During transcription, the RNA polymerase read the template DNA strand in the 3′→5′ direction, but the mRNA is formed in the 5′ to 3′ direction.^[3] The mRNA is single-stranded and therefore only contains three possible reading frames, of which only one is translated. The codons of the mRNA reading frame are translated in the 5′→3′ direction into amino acids by a ribosome to produce a polypeptide chain.

Open reading frame

An open reading frame (ORF) is a reading frame that has the potential to be transcribed into RNA and translated into protein. It requires a continuous sequence of DNA which may include a start codon, through a subsequent region which has a length that is a multiple of 3 nucleotides, to a stop codon in the same reading frame.^[4]

When a putative amino acid sequence resulting from the translation of an ORF remained unknown in mitochondrial and chloroplast genomes, the corresponding open reading frame was called an unidentified reading frame (URF). For example, the MT-ATP8 gene was first described as URF A6L when the complete human mitochondrial genome was sequenced.^[5]

Multiple reading frames

The usage of multiple reading frames leads to the possibility of overlapping genes; there may be many of these in viral, prokaryote, and mitochondrial genomes.^[6] Some viruses, e.g. hepatitis B virus and BYDV, use several overlapping genes in different reading frames.

In rare cases, a ribosome may shift from one frame to another during translation of an mRNA (translational frameshift). This causes the first part of the mRNA to be translated in one reading frame, and the latter part to be translated in a different reading frame. This is distinct from a frameshift mutation, as the nucleotide sequence (DNA or RNA) is not altered—only the frame in which it is read.

References

^ Rainey S, Repka J. "Quantitative sequence and open reading frame analysis based on codon bias" (PDF). Systemics, Cybernetics and Informatics. 4 (1): 65–72.
^ Badger JH, Olsen GJ (April 1999). "CRITICA: Coding Region Identification Tool Invoking Comparative Analysis". Mol Biol Evol. 16 (4): 512–24. doi:10.1093/oxfordjournals.molbev.a026133. PMID 10331277.
^ Lodish (2007). Molecular Cell Biology (6th ed.). W. H. Freeman. p. 121. ISBN 978-1429203142.
^ Benjamin C. Pierce (2012). Genetics: a conceptual approach. W. H. Freeman. ISBN 9781429232500.
^ Anderson S, Bankier AT, Barrell BG, de Bruijn MH, Coulson AR, Drouin J, Eperon IC, Nierlich DP, Roe BA, Sanger F, Schreier PH, Smith AJ, Staden R, Young IG (April 1981). "Sequence and organization of the human mitochondrial genome". Nature. 290 (5806): 457–65. Bibcode:1981Natur.290..457A. doi:10.1038/290457a0. PMID 7219534. S2CID 4355527.
^ Johnson Z, Chisholm S (2004). "Properties of overlapping genes are conserved across microbial genomes". Genome Res. 14 (11): 2268–72. doi:10.1101/gr.2433104. PMC 525685. PMID 15520290.

[1] Rainey S, Repka J. "Quantitative sequence and open reading frame analysis based on codon bias" (PDF). Systemics, Cybernetics and Informatics. 4 (1): 65–72.

[pmid10331277-2] Badger JH, Olsen GJ (April 1999). "CRITICA: Coding Region Identification Tool Invoking Comparative Analysis". Mol Biol Evol. 16 (4): 512–24. doi:10.1093/oxfordjournals.molbev.a026133. PMID 10331277.

[3] Lodish (2007). Molecular Cell Biology (6th ed.). W. H. Freeman. p. 121. ISBN 978-1429203142.

[pierce2012b12-4] Benjamin C. Pierce (2012). Genetics: a conceptual approach. W. H. Freeman. ISBN 9781429232500.

[pmid7219534-5] Anderson S, Bankier AT, Barrell BG, de Bruijn MH, Coulson AR, Drouin J, Eperon IC, Nierlich DP, Roe BA, Sanger F, Schreier PH, Smith AJ, Staden R, Young IG (April 1981). "Sequence and organization of the human mitochondrial genome". Nature. 290 (5806): 457–65. Bibcode:1981Natur.290..457A. doi:10.1038/290457a0. PMID 7219534. S2CID 4355527.

[Johnson_2004-6] Johnson Z, Chisholm S (2004). "Properties of overlapping genes are conserved across microbial genomes". Genome Res. 14 (11): 2268–72. doi:10.1101/gr.2433104. PMC 525685. PMID 15520290.

[1]

[2]

[3]

[4]

[5]

[6]

Reading frame

Genetic code

Open reading frame

Multiple reading frames

See also

References