Thanks to visit codestin.com
Credit goes to rnajournal.cshlp.org

HAMR: high-throughput annotation of modified ribonucleotides

  1. Li-San Wang1,2,5,6,8
  1. 1Genomics and Computational Biology Graduate Group, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
  2. 2Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
  3. 3Cell and Molecular Biology Graduate Group, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
  4. 4Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
  5. 5Penn Genome Frontiers Institute, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
  6. 6Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
    1. 7 These authors contributed equally to this work.

    Abstract

    RNA is often altered post-transcriptionally by the covalent modification of particular nucleotides; these modifications are known to modulate the structure and activity of their host RNAs. The recent discovery that an RNA methyl-6 adenosine demethylase (FTO) is a risk gene in obesity has brought to light the significance of RNA modifications to human biology. These noncanonical nucleotides, when converted to cDNA in the course of RNA sequencing, can produce sequence patterns that are distinguishable from simple base-calling errors. To determine whether these modifications can be detected in RNA sequencing data, we developed a method that can not only locate these modifications transcriptome-wide with single nucleotide resolution, but can also differentiate between different classes of modifications. Using small RNA-seq data we were able to detect 92% of all known human tRNA modification sites that are predicted to affect RT activity. We also found that different modifications produce distinct patterns of cDNA sequence, allowing us to differentiate between two classes of adenosine and two classes of guanine modifications with 98% and 79% accuracy, respectively. To show the robustness of this method to sample preparation and sequencing methods, as well as to organismal diversity, we applied it to a publicly available yeast data set and achieved similar levels of accuracy. We also experimentally validated two novel and one known 3-methylcytosine (3mC) sites predicted by HAMR in human tRNAs. Researchers can now use our method to identify and characterize RNA modifications using only RNA-seq data, both retrospectively and when asking questions specifically about modified RNA.

    Keywords

    Footnotes

    • 8 Corresponding authors

      E-mail bdgregor{at}sas.upenn.edu

      E-mail lswang{at}mail.med.upenn.edu

    • Received October 10, 2012.
    • Accepted August 23, 2013.

    This article is distributed exclusively by the RNA Society for the first 12 months after the full-issue publication date (see http://rnajournal.cshlp.org/site/misc/terms.xhtml). After 12 months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported), as described at http://creativecommons.org/licenses/by-nc/3.0/.

    | Table of Contents