Thanks to visit codestin.com
Credit goes to www.nature.com

Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Hotspots of aberrant epigenomic reprogramming in human induced pluripotent stem cells

A Corrigendum to this article was published on 01 October 2014

Abstract

Induced pluripotent stem cells (iPSCs) offer immense potential for regenerative medicine and studies of disease and development. Somatic cell reprogramming involves epigenomic reconfiguration, conferring iPSCs with characteristics similar to embryonic stem (ES) cells. However, it remains unknown how complete the reestablishment of ES-cell-like DNA methylation patterns is throughout the genome. Here we report the first whole-genome profiles of DNA methylation at single-base resolution in five human iPSC lines, along with methylomes of ES cells, somatic cells, and differentiated iPSCs and ES cells. iPSCs show significant reprogramming variability, including somatic memory and aberrant reprogramming of DNA methylation. iPSCs share megabase-scale differentially methylated regions proximal to centromeres and telomeres that display incomplete reprogramming of non-CG methylation, and differences in CG methylation and histone modifications. Lastly, differentiation of iPSCs into trophoblast cells revealed that errors in reprogramming CG methylation are transmitted at a high frequency, providing an iPSC reprogramming signature that is maintained after differentiation.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1: Global trends of human iPSC and ES cell DNA methylomes.
Figure 2: Partially methylated domains become highly methylated on induction of pluripotency.
Figure 3: CG-DMRs identified between pluripotent cells.
Figure 4: Characterization of CG-DMRs in iPSCs.
Figure 5: Failure to restore megabase-scale regions of non-CG methylation is a hallmark of iPSC reprogramming.

Similar content being viewed by others

Accession codes

Primary accessions

Sequence Read Archive

Data deposits

Analysed datasets can be browsed and downloaded from http://neomorph.salk.edu/ips_methylomes. Sequence data for MethylC-Seq, RNA-Seq and Chip-Seq experiments have been submitted to the NCBI SRA database under the accession numbers SRA023829.2 and SRP000941.

References

  1. Soldner, F. et al. Parkinson’s disease patient-derived induced pluripotent stem cells free of viral reprogramming factors. Cell 136, 964–977 (2009)

    Article  CAS  Google Scholar 

  2. Yamanaka, S. A fresh look at iPS cells. Cell 137, 13–17 (2009)

    Article  CAS  Google Scholar 

  3. Takahashi, K. & Yamanaka, S. Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell 126, 663–676 (2006)

    Article  CAS  Google Scholar 

  4. Takahashi, K. et al. Induction of pluripotent stem cells from adult human fibroblasts by defined factors. Cell 131, 861–872 (2007)

    Article  CAS  Google Scholar 

  5. Yu, J. et al. Induced pluripotent stem cell lines derived from human somatic cells. Science 318, 1917–1920 (2007)

    Article  ADS  CAS  Google Scholar 

  6. Park, I. et al. Reprogramming of human somatic cells to pluripotency with defined factors. Nature 451, 141–146 (2008)

    Article  ADS  CAS  Google Scholar 

  7. Yu, J. et al. Human induced pluripotent stem cells free of vector and transgene sequences. Science 324, 797–801 (2009)

    Article  ADS  CAS  Google Scholar 

  8. Zhao, X. Y. et al. iPS cells produce viable mice through tetraploid complementation. Nature 461, 86–90 (2009)

    Article  ADS  CAS  Google Scholar 

  9. Boland, M. J. et al. Adult mice generated from induced pluripotent stem cells. Nature 461, 91–94 (2009)

    Article  ADS  CAS  Google Scholar 

  10. Guenther, M. G. et al. Chromatin structure and gene expression programs of human embryonic and induced pluripotent stem cells. Cell Stem Cell 7, 249–257 (2010)

    Article  CAS  Google Scholar 

  11. Deng, J. et al. Targeted bisulfite sequencing reveals changes in DNA methylation associated with nuclear reprogramming. Nature Biotechnol. 27, 353–360 (2009)

    Article  CAS  Google Scholar 

  12. Doi, A. et al. Differential methylation of tissue- and cancer-specific CpG island shores distinguishes human induced pluripotent stem cells, embryonic stem cells and fibroblasts. Nature Genet. 41, 1350–1353 (2009)

    Article  CAS  Google Scholar 

  13. Kim, K. et al. Epigenetic memory in induced pluripotent stem cells. Nature 467, 285–290 (2010)

    Article  ADS  CAS  Google Scholar 

  14. Polo, J. M. et al. Cell type of origin influences the molecular and functional properties of mouse induced pluripotent stem cells. Nature Biotechnol. 28, 848–855 (2010)

    Article  CAS  Google Scholar 

  15. Stadtfeld, M. et al. Aberrant silencing of imprinted genes on chromosome 12qF1 in mouse induced pluripotent stem cells. Nature 465, 175–181 (2010)

    Article  ADS  CAS  Google Scholar 

  16. Miura, K. et al. Variation in the safety of induced pluripotent stem cell lines. Nature Biotechnol. 27, 743–745 (2009)

    Article  CAS  Google Scholar 

  17. Hu, B. Y. et al. Neural differentiation of human induced pluripotent stem cells follows developmental principles but with variable potency. Proc. Natl Acad. Sci. USA 107, 4335–4340 (2010)

    Article  ADS  CAS  Google Scholar 

  18. Lister, R. et al. Human DNA methylomes at base resolution show widespread epigenomic differences. Nature 462, 315–322 (2009)

    Article  ADS  CAS  Google Scholar 

  19. Sugii, S. et al. Human and mouse adipose-derived cells support feeder-independent induction of pluripotent stem cells. Proc. Natl Acad. Sci. USA 108, 3558–3563 (2010)

    Article  ADS  Google Scholar 

  20. Daley, G. et al. Broader implications of defining standards for the pluripotency of iPSCs. Cell Stem Cell 4, 200–201 (2009)

    Article  CAS  Google Scholar 

  21. Xu, R. H. et al. BMP4 initiates human embryonic stem cell differentiation to trophoblast. Nature Biotechnol. 20, 1261–1264 (2002)

    Article  CAS  Google Scholar 

  22. Cedar, H. & Bergman, Y. Linking DNA methylation and histone modification: patterns and paradigms. Nature Rev. Genet. 10, 295–304 (2009)

    Article  CAS  Google Scholar 

  23. Chodavarapu, R. K. et al. Relationship between nucleosome positioning and DNA methylation. Nature 466, 388–392 (2010)

    Article  ADS  CAS  Google Scholar 

  24. Laurent, L. et al. Dynamic changes in the human methylome during differentiation. Genome Res. 20, 320–331 (2010)

    Article  CAS  Google Scholar 

  25. Hawkins, R. D. et al. Distinct epigenomic landscapes of pluripotent and lineage-committed human cells. Cell Stem Cell 6, 479–491 (2010)

    Article  CAS  Google Scholar 

  26. Erhardt, A. et al. TMEM132D, a new candidate for anxiety phenotypes: evidence from human and mouse studies. Mol. Psychiatry advance online publication. 10.1038/mp.2010.4 (6 April 2010)

  27. Yilmaz, G., Alexander, J. S., Erkuran Yilmaz, C. & Granger, D. N. Induction of neuro-protective/regenerative genes in stem cells infiltrating post-ischemic brain tissue. Exp. Transl. Stroke Med. 2, 11 (2010)

    Article  Google Scholar 

  28. Li, M. Z. et al. Molecular mapping of developing dorsal horn-enriched genes by microarray and dorsal/ventral subtractive screening. Dev. Biol. 292, 555–564 (2006)

    Article  CAS  Google Scholar 

  29. Chin, M. H. et al. Induced pluripotent stem cells and embryonic stem cells are distinguished by gene expression signatures. Cell Stem Cell 5, 111–123 (2009)

    Article  CAS  Google Scholar 

  30. Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009)

    Article  Google Scholar 

  31. Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009)

    Article  CAS  Google Scholar 

  32. Trapnell, C. et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nature Biotechnol. 28, 511–515 (2010)

    Article  CAS  Google Scholar 

  33. Ludwig, T. et al. Feeder-independent culture of human embryonic stem cells. Nature Methods 3, 637–646 (2006)

    Article  CAS  Google Scholar 

  34. Ludwig, T. et al. Derivation of human embryonic stem cells in defined conditions. Nature Biotechnol. 24, 185–187 (2006)

    Article  CAS  Google Scholar 

  35. O’Malley, R. C., Alonso, J. M., Kim, C. J., Leisse, T. J. & Ecker, J. R. An adapter ligation-mediated PCR method for high-throughput mapping of T-DNA inserts in the Arabidopsis genome. Nature Protocols 2, 2910–2917 (2007)

    Article  Google Scholar 

  36. Chen, P., Cokus, S. J. & Pellegrini, M. B. S. Seeker: precise mapping for bisulfite sequencing. BMC Bioinformatics 11, 203 (2010)

    Article  CAS  Google Scholar 

  37. Lister, R. et al. Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell 133, 523–536 (2008)

    Article  CAS  Google Scholar 

Download references

Acknowledgements

We thank L. Zhang and G. Schroth for assistance with MethylC-Seq library sequencing. R.L. is supported by a California Institute for Regenerative Medicine Training Grant. M.P. is supported by a Catharina Foundation postdoctoral fellowship. R.D.H. is supported by an American Cancer Society Postdoctoral Fellowship. Y.K. is supported by the Japan Society for the Promotion of Science. This work was supported by grants from the following: Mary K. Chapman Foundation, the National Science Foundation (NSF) (NSF 0726408), the National Institutes of Health (NIH) (U01 ES017166, U01 1U01ES017166-01, DK062434), the California Institute for Regenerative Medicine (RB2-01530), the Morgridge Institute for Research and the Howard Hughes Medical Institute. We thank the NIH Roadmap Reference Epigenome Consortium (http://www.roadmapepigenomics.org/). This study was carried out as part of the NIH Roadmap Epigenomics Program.

Author information

Authors and Affiliations

Authors

Contributions

Experiments were designed by R.L., J.R.E., R.M.E., B.R., J.A.T., Y.S.K., R.Y., M.D. and R.D.H. Cells were grown by J.A.-B. and Y.S.K. MethylC-Seq and RNA-Seq experiments were conducted by R.L. and J.R.N. ChIP-Seq experiments were conducted by R.D.H. ChIP-Seq data analysis was performed by G.H., S.K. and R.D.H. Retroviral insertion site localization experiments were performed by R.O’M. and R.C. Sequencing data processing was performed by R.L. and G.H. Bioinformatic and statistical analyses were conducted by M.P., R.L. and G.H. R.S. performed data interpretation analyses. The manuscript was prepared by R.L., M.P. and J.R.E.

Corresponding author

Correspondence to Joseph R. Ecker.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Figures

The file contains Supplementary Figures 1-17 with legends (PDF 9437 kb)

Supplementary Tables

The file contains Supplementary Tables 1-7 (XLS 178 kb)

PowerPoint slides

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lister, R., Pelizzola, M., Kida, Y. et al. Hotspots of aberrant epigenomic reprogramming in human induced pluripotent stem cells. Nature 471, 68–73 (2011). https://doi.org/10.1038/nature09798

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue date:

  • DOI: https://doi.org/10.1038/nature09798

This article is cited by

Comments

Commenting on this article is now closed.

  1. THE COMMENT BELOW WAS SUBMITTED TO NATURE FOR BRIEF COMMENTS ARISING, BUT DECLINED FOR PUBLICATION DUE TO TECHNICAL CONCERNS. HENCE, I POST IT HERE.

    Fictive replicates in epigenomic analysis

    Arising from Lister et al. Nature 471, 68-73 (2011)

    For genome sequencing studies on cell lines reseachers might be tempted to not spend resources on replication, given the presumably small variation between replicates. Replication, however, is an important condition for any statistical inference [need reference]. Such replicates are absent for the comparisons of epigenomes of several embryonic stem (ES) cell lines and induced pluripotent stem cell (iPSC) lines in Lister et al. Yet, the authors make ubiquitous use of (adjusted) p-values for the detection of differentially methylated regions (DMRs): Lister et al. create fictive replicates by scanning the genome using 10 consecutive genomic windows as independent samples and computing (adjusted) p-values from those ?replicates?. We argue that such practice is likely to lead to many false discoveries.

    Lister et al. perform a number of comparisons for both CG and non-CG methylation. We focus on the comparison between the ES cell line H1 to an iPSC line, ADS-iPSC , but the arguments are equally valid for other comparisons. Lister et al. first smooth the methylation levels across the genome, then average methylation levels within windows of 100 base pairs (5000 for non-CG) and compare nW = 10 consecutive windows using a Wilcoxon two-sample test. Resulting p-values are adjusted using Benjamini- Hochberg False Discovery Rate (BH- FDR). The author?s choice nW = 10 immediately raises the question: why not 5 consecutive windows, or 20? In addition, it seems evident that observations from consecutive genomic windows are inherently dependent (correlated), in particular after smoothing, which is similar to computing averages for overlapping windows. Smoothing is useful for visualization, but is certainly not appropriate when genomic windows are used as ?replicates?. The resulting underestimated variation between ?replicates? leads to too optimistic (adjusted) p-values. For example, if the standard deviation (sd) is underestimated by a factor 2 with respect to truly independent replicates, Wilcoxon p-values (based on 2x10 fictive replicates) smaller than 0.01 are 10-100 fold too low with respect to those corresponding to true replicates. Given that BH-FDR adjusted p-values are proportional to raw p-values, the bias of the reported FDR may be huge.

    Following the Methods in Lister et al. we re-analyzed the data using nW= 5, 10 and 20 consecutive windows and applied the analysis also to the non-smoothed data for nW = 10. The results (Table 1) justify two conclusions: a) the fictive sample size has a large effect on the coverage of the detected DMRs, and for CG and CHH no DMRs are detected for nW= 5; b) without smoothing no DMRs are detected when using nW=10. Given that smoothing is inappropriate here and the sample size is fictive why should we then believe that the reported FDR = 0.01 by Lister et al. is indeed anywhere near 1%? The true proportion of false discoveries is likely to be several orders of magnitudes larger.

    Lister et al. use additional fold change criteria, which may prevent some degree of over-selection. Note, however, that the applied criterion varies tremendously between comparisons: from 2-fold to 8-fold. Given the absence of a biological reason for this difference, one can only speculate that the fold change criterion is used to produce a list of DMRs that is of a convenient size.
    The consequences of the methodological errors are two-fold. First, many of the DMRs will not validate in independent studies. Some of these may have been validated in other studies, but note that invalidated results are often not published [add reference]. Second, others may be tempted to use the same analysis for other studies. We hope to prevent the latter and emphasize that there is only one solution for obtaining proper p-values: replication, replication, replication.

    Table 1
    nW=5 nW =10 nW =20 nW =10
    Smoothed&#009Yes Yes Yes No
    CG # &#009 0&#009 902918&#009 825187 &#009 0
    CG %&#009 0&#009 31.4&#009 57.4&#009 0
    CHH #&#009 0&#009 50823&#009 25453&#009 0
    CHH %&#009 0&#009 88.4&#009 88.5&#009 0
    CHG #&#009 98214&#009 52229&#009 26142&#009 0
    CHG %&#009 85.4&#009 90.8&#009 90.9&#009 0
    Number (#) of detected DMRs and genomic coverage of those (%) for CG, CHH and CHG methylation

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing