This is a preview of subscription content, access via your institution
Relevant articles
Open Access articles citing this article.
-
Single-cell and spatial analyses of the GDF family in tumors, with a focus on the prognostic and biological role of GDF15 in hepatocellular carcinoma
Cell & Bioscience Open Access 27 June 2025
-
Identification and validation of ARF6 for a potential prognostic biomarker of acute myeloid leukemia
Cancer Cell International Open Access 20 June 2025
-
Engineering novel CRISPRi repressors for highly efficient mammalian gene regulation
Genome Biology Open Access 12 June 2025
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
£17.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
£139.00 per year
only £11.58 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
References
Weinstein, J.N. et al. Nat. Genet. 45, 1113–1120 (2013).
Zhang, J. et al. Database. http://dx.doi.org/10.1093/database/bar026 (2011)
Siva, N. Lancet 385, 103–104 (2015).
McKenna, A. et al. Genome Res. 20, 1297–1303 (2010).
UNC Bioinformatics. TCGA mRNA-seq pipeline for UNC data. https://webshare.bioinf.unc.edu/public/mRNAseq_TCGA/UNC_mRNAseq_summary.pdf (2013).
Albrecht, M., Michael, A., Patrick, D., Peter, B. & Douglas, T. in Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies (SWEET '12) 1. ACM (Association of Computing Machinery. http://dx.doi.org/10.1145/2443416.2443417 (2012).
Bernhardsson, E. & Frieder, E. Luigi. Github https://github.com/spotify/luigi (2016).
Goecks, J., Nekrutenko, A. & Taylor, J. Genome Biol. 11, R86 (2010).
UCSC. Xena http://xena.ucsc.edu (2016).
Amstutz, P. Common workflow language. Github https://github.com/common-workflow-language/common-workflow-language (2016).
Frazer, S. Workflow description language. Github https://github.com/broadinstitute/wdl (2014).
Vivian, J. Toil scripts. Github https://github.com/BD2KGenomics/toil-scripts/tree/master/src/toil_scripts (2016).
Apache Software Foundation. Apache Spark http://spark.apache.org/ (2017).
Massie, M. et al. ADAM: genomics formats and processing patterns for cloud scale computing. University of California, Berkeley, Technical Report No. UCB/EECS-2013-207 (2013).
Gentzsch, W. in Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid 35–36 http://dx.doi.org/10.1109/ccgrid.2001.923173 (IEEE, 2001).
Yoo, A.B., Jette, M.A. & Mark, G. in Lecture Notes in Computer Science 44–60 (2003) Springer, Berlin, Heidelberg.
Apache Software Foundation. Apache Mesos http://mesos.apache.org/
GTEx Consortium. Science 348, 648–660 (2015).
Dobin, A. et al. Bioinformatics 29, 15–21 (2013).
Li, B. & Dewey, C.N. BMC Bioinformatics 12, 323 (2011).
Bray, N.L., Pimentel, H., Melsted, P. & Pachter, L. Nat. Biotechnol. 34, 525–527 (2016).
Barker, A.D. et al. Clin. Pharmacol. Ther. 86, 97–100 (2009).
Kent, W.J. et al. Genome Res. 12, 996–1006 (2002).
Acknowledgements
This work was supported by (BD2K) the National Human Genome Research Institute of the National Institutes of Health award no. 5U54HG007990 and (Cloud Pilot) the National Cancer Institute of the National Institutes of Health under the Broad Institute subaward no. 5417071-5500000716. The UCSC Genome Browser work was supported by the NHGRI award 5U41HG002371 (Corporate Sponsors). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health or our corporate sponsors.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors received support from AWS, Microsoft, and Google.
Supplementary information
Supplementary Figures and Texts
Supplementary Notes 1–9, Supplementary Figures 1–4, Supplementary Tables 1 (PDF 2647 kb)
Rights and permissions
About this article
Cite this article
Vivian, J., Rao, A., Nothaft, F. et al. Toil enables reproducible, open source, big biomedical data analyses. Nat Biotechnol 35, 314–316 (2017). https://doi.org/10.1038/nbt.3772
Published:
Issue Date:
DOI: https://doi.org/10.1038/nbt.3772
This article is cited by
-
DeBasher: a flow-based programming bash extension for the implementation of complex and interactive workflows with stateful processes
BMC Bioinformatics (2025)
-
Identification and validation of ARF6 for a potential prognostic biomarker of acute myeloid leukemia
Cancer Cell International (2025)
-
Machine learning-based in-silico analysis identifies signatures of lysyl oxidases for prognostic and therapeutic response prediction in cancer
Cell Communication and Signaling (2025)
-
m6A regulator-based molecular classification and hub genes associated with immune infiltration characteristics and clinical outcomes in diffuse gliomas
BMC Medical Genomics (2025)
-
Potential of CLSPN as a therapeutic target in melanoma: a key player in melanoma progression and tumor microenvironment
Journal of Translational Medicine (2025)