Species identification via Nanopore sequencing of barcoded amplicons
NanoAmp was tested on Linux, but should work on Mac as well. The dependencies can be installed in a separate conda environment with the following command:
conda create -n nanoamp -c bioconda -c conda-forge canu=2.2 parallel medaka=1.6.0 longshot tabix perl-vcftools-vcf bbmap ucsc-blat
conda activate nanoampAfterwards, this repository must be cloned:
git clone https://github.com/edgardomortiz/NanoAmpFinally, add the directory where you cloned NanoAmp to your system $PATH
All steps can be run with a single command:
nanoamp.sh RUN_NAME CORESWhere RUN_NAME is the name of the sequencing run, assigned in MinKNOW. CORES indicates the number of processors to use.
However, if you wan to run steps manually, the order of execution is:
- Basecalling:
basecall_demultiplex_guppy.sh - Quality filtering:
filter_quality_bbduk.sh - Assembly:
assemble_canu.sh - Polish assembly:
polish_raw_medaka.sh - Separate haplotypes:
haplotype_polished_longshot.sh - Identify samples:
match_db.py
For help on each script just execute the script without extra arguments.