🔧 How to Run the Pipeline

Custom nf-core/nanoseq pipeline for Oxford Nanopore long-read sequencing, adapted for the UPWASZAK lab at EPFL: https://www.epfl.ch/labs/upwaszak

🔧 How to Run the Pipeline

We provide four wrapper scripts for common use cases:

1. `run_pipeline.sh`

Runs basecalling locally on a GPU-enabled machine (BMO)
Transfers data and the pipeline to a remote server (TREX) via rsync
SSHes into TREX, starts a tmux session, and runs the remaining steps (alignment, variant calling, annotation, etc.)

2. `run_pipeline_local.sh`

Runs the full pipeline locally (basecalling + variant calling)

3. `run_pipeline_only_basecalling.sh`

Runs only the basecalling step locally

4. `run_pipeline_skip_basecalling.sh`

Skips basecalling and runs all other steps locally (expects existing .fastq.gz input)

Each script has a configuration section you can customize. You must provide a valid input file (.pod5 or .fastq.gz), and tune parameters.

Example configuration section:

### === CONFIGURATION (edit these only) ===
REMOTE_HOST="upwaszaksrv1.epfl.ch"                
REMOTE_USER="${USER}"
SAMPLE_NAME="sampleName"  
REMOTE_DEST_DIR="${SAMPLE_NAME}"                   

INPUT_PATH="small_NA12878_DNA.pod5"  # .pod5 or .fastq.gz
GTF_PATH=""                                     
DORADO_MODEL="hac"                                 
DORADO_MODIFICATION="5mCG_5hmCG"
CALL_VARIANTS=true
VARIANT_CALLER="clair3"
CLAIR_MODEL="dorado_model"                         
STRUCTURAL_VARIANT_CALLER="longcalld"
PHASE_WHATSHAP=true
ANNOTATE_VCF=true
### =======================================

🛠 How to Modify the Pipeline

Change Parameters

Most parameters can be modified directly in the wrapper scripts (e.g., model type, input paths, tool settings).
For more advanced configuration (e.g., max.cpus, max.memory, max.time, and default resources), check and edit the nextflow.config file.

Modify Pipeline Logic

Main workflow: workflows/nanoseq.nf
Organized into:
- subworkflows/: groups of related steps
- modules/: individual tool processes

To change behavior of a specific tool (e.g. Clair3):

Locate where it's called in nanoseq.nf
Follow to subworkflows/local/short_variant_calling.nf
Modify logic in modules/local/clair3.nf

Custom Docker Containers

Each module in the pipeline runs inside a container that includes all the necessary tools for that specific process (e.g., Clair3, LongCalld, etc.).

If you need a custom setup (e.g., additional tools, modified versions), you can assign a custom container directly inside the module by specifying:

container 'docker.io/ff1997/methylasso:latest'

Build & Push a Custom Container

To create your own container and push on Docker Hub:

Write a Dockerfile with the required tools.
Build and push it to Docker Hub:

docker buildx build --platform linux/amd64 -t ff1997/methylasso:latest --load .
docker push ff1997/methylasso:latest

All Dockerfiles used in this pipeline are stored under the /containers/ directory.

👤 Credits

Francesco Feher
📧 [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 1,611 Commits
.github		.github
assets		assets
bin		bin
conf		conf
containers		containers
docs		docs
lib		lib
modules		modules
subworkflows		subworkflows
workflows		workflows
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitpod.yml		.gitpod.yml
.nf-core.yml		.nf-core.yml
.prettierignore		.prettierignore
.prettierrc.yml		.prettierrc.yml
CHANGELOG.md		CHANGELOG.md
CITATIONS.md		CITATIONS.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
main.nf		main.nf
modules.json		modules.json
nextflow.config		nextflow.config
nextflow_schema.json		nextflow_schema.json
pyproject.toml		pyproject.toml
run_pipeline.sh		run_pipeline.sh
run_pipeline_local.sh		run_pipeline_local.sh
run_pipeline_only_basecalling.sh		run_pipeline_only_basecalling.sh
run_pipeline_skip_basecalling.sh		run_pipeline_skip_basecalling.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔧 How to Run the Pipeline

1. `run_pipeline.sh`

2. `run_pipeline_local.sh`

3. `run_pipeline_only_basecalling.sh`

4. `run_pipeline_skip_basecalling.sh`

🛠 How to Modify the Pipeline

Change Parameters

Modify Pipeline Logic

Custom Docker Containers

Build & Push a Custom Container

👤 Credits

About

Uh oh!

Releases

Packages

Languages

License

FeherF/EPFL-nanoseq

Folders and files

Latest commit

History

Repository files navigation

🔧 How to Run the Pipeline

1. run_pipeline.sh

2. run_pipeline_local.sh

3. run_pipeline_only_basecalling.sh

4. run_pipeline_skip_basecalling.sh

🛠 How to Modify the Pipeline

Change Parameters

Modify Pipeline Logic

Custom Docker Containers

Build & Push a Custom Container

👤 Credits

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. `run_pipeline.sh`

2. `run_pipeline_local.sh`

3. `run_pipeline_only_basecalling.sh`

4. `run_pipeline_skip_basecalling.sh`

Packages