RNAkinet is a project dedicated to detecting 5EU-modified reads from the raw nanopore sequencing signal. This repository contains the code for tha analysis of the RNAkinet publication. For the actual RNAkinet code refer to https://github.com/maragkakislab/rnakinet
- Activate a conda environment with snakemake installed (you can use the
snakemake.yamlfile to create it) - Navigate to the
rnakinet/workflowfolder - Change the paths and parameters in
config/training_setup_custom.pyto reflect your data and requirements (provide genome fasta file, paths to fast5s etc...) - Open the
Snakemakefile and make sure the experiment name is the same one you specified in theconfig/training_setup_custom.pyfile - While in the workflow folder, run
snakemake --cores 32 --use-conda -npto get a plan for execution. Once ready remove the-npflag to run training - Once training is finished, the model checkpoint will be available in the
checkpoints_plfolder - You can use the checkpoint to run the
scripts/inference.pyto use it for prediction on other fast5 files