Thanks to visit codestin.com
Credit goes to github.com

Skip to content

artorias111/telsearch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Telsearch works on raw sequencing reads of any kind (Illumina, PacBio HiFi, ONT, etc.)

This is a rookie project to practice Rust, and as of now, I've implemented it to run well on a cluster with a lot of memory (>1TB (not a typo)). So use it with caution.

The default mode that currently runs without breaking uses exact string matching to detect a telomere sequence (and its reverse complement). That's a horrible way to detect biological sequences, I plan to include better pairwise alignment algorithms in the future.

Returns the frequency of a canonical telomere seqeunce in each read.

Example run: (The fastq file should be gz compressed)

## run with cargo
cargo run -- --reads path/to/fastq.gz --telomere TTAGGC

example output:

sequence  telomere_count
SEQ004  6
SEQ009  5
SEQ002  5
.
.
.

About

Find telomere sequence frequency in sequencing reads

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages