- 
          
- 
                Notifications
    You must be signed in to change notification settings 
- Fork 183
Home
        Benjamin Buchfink edited this page Sep 14, 2025 
        ·
        36 revisions
      
    # downloading the tool
wget http://github.com/bbuchfink/diamond/releases/download/v2.1.14/diamond-linux64.tar.gz
tar xzf diamond-linux64.tar.gz
# creating a diamond-formatted database file
./diamond makedb --in reference.fasta -d reference
# running a search in blastp mode
./diamond blastp -d reference -q queries.fasta -o matches.tsv
# running a search in blastx mode
./diamond blastx -d reference -q reads.fasta -o matches.tsv
# downloading and using a BLAST database (use DIAMOND >= v2.1.14)
update_blastdb.pl --decompress --blastdb_version 5 swissprot
./diamond blastp -d swissprot -q queries.fasta -o matches.tsv
# running fast clustering with linear scaling (30% identity threshold)
diamond linclust -d reference.fasta -o clusters.tsv --approx-id 30 -M 64G
# running sensitive clustering using all-vs-all alignment (30% identity threshold)
diamond cluster -d reference.fasta -o clusters.tsv --approx-id 30 -M 64GImportant
- 
Repeat masking is applied to the query and reference
sequences by default. To disable it, use --masking 0.
- DIAMOND is optimized for large input files of >1 million proteins. Naturally the tool can be used for smaller files as well, but the algorithm will not reach its full efficiency.
- The program may use quite a lot of memory and also temporary
disk space. Should the program fail due to running out of either
one, you need to set a lower value for the
block size parameter
-b.
- The sensitivity can be adjusted using the options --faster,--fast,--mid-sensitive,--sensitive,--more-sensitive,--very-sensitiveand--ultra-sensitive.