A Python implementation of the OrthoANI algorithm for nucleotide identity measurement.
OrthoANI is a metric proposed by Lee et al.[1] in 2016 to improve computation of Average Nucleotide Identity. It uses BLASTn to find orthologous blocks in a pair of sequences, and then computes the average identity only considering alignments of reciprocal orthologs.
PyOrthoANI is a reimplementation of the closed-source Java implementation
provided by the authors on ezbiocloud.net.
It relies on Biopython to handle the I/O, and calls
the BLAST+ binaries using the subprocess module of the Python standard
library.
Installing with pip is the easiest:
$ pip install pyorthoaniPyOrthoANI also requires the BLAST+ binaries to be installed on your machine
and available somewhere in your $PATH.
Use Biopython to load two FASTA files, and then orthoani.orthoani to compute
the OrthoANI metric between them:
import pyorthoani
from Bio.SeqIO import read
genome_1 = read("sequence1.fa", "fasta")
genome_2 = read("sequence2.fa", "fasta")
ani = pyorthoani.orthoani(genome_1, genome_2)pyorthoani can also be used from the CLI using a very simple command-line
interface mimicking the original Java tool:
$ pyorthoani -q sequence1.fa -r sequence2.fa
57.25orthoani uses the machine temporary folder to handle BLAST+ input and output
files, which is configurable through
tempfile.tempdir.
On some systems (like ArchLinux), this filesystem can reside in memory, which means
that your computer could have trouble processing very large files. If this
happens, try changing the value of the tempfile.tempdir to a directory that
is actually located on physical storage.
Values computed by this package and the original Java implementation may differ slightly because in Java the authors perform rounding of floating-point values at the sub-percent level, while this library uses the full values.
If you found PyOrthoANI useful, please cite our paper, as well as the original OrthoANI paper.
To cite PyOrthoANI:
Martin Larralde, Georg Zeller, Laura M. Carroll. 2025. PyOrthoANI, PyFastANI, and Pyskani: a suite of Python libraries for computation of average nucleotide identity. NAR Genomics and Bioinformatics 7(3):lqaf095. doi:10.1093/nargab/lqaf095.
To cite OrthoANI:
Imchang Lee, Yeong Ouk Kim, Sang-Cheol Park, Jongsik Chun. 2016. OrthoANI: An improved algorithm and software for calculating average nucleotide identity. International Journal of Systematic and Evolutionary Microbiology 66(2):1100-1103. doi:10.1099/ijsem.0.000760.
This library is provided under the open-source MIT license.
This project is in no way not affiliated, sponsored, or otherwise endorsed by the original OrthoANI authors. It was developed by Martin Larralde during his PhD project at the European Molecular Biology Laboratory in the Zeller team.
- [1] Imchang Lee, Yeong Ouk Kim, Sang-Cheol Park and Jongsik Chun. OrthoANI: An improved algorithm and software for calculating average nucleotide identity (2016). International Journal of Systematic and Evolutionary Microbiology. doi:10.1099/ijsem.0.000760. PMID:26585518.