Releases: pachterlab/gget
Releases · pachterlab/gget
v0.29.3 - BLAT access
Version ≥ 0.29.3 (Sep 11, 2025):
gget blat: Updated API request to new permissions.gget pdb: Added wwpdb mirror; falls back to rcsb if wwpdb fails.gget cellxgene: Improved argument handling; frontend unchanged. Fixes issue 181.gget setup/gget alphafold: Fixed pip_cmd bug in gget.setup("alphafold")
v0.29.2 - uv pip install gget
Version ≥ 0.29.2 (Jul 03, 2025):
ggetcan now be installed usinguv pip install gget- All package metadata (version, author, description, etc.) is now managed in setup.cfg for full compatibility with modern tools like uv, pip, and PyPI
- gget now uses a minimal setup.py and is fully PEP 517/518 compatible
gget setupwill now try to useuv pip installfirst for speed and modern dependency resolution, and fall back ontopip installif uv fails or is not available- Users are informed at each step which installer is being used and if a retry is happening
- Note: Some scientific dependencies (e.g., cellxgene-census) may not yet support Python 3.12. If you encounter installation errors, try using Python 3.9 or 3.10. (The pip installation might also still succeed in these cases.)
- All required dependencies are now listed in setup.cfg under install_requires -> Installing gget with
pip install .oruv pip install .will automatically install all dependencies
v0.29.1 - mutate and cosmic overhaul
gget mutate:- gget mutate has been simplified to focus on taking as input a list of mutations and associated reference genome with corresponding annotation information, and produce as output the sequences with the mutation incorporated and a short region of surrounding context. For the full functionality of the previous version and how it integrates in the context of a novel variant screening pipeline, visit the varseek repository being developed by members of the gget team at https://github.com/pachterlab/varseek.git.
- Added additional information to returned data frames as described here: #169
gget cosmic:- Major restructuring of the
gget cosmicmodule to adhere to new login requirements set by COSMIC - New arguments
emailandpasswordwere added to allow the user to manually enter their login credentials without required input for data download - Default changed:
gget_mutate=False - Deprecated argument:
entity - Argument
mutation_classis nowcosmic_project
- Major restructuring of the
gget bgee:type="orthologs"is now the default, removing the need to specify thetypeargument when calling orthologs- Allow querying multiple genes at once.
gget diamond:- Now supports translated alignment of nucleotide sequences to amino acid reference sequences using the
--translatedflag.
- Now supports translated alignment of nucleotide sequences to amino acid reference sequences using the
gget elm:- Improved server error handling.
v0.29.0 - cbio, opentargets, bgee and more
- New modules:
gget enrichrnow also supports species other than human and mouse (fly, yeast, worm, and fish) via modEnrichRgget mutate:
gget mutatewill now merge identical sequences in the final file by default. Mutation creation was vectorized to decrease runtime. Improved flanking sequence check for non-substitution mutations to make sure no wildtype kmer is retained in the mutation-containing sequence. Addition of several new arguments to customize sequence generation and output.gget cosmic:
Added support for targeted as well as gene screens. The CSV file created for gget mutate now also contains protein mutation info.gget ref:
Added out file option.gget infoandgget seq:
Switched to Ensembl POST API to increase speed (nothing changes in front end).- Other "behind the scenes" changes:
- Unit tests reorganized to increase speed and decrease code
- Requirements updated to allow newer mysql-connector versions
- Support Numpy>= 2.0
v0.28.6 - gget mutate, download_cosmic, fixes for Ensembl v112
- New module:
gget mutate gget cosmic: You can now download entire COSMIC databases using the argumentdownload_cosmicargumentgget ref: Can now fetch the GRCh37 genome assembly usingspecies='human_grch37'gget search: Adjust access of human data to the structure of Ensembl release 112 (fixes issue 129)
v0.28.4 - Fix Windows bug in gget elm setup
Fix Windows bug in gget elm setup
v0.28.3 - cosmic, invertebrates for ref and search, elm improvements
gget searchandgget refnow also support fungi 🍄, protists 🌝, and invertebrate metazoa 🐝 🐜 🐌 🐙 (in addition to vertebrates and plants)- New module:
gget cosmic gget enrichr: Fix duplicate scatter dots in plot when pathway names are duplicatedgget elm:- Changed ortho results column name 'Ortholog_UniProt_ID' to 'Ortholog_UniProt_Acc' to correctly reflect the column contents, which are UniProt Accessions. 'UniProt ID' was changed to 'UniProt Acc' in the documentation for all
ggetmodules. - Changed ortho results column name 'motif_in_query' to 'motif_inside_subject_query_overlap'.
- Added interaction domain information to results (new columns: "InteractionDomainId", "InteractionDomainDescription", "InteractionDomainName").
- The regex string for regular expression matches was encapsulated as follows: "(?=(regex))" (instead of directly passing the regex string "regex") to enable capturing all occurrences of a motif when the motif length is variable and there are repeats in the sequence (https://regex101.com/r/HUWLlZ/1).
- Changed ortho results column name 'Ortholog_UniProt_ID' to 'Ortholog_UniProt_Acc' to correctly reflect the column contents, which are UniProt Accessions. 'UniProt ID' was changed to 'UniProt Acc' in the documentation for all
gget setup: Use theoutargument to specify a directory the ELM database will be downloaded into. Completes this feature request.gget diamond: The DIAMOND command is now run with--ignore-warningsflag, allowing niche sequences such as amino acid sequences that only contain nucleotide characters and repeated sequences. This is also true for DIAMOND alignments performed withingget elm.gget refandgget searchback-end change: the current Ensembl release is fetched from the new release file on the Ensembl FTP site to avoid errors during uploads of new releases.gget search:- FTP link results (
--ftp) are saved in txt file format instead of json. - Fix URL links to Ensembl gene summary for species with a subspecies name and invertebrates.
- FTP link results (
gget ref:- Back-end changes to increase speed
- New argument:
list_iv_speciesto list all available invertebrate species (can be combined with thereleaseargument to fetch all species available from a specific Ensembl release)
v0.28.2 - NCBI server issues and gget elm expand
gget info: Return a logging error message when the NCBI server fails for a reason other than a fetch fail (this is an error on the server side rather than an error withgget)- Replace deprecated 'text' argument to find()-type methods whenever used with dependency
BeautifulSoup gget elm: Remove false positive and true negative instances from returned resultsgget elm: Addexpandargument
v0.28.0 - gget elm + gget diamond
- Updated documentation of
gget muscleto add a tutorial on how to visualize sequences with sequence name lengths + slight change to returned visualization so it's a bit more robust to varying sequence names gget musclenow also allows a list of sequences as input (as an alternative to providing the path to a FASTA file)- Allow missing gene filter for
gget cellxgene(fixes bug) gget seq: Allow missing gene names (fixes #107)- New arguments for
gget enrichr: Use argumentskegg_outandkegg_rankto create an image of the KEGG pathway with the genes from the enrichment analysis highlighted (thanks to this PR by Noriaki Sato) - New modules:
gget elmandgget diamond
co-authored-by: @anhchi172