Stars
An end-to-end pipeline to fine-tune and use text-to-audio diffusion models
A user interface toolkit mainly for audio plug-ins
A bio-inspired memory architecture for multi-modal reasoning and recall in AI systems
A collection of basic effects, available as open-source (MIT) C++ classes.
Simple Online Realtime Tracking with a Deep Association Metric
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
Image-to-Image Translation in PyTorch
A New Padding Scheme: Partial Convolution based Padding
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
Official repository for the paper - SLAP: Siamese Language-Audio Pretraining without negative samples for Music Understanding
Training, validation, and inference code for various SSL approaches and architectures.
Zammad is a web based open source helpdesk/customer support system.
Trae Agent is an LLM-based agent for general purpose software engineering tasks.
📥 An inbox UX for interacting with human-in-the-loop agents.
An easy-to-use Python wrapper for NSAppleScript, allowing Python scripts to communicate with AppleScripts and AppleScriptable applications.
Community plugins list, theme list, and releases of Obsidian.
Template for Obsidian community plugins with build configuration and development best practices.
Lightweight, flexible HTTP server framework written in Swift
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.