Stars
PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437
"Deep Generative Modeling": Introductory Examples
Tools to Design or Visualize Architecture of Neural Network
Efficient vision foundation models for high-resolution generation and perception.
Infinite Photorealistic Worlds using Procedural Generation
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep lear…
PyTorch implementations of Generative Adversarial Networks.
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…
Official PyTorch implementation of One-Minute Video Generation with Test-Time Training
Official implementation of Inductive Moment Matching
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
A Conversational Speech Generation Model
A Python package that makes it easy for developers to create AI apps powered by various AI providers.
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Code for NeurIPS 2024 paper - The GAN is dead; long live the GAN! A Modern Baseline GAN - by Huang et al.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Font files available from Google Fonts, and a public issue tracker for all things Google Fonts
Simple, unified interface to multiple Generative AI providers
A simple screen parsing tool towards pure vision based GUI agent
[ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Entropy Based Sampling and Parallel CoT Decoding
notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification dataset and the transformers library
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Morphological processing for languages of the Horn of Africa
Refine high-quality datasets and visual AI models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…