Stars
Easily compute clip embeddings and build a clip retrieval system with them
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
A playbook for systematically maximizing the performance of deep learning models.
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
A deep learning library for video understanding research.
A collection of reference Jupyter notebooks and demo AI/ML applications for enterprise use cases: marketing, pricing, supply chain, smart manufacturing, and more.
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
This repository includes the notebook of „Understanding Videos at Scale: How to Extract Insights for Business Research“
Python scripts for modelling timbral attributes
pix2tex: Using a ViT to convert images of equations into LaTeX code.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Multi agent system for AI-driven software development. Combine LLM with DevOps tools to convert natural language requirements into working software. Supports any development language and extends th…
[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
AI companions with memory: a lightweight stack to create and host your own AI companions
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM …
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Code for the paper Fine-Tuning Language Models from Human Preferences
Code for "Learning to summarize from human feedback"
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
A feature-rich command-line audio/video downloader
Python library to download YouTube content and retrieve metadata