Stars
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Development repository for the Triton language and compiler
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
An alignment auditing agent capable of quickly exploring alignment hypothesis
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Interactive roadmaps, guides and other educational content to help developers grow in their careers.
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Claude Code superpowers: core skills library
Anthropic's Interactive Prompt Engineering Tutorial
💯 Curated coding interview preparation materials for busy software engineers
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
Open-source platform to build and deploy AI agent workflows.
Find, verify, and analyze leaked credentials
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
LLM agents built for control. Designed for real-world use. Deployed in minutes.
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
Free, local, open-source AI app builder ✨ v0 / lovable / Bolt alternative 🌟 Star if you like it!
Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with co…
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
Google Gen AI Python SDK provides an interface for developers to integrate Google's generative models into their Python applications.
MCP Toolbox for Databases is an open source MCP server for databases.
Trae Agent is an LLM-based agent for general purpose software engineering tasks.
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.