Change the repository type filter
All
Repositories list
79 repositories
- binutils-gdbPublic
- glibcPublic
- WSLPublic
- et-manPublic
- cogent-sdkPublic
- bashPublic
- miaPublic
- orgPublic
- RWKV-RunnerPublic
- frida-corePublic
- fridaPublic
- frida-toolsPublic
- rwkv_ai00_serverPublic
- web-rwkvPublic
- android-unpinnerPublic
- skypilotPublic
- sglangPublic
- dspyPublic
- llama-cookbookPublicWelcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
- acceleratePublic
- sgl-kernel-npuPublic
- omePublic
- aompPublic
- TensorRT-LLMPublicTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.
- git-lfsPublic
- dinov3Public
- llama.rnPublic
- llama.nodePublic