Thanks to visit codestin.com
Credit goes to github.com

Skip to content
Change the repository type filter

All

    Repositories list

    • MDX
      103541327Updated Oct 24, 2025Oct 24, 2025
    • Official inference library for pre-processing of Mistral models
      Python
      10980414Updated Oct 24, 2025Oct 24, 2025
    • cookbook

      Public
      Jupyter Notebook
      4472k413Updated Oct 24, 2025Oct 24, 2025
    • Python client library for Mistral AI platform
      Python
      151660235Updated Oct 21, 2025Oct 21, 2025
    • Gateway API Inference Extension
      Go
      184000Updated Oct 9, 2025Oct 9, 2025
    • Inference scheduler for llm-d
      Go
      85000Updated Oct 9, 2025Oct 9, 2025
    • Distributed KV cache coordinator
      Go
      51000Updated Oct 7, 2025Oct 7, 2025
    • client-ts

      Public
      TS Client library for Mistral AI platform
      TypeScript
      28101275Updated Sep 2, 2025Sep 2, 2025
    • Python
      137731Updated Aug 20, 2025Aug 20, 2025
    • Mistral AI documentation for SageMaker
      Jupyter Notebook
      4101Updated Apr 18, 2025Apr 18, 2025
    • Official inference library for Mistral models
      Jupyter Notebook
      97911k12932Updated Mar 20, 2025Mar 20, 2025
    • client-js

      Public archive
      JS Client library for Mistral AI platform
      JavaScript
      4719400Updated Oct 10, 2024Oct 10, 2024
    • Python
      2983k3511Updated Sep 13, 2024Sep 13, 2024
    • TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
      C++
      1.8k1400Updated Feb 12, 2024Feb 12, 2024
    • vllm-release

      Public archive
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      11k5200Updated Dec 11, 2023Dec 11, 2023
    • megablocks-public

      Public archive
      Python
      21586500Updated Dec 8, 2023Dec 8, 2023
    • FastChat-release

      Public archive
      An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
      Python
      4.8k4700Updated Oct 2, 2023Oct 2, 2023
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      31k3000Updated Sep 27, 2023Sep 27, 2023