Thanks to visit codestin.com
Credit goes to www.libhunt.com

Python llama

Open-source Python projects categorized as llama

Top 23 Python llama Projects

  1. vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Project mention: DeepSeek-OCR: When a Picture Is Actually Worth 10 Fewer Tokens | dev.to | 2025-10-26

    One gotcha: if you're using vLLM, you'll need the 0.8.5 wheel for CUDA 11.8. Download it from vLLM releases before installing.

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. LLaMA-Factory

    Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

    Project mention: Llama-Factory: Unified, Efficient Fine-Tuning for 100 Open LLMs | news.ycombinator.com | 2025-09-18
  4. unsloth

    Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

    Project mention: Why ML Needs a New Programming Language | news.ycombinator.com | 2025-09-05
  5. aider

    aider is AI pair programming in your terminal

    Project mention: Show HN: I built Solveig, it turns any LLM into an assistant in your terminal | news.ycombinator.com | 2025-11-13

    See Usage for more: https://github.com/FSilveiraa/solveig/blob/main/docs/usage.m...

    ---

    FEATURES

    AI Terminal Assistant - Automate task planning, file management, code analysis and system management using natural language in your terminal.

    Safe by Design - Granular controls with pattern-based permissions. File operations prioritized, and shell commands can be disabled.

    Plugin Architecture - Extend capabilities through drop-in plugins. Add SQL queries, web scraping or block dangerous commands with 100 lines of Python.

    Modern CLI - Clear interface with task planning and listing, file content previews, diff editing, API usage tracking, code linting, waiting animations and rich tree displays for informed user decisions.

    Provider Independence - Works with any OpenAI-compatible API, including local models.

    tl;dr: similar idea to Claude Code (https://claude.com/product/claude-code) or Aider (https://aider.chat/), focusing on providing explicit user consent, granular configuration, drop-in plugins and the ability to integrate any model, backend or API.

    See the Features for more: https://github.com/FSilveiraa/solveig/blob/main/docs/about.m...

    ---

    TYPICAL TASKS

    - "Find and list all the duplicate files inside ~/Documents/"

  6. fish-speech

    SOTA Open Source TTS

    Project mention: 2025 Voice AI Guide: How to Make Your Own Real-Time Voice Agent (Part-1) | dev.to | 2025-09-20

    FishSpeech — Natural dialogue flow

  7. LLaVA

    [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

    Project mention: OpenAI Open Models | news.ycombinator.com | 2025-08-05
  8. sglang

    SGLang is a fast serving framework for large language models and vision language models.

    Project mention: I Want Everything Local – Building My Offline AI Workspace | news.ycombinator.com | 2025-08-08

    Agreed that this is a huge limit. There's a lot of examples actually of "tool calling" but it's all bespoke code-it-yourself: very few of these systems have MCP integration.

    I have a ton of respect for SGLang as a runtime. I'm hoping something can be done there. https://github.com/sgl-project/sglang/discussions/4461 . As noted in that thread, it is really great that Qwen3-Coder has a tool-parser built-in: hopefully can be some kind useful reference/start. https://huggingface.co/Qwen/Qwen3-Coder-480B-A35B-Instruct/b...

  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  10. Chinese-LLaMA-Alpaca

    中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

  11. ChuanhuChatGPT

    GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

  12. AstrBot

    ✨ 一站式 LLM 聊天机器人平台及开发框架 ✨ 支持 QQ、QQ频道、Telegram、企微、飞书、钉钉 | 知识库、MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify

    Project mention: AstrBot: Revolutionizing Chatbot Development with Ease and Flexibility | dev.to | 2025-03-26

    View the Project on GitHub

  13. PaddleNLP

    Easy-to-use and powerful LLM and SLM library with awesome model zoo.

  14. OpenLLM

    Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

    Project mention: Your 2025 Roadmap to Becoming an AI Engineer for Free for Vue.js Developers | dev.to | 2025-08-06

    REST APIs to connect AI models to Vue.js apps (example 1, example 2).

  15. ludwig

    Low-code framework for building custom LLMs, neural networks, and other AI models

  16. shell_gpt

    A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

    Project mention: Supercharge Your Terminal: ShellGPT + ChromaDB + LangChain for Context-Aware Automation | dev.to | 2025-09-01

    🗃 To explore ShellGPT in depth, including installation instructions, usage examples, and advanced configuration options, head over to the official ShellGPT GitHub repository.

  17. petals

    🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

    Project mention: Petals: Run large language models at home, BitTorrent‑style | news.ycombinator.com | 2025-05-27
  18. inference

    Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

  19. GPTCache

    Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

  20. lmdeploy

    LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

    Project mention: LLM Benchmarking: Cost-Efficient Performance | dev.to | 2025-04-08

    As an LLM serving framework, we experimented with both vLLM and LMdeploy. vLLM is one of the most popular frameworks and is frequently mentioned by our prospective clients. LMdeploy is a highly optimized framework and has shown the highest inference speed in recent benchmarking research. When using these frameworks, we used the out-of-the-gate inference configurations for both the baseline and experimental benchmark.

  21. free-llm-api-resources

    A list of free LLM inference resources accessible via API.

    Project mention: I Recreated Tom Riddle’s Diary, But Used My Soul Instead 👻 | dev.to | 2025-02-06

    Found a GitHub list of free LLM APIs 🏆

  22. mergekit

    Tools for merging pretrained large language models.

  23. Liger-Kernel

    Efficient Triton Kernels for LLM Training

    Project mention: Why ML Needs a New Programming Language | news.ycombinator.com | 2025-09-05
  24. Baichuan-7B

    A large-scale 7B pretraining language model developed by BaiChuan-Inc.

  25. Huatuo-Llama-Med-Chinese

    Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python llama discussion

Log in or Post with

Python llama related posts

  • Structured Outputs on the Claude Developer Platform (API)

    7 projects | news.ycombinator.com | 14 Nov 2025
  • Tongyi DeepResearch – open-source 30B MoE Model that rivals OpenAI DeepResearch

    4 projects | news.ycombinator.com | 2 Nov 2025
  • Ask HN: Who uses open LLMs and coding assistants locally? Share setup and laptop

    12 projects | news.ycombinator.com | 31 Oct 2025
  • Meltdown Version Pi

    1 project | news.ycombinator.com | 19 Oct 2025
  • Claude Code vs. Codex: I Built a Sentiment Dashboard from 500 Reddit Comments

    2 projects | news.ycombinator.com | 18 Oct 2025
  • LoRA Without Regret

    2 projects | news.ycombinator.com | 4 Oct 2025
  • Amazon Bedrock AgentCore Runtime - Part 6 Using AgentCore short-term Memory with Strands Agents SDK

    3 projects | dev.to | 29 Sep 2025
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 15 Nov 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source llama projects in Python? This list will help you:

# Project Stars
1 vllm 62,592
2 LLaMA-Factory 62,169
3 unsloth 48,261
4 aider 38,394
5 fish-speech 24,035
6 LLaVA 23,909
7 sglang 20,068
8 Chinese-LLaMA-Alpaca 18,945
9 ChuanhuChatGPT 15,431
10 AstrBot 13,227
11 PaddleNLP 12,843
12 OpenLLM 11,928
13 ludwig 11,616
14 shell_gpt 11,528
15 petals 9,787
16 inference 8,736
17 GPTCache 7,827
18 lmdeploy 7,266
19 free-llm-api-resources 6,541
20 mergekit 6,443
21 Liger-Kernel 5,836
22 Baichuan-7B 5,680
23 Huatuo-Llama-Med-Chinese 4,887

Sponsored
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io

Did you know that Python is
the 2nd most popular programming language
based on number of references?