SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python llama Projects
-
Project mention: DeepSeek-OCR: When a Picture Is Actually Worth 10 Fewer Tokens | dev.to | 2025-10-26
One gotcha: if you're using vLLM, you'll need the 0.8.5 wheel for CUDA 11.8. Download it from vLLM releases before installing.
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
Project mention: Llama-Factory: Unified, Efficient Fine-Tuning for 100 Open LLMs | news.ycombinator.com | 2025-09-18
-
unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
-
Project mention: Show HN: I built Solveig, it turns any LLM into an assistant in your terminal | news.ycombinator.com | 2025-11-13
See Usage for more: https://github.com/FSilveiraa/solveig/blob/main/docs/usage.m...
---
FEATURES
AI Terminal Assistant - Automate task planning, file management, code analysis and system management using natural language in your terminal.
Safe by Design - Granular controls with pattern-based permissions. File operations prioritized, and shell commands can be disabled.
Plugin Architecture - Extend capabilities through drop-in plugins. Add SQL queries, web scraping or block dangerous commands with 100 lines of Python.
Modern CLI - Clear interface with task planning and listing, file content previews, diff editing, API usage tracking, code linting, waiting animations and rich tree displays for informed user decisions.
Provider Independence - Works with any OpenAI-compatible API, including local models.
tl;dr: similar idea to Claude Code (https://claude.com/product/claude-code) or Aider (https://aider.chat/), focusing on providing explicit user consent, granular configuration, drop-in plugins and the ability to integrate any model, backend or API.
See the Features for more: https://github.com/FSilveiraa/solveig/blob/main/docs/about.m...
---
TYPICAL TASKS
- "Find and list all the duplicate files inside ~/Documents/"
-
Project mention: 2025 Voice AI Guide: How to Make Your Own Real-Time Voice Agent (Part-1) | dev.to | 2025-09-20
FishSpeech — Natural dialogue flow
-
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
-
Project mention: I Want Everything Local – Building My Offline AI Workspace | news.ycombinator.com | 2025-08-08
Agreed that this is a huge limit. There's a lot of examples actually of "tool calling" but it's all bespoke code-it-yourself: very few of these systems have MCP integration.
I have a ton of respect for SGLang as a runtime. I'm hoping something can be done there. https://github.com/sgl-project/sglang/discussions/4461 . As noted in that thread, it is really great that Qwen3-Coder has a tool-parser built-in: hopefully can be some kind useful reference/start. https://huggingface.co/Qwen/Qwen3-Coder-480B-A35B-Instruct/b...
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
-
AstrBot
✨ 一站式 LLM 聊天机器人平台及开发框架 ✨ 支持 QQ、QQ频道、Telegram、企微、飞书、钉钉 | 知识库、MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify
Project mention: AstrBot: Revolutionizing Chatbot Development with Ease and Flexibility | dev.to | 2025-03-26View the Project on GitHub
-
-
OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Project mention: Your 2025 Roadmap to Becoming an AI Engineer for Free for Vue.js Developers | dev.to | 2025-08-06REST APIs to connect AI models to Vue.js apps (example 1, example 2).
-
-
shell_gpt
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.
Project mention: Supercharge Your Terminal: ShellGPT + ChromaDB + LangChain for Context-Aware Automation | dev.to | 2025-09-01🗃 To explore ShellGPT in depth, including installation instructions, usage examples, and advanced configuration options, head over to the official ShellGPT GitHub repository.
-
petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Project mention: Petals: Run large language models at home, BitTorrent‑style | news.ycombinator.com | 2025-05-27 -
inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
-
-
As an LLM serving framework, we experimented with both vLLM and LMdeploy. vLLM is one of the most popular frameworks and is frequently mentioned by our prospective clients. LMdeploy is a highly optimized framework and has shown the highest inference speed in recent benchmarking research. When using these frameworks, we used the out-of-the-gate inference configurations for both the baseline and experimental benchmark.
-
Found a GitHub list of free LLM APIs 🏆
-
-
-
-
Huatuo-Llama-Med-Chinese
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python llama discussion
Python llama related posts
-
Structured Outputs on the Claude Developer Platform (API)
-
Tongyi DeepResearch – open-source 30B MoE Model that rivals OpenAI DeepResearch
-
Ask HN: Who uses open LLMs and coding assistants locally? Share setup and laptop
-
Meltdown Version Pi
-
Claude Code vs. Codex: I Built a Sentiment Dashboard from 500 Reddit Comments
-
LoRA Without Regret
-
Amazon Bedrock AgentCore Runtime - Part 6 Using AgentCore short-term Memory with Strands Agents SDK
-
A note from our sponsor - SaaSHub
www.saashub.com | 15 Nov 2025
Index
What are some of the best open-source llama projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | vllm | 62,592 |
| 2 | LLaMA-Factory | 62,169 |
| 3 | unsloth | 48,261 |
| 4 | aider | 38,394 |
| 5 | fish-speech | 24,035 |
| 6 | LLaVA | 23,909 |
| 7 | sglang | 20,068 |
| 8 | Chinese-LLaMA-Alpaca | 18,945 |
| 9 | ChuanhuChatGPT | 15,431 |
| 10 | AstrBot | 13,227 |
| 11 | PaddleNLP | 12,843 |
| 12 | OpenLLM | 11,928 |
| 13 | ludwig | 11,616 |
| 14 | shell_gpt | 11,528 |
| 15 | petals | 9,787 |
| 16 | inference | 8,736 |
| 17 | GPTCache | 7,827 |
| 18 | lmdeploy | 7,266 |
| 19 | free-llm-api-resources | 6,541 |
| 20 | mergekit | 6,443 |
| 21 | Liger-Kernel | 5,836 |
| 22 | Baichuan-7B | 5,680 |
| 23 | Huatuo-Llama-Med-Chinese | 4,887 |