InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 23 Python llama Projects
-
Project mention: DeepSeek-OCR: When a Picture Is Actually Worth 10 Fewer Tokens | dev.to | 2025-10-26
One gotcha: if you're using vLLM, you'll need the 0.8.5 wheel for CUDA 11.8. Download it from vLLM releases before installing.
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
Project mention: Llama-Factory: Unified, Efficient Fine-Tuning for 100 Open LLMs | news.ycombinator.com | 2025-09-18
-
unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
-
Project mention: Show HN: I built Solveig, it turns any LLM into an assistant in your terminal | news.ycombinator.com | 2025-11-13
See Usage for more: https://github.com/FSilveiraa/solveig/blob/main/docs/usage.m...
---
FEATURES
AI Terminal Assistant - Automate task planning, file management, code analysis and system management using natural language in your terminal.
Safe by Design - Granular controls with pattern-based permissions. File operations prioritized, and shell commands can be disabled.
Plugin Architecture - Extend capabilities through drop-in plugins. Add SQL queries, web scraping or block dangerous commands with 100 lines of Python.
Modern CLI - Clear interface with task planning and listing, file content previews, diff editing, API usage tracking, code linting, waiting animations and rich tree displays for informed user decisions.
Provider Independence - Works with any OpenAI-compatible API, including local models.
tl;dr: similar idea to Claude Code (https://claude.com/product/claude-code) or Aider (https://aider.chat/), focusing on providing explicit user consent, granular configuration, drop-in plugins and the ability to integrate any model, backend or API.
See the Features for more: https://github.com/FSilveiraa/solveig/blob/main/docs/about.m...
---
TYPICAL TASKS
- "Find and list all the duplicate files inside ~/Documents/"
-
Project mention: 2025 Voice AI Guide: How to Make Your Own Real-Time Voice Agent (Part-1) | dev.to | 2025-09-20
FishSpeech — Natural dialogue flow
-
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
-
Project mention: I Want Everything Local – Building My Offline AI Workspace | news.ycombinator.com | 2025-08-08
Agreed that this is a huge limit. There's a lot of examples actually of "tool calling" but it's all bespoke code-it-yourself: very few of these systems have MCP integration.
I have a ton of respect for SGLang as a runtime. I'm hoping something can be done there. https://github.com/sgl-project/sglang/discussions/4461 . As noted in that thread, it is really great that Qwen3-Coder has a tool-parser built-in: hopefully can be some kind useful reference/start. https://huggingface.co/Qwen/Qwen3-Coder-480B-A35B-Instruct/b...
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
-
ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
-
AstrBot
✨ 一站式 LLM 聊天机器人平台及开发框架 ✨ 支持 QQ、QQ频道、Telegram、企微、飞书、钉钉 | 知识库、MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify
Project mention: AstrBot: Revolutionizing Chatbot Development with Ease and Flexibility | dev.to | 2025-03-26View the Project on GitHub
-
-
OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Project mention: Your 2025 Roadmap to Becoming an AI Engineer for Free for Vue.js Developers | dev.to | 2025-08-06REST APIs to connect AI models to Vue.js apps (example 1, example 2).
-
-
shell_gpt
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.
Project mention: Supercharge Your Terminal: ShellGPT + ChromaDB + LangChain for Context-Aware Automation | dev.to | 2025-09-01🗃 To explore ShellGPT in depth, including installation instructions, usage examples, and advanced configuration options, head over to the official ShellGPT GitHub repository.
-
petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Project mention: Petals: Run large language models at home, BitTorrent‑style | news.ycombinator.com | 2025-05-27 -
inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
-
-
As an LLM serving framework, we experimented with both vLLM and LMdeploy. vLLM is one of the most popular frameworks and is frequently mentioned by our prospective clients. LMdeploy is a highly optimized framework and has shown the highest inference speed in recent benchmarking research. When using these frameworks, we used the out-of-the-gate inference configurations for both the baseline and experimental benchmark.
-
Found a GitHub list of free LLM APIs 🏆
-
-
-
-
Huatuo-Llama-Med-Chinese
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python llama discussion
Python llama related posts
-
Structured Outputs on the Claude Developer Platform (API)
-
Tongyi DeepResearch – open-source 30B MoE Model that rivals OpenAI DeepResearch
-
Ask HN: Who uses open LLMs and coding assistants locally? Share setup and laptop
-
Meltdown Version Pi
-
Claude Code vs. Codex: I Built a Sentiment Dashboard from 500 Reddit Comments
-
LoRA Without Regret
-
Amazon Bedrock AgentCore Runtime - Part 6 Using AgentCore short-term Memory with Strands Agents SDK
-
A note from our sponsor - InfluxDB
www.influxdata.com | 15 Nov 2025
Index
What are some of the best open-source llama projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | vllm | 62,592 |
| 2 | LLaMA-Factory | 62,169 |
| 3 | unsloth | 48,261 |
| 4 | aider | 38,394 |
| 5 | fish-speech | 24,035 |
| 6 | LLaVA | 23,909 |
| 7 | sglang | 20,068 |
| 8 | Chinese-LLaMA-Alpaca | 18,945 |
| 9 | ChuanhuChatGPT | 15,431 |
| 10 | AstrBot | 13,227 |
| 11 | PaddleNLP | 12,843 |
| 12 | OpenLLM | 11,928 |
| 13 | ludwig | 11,616 |
| 14 | shell_gpt | 11,528 |
| 15 | petals | 9,787 |
| 16 | inference | 8,736 |
| 17 | GPTCache | 7,827 |
| 18 | lmdeploy | 7,266 |
| 19 | free-llm-api-resources | 6,541 |
| 20 | mergekit | 6,443 |
| 21 | Liger-Kernel | 5,836 |
| 22 | Baichuan-7B | 5,680 |
| 23 | Huatuo-Llama-Med-Chinese | 4,887 |