Top 23 Python large-language-model Projects

langflow

1 47 138,041 10.0 Python

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Project mention: Building Custom Components in Langflow 🛠️ | dev.to | 2025-11-05

Langflow: https://www.langflow.org/
Stream

getstream.io featured

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
gpt_academic

2 2 69,658 8.5 Python

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
LLaMA-Factory

3 8 62,169 9.7 Python

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Project mention: Llama-Factory: Unified, Efficient Fine-Tuning for 100 Open LLMs | news.ycombinator.com | 2025-09-18
storm

4 7 27,602 5.8 Python

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Project mention: Code Explanation: "STORM: Synthesis of Topic Outlines through Retrieval and Multi-perspective Question Asking" | dev.to | 2025-03-08

Note: this explanation only covers the knowledge_storm in the storm repo because it aligns with my interests.
LightRAG

5 10 22,597 10.0 Python

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Project mention: 🍥 Hands-on Experience with LightRAG | dev.to | 2025-10-27

LightRAG examples: https://github.com/HKUDS/LightRAG/tree/main/examples
Qwen

6 8 19,710 6.0 Python

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Project mention: Running Qwen, Nearly as Powerful as DeepSeek, on a MacBook Pro | dev.to | 2025-02-05

Qwen (Qwen GitHub Repository) has been gaining attention recently as a powerful open-source large language model (LLM). I decided to give it a spin on my MacBook Pro using Ollama, a platform designed for running local LLMs. While Qwen2.5-Max boasts the highest performance, my setup could only handle the smaller Qwen2.5 (32B) model. Here's what I found!
Chinese-LLaMA-Alpaca

7 4 18,945 2.3 Python

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
InfluxDB

www.influxdata.com featured

InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
langextract

8 3 16,867 9.1 Python

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Project mention: All Data and AI Weekly #203: 18-Aug-2025 | dev.to | 2025-08-18

langextract: A tool for extracting language information. View on GitHub
NeMo

9 31 16,065 9.9 Python

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Project mention: FFmpeg 8.0 adds Whisper support | news.ycombinator.com | 2025-08-13

git clone https://github.com/NVIDIA/NeMo.git nemo
ChatGLM2-6B

10 4 15,703 5.0 Python

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
ml-engineering

11 12 15,734 9.3 Python

Machine Learning Engineering Open Book

Project mention: Real-time Nvidia GPU dashboard | news.ycombinator.com | 2025-10-06

For kernel-level performance tuning you can use the occupancy calculator as pointed out by jplusqualt or you can profile your kernel with Nsight compute which will give you a ton of info.
But for model-wide performance, you basically have to come up with your own calculation to estimate the FLOPs required by your model and based on that figure out how well your model is maxing out the GPU capabilities (MFU/HFU).
Here is a more in-depth example on how you might do this: https://github.com/stas00/ml-engineering/tree/master/trainin...
camel

12 16 14,781 9.9 Python

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Project mention: Revisiting Minsky's Society of Mind in 2025 | news.ycombinator.com | 2025-06-18

It seems like you might be confusing "research programs with things like "branding" and superficial terminology. Here, enjoy this thing clearly building on SoM and edited earlier this week: ideas https://github.com/camel-ai/camel/blob/master/camel/societie...
Megatron-LM

13 20 14,189 9.9 Python

Ongoing research training transformer models at scale
agentscope

14 4 13,842 8.7 Python

AgentScope: Agent-Oriented Programming for Building LLM Applications

Project mention: All Data and AI Weekly #207: 15 Sept 2025 | dev.to | 2025-09-15

GitHub Link: https://github.com/agentscope-ai/agentscope Summary: Agentscope is an agent-oriented programming library that makes it easier to build LLM applications. It's designed to be "developer-centric" with features like asynchronous execution, parallel tool calls, and real-time steering. It offers a transparent approach where prompt engineering and API invocation are fully visible and controllable. Why it's important: Agentscope, along with its related libraries like agentscope-runtime and agentscope-studio, provides a comprehensive toolkit for not only developing but also deploying and visualizing agent-based applications.
litgpt

15 2 12,918 9.2 Python

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Project mention: Lightning.ai – an enterprise managed inference platform for AI | news.ycombinator.com | 2025-10-09

After making model training simpler with PyTorch Lightning, Lightning.AI is now tackling the next bottleneck — inference. Their new managed service targets enterprises deploying LLMs and deep learning models at scale, emphasizing performance, cost-efficiency, and developer-friendly tooling.
Platform: https://lightning.ai/
MOSS

16 4 12,049 4.7 Python

An open-source tool-augmented conversational language model from Fudan University
LLMSurvey

17 3 11,956 7.3 Python

The official GitHub page for the survey paper "A Survey of Large Language Models".
txtai

18 392 11,819 9.4 Python

💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows

Project mention: The AI-Native GraphDB + GraphRAG + Graph Memory Landscape & Market Catalog | dev.to | 2025-10-26

GitHub: https://github.com/neuml/txtai
HRM

19 13 11,683 5.3 Python

Hierarchical Reasoning Model Official Release

Project mention: Is chain-of-thought AI reasoning a mirage? | news.ycombinator.com | 2025-08-14

This paper I read from here has an interesting mathematical model for reasoning based on cognitive science. https://arxiv.org/abs/2506.21734 (there is also code here https://github.com/sapientinc/HRM) I think we will see dramatic performance increases on "reasoning" problems when this is worked into existing AI architectures.
petals

20 102 9,787 7.2 Python

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Project mention: Petals: Run large language models at home, BitTorrent‑style | news.ycombinator.com | 2025-05-27
PentestGPT

21 18 9,054 4.6 Python

A GPT-empowered penetration testing tool
deeplake

22 15 8,894 6.3 Python

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

Project mention: What I Learned Comparing Zilliz Cloud and Deep Lake for Scalable Vector Search | dev.to | 2025-06-09

As I scaled up a semantic search engine for multi-modal content, I found myself at a fork in the road. Should I lean into a purpose-built vector database like Zilliz Cloud, or embrace a more flexible data lake approach with Deep Lake? These tools promise vector search at scale—but they come from fundamentally different architectural philosophies.
optimate

23 105 8,366 5.8 Python

A collection of libraries to optimise AI model performances
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python large-language-models discussion

Python large-language-models related posts

Building Custom Components in Langflow 🛠️

2 projects | dev.to | 5 Nov 2025
🍥 Hands-on Experience with LightRAG

1 project | dev.to | 27 Oct 2025
5 Ways to Detect AI Agent Hallucinations

1 project | dev.to | 26 Oct 2025
Evalúa y Mejora Tus Agentes: Evaluación Automatizada con RAGAS para Agentes de Producción

2 projects | dev.to | 15 Oct 2025
Real-time Nvidia GPU dashboard

2 projects | news.ycombinator.com | 6 Oct 2025
Lightning.ai – an enterprise managed inference platform for AI

1 project | news.ycombinator.com | 9 Oct 2025
My very first hands-on epxerience with Langflow

2 projects | dev.to | 21 Sep 2025
A note from our sponsor - Stream
getstream.io | 16 Nov 2025

Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure. Learn more →

Index

What are some of the best open-source large-language-model projects in Python? This list will help you:

#	Project	Stars
1	langflow	138,041
2	gpt_academic	69,658
3	LLaMA-Factory	62,169
4	storm	27,602
5	LightRAG	22,597
6	Qwen	19,710
7	Chinese-LLaMA-Alpaca	18,945
8	langextract	16,867
9	NeMo	16,065
10	ChatGLM2-6B	15,703
11	ml-engineering	15,734
12	camel	14,781
13	Megatron-LM	14,189
14	agentscope	13,842
15	litgpt	12,918
16	MOSS	12,049
17	LLMSurvey	11,956
18	txtai	11,819
19	HRM	11,683
20	petals	9,787
21	PentestGPT	9,054
22	deeplake	8,894
23	optimate	8,366

Python large-language-models

Top 23 Python large-language-model Projects

Python large-language-models discussion

Python large-language-models related posts

Building Custom Components in Langflow 🛠️

🍥 Hands-on Experience with LightRAG

5 Ways to Detect AI Agent Hallucinations

Evalúa y Mejora Tus Agentes: Evaluación Automatizada con RAGAS para Agentes de Producción

Real-time Nvidia GPU dashboard

Lightning.ai – an enterprise managed inference platform for AI

My very first hands-on epxerience with Langflow

Index

Did you know that Python is the 2nd most popular programming language based on number of references?

Did you know that Python is
the 2nd most popular programming language
based on number of references?