-
fastllm Public
Forked from ztxz16/fastllm纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
C++ Apache License 2.0 UpdatedAug 20, 2025 -
mcp-deep-research Public
An mcp server designed for local-deployed deep research.
Python MIT License UpdatedAug 16, 2025 -
ktransformers Public
Forked from kvcache-ai/ktransformersA Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Python Apache License 2.0 UpdatedMay 7, 2025 -
FreeBeyond Public
A simple bash script to automatically deploy Xray(Vless) or NaiveProxy service.
HTML GNU General Public License v3.0 UpdatedNov 11, 2024 -
KaiWu Public
An LLM PDF chat framework optimized for research articles and local deployment.
-
-
-
AemConvertor Public
A simple python script to convert .aem to .obj or .obj to .aem
-
text-generation-webui Public
Forked from oobabooga/text-generation-webuiA gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.
Python GNU Affero General Public License v3.0 UpdatedJun 7, 2023 -