Raymond Maarloeve LLMServer

Official language model (LLM) server for the narrator and NPCs in the Raymond Maarloeve game project.

A lightweight REST API for managing local language models used by NPCs and the narrator in the game. Supports multiple model loading, response generation, and dynamic resource management.

📚 Documentation

Full project documentation is available at:
🔗 https://raymondmaarloeve.github.io/LLMServer/
Main repo: 🔗 https://github.com/RaymondMaarloeve/RaymondMaarloeve

✨ Features

🔁 Supports multiple LLMs simultaneously (model_id)
🔌 Simple /chat endpoint with full conversation history handling
🚦 Automatic response termination detection using special tags (<npc>, <human>, etc.)
🧹 Ability to unload models from memory (/unload)
📂 File browsing via API (/list-files)

🧩 Technologies

Python 3.12
Flask – REST API
llama-cpp-python – interface for local LLaMA models
PyInstaller – server binary packaging

🚀 Usage

Run the server:
```
python main.py
```

Load a model:

POST /load
{
  "model_id": "npc_village",
  "model_path": "models/ggml-npc-q4.bin",
  "n_ctx": 2048,
  "n_gpu_layers": 16
}

Send a chat request:

POST /chat
{
  "model_id": "npc_village",
  "messages": [
    {"role": "system", "content": "You are a grumpy blacksmith."},
    {"role": "user", "content": "Hello there!"},
    {"role": "assistant", "content": "Hmph. What do you want?"},
    {"role": "user", "content": "Got any gossip?"}
  ]
}

Receive the response and display it in-game.

🛠 Building

To build a standalone version:

 CMAKE_ARGS="-DGGML_VULKAN=on" uv pip install llama-cpp-python --no-cache
 uv run pyinstaller --onefile --additional-hooks-dir hooks main.py

🔍 API Endpoints

Endpoint	Description
`/load`	Load a model into memory
`/chat`	Generate a response in chat style
`/unload`	Release model resources
`/status`	Check available models and GPU status
`/list-files`	List files in a specified directory
`/register`	Register a model for lazy-loading

The LLMServer project is the foundation of narration and NPC behavior in the world of Raymond Maarloeve.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.github/workflows		.github/workflows
hooks		hooks
.gitignore		.gitignore
LLMServer.http		LLMServer.http
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Raymond Maarloeve LLMServer

📚 Documentation

✨ Features

🧩 Technologies

🚀 Usage

🛠 Building

🔍 API Endpoints

About

Uh oh!

Releases 7

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

Uh oh!

Uh oh!

RaymondMaarloeve/LLMServer

Folders and files

Latest commit

History

Repository files navigation

Raymond Maarloeve LLMServer

📚 Documentation

✨ Features

🧩 Technologies

🚀 Usage

🛠 Building

🔍 API Endpoints

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 7

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages