Areg Edge AI Agent

An Edge AI agent service provider and consumer built on top of the Areg SDK for distributed and event driven systems.

Important

This repository is under active construction and may not be fully functional yet. Follow the repository to receive updates as development progresses.

Why Edge AI on Areg

Areg SDK is a framework with tools for building distributed applications where communication, concurrency, and service lifecycle are first class concepts. It is designed for systems that require predictability, clear ownership, and strong runtime observability.

This repository demonstrates how Edge AI agents can be modeled as Areg services. Each agent acts as a service provider, or a consumer, or both of them, communicating through well defined interfaces instead of ad hoc message passing. Areg manages discovery, message routing, threading, and fault isolation, allowing AI logic to remain focused, testable, and reusable.

A key strength of this approach is observability. Areg provides structured logging that captures service interactions, message flow, and per method performance metrics. These logs can be inspected in real time or offline using Lusan, the official UI toolset for debugging and monitoring Areg based systems.

In practice, this means Edge AI systems built on Areg are transparent, traceable, and ready for production from the start.

Project Building

Projects in this repository are built with the AREG Framework, Qt for UI development, and the llama.cpp Edge AI engine. CMake is used as the primary build system, and both areg and llama.cpp are automatically downloaded and built during the build process. Building with Qt Creator is recommended, though command-line builds are fully supported if the required Qt packages are installed.

Important

To run the applications, you must have AI models compatible with llama.cpp. Compatible models are available on Hugging Face; filter by llama.cpp and TextGeneration. For convenience, place the models in the ./models/llama/text directory so they are automatically copied to the application working directory during build.

Requirements

C++17 compatible compiler
GCC, Clang, MSVC, or MinGW
CMake 3.20 or newer
Required to build applications
Java 17 or newer
Required for code generation tools
Qt 5.12 or newer
Required for UI based projects
Supported Edge AI engine
Currently, only llama.cpp is supported.
Other tools
For advanced log or service interface visualization, the optional Lusan tool can be built and used alongside these projects.

Supported Platforms

Operating systems
Linux and Windows
Hardware architectures
x86, x86_64, ARM, and AArch64

Planned Use Cases

The following use cases illustrate how Areg can be applied to Edge AI systems. These cases represent the intended direction of the project.

Use Case 1: One AI Agent Serving Multiple Clients

Note

This use case is fully functional and can be tested by running the aiagent application as the AI Service Provider and one or more edgedevice applications as Service Consumers (device simulations).

A single AI agent processes text requests from multiple clients connected over a local network. Clients may join or leave at any time. When the AI agent is online, it accepts requests, queues them internally, and returns each response to the correct client without mixing results.

Key capabilities demonstrated:

No startup order dependency -- the AI Service Provider and Service Consumers can start, stop, join, or leave the network at any time without affecting system stability.
Request queuing and isolation -- all requests are queued on the AI agent side, and each response is routed back to the originating client.
Automatic service discovery -- when the AI agent becomes available, all connected clients are notified and can immediately start communication.

Steps to Run the Demo

Build the project
Build using Qt Creator or the CMake command line. Optionally, build the lusan application for log visualization.
Start mtrouter (multi-target router)
Run mtrouter as a console application or system service on any machine in the network.
- Ensure the ./config/areg.init file is present in the working directory.
- Verify that the router IP address and port are correctly configured using
  router::*::address::tcpip and
  router::*::port::tcpip.
(Optional) Start logcollector
If you plan to visualize logs in Lusan, start logcollector as a console application or system service.
- Ensure ./config/areg.init is available.
- Configure the logger IP address and port using
  logger::*::address::tcpip and
  logger::*::port::tcpip.
Start aiagent (AI Service Provider)
Run aiagent on a sufficiently powerful machine.
- Ensure ./config/areg.init is present and correctly configured for mtrouter (and optionally logcollector).
- The router configuration is automatically loaded from areg.init and displayed on the Router Connection page. This is a screenshot of already connected and model activated page:
- AI models located in the ./models/llama/text/ folder of working directory are automatically listed.
- Select a model, choose the desired Reply Quality, and optionally adjust parameters such as Text Length and Threads Use.
- Click Connect to connect to mtrouter and activate the model.
- Models and parameters can be changed at runtime using the Activate button.
- If models are stored elsewhere, use Browse... to select a different model directory.
- Once connected, the application automatically switches to the AI Agent Chat page.
Start one or more edgedevice instances (Service Consumers)
Run edgedevice on one or more less powerful machines.
- Ensure each instance has its own ./config/areg.init file with correct mtrouter (and optional logcollector) settings.
- The router configuration is automatically displayed on the Router Connection page. This is a screenshot of already connected and model activated page:
- Click Connect to join the network.
- Once connected, the application automatically switches to the AI Agent Chat page.

Example: Multiple edgedevice instances communicating simultaneously with a single aiagent:

Tip

mtrouter, logcollector, aiagent, and edgedevice can be started in any order. Services are discovered automatically as they become available. Multiple requests can be sent without waiting for previous responses, each edgedevice instance and each request receives its own independent reply from the AI agent.

Case 2: Multiple AI agents managed by a central service

This use case is part of the current implementation.

A central service provider receives requests from many clients. For each request, the provider starts a dedicated thread or process hosting an AI engine instance. Processing runs in parallel and results are returned to the provider, which forwards them to the originating clients.