Introduction

The repository offers a user-friendly web interface for interacting with Large Language Models (LLMs). Users can install it by cloning the GitHub repository and running it locally. The app supports multiple LLMs such as Llama and Mistral, accessible through its chat interface.

Features include:

Regenerating LLM responses
Branching conversations
Saving and loading chat sessions
Customizable generation settings
Customizable system messages
Integration of user-defined APIs/tools
Availability of various pre-installed tools for LLMs (coming later)

Users can customize the application's functionality according to their preferences. The integration of external tools and APIs enables enhanced capabilities for LLMs during conversations.

Status: early development stage

This project is still under early development. It may have bugs and/or limitations. Use it at your own risk. Web-based GUI is not yet complete, some important GUI elements are missing and certain features are experimental and may be removed later.

Installation

To get started with our application, follow these steps to install it on your local environment:

Prerequisites

Docker engine or docker desktop
docker-compose

Actual installation

To get started, clone the repository:

git clone https://github.com/X-rayLaser/penpal.git
cd penpal

Create webapp.env file storing environment variables used by app containers:

LLM_HOST=<Host name or IP address of your LLM server>
LLM_PORT=<Port on which LLM server is listening> # 9100 by default
NGINX_HOST=<Host name or IP address of NGINX server> # localhost by default

Make the bash scripts below executable:

chmod +x scripts/build_assets.sh
chmod +x scripts/run_django_server.sh

Build docker image(s):

docker-compose -f docker-compose.production.yml build

Generate a secret key by executing the command below:

docker-compose -f docker-compose.production.yml run --no-deps webapp python scripts/generate_key.py

Apply django migrations

docker-compose -f docker-compose.production.yml run --no-deps webapp python manage.py migrate

Run the app:

docker-compose -f docker-compose.production.yml up

Stop the app (when launched with -d flag):

docker-compose -f docker-compose.production.yml stop

Getting Started

To get started with using the web app to interact with Large Language Models (LLMs), follow the steps below.

Prerequisites

Clone and build the llama.cpp project, which includes an LLM server.
Obtain a Large Language Model in GGUF format. You may download models from sources like Hugging Face or convert existing models to this format using utilities provided in the llama.cpp project.

Setting Up Your Local LLM Server

Clone the llama.cpp project:

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp

Build it by running make:

make

Run the server and test it. On unix-based systems (Linux, macOS, etc.):

./server -m models/7B/ggml-model.gguf -c 2048 --port 9000

On Windows:

server.exe -m models\7B\ggml-model.gguf -c 2048 --port 9000

Try generating a few tokens for a prompt:

curl --request POST \
    --url http://localhost:9000/completion \
    --header "Content-Type: application/json" \
    --data '{"prompt": "Building a website can be done in 10 simple steps:","n_predict": 128}'

Refer to LLama.cpp repository for more information on building options (including on how to build it with support for CUDA) as well as information about a server app.

After you make sure that it works, you can stop it and change the working directory back:

cd ..

Finally, run llm_services/llamacpp.py:

python llm_services/llamacpp.py <model> --port 9000 -c 512 -ngl 0

Replace model with a path to your model file in GGUF format. Use --port option to specify a port number for the LLM server. Use -c option to specify context size. Use -ngl option to specify a number of layers to offload to GPU.

Configuring Your Web App

Create a new file named local_settings.py in the project directory of your web app (mysite).
Add the following configuration to the file:

LLM_SETTINGS = {
    "generator": {
        "class": "llm_utils.generators.RemoteLLM",
        "kwargs": {
            "host": "localhost",
            "port": 9000
        }
    }
}

Now your web app is configured to communicate with your local LLM server and use it for token generation.

Running the app

python manage.py runserver 8000

If you are running the code in Vagrant or inside virtual machine, make sure to forward 8000 port and run server this way:

python manage.py runserver 0.0.0.0:8000

Open a browser and navigate to http://localhost:8000. You should see a web app.

Name		Name	Last commit message	Last commit date
Latest commit History 135 Commits
chats		chats
common		common
docker		docker
features		features
frontend/src		frontend/src
llm_services		llm_services
llm_utils		llm_utils
mysite		mysite
nginx		nginx
pygentify		pygentify
scripts		scripts
stt		stt
templates		templates
tts		tts
tts_server_mock		tts_server_mock
.dockerignore		.dockerignore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
babel.config.json		babel.config.json
docker-compose.production.yml		docker-compose.production.yml
docker-compose.yml		docker-compose.yml
import_utils.py		import_utils.py
manage.py		manage.py
package.json		package.json
requirements.txt		requirements.txt
test.js		test.js
webpack.config.js		webpack.config.js
websocket_server.py		websocket_server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Introduction

Status: early development stage

Installation

Prerequisites

Actual installation

Getting Started

Prerequisites

Setting Up Your Local LLM Server

Configuring Your Web App

Running the app

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

X-rayLaser/penpal

Folders and files

Latest commit

History

Repository files navigation

Introduction

Status: early development stage

Installation

Prerequisites

Actual installation

Getting Started

Prerequisites

Setting Up Your Local LLM Server

Configuring Your Web App

Running the app

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages