Gen AI LLM API

API (Fastapi application) to explore Generative AI and Large Language Models using Ollama.

Features

Retrieval Augmented Generation (RAG)

Requirements

Python 3.12
Fastapi
Postgres as vector database (pgai)
Pytest
Docker

Installation

Clone Project

git clone https://github.com/taiyeoguns/gen-ai-llm-api.git

Add details in `.env` file

Create .env file from example file and maintain necessary details in it.

cp .env.example .env

The following environment variables should be set in the .env file even if they do not 'exist', the docker postgres image will use them for setting up the container - POSTGRES_USER, POSTGRES_PASSWORD, POSTGRES_DB.

OLLAMA_GENERATION_MODEL and OLLAMA_EMBEDDING_MODEL should be set to values that exist in the Ollama Registry for generation model and embedding model respectively. For example, you can set OLLAMA_GENERATION_MODEL to llama3.2:1b and OLLAMA_EMBEDDING_MODEL to nomic-embed-text.

Run application with Docker

It is advisable to run the entire application with Docker to ensure all components needed are set up correctly. Ensure database details are added to .env file from earlier.

Note: Make sure system to run application has adequate resources e.g. CPU, GPU to run the models.

To run the application with GPU support, install NVIDIA container toolkit from here: https://hub.docker.com/r/ollama/ollama

Also update the docker-compose.yml file, ollama_service section with:

deploy:
    resources:
    reservations:
        devices:
        - driver: nvidia
        capabilities: ["gpu"]
        count: all

Full ollama_service in docker compose file will look like:

ollama_service:
    container_name: ollama_container
    build:
      context: ./.docker/ollama
    ports:
      - "11434:11434"
    deploy:
      resources:
        reservations:
          devices:
          - driver: nvidia
            capabilities: ["gpu"]
            count: all
    healthcheck:
      test: ["CMD-SHELL", "curl -f http://localhost:11434"]
      interval: 10s
      timeout: 5s
      retries: 5
    volumes:
      - ollama_data:/root/.ollama

With Docker and Docker Compose set up, run:

make docker-run

Thereafter, application should be available at http://localhost:8000

OpenAPI documentation page should also be available at http://localhost:8000/docs

Retrieval Augmented Generation (RAG)

To test the RAG implementation, create new page content by navigating to /pages/create in the browser or send a POST request to the API endpoint /v1/pages.

After creating page content, test the chat with the AI model by sending a POST HTTP request to the /v1/chat endpoint and ask questions about the created content.

Tests

In command prompt, run:

make test

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.devcontainer		.devcontainer
.docker		.docker
.github/workflows		.github/workflows
.vscode		.vscode
app		app
logs		logs
migrations		migrations
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
Dockerfile		Dockerfile
Makefile		Makefile
alembic.ini		alembic.ini
config.py		config.py
docker-compose.yml		docker-compose.yml
readme.md		readme.md
requirements-dev.txt		requirements-dev.txt
requirements-test.txt		requirements-test.txt
requirements.txt		requirements.txt
run.py		run.py
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gen AI LLM API

Features

Requirements

Installation

Clone Project

Add details in `.env` file

Run application with Docker

Retrieval Augmented Generation (RAG)

Tests

About

Uh oh!

Releases

Packages

Uh oh!

Languages

taiyeoguns/gen-ai-llm-api

Folders and files

Latest commit

History

Repository files navigation

Gen AI LLM API

Features

Requirements

Installation

Clone Project

Add details in .env file

Run application with Docker

Retrieval Augmented Generation (RAG)

Tests

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Add details in `.env` file

Packages