ai-training-project

A comprehensive project structure for AI model development, training, evaluation, and deployment.

Project Structure

This repository provides a well-organized structure for developing AI projects:

ai-training-project/
├── configs/       # Configuration files
├── data/          # Dataset storage
├── notebooks/     # Jupyter notebooks
├── performance/   # Loadtest API 
├── scripts/       # Utility scripts
├── src/           # Main source code
├── api/           # API implementation
├── ui/            # Gradio UI
├── export/        # Model export utilities
├── evaluation/    # Evaluation code
├── tests/         # Test suite
├── weights/       # Model weights
├── logs/          # Training logs
└── docs/          # Documentation

Getting Started

Prerequisites

Python 3.8+
uv package manager

Setup

Clone this repository:

git clone https://github.com/MetythornPenn/ai-stack.git konai
cd konai

Create a virtual environment and install dependencies using uv:

make env 
# or 
uv venv -p 3.10

source .venv/env/bin/activate  # On Windows: .uv\env\Scripts\activate

Install required packages:
```
make install
# or 
uv sync
```

Usage

The Makefile provides shortcuts for common operations:

Development

Run tests: make test
Format code: make format
Run linters: make lint
Clean temporary files: make clean

Training and Evaluation

Train a model: make train
Evaluate a model: make eval
Export a model: make export

Deployment

Run API server: make api
Run Gradio UI: make ui

Configuration

The project uses YAML configuration files located in the configs/ directory:

configs/model/ - Model architecture configurations
configs/training/ - Training configurations
configs/inference/ - Inference configurations

Example:

# configs/training/default.yaml
project:
  name: "ai_training_project"
  
data:
  dataset: "custom_dataset"
  batch_size: 32

model:
  architecture: "resnet50"
  num_classes: 10

training:
  epochs: 100
  optimizer:
    name: "adam"
    learning_rate: 0.001

Data Management

Place your datasets in the appropriate directories:

data/raw/ - Raw, unprocessed data
data/processed/ - Processed, ready-to-use data
data/interim/ - Intermediate data processing results
data/external/ - External data sources

Model Development

Define your model architecture in src/models/
Implement data loading in src/data/
Configure training in configs/
Run training with make train

Evaluation and Export

Evaluate your model with make eval

Export your model to different formats:

python scripts/export/export_model.py --weights weights/checkpoints/model.pt --format onnx,tflite

API and UI

Start the FastAPI server: make api
Start the Gradio UI: make ui

The API server will be available at http://localhost:8000 with documentation at http://localhost:8000/docs.

Adding New Components

Adding a New Model

Create a new file in src/models/architectures/
Implement the model class
Register the model in src/models/__init__.py
Create a configuration in configs/model/

Adding a New Dataset

Create a new dataset class in src/data/dataset.py
Implement data loading and preprocessing
Update the data configuration in configs/training/

Project Organization

├── LICENSE            <- License file
├── Makefile           <- Makefile with commands like `make data` or `make train`
├── README.md          <- The top-level README for developers using this project
├── configs/           <- Configuration files
│   ├── model/         <- Model configurations
│   ├── training/      <- Training configurations
│   └── inference/     <- Inference configurations
│
├── data/              <- Data storage
│   ├── raw/           <- The original, immutable data dump
│   ├── processed/     <- The final, canonical data sets for modeling
│   ├── interim/       <- Intermediate data that has been transformed
│   └── external/      <- Data from third party sources
│
├── notebooks/         <- Jupyter notebooks
│   ├── exploratory/   <- Initial data exploration
│   ├── preprocessing/ <- Data preprocessing notebooks
│   ├── model_development/ <- Model development and experimentation
│   └── results_analysis/  <- Analysis of results
│
├── scripts/           <- Utility scripts
│   ├── data_processing/ <- Scripts for data processing
│   ├── training/      <- Scripts for training models
│   ├── evaluation/    <- Scripts for model evaluation
│   ├── export/        <- Scripts for model export
│   └── deployment/    <- Scripts for model deployment
│
├── src/               <- Source code for use in this project
│   ├── __init__.py    <- Makes src a Python package
│   ├── data/          <- Code to process data
│   ├── models/        <- Model implementations
│   ├── training/      <- Training implementations
│   ├── utils/         <- Utility functions
│   └── cli/           <- Command line interface
│
├── api/               <- API implementation (FastAPI)
│   ├── app.py         <- Main API application
│   ├── routes/        <- API routes
│   ├── models/        <- API models (Pydantic)
│   └── services/      <- Business logic services
│
├── ui/                <- UI implementation (Gradio)
│   ├── app.py         <- Main UI application
│   ├── components/    <- UI components
│   └── pages/         <- UI pages
│
├── export/            <- Code for model export
│
├── evaluation/        <- Code for model evaluation
│   ├── metrics/       <- Evaluation metrics
│   ├── visualizations/ <- Evaluation visualizations
│   └── benchmarks/    <- Benchmarking tools
│
├── tests/             <- Test files
│   ├── unit/          <- Unit tests
│   ├── integration/   <- Integration tests
│   └── fixtures/      <- Test fixtures
│
├── weights/           <- Model weights
│   ├── pretrained/    <- Pretrained weights
│   └── checkpoints/   <- Training checkpoints
│
├── logs/              <- Log files
│   ├── tensorboard/   <- TensorBoard logs
│   ├── training/      <- Training logs
│   └── evaluation/    <- Evaluation logs
│
└── docs/              <- Documentation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ai-training-project

Project Structure

Getting Started

Prerequisites

Setup

Usage

Development

Training and Evaluation

Deployment

Configuration

Data Management

Model Development

Evaluation and Export

API and UI

Adding New Components

Adding a New Model

Adding a New Dataset

Project Organization

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
api		api
configs		configs
data		data
docker		docker
docs		docs
evaluation		evaluation
notebooks		notebooks
performance		performance
scripts		scripts
src		src
tests		tests
tools		tools
ui		ui
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
uv.lock		uv.lock

MetythornPenn/ai-stack

Folders and files

Latest commit

History

Repository files navigation

ai-training-project

Project Structure

Getting Started

Prerequisites

Setup

Usage

Development

Training and Evaluation

Deployment

Configuration

Data Management

Model Development

Evaluation and Export

API and UI

Adding New Components

Adding a New Model

Adding a New Dataset

Project Organization

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages