LLM as Labelers

Use an LLM to create training labels, then distill that knowledge into a smaller, faster model.

What this does

This repo shows how to use Gemini 2.5 Flash to label images from CIFAR-100, then train a small MobileNet model on those labels. The idea is that you get most of the LLM's accuracy but with much faster inference.

Setup

git clone <this-repo>
cd llm-as-labelers
uv sync
export OPENROUTER_API_KEY="your_key_here"

Usage

Run the whole pipeline:

uv run cifar_distill.py --step all

Or run individual steps:

uv run cifar_distill.py --step prep    # Download and prepare CIFAR data
uv run cifar_distill.py --step label   # Get LLM labels
uv run cifar_distill.py --step train   # Train student model

What it does

Takes CIFAR-100 and filters it down to 5 classes: apple, mushroom, orange, pear, sweet_pepper
Sends images to Gemini 2.5 Flash for labeling (with dual-pass consistency checking)
Trains a MobileNet v3 Small on those labels
Evaluates on the original CIFAR test set

Results

The student model gets about 87% accuracy and runs at 900+ images/second on Apple Silicon. The model file is only 6MB.

Files

cifar_distill.py - Main script that does everything
plot_confusion.py - Visualize the confusion matrix
throughput.py - Test inference speed
blog.md - Longer explanation of the approach

Why this matters

Instead of calling an expensive LLM API for every prediction, you can:

Use the LLM once to create a training set
Train a small model that captures most of the LLM's knowledge
Deploy the small model for fast, cheap inference
Fall back to the LLM only for uncertain cases

This is especially useful when you need to classify thousands of items quickly or want to run inference locally.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
cifar5_run		cifar5_run
.gitignore		.gitignore
.python-version		.python-version
Makefile		Makefile
README.md		README.md
bench_infer.py		bench_infer.py
blog.md		blog.md
cifar5_llm_distill.py		cifar5_llm_distill.py
confidence_analysis.png		confidence_analysis.png
confidence_analysis.py		confidence_analysis.py
confidence_analysis_results.json		confidence_analysis_results.json
confidence_by_class.png		confidence_by_class.png
cost_analysis_results.json		cost_analysis_results.json
cost_worksheet.py		cost_worksheet.py
export_model.py		export_model.py
infer_one.py		infer_one.py
latency_comparison.py		latency_comparison.py
latency_comparison_results.json		latency_comparison_results.json
plot_confusion.py		plot_confusion.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM as Labelers

What this does

Setup

Usage

What it does

Results

Files

Why this matters

About

Uh oh!

Releases

Packages

Languages

Ryandonofrio3/llm-flywheel

Folders and files

Latest commit

History

Repository files navigation

LLM as Labelers

What this does

Setup

Usage

What it does

Results

Files

Why this matters

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages