Becoming Full-Stack AI Researchers

CURRENTLY UNDER CONSTRUCTION. THE CODE HAS NOT BEEN FULLY REVIEWED OR VERIFIED. PLEASE COME BACK A LITTLE LATER.

🎯 Overview

Welcome to the Becoming Full-Stack AI Researchers working group at Yale University! This repository contains comprehensive tutorials, minimal working examples (MWEs), and educational materials covering the essential packages, frameworks, and tools for end-to-end AI development and research.

Goals

🚀 Equip researchers with skills to go beyond narrow, single-aspect AI work toward holistic, end-to-end AI project capability
📚 Build reusable onboarding materials for Yale members interested in AI research
🤝 Create a community of Explorers and Builders in AI

Deliverables

GitHub Repository: Minimal working examples, demos, and slides
Tutorial Paper: Comprehensive co-authored guide for all modules
Presentations: In-depth framework introductions

📋 Table of Contents

🎓 Modules

Our curriculum is organized into six interconnected modules, each covering critical aspects of the AI research and engineering pipeline.

Module 1: LLM as Black Boxes 1 – Datasets, Models, and Benchmarking

Topics: HuggingFace, Quantization (BitsAndBytes), Datasets (Parquet, PyArrow), Benchmarking (lm-eval, inspect-ai)

📁 Materials:

MWEs/LLM_Evaluation_Alignment/ - Evaluation and alignment presentation

🎯 Learning Objectives:

Load and save sharded HuggingFace model checkpoints
Quantize models for efficient deployment
Store and load datasets in efficient formats
Benchmark model performance

Module 2: LLM as Black Boxes 2 – Inference, Evaluation, Deployment

Topics: OpenRouter, vLLM (PagedAttention), FastAPI, GEPA, Tools/MCP, TransformerLens

📁 Materials:

MWEs/Inference/ - Complete inference tutorial with API usage, tools, and GEPA
MWEs/vllm+deepspeed/ - vLLM tutorial with PagedAttention deep dive

🎯 Learning Objectives:

Use APIs for LLM inference (OpenRouter, OpenAI)
Understand model selection tradeoffs (cost, performance, latency)
Implement tool calling and MCP integration
Optimize prompts with GEPA
Deploy models with vLLM for efficient serving

📄 Tutorial Paper: overleaf/sections/vllm.tex

Module 3: Post-Training LLMs 1 – Supervised Fine-Tuning (SFT)

Topics: LoRA/QLoRA with PEFT, PyTorch Lightning

📁 Materials:

MWEs/LoRA_tutorials/ - Comprehensive LoRA tutorial with single-cell biology demo
MWEs/pytorch/ - PyTorch fundamentals

🎯 Learning Objectives:

Understand parameter-efficient fine-tuning (PEFT)
Implement LoRA from scratch
Compare LoRA with full fine-tuning
Optimize rank selection and hyperparameters
Orchestrate training with PyTorch Lightning

📄 Tutorial Paper: overleaf/sections/lora.tex, overleaf/sections/sft.tex

Module 4: Post-Training LLMs 2 – Reinforcement Learning (RL)

Topics: Docker/Apptainer, VERL, Ray, JAX, Weights & Biases

📁 Materials:

MWEs/verl/ - VERL tutorial for PPO training on GSM8K
MWEs/ray_train/ - Distributed training with Ray (data parallel, ZeRO, model parallel)
MWEs/vllm+deepspeed/ - DeepSpeed integration

🎯 Learning Objectives:

Container workflows (Docker, Apptainer)
Reinforcement learning with VERL (PPO)
Distributed training strategies (Ray, DeepSpeed ZeRO)
Experiment tracking (W&B)

📄 Tutorial Paper: overleaf/sections/ray.tex, overleaf/sections/deepspeed.tex

Module 5: Agentic LLMs 1 – Software & Hardware Agents

Topics: LangChain, ReAct, MemGPT, OpenVLA

📁 Materials:

MWEs/agentic_rl_workshop.ipynb - Agentic RL workshop
MWEs/Robotics/ - Vision-Language-Action frameworks

🎯 Learning Objectives:

Build multi-step reasoning workflows
Implement agent frameworks
Vision-Language-Action models for robotics

Module 6: Agentic LLMs 2 – End-to-End Project

Topics: Complete pipeline from data → training → deployment

🎯 Learning Objectives:

Build complete AI pipelines
Scale and debug on HPC clusters
Deploy production systems

Foundational Topics

Topics: PyTorch, JAX, TensorFlow, Scaling Laws

📁 Materials:

MWEs/pytorch/ - Comprehensive PyTorch tutorial (autograd, custom ops, optimization)
MWEs/Scaling_Laws/ - Scaling laws analysis (Kaplan, Chinchilla)

📄 Tutorial Paper: overleaf/sections/torch-jax-tf.tex

🚀 Getting Started

Prerequisites

Python 3.8+ (3.10 recommended)
Fluency in Python (required)
Git and Conda (or virtualenv)
CUDA-capable GPU (optional but recommended for deep learning tasks)

Quick Start

# Clone the repository
git clone https://github.com/sashacui/full-stack-ai.git
cd full-stack-ai

# Choose a module to start with (e.g., PyTorch basics)
cd MWEs/pytorch

# Create environment and install dependencies
conda create -n pytorch-tutorial python=3.10
conda activate pytorch-tutorial
pip install torch numpy pandas jupyter

# Run the tutorial
jupyter notebook pytorch_tutorial.ipynb

Recommended Learning Path

For Beginners

Start with: MWEs/pytorch/ - Learn PyTorch fundamentals
Move to: MWEs/Inference/ - Understand LLM APIs and inference
Then try: MWEs/LoRA_tutorials/ - Learn parameter-efficient fine-tuning

For Intermediate Users

Start with: MWEs/vllm+deepspeed/ - Efficient serving
Move to: MWEs/ray_train/ - Distributed training
Then try: MWEs/verl/ - RL fine-tuning

For Advanced Users

Explore all modules based on your research needs
Experiment with combinations (e.g., LoRA + VERL + vLLM)
Build end-to-end projects using multiple tools

📂 Repository Structure

full-stack-ai/
├── MWEs/                           # Minimal Working Examples
│   ├── pytorch/                    # PyTorch fundamentals
│   │   ├── pytorch_tutorial.ipynb
│   │   └── README.md
│   ├── Inference/                  # LLM inference, tools, GEPA
│   │   ├── inference.ipynb
│   │   ├── tools.py
│   │   ├── GEPA_utils.py
│   │   └── README.md
│   ├── LoRA_tutorials/             # LoRA/PEFT tutorials
│   │   ├── lora_single_cell_demo_clean.ipynb
│   │   ├── pytorch_lightning_tutorial.ipynb
│   │   └── README.md
│   ├── vllm+deepspeed/             # vLLM and DeepSpeed
│   │   ├── vllm_sections_1_4.ipynb
│   │   ├── deepspeed_tutorial_sections_1_4.ipynb
│   │   └── README.md
│   ├── ray_train/                  # Ray distributed training
│   │   ├── train_cifar.py
│   │   ├── zero_deepspeed.py
│   │   ├── model_par.py
│   │   └── README.md
│   ├── verl/                       # VERL RL training
│   │   ├── evaluate_gsm8k.py
│   │   ├── compare_results.py
│   │   └── README.md
│   ├── LLM_Evaluation_Alignment/   # Evaluation & alignment
│   │   └── llm_evaluation_presentation.ipynb
│   ├── Scaling_Laws/               # Scaling laws analysis
│   │   └── scaling_laws.ipynb
│   ├── Robotics/                   # VLA frameworks
│   │   └── frameworks.ipynb
│   └── agentic_rl_workshop.ipynb   # Agentic systems
│
├── overleaf/                       # Tutorial paper source
│   ├── tutorial.tex                # Main tutorial document
│   ├── syllabus.tex                # Course syllabus
│   └── sections/                   # Individual sections
│       ├── introduction.tex
│       ├── torch-jax-tf.tex
│       ├── ray.tex
│       ├── lora.tex
│       ├── vllm.tex
│       ├── deepspeed.tex
│       ├── sft.tex
│       └── conclusion.tex
│
├── slides/                         # Presentation materials
│   ├── ray_train.pdf
│   └── verl_tutorial.pdf
│
├── README.md                       # This file
└── CLEANUP_PLAN.md                 # Development roadmap

💻 Installation

System Requirements

OS: Linux (Ubuntu 20.04+), macOS (11+), or Windows (WSL2)
RAM: 16GB+ (32GB recommended for large models)
GPU: NVIDIA GPU with 8GB+ VRAM (optional but recommended)
Storage: 50GB+ free space (for models and datasets)

Environment Setup

We recommend using Conda for environment management:

# Install Miniconda (if not already installed)
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
bash Miniconda3-latest-Linux-x86_64.sh

# Create a base environment
conda create -n fullstack-ai python=3.10
conda activate fullstack-ai

# Install common dependencies
pip install torch torchvision torchaudio
pip install transformers accelerate datasets
pip install jupyter jupyterlab ipython
pip install numpy pandas matplotlib seaborn scikit-learn

Module-Specific Installation

Each MWE folder contains its own README.md with specific installation instructions. For example:

# For vLLM tutorial
cd MWEs/vllm+deepspeed
pip install vllm
jupyter notebook vllm_sections_1_4.ipynb

# For LoRA tutorial
cd MWEs/LoRA_tutorials
pip install scanpy leidenalg
jupyter notebook lora_single_cell_demo_clean.ipynb

# For VERL tutorial
cd MWEs/verl
# Follow containerized setup in README.md

🎮 Usage

Running Jupyter Notebooks

# Start Jupyter
jupyter notebook

# Or use JupyterLab
jupyter lab

# Access via browser at http://localhost:8888

Running Python Scripts

# Example: Ray training
cd MWEs/ray_train
python train_cifar.py

# Example: VERL evaluation
cd MWEs/verl
python evaluate_gsm8k.py --model_path <path> --data_path <path>

Using HPC Clusters

# Example SLURM job submission
cd MWEs/verl
sbatch ppo_gsm8k.sh

📚 Resources

Official Documentation

Course Website

Full-Stack AI Course Website

Papers

📖 Tutorial Paper

The complete tutorial paper is being developed in the overleaf/ directory. Current sections include:

✅ Introduction
✅ PyTorch, JAX, and TensorFlow Fundamentals
✅ Ray: Distributed Training
✅ LoRA: Parameter-Efficient Fine-Tuning
🚧 vLLM: Efficient Inference
🚧 DeepSpeed: Memory-Efficient Training
🚧 SFT: Supervised Fine-Tuning
🚧 Evaluation and Benchmarking
🚧 Agentic Systems
✅ Conclusion

Legend: ✅ = Complete, 🚧 = In Progress

To compile the tutorial paper (requires LaTeX):

cd overleaf
pdflatex tutorial.tex
bibtex tutorial
pdflatex tutorial.tex
pdflatex tutorial.tex

📄 Citation

If you use these materials in your research or teaching, please cite:

@misc{fullstackai2025,
  title        = {Becoming Full-Stack AI Researchers: A Comprehensive Tutorial},
  author       = {Cui, Sasha and Mader, Alexander and Typaldos, George and Bai, Donglu and Kazdan, Josh and Hu, Xinyang and Wei, Jeffrey and Feng, Austin and Lin, Oliver and Zhu, Chris and Vishnempet, Shivkumar and Sun, Xingzhi and Le, Quan and Luo, Ping and Lafferty, John and Sekhon, Jasjeet},
  year         = {2025},
  institution  = {Yale University},
  howpublished = {\url{https://github.com/sashacui/full-stack-ai}},
  note         = {Fall 2025 Working Group}
}

Acknowledgments

Institutions

Wu Tsai Institute at Yale University - GPU resources and classroom space
Yale Department of Statistics & Data Science
Yale Department of Physics
Yale Department of Philosophy
Misha High Performance Computing Cluster

Contributors (Fall 2025 Working Group)

Session leads (by date)

23 Sept

Inference & APIs, Tools, MCP, Prompt Engineering — Alexander Mader
Distributed Training (Ray, PyTorch vs JAX vs TensorFlow) — George Typaldos, Sasha Cui

7 Oct

SFT (PEFT, Lightning) — Donglu Bai
Pretraining and Model Collapse — Josh Kazdan

21 Oct

Serving (vLLM) & Distributed Training (DeepSpeed) — Xinyang Hu
Scaling Laws — Alexander Mader

4 Nov

Robotics (OpenVLA, RoboSuite, RoboVerse, LeRobot) — Jeffrey Wei, Austin Feng, Oliver Lin
Model Evaluation, Benchmarking, RLHF, RLAIF (lm-eval) — Chris Zhu

18 Nov

Agents (LangChain, ReAct workflows) — Shivkumar Vishnempet, Xinyang Hu
Alignment and Interpretability — Oliver Lin

2 Dec

RL (VERL, Q-function Monte Carlo), Containers (Docker, Apptainer) — Xingzhi Sun, Quan Le, Donglu Bai
Jailbreaking — Josh Kazdan

Advisors & Supporters

We thank Ping Luo, John Lafferty, Linjun Zhang, Anurag Kashyap, Theo Saarinen, and Yuxuan Zhu for helpful comments and suggestions.

📞 Contact

Email: [email protected]
Website: https://sashacui.com/full-stack.html
Issues: GitHub Issues

⬆ Back to Top

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
MWEs		MWEs
overleaf		overleaf
slides		slides
.gitignore		.gitignore
CLEANUP_PLAN.md		CLEANUP_PLAN.md
CLEANUP_SUMMARY.md		CLEANUP_SUMMARY.md
COMPLETION_REPORT.md		COMPLETION_REPORT.md
FINAL_STATUS.md		FINAL_STATUS.md
PROGRESS_SUMMARY.md		PROGRESS_SUMMARY.md
README.md		README.md

Sasha-Cui/full-stack-ai

Folders and files

Latest commit

History

Repository files navigation

Becoming Full-Stack AI Researchers

CURRENTLY UNDER CONSTRUCTION. THE CODE HAS NOT BEEN FULLY REVIEWED OR VERIFIED. PLEASE COME BACK A LITTLE LATER.

🎯 Overview

Goals

Deliverables

📋 Table of Contents

🎓 Modules

Module 1: LLM as Black Boxes 1 – Datasets, Models, and Benchmarking

Module 2: LLM as Black Boxes 2 – Inference, Evaluation, Deployment

Module 3: Post-Training LLMs 1 – Supervised Fine-Tuning (SFT)

Module 4: Post-Training LLMs 2 – Reinforcement Learning (RL)

Module 5: Agentic LLMs 1 – Software & Hardware Agents

Module 6: Agentic LLMs 2 – End-to-End Project

Foundational Topics

🚀 Getting Started

Prerequisites

Quick Start

Recommended Learning Path

For Beginners

For Intermediate Users

For Advanced Users

📂 Repository Structure

💻 Installation

System Requirements

Environment Setup

Module-Specific Installation

🎮 Usage

Running Jupyter Notebooks

Running Python Scripts

Using HPC Clusters

📚 Resources

Official Documentation

Course Website

Papers

📖 Tutorial Paper

📄 Citation

Acknowledgments

Institutions

Contributors (Fall 2025 Working Group)

Advisors & Supporters

📞 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 8

Uh oh!

Languages

Packages