AutoPose

Synthetic Data Augmentation in Pose Space for Action Recognition (Research Use Only)

Licensed under XXXX-2, where users can use this noncommercially (research or evaluation purposes only) on NVIDIA Processors.

This project will download and install additional third-party open source software projects. Review the license terms of these open source projects before use.

[Colab demo redacted for conference submission purposes]

[Hugging Face demo redacted for conference submission purposes]

AutoPose is an open-source framework that augments underrepresented action classes through pose-level interpolation. Instead of rendering full photorealistic frames, AutoPose generates kinematically valid, class-consistent video clips directly from interpolated pose trajectories. This approach preserves structural fidelity, avoids common motion artifacts from heavy generative pipelines, and makes it efficient for enriching rare, safety-critical actions such as falls.

Inspired by interpolation-based methods like SMOTE, AutoPose adapts these ideas to human pose space. It creates synthetic sequences via two modes: Synthetic-Mix (interpolating real with synthetic poses) and Real-Mix (interpolating between real poses), with interpolation weights optimized in a gradient-guided loop.

Each component of the framework can be customized with models of your choice. While the reference implementation integrates an action recognition network for optimization, users may substitute alternative models. Components can also be run independently or automated for specific use cases.

Why Pose-Level Augmentation?

Capability	AutoPose (Pose)
Fine-grained motion control	✅ Improved on prior works
Independence from textures	✅ Improved on prior works
Maintaining semantic labels and kinematic plausibility	✅ Improved on prior works

Pose-level synthesis keeps joint semantics explicit, reduces visual artifacts, and allows customizable generation speed and quality on NVIDIA GPUs.
This code has been developed and tested only with NVIDIA Processors.

Publications [Redacted for Conference Review]

Repository Structure

Minimal Setup Checklist

Step	Command / Action	Notes
1 Install deps	`pip install -r requirements.txt`	CUDA-enabled PyTorch recommended
2 Clone sub-repos	`git clone` the five required repos (see list below)	Keep the folder names unchanged
3 Create `.env`	Add your required keys and paths to each repo	No keys provided in this repo
4 Download models	Obtain all checkpoints from setup instructions	Place files exactly as instructed
5 Smoke-test	Run each repo’s test script once and then run `components\running.py`	Fail-fast before running the full pipeline
6 Automate Pipeline	Select your CV model for the optimization loop and automate using included components	Each use case is user-defined

Required External Repositories

StridedTransformer-Pose3D/
text-to-motion/
joints2smpl/
SlowFast/
Blender-3.0.0/          (binary drop)

Suggested Folder Structure

autopose/
├── StridedTransformer-Pose3D/   # Pose extraction and temporal modeling
├── text-to-motion/              # Text-to-motion generation utilities
├── joints2smpl/                 # Pose-to-SMPL conversion scripts
├── SlowFast/                    # Action recognition backbone (for optimization loop)
├── Blender-3.0.0/               # Blender binary (for rendering if required)
├── components/                  # Core AutoPose modules
│   ├── dataset/                 # Data loading & preprocessing
│   ├── optimization/            # Pose-level interpolation methods
│   ├── renders/                 # Video rendering utilities
│   ├── video_processing/        # Training and optimization loop
│   ├── utils/                   # Helper functions and shared code
│   └── ...                      # Additional modules/scripts from the AutoPose repo

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
components		components
README.md		README.md
license (NSCLv2).md		license (NSCLv2).md
setup-guide.md		setup-guide.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AutoPose

Synthetic Data Augmentation in Pose Space for Action Recognition (Research Use Only)

Licensed under XXXX-2, where users can use this noncommercially (research or evaluation purposes only) on NVIDIA Processors.

[Colab demo redacted for conference submission purposes]

[Hugging Face demo redacted for conference submission purposes]

Why Pose-Level Augmentation?

Publications [Redacted for Conference Review]

Repository Structure

Minimal Setup Checklist

Required External Repositories

Suggested Folder Structure

About

Uh oh!

Releases

Packages

Languages

Meg-R/autopose

Folders and files

Latest commit

History

Repository files navigation

AutoPose

Synthetic Data Augmentation in Pose Space for Action Recognition (Research Use Only)

Licensed under XXXX-2, where users can use this noncommercially (research or evaluation purposes only) on NVIDIA Processors.

[Colab demo redacted for conference submission purposes]

[Hugging Face demo redacted for conference submission purposes]

Why Pose-Level Augmentation?

Publications [Redacted for Conference Review]

Repository Structure

Minimal Setup Checklist

Required External Repositories

Suggested Folder Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages