Angstrom: Phase-Based Motion Amplification

Angstrom is a Python library for phase-based motion amplification in videos. It uses complex steerable pyramids to decompose video frames and amplify subtle motion by manipulating phase coefficients. This technique is particularly useful for revealing imperceptible motion in videos, such as breathing, heartbeat, or structural vibrations.

🚀 Features

Phase-based motion amplification: Uses complex steerable pyramids for accurate motion detection
Temporal filtering: Apply bandpass filters to target specific motion frequencies (e.g., 0.1-2.0 Hz for human motion)
GPU acceleration: Leverages PyTorch for efficient computation (optional)
Multiple output formats: Support for various video formats
Configurable parameters: Fine-tune amplification factors and frequency ranges
Real-time processing: Optimized for processing video sequences
Command-line interface: Easy-to-use CLI for batch processing
Memory optimization: Efficient processing for large videos
Visualization tools: Built-in utilities for analyzing pyramid structures and phases

📦 Installation

From PyPI (Recommended)

pip install angstrom

From Source

git clone https://github.com/levi2234/Angstrom.git
cd Angstrom
pip install -e .

Optional Dependencies

# With development tools
pip install angstrom[dev]

# With documentation tools
pip install angstrom[docs]

# With GPU acceleration
pip install angstrom[gpu]

# With everything
pip install angstrom[all]

Dependencies

Python 3.8+
NumPy 1.21.0+
OpenCV 4.5.0+
SciPy 1.7.0+
tqdm 4.62.0+
Matplotlib 3.5.0+
PyTorch 1.9.0+ (optional, for GPU acceleration)

🎯 Quick Start

Python API

from angstrom.core.motion_amplifier import MotionAmplifier

# Initialize the motion amplifier
amplifier = MotionAmplifier()

# Process a video with motion amplification
amplifier.process_video(
    input_path="input_video.mp4",
    output_path="amplified_video.mp4",
    amplification_factor=10,
    frequency_range=(0.1, 2.0)  # Hz - typical human motion frequencies
)

Command Line Interface

# Basic motion amplification
angstrom input.mp4 output.mp4 --factor 10

# Amplify specific frequency range (breathing motion)
angstrom input.mp4 output.mp4 --factor 50 --freq-range 0.1 0.5

# Amplify heartbeat motion
angstrom input.mp4 output.mp4 --factor 100 --freq-range 0.8 2.0

# Use GPU acceleration
angstrom input.mp4 output.mp4 --device cuda --verbose

🔬 How It Works

Angstrom uses a phase-based motion amplification approach:

Video Decomposition: Each frame is decomposed using complex steerable pyramids
Phase Extraction: Phase coefficients are extracted from the complex pyramid coefficients
Motion Detection: Temporal differences between frames reveal motion information
Frequency Filtering: Bandpass filters isolate motion at specific frequencies
Motion Amplification: The filtered motion is amplified by a specified factor
Reconstruction: Amplified motion is added back to the original phase and reconstructed

Key Components

Complex Steerable Pyramids: Multi-scale, multi-orientation decomposition
Phase Manipulation: Direct manipulation of phase coefficients for motion amplification
Temporal Filtering: Frequency-domain filtering to isolate specific motion types
PyTorch Integration: GPU-accelerated computation for efficient processing (optional)

📁 Project Structure

Angstrom/
├── src/angstrom/
│   ├── core/
│   │   └── motion_amplifier.py      # Main motion amplification class
│   ├── processing/
│   │   ├── phase.py                 # Phase extraction and manipulation
│   │   ├── pyramid.py               # Complex steerable pyramid wrapper
│   │   ├── filters.py               # Filtering utilities
│   │   ├── temporal_ideal_filter.py # Temporal filtering implementation
│   │   └── processing.py            # General processing utilities
│   ├── pyramids/
│   │   ├── steerable_pyramid.py     # Complex steerable pyramid implementation
│   │   └── pyramid_utils.py         # Pyramid utility functions
│   ├── io/
│   │   └── video_io.py              # Video input/output utilities
│   ├── utils/
│   │   ├── memory_monitor.py        # Memory usage monitoring
│   │   ├── visualization.py         # Visualization utilities
│   │   └── helpers.py               # Helper functions
│   ├── data/
│   │   └── testvideos/              # Test video files
│   └── cli.py                       # Command-line interface
├── examples/                        # Usage examples and test scripts
├── tests/                          # Unit tests
├── docs/                           # Documentation
└── pyproject.toml                  # Project configuration

🧪 Examples

Basic Usage

from angstrom.core.motion_amplifier import MotionAmplifier
import torch

# Initialize with specific device
device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
amplifier = MotionAmplifier(device=device)

# Load and process video step by step
amplifier.load_video("input_video.mp4")
amplifier.process()  # Decompose frames into pyramid coefficients

# Amplify motion with custom parameters
amplified_video = amplifier.amplify(
    amplification_factor=20,
    frequency_range=(1.8, 2.2)  # Target specific frequency band
)

# Save the result
amplifier.save_video(amplified_video, "amplified_output.mp4")

Processing Different Motion Types

# For breathing motion (0.1-0.5 Hz)
amplifier.process_video(
    input_path="breathing_video.mp4",
    output_path="amplified_breathing.mp4",
    amplification_factor=50,
    frequency_range=(0.1, 0.5)
)

# For heartbeat motion (0.8-2.0 Hz)
amplifier.process_video(
    input_path="heartbeat_video.mp4",
    output_path="amplified_heartbeat.mp4",
    amplification_factor=100,
    frequency_range=(0.8, 2.0)
)

# For structural vibrations (5-20 Hz)
amplifier.process_video(
    input_path="vibration_video.mp4",
    output_path="amplified_vibration.mp4",
    amplification_factor=200,
    frequency_range=(5.0, 20.0)
)

Advanced Usage

from angstrom.core.motion_amplifier import MotionAmplifier
from angstrom.processing.phase import extract_phase, extract_amplitude

# Custom phase processing
amplifier = MotionAmplifier()
amplifier.load_video("input.mp4")
amplifier.process()

# Extract phase and amplitude manually
frame_coeffs = amplifier.pyramid_coeffs[0]
phase_coeffs = extract_phase(frame_coeffs)
amplitude_coeffs = extract_amplitude(frame_coeffs)

# Custom processing...

Visualization Examples

from angstrom.utils.visualization import visualize_pyramid_phases

# Visualize pyramid phases for analysis
visualize_pyramid_phases(
    pyramid_coeffs=amplifier.pyramid_coeffs[0],
    output_path="pyramid_visualization.png"
)

📊 Performance

Processing Speed: ~2-5 frames/second on CPU, ~10-20 frames/second on GPU
Memory Usage: Scales with video resolution and number of frames
Accuracy: High-quality motion amplification with minimal artifacts
Scalability: Supports videos of various resolutions and frame rates

Memory Optimization

For large videos or limited memory systems, Angstrom provides memory-efficient processing:

from angstrom.core.motion_amplifier import MotionAmplifier
from angstrom.utils.memory_monitor import MemoryMonitor

# Monitor memory usage
monitor = MemoryMonitor()

# Automatic memory optimization for large videos
amplifier = MotionAmplifier()
amplifier.process_video(
    input_path="large_video.mp4",
    output_path="output.mp4",
    amplification_factor=10
)

# Check memory usage
monitor.print_memory_summary()

Memory Optimization Features:

Streaming Processing: Process videos in chunks to minimize memory usage
Automatic Chunk Sizing: Calculate optimal chunk size based on available memory
Memory Monitoring: Track memory usage during processing
Video Downsampling: Reduce resolution for memory-constrained systems
Pyramid Compression: Compress coefficients to save memory

🔧 Configuration

Amplification Parameters

amplification_factor: How much to amplify motion (typically 10-200)
frequency_range: Target frequency band in Hz (e.g., (0.1, 2.0) for human motion)
fps: Video frame rate (automatically detected)

Pyramid Parameters

height: Number of pyramid levels (default: 5)
nbands: Number of orientation bands (default: 4)
scale_factor: Scaling factor between levels (default: 2)

CLI Options

angstrom --help

Available options:

--factor, -f: Amplification factor (default: 10.0)
--freq-range, -r: Frequency range in Hz (e.g., 0.1 2.0)
--device, -d: Device to use (cpu/cuda)
--verbose, -v: Enable verbose output

🐛 Troubleshooting

Common Issues

"No motion detected": Try increasing amplification_factor or adjusting frequency_range
"Video appears frozen": Check if motion is within the specified frequency range
"Out of memory": Reduce video resolution or process in smaller chunks
"Poor quality output": Ensure input video has sufficient motion and good lighting
"CUDA not available": Install PyTorch with CUDA support or use CPU processing

Debug Mode

# Enable debug output
amplifier = MotionAmplifier()
amplifier.load_video("input.mp4")
amplifier.process()

# Check pyramid coefficients
print(f"Number of frames: {len(amplifier.pyramid_coeffs)}")
print(f"Pyramid structure: {type(amplifier.pyramid_coeffs[0])}")

Performance Tips

Use GPU acceleration when available (install with pip install angstrom[gpu])
Process videos in smaller chunks for large files
Adjust frequency range to match expected motion
Use appropriate amplification factors (start with 10-50)
Monitor memory usage for large videos

📚 Documentation

Full Documentation: https://levi2234.github.io/Angstrom/
API Reference: https://levi2234.github.io/Angstrom/modules.html
Examples: See the examples/ directory

🧪 Testing

Run the test suite:

# Install development dependencies
pip install -e ".[dev]"

# Run all tests
pytest

# Run specific test categories
pytest -m "unit"           # Unit tests only
pytest -m "integration"    # Integration tests
pytest -m "gpu"           # GPU tests
pytest -m "video"         # Video processing tests

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details.

Development Setup

# Clone the repository
git clone https://github.com/levi2234/Angstrom.git
cd Angstrom

# Install development dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Run linting
flake8 src/
black src/
pylint src/

Code Quality

The project uses several tools to maintain code quality:

Black: Code formatting
Flake8: Linting
Pylint: Static analysis
Pytest: Testing
Pre-commit: Git hooks

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Complex Steerable Pyramids: Based on the implementation from PyTorchSteerablePyramid
Motion Amplification Theory: Inspired by the work of Wadhwa et al. on Eulerian Video Magnification
PyTorch: For efficient GPU-accelerated computation

📞 Support

Issues: GitHub Issues
Discussions: GitHub Discussions
Email: [email protected]

🔗 Related Projects

Made with ❤️ for the computer vision community

Generating Documentation Locally

Install the package with documentation dependencies:
```
pip install -e ".[docs]"
```
Generate the documentation:
```
cd docs
make html
```
View the documentation by opening docs/_build/html/index.html in your browser.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github		.github
.vscode		.vscode
docs		docs
examples		examples
pyramid_visualizations		pyramid_visualizations
src		src
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.pypirc		.pypirc
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
amplified_output.mp4		amplified_output.mp4
amplified_output_mech.mp4		amplified_output_mech.mp4
comparison.gif		comparison.gif
memory_efficient_output.mp4		memory_efficient_output.mp4
pyproject.toml		pyproject.toml
test_subtle_motion.mp4		test_subtle_motion.mp4

License

levi2234/Angstrom

Folders and files

Latest commit

History

Repository files navigation