vcuda-hook

a transparent-level library overhook lib-cuda and lib-nvidia-ml

HomePage

CFN-Cloud(In development...)

Build Dependencies

CMake >= 3.19
Docker > 20.10
CUDA >= 12.6
Yaml-cpp > 0.7
Spdlog > 1.x

How to Use

build

build builder image

bash ./hack/build-builder.sh

build library

bash ./hack/build-via-docker.sh

configure

# use env
export LD_PRELOAD=/path/to/libvcuda-hook.so
export VCUDA_LOG_LEVEL=debug
export VCUDA_MEMORY_LIMIT=(1024 * 1024 * 1024 * 10) // limit 10G

usage

# manual
your_application

# or use docker
docker run -it --gpus all --rm -v /path/to/libvcuda-hook.so:/usr/lib64/libvcuda-hook.so -e LD_PRELOAD=/usr/lib64/libvcuda-hook.so vllm/vllm-openai:latest bash

Features

GPU Virtualization Features

Base Features

✅ Minimal Performance Overhead
✅ Fractional GPU Usage
✅ Fine-grained GPU Memory Control
✅ Multi‑Process GPU Memory Unified Control
✅ Container GPU Sharing
☐ Kubernetes Support
...

More Features

☐ Remote GPU Call Over Network
☐ Oversub GPU Memory Control
☐ GPU Task Hot Snapshot
...

Why This Project?

Based on several core motivations, I developed this project:

Personal Technical Interest and Professional Needs: Driven by interest in GPU virtualization technology and CUDA programming, along with related requirements encountered in practical work
Open Architecture: Provide an open-source solution that allows the community to participate in improvements and feature extensions
High Scalability: Design a flexible architecture that supports various GPU virtualization scenarios, including GPU resource sharing in containerized environments
Dynamic Controllability: Implement runtime dynamic configuration and management capabilities, allowing GPU resource allocation adjustments based on demand
Transparent Proxy Layer: Serve as a transparent proxy for CUDA dynamic libraries, enabling GPU virtualization functionality without modifying existing applications

This project aims to provide a simple and easy-to-use GPU virtualization solution for containerized environments, enabling safe and efficient sharing of GPU resources among multiple containers.

Contributing

Code of conduct

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.github/workflows		.github/workflows
hack		hack
include		include
src		src
tests		tests
third_party		third_party
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

vcuda-hook

HomePage

Build Dependencies

How to Use

build

configure

usage

Features

GPU Virtualization Features

Base Features

More Features

Why This Project?

Contributing

License

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

License

ScaletKlazz/vcuda-hook

Folders and files

Latest commit

History

Repository files navigation

vcuda-hook

HomePage

Build Dependencies

How to Use

build

configure

usage

Features

GPU Virtualization Features

Base Features

More Features

Why This Project?

Contributing

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages