Thanks to visit codestin.com
Credit goes to github.com

Skip to content

huggingface/deep-learning-containers

 
 

Repository files navigation

AWS Logo

AWS Deep Learning Containers

One stop shop for running AI/ML on AWS

Docs · Available Images · Tutorials

Auto Release - vLLM EC2 Auto Release - vLLM SageMaker Auto Release - vLLM-Omni Auto Release - Ray Auto Release - SGLang EC2 Auto Release - SGLang SageMaker


About

AWS Deep Learning Containers (DLCs) are pre-built Docker images for running AI/ML workloads on AWS. Each image is tested and patched for security vulnerabilities. For more details, visit our documentation.


🔥 What's New

🚀 Release Highlights

  • [2026/05/30] vLLM v0.22.0 — EC2: 0.22.0-gpu-py312-ec2 · SageMaker: 0.22.0-gpu-py312 · MiniCPM-V 4.6, InternS2 Preview, OpenVLA, EXAONE-4.5; DeepSeek V4 maturity (NVFP4 fused MoE, MTP speculative decoding); Blackwell SM12x support.
  • [2026/05/18] SGLang v0.5.12 — EC2: 0.5.12-gpu-py312-ec2 · SageMaker: 0.5.12-gpu-py312 · DeepSeek V4, Intern-S2-Preview, MiniCPM-V 4.6, Laguna-XS.2, Ring-2.6-1T, Gemma 4 MTP.
  • [2026/05/16] vLLM v0.21.0 — EC2: 0.21.0-gpu-py312-ec2 · SageMaker: 0.21.0-gpu-py312 · MiMo-V2.5, Laguna XS.2, Moondream3, Cohere MoE/Eagle; DeepSeek V4 on AMD + pipeline parallelism.
  • [2026/05/13] vLLM-Omni v0.20.0 — EC2: omni-cuda-v1.1 · SageMaker: omni-sagemaker-cuda-v1.1 · Adds /v1/audio/generate (stable-audio-open) and /v1/videos/sync (unblocks video on SageMaker); supports CosyVoice3, ERNIE-Image-Turbo, Wan2.1-VACE-1.3B; CUDA 13.0 + PyTorch 2.11.0.
  • [2026/05/11] vLLM v0.20.2 — EC2: 0.20.2-gpu-py312-ec2 · SageMaker: 0.20.2-gpu-py312 · Bug fixes for DeepSeek V4.
  • [2026/05/06] SGLang v0.5.11 — EC2: 0.5.11-gpu-py312-ec2 · SageMaker: 0.5.11-gpu-py312 · Model support for Gemma 4, GLM-5.1, Qwen 3.4, and more
  • [2026/05/05] vLLM v0.20.1 — EC2: 0.20.1-gpu-py312-ec2 · SageMaker: 0.20.1-gpu-py312 · Bug fixes for DeepSeek V4.
  • [2026/04/30] PyTorch v2.11.0 — EC2: 2.11.0-cu130-amzn2023 · SageMaker: 2.11.0-cu130-amzn2023-sagemaker · Amazon Linux 2023 with EFA, flash-attn, and transformer-engine.

📢 Support Updates

  • [2026/04/28] We cannot guarantee security patching on Ubuntu-based vLLM and SGLang images due to the lack of Ubuntu Pro licensing. Customers may continue using these images at their own discretion and risk. We recommend migrating to our Amazon Linux-based images.
  • [2026/02/10] Extended support for PyTorch 2.6 Inference containers until June 30, 2026
    • PyTorch 2.6 Inference images will continue to receive security patches and updates through end of June 2026
    • For complete framework support timelines, see our Support Policy

📝 Blog Posts

🎓 Workshop


License

This project is licensed under the Apache-2.0 License.

About

One stop shop for running AI/ML on AWS.

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 63.0%
  • Shell 33.3%
  • Dockerfile 3.6%
  • C 0.1%