-
-
RRL Public
Code for paper Swift Machine Learning Model Serving Scheduling: A Region Based Reinforcement Learning Approach
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJul 17, 2024 -
FasterTransformer-cutlass_kernels Public
Forked from NVIDIA/FasterTransformerCuda Apache License 2.0 UpdatedJul 17, 2023 -
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Python MIT License UpdatedJan 11, 2023 -
DeepSpeedExamples Public
Forked from deepspeedai/DeepSpeedExamplesExample models using DeepSpeed
Python MIT License UpdatedJun 27, 2022 -
SimiGrad Public
Forked from SimiGrad/SimiGradPublic Code for NIPS submission SimiGrad: Fine-Grained Adaptive Batching for Large ScaleTraining using Gradient Similarity Measurement
-
tfevent_merger Public
A helper script to merge tfevent files and correct the relative time. Useful when the training was interrupted and restarted.
Python UpdatedJan 6, 2021