Popular repositories Loading
-
-
ParallelBench
ParallelBench Public[ICLR 2026] ParallelBench: Understanding the Tradeoffs of Parallel Decoding in Diffusion LLMs
-
eta-inversion
eta-inversion Public[ECCV 2024] Official Pytorch Implementation for "Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing"
-
draft-based-approx-llm
draft-based-approx-llm Public[ICLR 2026] Draft-based Approximate Inference for LLMs
Repositories
- furiosa-perf Public
furiosa-ai/furiosa-perf’s past year of commit activity - furiosa-rngd-validator Public
furiosa-ai/furiosa-rngd-validator’s past year of commit activity - llm-compressor-compression-part Public Forked from vllm-project/llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
furiosa-ai/llm-compressor-compression-part’s past year of commit activity - chrometracer Public
furiosa-ai/chrometracer’s past year of commit activity - EfficientRollout Public
furiosa-ai/EfficientRollout’s past year of commit activity - cocotbext-fcov Public
furiosa-ai/cocotbext-fcov’s past year of commit activity - furiosa-opt Public
furiosa-ai/furiosa-opt’s past year of commit activity - vllm-compression-part Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
furiosa-ai/vllm-compression-part’s past year of commit activity - VLMEvalKit Public Forked from open-compass/VLMEvalKit
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
furiosa-ai/VLMEvalKit’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…