Thanks to visit codestin.com
Credit goes to github.com

Skip to content

sfc-gh-mhidayetoglu/ArcticInference

 
 

Repository files navigation

License Apache 2.0 PyPI version

Latest news

ArcticInference

ArcticInference is a new library from Snowflake AI Research that contains current and future LLM inference optimizations developed at Snowflake. It is integrated with vLLM v0.8.1 using vLLM’s custom plugin feature, allowing us to develop and integrate inference optimizations quickly into vLLM and make them available to the community.

Once installed, ArcticInference automatically patches vLLM to use Arctic Ulysses and other optimizations implemented in ArcticInference, and users can continue to use their familiar vLLM APIs and CLI. It’s easy to get started!

Installation

$ pip install arctic-inference[vllm]

Projects

To better understand what features ArcticInference supports please refer to the following list of projects we have released under this framework:

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 92.4%
  • C++ 6.9%
  • Other 0.7%