Thanks to visit codestin.com Credit goes to github.com
We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
add SCBench
V0.1.5.post1: Support LLaMA-3-70B, Multi-gpu, fix kernel / sqrt(dk)
V0.1.5: Support LLaMA 3.1
V0.1.4.post4: Hotfix vLLM >= 0.4.1
V0.1.4.post3: remove flash_attn dependency
V0.1.4.post2: support multi-gpu, remove pycuda
V0.1.4.post1: support other vllm version
V0.1.4: hotfix config in pip
V0.1.3: add bdist cache
V0.1.2: Hotfix pip setup