Thanks to visit codestin.com
Credit goes to github.com

Skip to content

b6868

  • b6868
  • 85a7d86
  • Verified

    This commit was created on GitHub.com and signed with GitHub’s verified signature.
  • Choose a tag to compare

  • b6868
  • 85a7d86
  • Choose a tag to compare

  • Verified

    This commit was created on GitHub.com and signed with GitHub’s verified signature.
@ggerganov ggerganov tagged this 28 Oct 18:19
* memory : remove KV cache size padding

* cont : restore padding for n_kv tensor shape

* server : use slot context size instead of training context size

* server : simplify context limit logic
Assets 2
Loading