Thanks to visit codestin.com
Credit goes to github.com

Skip to content
This repository was archived by the owner on Jun 3, 2025. It is now read-only.

[Cherry-Pick][Text Generation] Terminate the inference when kv cache is full#1447

Merged
rahul-tuli merged 9 commits into
release/1.6from
cp/kv_cache_full
Dec 1, 2023
Merged

[Cherry-Pick][Text Generation] Terminate the inference when kv cache is full#1447
rahul-tuli merged 9 commits into
release/1.6from
cp/kv_cache_full

Conversation

@dbogunowicz

Copy link
Copy Markdown
Contributor

Cherry-pick for #1446

Comment thread src/deepsparse/transformers/pipelines/text_generation.py Outdated
Comment thread src/deepsparse/transformers/pipelines/text_generation.py Outdated
dsikka
dsikka previously requested changes Dec 1, 2023

@dsikka dsikka left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could we add a test case to catch this case?

@tlrmchlsmth

Copy link
Copy Markdown
Member

Could we add a test case to catch this case?

+1

@rahul-tuli rahul-tuli left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approving to unblock release, @dbogunowicz will track this and add tests in main

@rahul-tuli rahul-tuli dismissed stale reviews from dsikka and tlrmchlsmth December 1, 2023 18:18

Damian will add tests in main, landing this to unblock QA

@rahul-tuli rahul-tuli merged commit e94dcac into release/1.6 Dec 1, 2023
@rahul-tuli rahul-tuli deleted the cp/kv_cache_full branch December 1, 2023 18:18
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants