Thanks to visit codestin.com
Credit goes to github.com

Skip to content
This repository was archived by the owner on Jun 3, 2025. It is now read-only.

[Text Generation][Enhancement] If prompt_processing_sequence_length == 1, do not inititalize multitoken_engine#1214

Merged
dbogunowicz merged 1 commit into
mainfrom
feature/damian/prompt_processing_one
Aug 30, 2023
Merged

[Text Generation][Enhancement] If prompt_processing_sequence_length == 1, do not inititalize multitoken_engine#1214
dbogunowicz merged 1 commit into
mainfrom
feature/damian/prompt_processing_one

Conversation

@dbogunowicz

@dbogunowicz dbogunowicz commented Aug 29, 2023

Copy link
Copy Markdown
Contributor

A simple improvement that streamlines the pipeline.

If prompt_processing_sequence_length == 1, we are essentially running single-token prompt prefill , so we should not be initializing and running the additional engine.

@dbogunowicz dbogunowicz changed the title Feature/damian/prompt processing one [Text Generation][Enhancement] If prompt_processing_sequence_length == 1, do not inititalize multitoken_engine Aug 29, 2023
Comment thread src/deepsparse/transformers/pipelines/text_generation.py

@mgoin mgoin left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

awesome! a special case i'll take advantage of :)

@dbogunowicz dbogunowicz merged commit d8b63da into main Aug 30, 2023
@dbogunowicz dbogunowicz deleted the feature/damian/prompt_processing_one branch August 30, 2023 16:03
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants