Add `min_tokens` argument #240

Yard1 · 2023-05-18T20:44:33Z

This PR adds a min_tokens argument to complement max_tokens. When set, the EOS token will be discarded until min_tokens have been generated.

Signed-off-by: Antoni Baum <[email protected]>

Yard1 · 2023-05-19T19:27:48Z

@abetlen, could you review when you have a moment? Thank you!

abetlen · 2023-05-26T17:05:35Z

Hey @Yard1 sorry to take so long on this reply. The issue I see with this at the moment is that the stop token is still being generated and appended to the eval_logits and eval_tokens internally. This will probably cause some kind of issue in generation, what you really want is a min_tokens inside of generate or something that essentially ignores / sets the eos token probability to 0 until a certain number of tokens are generated.

Also, it should probably be noted that fewer than min_tokens may be returned if e.g. another stop criteria like a stop sequence is encountered.

abetlen · 2023-05-26T17:07:05Z

The proposed change would be:

Add ignore_eos to sample
Move min_tokens check to generate and where the returned tokens is less than min_tokens set ignore_eos=True

Yard1 · 2023-06-02T14:59:31Z

Got it, thanks! Let me take a look at this.

Yard1 · 2023-06-02T17:15:55Z

Hmm, now that I think about it, this can be easily implemented through a LogitsProcessor. Therefore, we don't need a separate argument (and by extension, this PR). Let me know if that makes sense - we can close this PR then.

Also start adding prompts in "./prompts"

IkariDevGIT · 2023-08-06T22:07:27Z

any updates on this?

abetlen · 2024-05-14T13:52:09Z

@Yard1 we've added the parameter to the server and it's available in the python API via a new MinTokensLogitProcessor

Add min_tokens argument

94b5f39

Signed-off-by: Antoni Baum <[email protected]>

gjmulder added enhancement New feature or request high-priority labels May 23, 2023

xaptronic pushed a commit to xaptronic/llama-cpp-python that referenced this pull request Jun 13, 2023

Add "--instruct" argument for usage with Alpaca (abetlen#240)

9e17072

Also start adding prompts in "./prompts"

xaptronic pushed a commit to xaptronic/llama-cpp-python that referenced this pull request Jun 13, 2023

Add instruction for using Alpaca (abetlen#240)

a4e63b7

abetlen force-pushed the main branch 2 times, most recently from 8c93cf8 to cc0fe43 Compare November 14, 2023 20:24

twaka mentioned this pull request Apr 8, 2024

add min_tokens argument #1333

Merged

abetlen closed this May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `min_tokens` argument #240

Add `min_tokens` argument #240

Yard1 commented May 18, 2023

Yard1 commented May 19, 2023

abetlen commented May 26, 2023

abetlen commented May 26, 2023

Yard1 commented Jun 2, 2023

Yard1 commented Jun 2, 2023

IkariDevGIT commented Aug 6, 2023

abetlen commented May 14, 2024

Add min_tokens argument #240

Add min_tokens argument #240

Conversation

Yard1 commented May 18, 2023

Yard1 commented May 19, 2023

abetlen commented May 26, 2023

abetlen commented May 26, 2023

Yard1 commented Jun 2, 2023

Yard1 commented Jun 2, 2023

IkariDevGIT commented Aug 6, 2023

abetlen commented May 14, 2024

Add `min_tokens` argument #240

Add `min_tokens` argument #240