Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

RiddleHe
Copy link

We identified a bug in the parser.py file, where the default stop token for model generation is "###":

parser.add_argument(
        "--stop",
        default="###",
        type=str,
        help="Stop token (use `,` to separate multiple tokens)",
    )

This will cutoff model response prematurely. For example, in our evaluations, we find that Qwen3-8B likes to output "### Solution code" so the solution code is discarded.

Currently, this feature is not disabled for vllm runner, which is the default entry point for most open source models.

We changed the default flag to None and also handled the none type in subsequent logic.

@Naman-ntc
Copy link
Contributor

These are default values and not the recommend values (especially for newer models). Please pass empty string "" for the stop argument.

@RiddleHe
Copy link
Author

Thanks for the info. We have talked to a few labs and none of them are aware of this default value. As a result, many of them often fail to reproduce paper results as they just accept the default. Since it is quite safe to assume that most people are not aware of "###" being the default, we suggest changing the default to make it easier for the researchers!

@acsezen
Copy link

acsezen commented Sep 14, 2025

These are default values and not the recommend values (especially for newer models). Please pass empty string "" for the stop argument.

Will anything break if the default value is left empty?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants