Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

@njhill
Copy link
Member

@njhill njhill commented Nov 15, 2025

Small fix, took a while to track down πŸ˜…

@njhill njhill added the bug Something isn't working label Nov 15, 2025
@mergify mergify bot added the v1 label Nov 15, 2025
@njhill njhill added this to the v0.11.1 milestone Nov 15, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request addresses a bug related to async scheduling, chunked prefill, and preemption. The core of the fix is in vllm/v1/core/sched/scheduler.py, where previously only a slice of token IDs was being sent for requests undergoing chunked prefill. This was incorrect as the connector requires the full list of tokens for prefix matching. The change correctly sends a full copy of the token IDs. To facilitate this, a copy() method was added to the ConstantList class. The re-enabling of previously skipped tests in tests/v1/e2e/test_async_scheduling.py validates the fix. The changes are correct, well-targeted, and I have no further recommendations.

@njhill njhill mentioned this pull request Nov 15, 2025
16 tasks
@njhill njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant