Thanks to visit codestin.com
Credit goes to Github.com

Skip to content

bug: Thought Models (Like Qwen) stops working when generation is interrupted #7172

@Rivridis

Description

@Rivridis

Version: 0.7.5

Describe the Bug

When a thinking model's generation is interrupted (Qwen 3 8B used), the next response does not generate at times or the thought window does not show up. The response speed gets very slow and breaks the context management if the context memory is nearly full in this case.

Steps to Reproduce

  1. Ask a thinking model a few questions.
  2. Pause the generation while a question is being answered.
  3. Ask the thinking model another question. The model then breaks, or delays response.
  4. Repeat this with nearly full context memory.

Screenshots / Logs

Image

Operating System

  • Windows
  • MacOS
  • Linux

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions