Codestin Search App

roomote · 2025-08-12T09:03:35Z

This PR fixes the issue where GPT-5 models fail with LiteLLM due to using the deprecated max_tokens parameter instead of max_completion_tokens.

Problem

When using GPT-5 models with LiteLLM, users encounter the error:

LiteLLM streaming error: 400 litellm.BadRequestError: AzureException BadRequestError - Unsupported parameter: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead.

Solution

Added detection for GPT-5 model variants (gpt-5, gpt5, GPT-5, gpt-5-turbo, gpt5-preview, etc.)
Updated both createMessage and completePrompt methods to use max_completion_tokens for GPT-5 models
Maintained backward compatibility by continuing to use max_tokens for non-GPT-5 models

Testing

Added comprehensive test coverage for GPT-5 model handling
Verified that the correct parameter is used for various GPT-5 model names
Ensured non-GPT-5 models continue to use max_tokens as before
All existing tests pass without regression

Fixes #6979

Important

Fixes GPT-5 model issue in LiteLLM by using max_completion_tokens instead of max_tokens, with tests for various model names.

Behavior:
- Fixes issue with GPT-5 models in LiteLLMHandler by using max_completion_tokens instead of deprecated max_tokens.
- Updates createMessage and completePrompt methods in lite-llm.ts.
- Maintains backward compatibility for non-GPT-5 models using max_tokens.
Testing:
- Adds tests in lite-llm.spec.ts for GPT-5 model handling.
- Verifies correct parameter usage for various GPT-5 model names.
- Ensures non-GPT-5 models continue using max_tokens.
Misc:
- Adds isGpt5() function to detect GPT-5 model variants in lite-llm.ts.

^{This description was created by}^{for 822886b. You can customize this summary. It will automatically update as commits are pushed.}

- GPT-5 models require max_completion_tokens instead of the deprecated max_tokens parameter - Added detection for GPT-5 model variants (gpt-5, gpt5, GPT-5, etc.) - Updated both createMessage and completePrompt methods to handle GPT-5 models - Added comprehensive tests for GPT-5 model handling Fixes #6979

ellipsis-dev · 2025-08-12T09:05:21Z

src/api/providers/lite-llm.ts

 		let maxTokens: number | undefined = info.maxTokens ?? undefined

+		// Check if this is a GPT-5 model that requires max_completion_tokens instead of max_tokens
+		const isGPT5Model = modelId.toLowerCase().includes("gpt-5") || modelId.toLowerCase().includes("gpt5")


Consider extracting the GPT-5 model detection logic into a shared helper function. This logic (using modelId.toLowerCase().includes('gpt-5') || modelId.toLowerCase().includes('gpt5')) appears in both createMessage and completePrompt, and centralizing it would improve maintainability.

^{This comment was generated because it violated a code review rule: irule_tTqpIuNs8DV0QFGj.}

roomote

Reviewing my own code because apparently I trust no one, not even myself.

roomote · 2025-08-12T09:08:37Z

src/api/providers/lite-llm.ts

 		let maxTokens: number | undefined = info.maxTokens ?? undefined

+		// Check if this is a GPT-5 model that requires max_completion_tokens instead of max_tokens
+		const isGPT5Model = modelId.toLowerCase().includes("gpt-5") || modelId.toLowerCase().includes("gpt5")


The model detection logic could be more precise. Currently, modelId.toLowerCase().includes("gpt-5") would match unintended models like "not-gpt-5000". Consider using a more specific pattern:

Suggested change

const isGPT5Model = modelId.toLowerCase().includes("gpt-5") || modelId.toLowerCase().includes("gpt5")

// Check if this is a GPT-5 model that requires max_completion_tokens instead of max_tokens

const modelLower = modelId.toLowerCase()

const isGPT5Model = modelLower.startsWith("gpt-5") || modelLower.startsWith("gpt5") || modelLower === "gpt5"

roomote · 2025-08-12T09:08:37Z

src/api/providers/lite-llm.ts

 		const { id: modelId, info } = await this.fetchModel()

+		// Check if this is a GPT-5 model that requires max_completion_tokens instead of max_tokens
+		const isGPT5Model = modelId.toLowerCase().includes("gpt-5") || modelId.toLowerCase().includes("gpt5")


This detection logic is duplicated from line 111. Would it be cleaner to extract this into a helper method to maintain DRY principles? Something like:

private isGPT5Model(modelId: string): boolean { const modelLower = modelId.toLowerCase() return modelLower.startsWith("gpt-5") || modelLower.startsWith("gpt5") || modelLower === "gpt5" }

roomote · 2025-08-12T09:08:37Z

src/api/providers/lite-llm.ts


+		// GPT-5 models require max_completion_tokens instead of the deprecated max_tokens parameter
+		if (isGPT5Model && maxTokens) {
+			// @ts-ignore - max_completion_tokens is not in the OpenAI types yet but is supported


Is there a way to avoid using @ts-ignore here? Could we extend the OpenAI types or create a custom interface that includes max_completion_tokens to maintain type safety? For example:

interface GPT5RequestOptions extends Omit<OpenAI.Chat.Completions.ChatCompletionCreateParamsStreaming, 'max_tokens'> { max_completion_tokens?: number max_tokens?: never }

roomote · 2025-08-12T09:08:37Z

src/api/providers/__tests__/lite-llm.spec.ts

+		})
+
+		it("should use max_completion_tokens for various GPT-5 model variations", async () => {
+			const gpt5Variations = ["gpt-5", "gpt5", "GPT-5", "gpt-5-turbo", "gpt5-preview"]


Great test coverage! Consider adding edge cases like mixed case variations ("GpT-5", "gPt5") or models with additional suffixes ("gpt-5-32k", "gpt-5-vision") to ensure the detection works correctly for all possible GPT-5 model names.

src/api/providers/lite-llm.ts

daniel-lxs · 2025-08-18T23:49:58Z

This can be merged after #7067 since this PR will update the OpenAI SDK, allowing us to use it without the type error suppression comments.

Copilot

Pull Request Overview

This PR fixes an issue where GPT-5 models fail with LiteLLM due to using the deprecated max_tokens parameter instead of max_completion_tokens. The fix adds detection for GPT-5 model variants and conditionally uses the correct parameter based on the model type.

Adds GPT-5 model detection logic to identify models requiring max_completion_tokens
Updates both createMessage and completePrompt methods to use the appropriate parameter
Maintains backward compatibility for non-GPT-5 models

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
src/api/providers/lite-llm.ts	Implements GPT-5 model detection and conditional parameter usage
src/api/providers/tests/lite-llm.spec.ts	Adds comprehensive test coverage for GPT-5 handling and refactors mocking

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-09-23T00:33:07Z

src/api/providers/lite-llm.ts

 		let maxTokens: number | undefined = info.maxTokens ?? undefined

+		// Check if this is a GPT-5 model that requires max_completion_tokens instead of max_tokens
+		const isGPT5Model = modelId.toLowerCase().includes("gpt-5") || modelId.toLowerCase().includes("gpt5")


The GPT-5 model detection logic is duplicated between the createMessage and completePrompt methods. Consider extracting this into a private helper method to improve maintainability and ensure consistency.

Copilot · 2025-09-23T00:33:08Z

src/api/providers/lite-llm.ts

+			// @ts-ignore - max_completion_tokens is not in the OpenAI types yet but is supported
+			requestOptions.max_completion_tokens = maxTokens


Using @ts-ignore to suppress TypeScript errors for max_completion_tokens is not ideal. Consider using type assertion with a more specific interface or extending the OpenAI types to include this property for better type safety.

Copilot · 2025-09-23T00:33:08Z

src/api/providers/lite-llm.ts

+				// @ts-ignore - max_completion_tokens is not in the OpenAI types yet but is supported
+				requestOptions.max_completion_tokens = info.maxTokens


This is a duplicate of the previous @ts-ignore comment. The same type safety concerns apply here - consider using a consistent approach to handle the missing type definition.

…l handling

roomote bot requested review from cte, jr and mrubens as code owners August 12, 2025 09:03

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Aug 12, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Aug 12, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Aug 12, 2025

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. bug Something isn't working labels Aug 12, 2025

ellipsis-dev bot reviewed Aug 12, 2025

View reviewed changes

roomote bot commented Aug 12, 2025

View reviewed changes

roomote bot mentioned this pull request Aug 12, 2025

GPT-5 max tokens not supports #6979

Closed

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Aug 12, 2025

daniel-lxs moved this from Triage to PR [Needs Prelim Review] in Roo Code Roadmap Aug 13, 2025

hannesrudolph added PR - Needs Preliminary Review and removed Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. labels Aug 13, 2025

daniel-lxs marked this pull request as draft August 18, 2025 23:50

daniel-lxs moved this from PR [Needs Prelim Review] to PR [Draft / In Progress] in Roo Code Roadmap Aug 18, 2025

hannesrudolph added PR - Draft / In Progress and removed PR - Needs Preliminary Review labels Aug 19, 2025

hannesrudolph moved this from PR [Draft / In Progress] to PR [Needs Prelim Review] in Roo Code Roadmap Sep 23, 2025

hannesrudolph assigned daniel-lxs Sep 23, 2025

hannesrudolph marked this pull request as ready for review September 23, 2025 00:32

Copilot AI review requested due to automatic review settings September 23, 2025 00:32

Copilot AI reviewed Sep 23, 2025

View reviewed changes

hannesrudolph added PR - Needs Preliminary Review and removed PR - Draft / In Progress labels Sep 23, 2025

fix: remove TypeScript ignore for max_completion_tokens in GPT-5 mode…

3d576b1

…l handling

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use max_completion_tokens for GPT-5 models in LiteLLM provider#6980

fix: use max_completion_tokens for GPT-5 models in LiteLLM provider#6980
mrubens merged 4 commits intomainfrom
fix/gpt5-max-completion-tokens

roomote bot commented Aug 12, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

ellipsis-dev bot Aug 12, 2025

Uh oh!

roomote bot left a comment

Uh oh!

roomote bot Aug 12, 2025

Uh oh!

roomote bot Aug 12, 2025

Uh oh!

roomote bot Aug 12, 2025

Uh oh!

roomote bot Aug 12, 2025

Uh oh!

Uh oh!

daniel-lxs commented Aug 18, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Sep 23, 2025

Uh oh!

Copilot AI Sep 23, 2025

Uh oh!

Copilot AI Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		// @ts-ignore - max_completion_tokens is not in the OpenAI types yet but is supported
		requestOptions.max_completion_tokens = maxTokens

		// @ts-ignore - max_completion_tokens is not in the OpenAI types yet but is supported
		requestOptions.max_completion_tokens = info.maxTokens

Conversation

roomote bot commented Aug 12, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Testing

Uh oh!

ellipsis-dev bot Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot left a comment

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

daniel-lxs commented Aug 18, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

roomote bot commented Aug 12, 2025 •

edited by ellipsis-dev bot

Loading