Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

@jgreer013
Copy link
Contributor

@jgreer013 jgreer013 commented Jun 26, 2025

Description

Now that we have an AdaptiveSemaphore, we need a way of keeping track of successful and failed requests, and use this information to influence our inference concurrency.

The adaptive concurrency controller handles this by exposing methods to track successes and failures on top of being able to be used as a semaphore. The controller adjusts the semaphore based on a computed error rate, changing the concurrency after a certain number of requests and certain amount of time.

Depending on error rate, the controller will reduce or increase concurrency up to some limits, and waits to make further changes depending on how requests do.

Related issues

Towards OPE-1307

Before submitting

  • This PR only changes documentation. (You can ignore the following checks in that case)
  • Did you read the contributor guideline Pull Request guidelines?
  • Did you link the issue(s) related to this PR in the section above?
  • Did you add / update tests where needed?

Reviewers

At least one review from a member of oumi-ai/oumi-staff is required.

@jgreer013 jgreer013 changed the title Add adaptive throughput controller in preparation for adaptive inference Add adaptive concurrency controller in preparation for adaptive inference Jun 26, 2025
@jgreer013 jgreer013 requested a review from taenin June 26, 2025 21:34
@jgreer013 jgreer013 requested a review from taenin June 30, 2025 17:47
@jgreer013 jgreer013 merged commit 3226a8b into main Jul 1, 2025
5 checks passed
@jgreer013 jgreer013 deleted the jgreer013/adaptive-inference branch July 1, 2025 17:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants