How does one setup a global timeout to all requests? #7364

PLNech · 2020-07-21T08:44:52Z

PLNech
Jul 21, 2020

First check

Ticked all checks, then first commitment choice

I added a very descriptive title to this issue.
I used the GitHub search to find a similar issue and didn't find it.
I searched the FastAPI documentation, with the integrated search.
I already searched in Google "How to X in FastAPI" and didn't find any information.
I already read and followed all the tutorial in the docs and didn't find an answer.
I already checked if it is not related to FastAPI but to Pydantic.
I already checked if it is not related to FastAPI but to Swagger UI.
I already checked if it is not related to FastAPI but to ReDoc.
After submitting this, I commit to one of:
- Read open issues with questions until I find 2 issues where I can help someone and add a comment to help there.
- I already hit the "watch" button in this repository to receive notifications and I commit to help at least 2 people that ask questions in the future.
- Implement a Pull Request for a confirmed bug.

Description

Hi there, first of all many thanks for the work on FastAPI - this is now my goto framework for building Python-based REST APIs :)

My question is about adding a global timeout to any potential request served by the server. My use-case includes occasionally long loading times when I have to load a new model for a given user request, and instead of blocking for 30-50s (which would often timeout on the user side due to default connection timeouts), I would like to return a temporary error whenever any endpoint takes more than a given delay to complete.

Example

Today the only way I found to implement a timeout on every request is to wrap every endpoint method within a context manager like this one:

@contextmanager
def timeout_after(seconds: int):
    # Register a function to raise a TimeoutError on the signal.
    signal.signal(signal.SIGALRM, raise_timeout)
    # Schedule the signal to be sent after `seconds`.
    signal.alarm(seconds)

    try:
        yield
    finally:
        # Unregister the signal so it won't be triggered if the timeout is not reached.
        signal.signal(signal.SIGALRM, signal.SIG_IGN)

def raise_timeout(_, frame):
    raise TimeoutError

# Used as such:
@timeout_after(5)
@app.get("/1/version", tags=["Meta"],
         description="Check if the server is alive, returning the version it runs.",
         response_model=Version,
         response_description="the version of the API currently running.")
async def version() -> Version:
    return current_version

This is however quite cumbersome to add on every single function decorated as an endpoint.
Besides, it feels hacky: isn't there a better way to define app-level timeouts broadly, with a common handler, maybe akin to how ValidationErrors can be managed in a single global handler?

Environment

OS: [e.g. Linux / Windows / macOS]: Linux
FastAPI Version [e.g. 0.3.0]: 0.58.0
Python version: 3.7.7

Additional context

I looked into Starlette's timeout support to see if that was handled at a lower level. but to no avail.

Answered by ZionStage

Aug 28, 2020

Hey @PLNech

I have implemented and tested the middleware and it seems to be working fine for me. Here is my code

import asyncio
import time


import pytest

from fastapi import FastAPI, Request, Response, HTTPException
from fastapi.responses import JSONResponse
from httpx import AsyncClient
from starlette.status import HTTP_504_GATEWAY_TIMEOUT

REQUEST_TIMEOUT_ERROR = 1  # Threshold

app = FastAPI() # Fake app

# Creating a test path
@app.get("/test_path")
async def route_for_test(sleep_time: float) -> None:
    await asyncio.sleep(sleep_time)

# Adding a middleware returning a 504 error if the request processing time is above a certain threshold
@app.middleware("http")
async def timeout_…

View full answer

ZionStage · 2020-08-27T08:50:02Z

ZionStage
Aug 27, 2020

Hi @PLNech

I am developing my own API using FastAPI and ran into the same "problem" as I am trying to add a global timeout to all my requests.

I am still new to fastapi but from what I understand I believe the "fastapi" way to do so would be to use a middleware as they are designed to be ran at every request by nature. As I searched on how to do so I found this
gitter community thread and thought it could maybe help you.

I am going to implement both your solution and the middleware based one and see which one I prefer and works best. Also note that there seems to be a problem with starlette 0.13.3 and higher so keep that in mind.

Also if you found a workaround by now I am more than interested.

Hope it helped you a bit

0 replies

PLNech · 2020-08-27T10:03:19Z

PLNech
Aug 27, 2020
Author

Hi @ZionStage, thanks for your message! I haven't found a workaround for now. Looking forward to continuing this conversation with you as we move forward on this topic :)

0 replies

ZionStage · 2020-08-28T13:37:24Z

ZionStage
Aug 28, 2020

Hey @PLNech

I have implemented and tested the middleware and it seems to be working fine for me. Here is my code

import asyncio
import time


import pytest

from fastapi import FastAPI, Request, Response, HTTPException
from fastapi.responses import JSONResponse
from httpx import AsyncClient
from starlette.status import HTTP_504_GATEWAY_TIMEOUT

REQUEST_TIMEOUT_ERROR = 1  # Threshold

app = FastAPI() # Fake app

# Creating a test path
@app.get("/test_path")
async def route_for_test(sleep_time: float) -> None:
    await asyncio.sleep(sleep_time)

# Adding a middleware returning a 504 error if the request processing time is above a certain threshold
@app.middleware("http")
async def timeout_middleware(request: Request, call_next):
    try:
        start_time = time.time()
        return await asyncio.wait_for(call_next(request), timeout=REQUEST_TIMEOUT_ERROR)

    except asyncio.TimeoutError:
        process_time = time.time() - start_time
        return JSONResponse({'detail': 'Request processing time excedeed limit',
                             'processing_time': process_time},
                            status_code=HTTP_504_GATEWAY_TIMEOUT)

# Testing wether or not the middleware triggers
@pytest.mark.asyncio
async def test_504_error_triggers():
    # Creating an asynchronous client to test our asynchronous function
    async with AsyncClient(app=app, base_url="http://test") as ac:
        response = await ac.get("/test_path?sleep_time=3")
    content = eval(response.content.decode())
    assert response.status_code == HTTP_504_GATEWAY_TIMEOUT
    assert content['processing_time'] < 1.1

# Testing middleware's consistency for requests having a processing time close to the threshold 
@pytest.mark.asyncio
async def test_504_error_consistency():
    async with AsyncClient(app=app, base_url="http://test") as ac:
        errors = 0
        sleep_time = REQUEST_TIMEOUT_ERROR*0.9
        for i in range(100):
            response = await ac.get("/test_path?sleep_time={}".format(sleep_time))
            if response.status_code == HTTP_504_GATEWAY_TIMEOUT:
                errors += 1
        assert errors == 0

# Testing middleware's precision
# ie : Testing if it triggers when it should not and vice versa
@pytest.mark.asyncio
async def test_504_error_precision():
    async with AsyncClient(app=app, base_url="http://test") as ac:
        should_trigger = []
        should_pass = []
        have_triggered = []
        have_passed = []
        for i in range(200):
            sleep_time = 2 * REQUEST_TIMEOUT_ERROR * random.random()
            if sleep_time < 1.1:
                should_pass.append(i)
            else:
                should_trigger.append(i)
            response = await ac.get("/test_path?sleep_time={}".format(sleep_time))
            if response.status_code == HTTP_504_GATEWAY_TIMEOUT:
                have_triggered.append(i)
            else:
                have_passed.append(i)
        assert should_trigger == have_triggered

I created three tests, the first one is designed to see wether or not the middleware actually does its job.
The second one is just there to check if there is any consistency problem with a single request.
The third one is here to check if I ran into the same issue raised in the thread I mentioned.

As far as I am concerned the first two tests passed without a problem.
However the third one failed. There are requests that have triggered when they should not :

E           AssertionError: assert [3, 7, 10, 11, 12, 14, ...] == [3, 7, 8, 10, 11, 12, ...]
E             At index 2 diff: 10 != 8
E             Right contains 11 more items, first extra item: 165

This is the issue mentioned in the thread. I'll downgrade to starlette 0.13.2 and see if the test pass.

I might have made some mistakes or overlooked some things so I you ever have the chance to do some tests on your end let me know.

Cheers !

Note :
I wrote assert content['processing_time'] < 1.1 and not assert content['processing_time'] < 1 because the time I am monitoring isn't really the time it takes for python to execute the function (time to execute asyncio.wait_for and catching the exception I guess) . I do not know the convention in this case.

0 replies

thomas-maschler · 2020-09-15T02:49:49Z

thomas-maschler
Sep 15, 2020

@PLNech have you tried changing the timeout settings for gunicorn? By default it times out after 60 sec I believe but you can overwrite the settings.

https://docs.gunicorn.org/en/latest/settings.html#timeout
#551

0 replies

PLNech · 2020-09-17T09:45:11Z

PLNech
Sep 17, 2020
Author

@ZionStage: thanks for sharing your implementation, this looks promising! I'll make some room in our backlog to give it a try in our next sprint and will let you know how it goes :)

0 replies

PLNech · 2020-09-17T09:48:04Z

PLNech
Sep 17, 2020
Author

@thomas-maschler: thanks for the advice. Unfortunately I've tried using Gunicorn's timeout, but it triggers a full restart of the app, disrupting other users of the service (e.g. by unloading their models from memory). What I'm trying to achieve is rather to enforce a timeout on individual requests, without affecting any other work handled by this worker.

0 replies

tiangolo · 2021-01-17T16:55:09Z

tiangolo
Jan 17, 2021
Maintainer

Thanks for the discussion here everyone!

Yes, indeed I think the solution would be with a middleware.

About the failing tests from @ZionStage, I understand there are no guarantees about sub-second precisions in async/await (I think Python in general). Either way, it would probably be impossible to expect absolute sub-second precision from something on the network. I would test only with integers to be sure.

But anyway, I think that's pretty much the right approach. ✔️

0 replies

2021-05-16T18:29:53Z

github-actions[bot]
bot May 16, 2021

Assuming the original need was handled, this will be automatically closed now. But feel free to add more comments or create new issues or PRs.

0 replies

MasterScrat · 2021-11-08T08:26:34Z

MasterScrat
Nov 8, 2021

This is good to return an error message to the user in case of timeout, but is there a way to actually kill the request at the same time so it doesn't keep using resources?

0 replies

lamoni · 2022-05-16T07:10:58Z

lamoni
May 16, 2022

Bumping this for @MasterScrat's question. Wondering the same thing

0 replies

lionel-ovaert · 2022-10-04T08:30:43Z

lionel-ovaert
Oct 4, 2022

Another bump for @MasterScrat's question

0 replies

BarisicLuka · 2022-10-04T09:45:21Z

BarisicLuka
Oct 4, 2022

@lionel-ovaert When raising the error once the time limit has been reached should stop any undergoing processes linked to the request, doesn't it ?

0 replies

dmelo · 2022-11-21T23:17:35Z

dmelo
Nov 21, 2022

Expanding on the middleware from @ZionStage , if the router uses non-asyncio blocking functions, it might end up missing the asyncio.TimeoutError. In the example bellow, tweaked from @ZionStage's code:

import asyncio
import time


import pytest

from fastapi import FastAPI, Request, Response, HTTPException
from fastapi.responses import JSONResponse
from httpx import AsyncClient
from starlette.status import HTTP_504_GATEWAY_TIMEOUT
import requests

REQUEST_TIMEOUT_ERROR = 1  # Threshold

app = FastAPI() # Fake app

# Creating a test path
@app.get("/test_path")
async def route_for_test(sleep_time: float) -> None:
    requests.get('https://i575rbl2mc.execute-api.us-east-1.amazonaws.com/sleep?time=3')
    return JSONResponse({}, status_code=200)

# Adding a middleware returning a 504 error if the request processing time is above a certain threshold
@app.middleware("http")
async def timeout_middleware(request: Request, call_next):
    try:
        start_time = time.time()
        return await asyncio.wait_for(call_next(request), timeout=REQUEST_TIMEOUT_ERROR)

    except asyncio.TimeoutError:
        process_time = time.time() - start_time
        return JSONResponse({'detail': 'Request processing time excedeed limit',
                             'processing_time': process_time},
                            status_code=HTTP_504_GATEWAY_TIMEOUT)

# Testing wether or not the middleware triggers
@pytest.mark.asyncio
async def test_504_error_triggers():
    # Creating an asynchronous client to test our asynchronous function
    async with AsyncClient(app=app, base_url="http://test") as ac:
        response = await ac.get("/test_path?sleep_time=3")
    content = eval(response.content.decode())
    assert response.status_code == HTTP_504_GATEWAY_TIMEOUT
    assert content['processing_time'] < 1.1

When running, we have that it lasted the whole execution of the router, way more than the timeout set on the middleware, and it returned 200, it bypassed the middleware:

❯ time pipenv run pytest c.py
================================================ test session starts ================================================
platform linux -- Python 3.10.8, pytest-7.2.0, pluggy-1.0.0
rootdir: /home/dmelo/proj3/python/b
plugins: anyio-3.6.2, asyncio-0.20.2
asyncio: mode=strict
collected 1 item                                                                                                    

c.py F                                                                                                        [100%]

===================================================== FAILURES ======================================================
______________________________________________ test_504_error_triggers ______________________________________________

    @pytest.mark.asyncio
    async def test_504_error_triggers():
        # Creating an asynchronous client to test our asynchronous function
        async with AsyncClient(app=app, base_url="http://test") as ac:
            response = await ac.get("/test_path?sleep_time=3")
        content = eval(response.content.decode())
>       assert response.status_code == HTTP_504_GATEWAY_TIMEOUT
E       assert 200 == 504
E        +  where 200 = <Response [200 OK]>.status_code

c.py:43: AssertionError
============================================== short test summary info ==============================================
FAILED c.py::test_504_error_triggers - assert 200 == 504
================================================= 1 failed in 3.93s =================================================
pipenv run pytest c.py  0.80s user 0.10s system 19% cpu 4.603 total

I'm posting here in the hope that somebody either (a) managed to have a good implementation of requests timeout feature working or (b) knows how to make this middleware works even on those situations.

1 reply

slhck Jun 27, 2023

Did you figure this out eventually?

LMalikov · 2023-01-12T19:35:03Z

LMalikov
Jan 12, 2023

Even if the router function contains async code it doesn't get interrupted/cancelled with this middleware solution.
The following example keeps printing Running... endlessly even though TimeoutException is triggered and underlying Task created by asyncio.wait_for(...) gets cancelled.

app = FastAPI()

@app.get("/long_running")
async def long_running():
    try:
        while True:
            print("Running...")
            await asyncio.sleep(1)
    except asyncio.CancelledError:  # This never happens :(
        print("Cancelled.")

@app.middleware("http")
async def timeout_middleware(request: Request, call_next):
    try:
        return await asyncio.wait_for(call_next(request), timeout=3)
    except asyncio.TimeoutError:
        return JSONResponse({'detail': 'Request processing time exceeded limit'}, 504)

@tiangolo shouldn't we hit except asyncio.CancelledError in this case? 🙏

0 replies

liyunrui · 2023-01-13T08:43:54Z

liyunrui
Jan 13, 2023

@LMalikov I got the same error. It looks like you need at least two middleware dectorators in the main.py but it's super wierd. For example,

@app.middleware("http")
async def add_process_time_header(request: Request, call_next):
    start_time = time.time()
    response = await call_next(request)
    process_time = time.time() - start_time
    # response.headers["X-Process-Time"] = str(process_time)
    print("adfadsfasdf")
    # response.headers["Test middleware"] = str(random.randint(1, 1000))
    return response


REQUEST_TIMEOUT_ERROR = 1.0 # seconds to wait for
from fastapi.responses import JSONResponse
from starlette.status import HTTP_504_GATEWAY_TIMEOUT

#Adding a middleware returning a 504 error if the request processing time is above a certain threshold
@app.middleware("http")
async def timeout_middleware(request: Request, call_next):
    try:
        start_time = time.time()
        return await asyncio.wait_for(call_next(request), timeout=REQUEST_TIMEOUT_ERROR)

    except asyncio.TimeoutError:
        process_time = time.time() - start_time
        res = timeout_fallback(process_time)
        return res

def timeout_fallback(process_time):
    response = JSONResponse({'detail': 'Request processing time excedeed limit',
                             'processing_time': process_time},
                            status_code=HTTP_504_GATEWAY_TIMEOUT)
    return response

0 replies

liyunrui · 2023-01-13T09:00:31Z

liyunrui
Jan 13, 2023

Does anyone

@LMalikov I got the same error. It looks like you need at least two middleware dectorators in the main.py but it's super wierd. For example,

@app.middleware("http")
async def add_process_time_header(request: Request, call_next):
    start_time = time.time()
    response = await call_next(request)
    process_time = time.time() - start_time
    # response.headers["X-Process-Time"] = str(process_time)
    print("adfadsfasdf")
    # response.headers["Test middleware"] = str(random.randint(1, 1000))
    return response


REQUEST_TIMEOUT_ERROR = 1.0 # seconds to wait for
from fastapi.responses import JSONResponse
from starlette.status import HTTP_504_GATEWAY_TIMEOUT

#Adding a middleware returning a 504 error if the request processing time is above a certain threshold
@app.middleware("http")
async def timeout_middleware(request: Request, call_next):
    try:
        start_time = time.time()
        return await asyncio.wait_for(call_next(request), timeout=REQUEST_TIMEOUT_ERROR)

    except asyncio.TimeoutError:
        process_time = time.time() - start_time
        res = timeout_fallback(process_time)
        return res

def timeout_fallback(process_time):
    response = JSONResponse({'detail': 'Request processing time excedeed limit',
                             'processing_time': process_time},
                            status_code=HTTP_504_GATEWAY_TIMEOUT)
    return response

Does anyone know why? It's super weird. Basically, you need to have two @app.middleware("http"). Otherwise, the timeout exception won't work.

0 replies

liyunrui · 2023-01-13T10:17:45Z

liyunrui
Jan 13, 2023

Basically, my problem is like this https://stackoverflow.com/questions/74132015/asyncio-wait-for-doesnt-time-out-as-expected

0 replies

galigutta · 2023-01-23T15:56:24Z

galigutta
Jan 23, 2023

Same problem as whats noted in the stackoverflow link above. The aysncio timeout is not respected.

0 replies

Naish21 · 2023-02-07T13:14:16Z

Naish21
Feb 7, 2023

I think this can be fixed using python 3.11 and asyncio.timeout_at instead asyncio.wait_for.
In the meantime (I'm using python 3.9) I'll try something else. I'll tell you if it works.

0 replies

Naish21 · 2023-02-07T15:08:48Z

Naish21
Feb 7, 2023

Workaround: I've created a decorator to use in the endpoints you want to raise a response 504:
(place it in a file named abort_after.py)

import functools
import signal
import sys

from fastapi.responses import JSONResponse
from starlette import status


class TimeOutException(Exception):
    """It took longer than expected"""


def abort_after(max_execution_time):
    def decorator(func):
        @functools.wraps(func)
        def wrapper(*args, **kwargs):
            def handle_timeout(signum, frame):
                raise TimeOutException(f"Function execution took longer than {max_execution_time}s and was terminated")
            if sys.platform == 'win32':
                print("Won't be stopped in windows!")
            else:
                signal.signal(signal.SIGALRM, handle_timeout)
                signal.alarm(max_execution_time)
            result = func(*args, **kwargs)
            if sys.platform != 'win32':
                signal.alarm(0)
            return result
        return wrapper
    return decorator


def timeout_response() -> JSONResponse:
    return JSONResponse(
        {
            'detail': 'Request processing time excedeed limit',
        },
        status_code=status.HTTP_504_GATEWAY_TIMEOUT,
    )

Then you can use it in your endpoint:

import time
from fastapi import APIRouter
from abort_after import abort_after, TimeOutException, timeout_response

router = APIRouter()


@router.post(f"{URL_prefix}/test",
             tags=['Test'],
             )
async def test():
    try:
        long_func(60)
    except TimeOutException:
        return timeout_response()
    return {'Test': 'ok'}


@abort_after(5)
def long_func(seconds: int) -> None:
    time.sleep(seconds)

1 reply

kgdrathan Sep 25, 2023

This is not aborting long_func if it is awaiting
Still, thank you very much, this is is useful for me.

rinzool · 2023-02-17T13:58:21Z

rinzool
Feb 17, 2023

Thanks @Naish21 I really like your solution!
Note that it only works with seconds as integer (so no timeout below 1s).
To use this solution with a floating number of second, one can replace

signal.alarm(max_execution_time)

with

signal.setitimer(signal.ITIMER_REAL, max_execution_time)

setitimer can work with floating number, so it is possible to define a timeout of 300ms for example (@abort_after(0.3))

0 replies

nicolasdespres · 2023-02-17T17:06:48Z

nicolasdespres
Feb 17, 2023

I am afraid this solution will not play well in a concurrent environment since there is only one timer per process, whereas there will be many co-routines running concurrently within the same process.

0 replies

dmelo · 2023-02-28T12:57:07Z

dmelo
Feb 28, 2023

So far, I have not seen any satisfactory solution for this problem. And the underlying problem seems to be that we might use functions on the router that are not friendly with asyncio.

0 replies

jalvespinto · 2024-05-17T14:07:02Z

jalvespinto
May 17, 2024

I am currently using something like this and it seems to work, but I not sure where it could go wrong...

import functools

from anyio import fail_after, sleep
from fastapi import FastAPI, Request, status
from fastapi.responses import JSONResponse


def timeout_after(timeout: int = 10):
    def decorator(func):
        @functools.wraps(func)
        async def wrapper(*args, **kwargs):
            with fail_after(timeout):
                return await func(*args, **kwargs)

        return wrapper

    return decorator


app = FastAPI()


@app.exception_handler(TimeoutError)
async def timeout_exception_handler(request: Request, exc: TimeoutError):
    return JSONResponse(
        status_code=status.HTTP_408_REQUEST_TIMEOUT,
        content={"detail": "Request processing time excedeed limit"},
    )


@app.get("/")
@timeout_after(1)
async def root():
    await sleep(2)
    return {"message": "Hello World"}

1 reply

Garrett-R Jun 1, 2024

Unfortunately, I think this suffers from the same problem as the above attempts: it relies on hitting an await to stop.

For example, try modifying your example endpoint to this:

@app.get("/")
@timeout_after(1)
async def root():
    expensive_function()
    return {"message": "Hello World"}


def expensive_function():
    count = 0
    while True:
        count += 1
        if count % 10000000 == 0:
            print('still working...')

Hitting this endpoint, it'll work on it forever, not ever releasing the resources.

Would be nice to have an equivalent to Gunicorn+Flask's 30s timeout that actually does release the resources (by killing the errant worker - docs).

fikri-bachtiar · 2024-08-12T03:43:57Z

fikri-bachtiar
Aug 12, 2024

Hello everyone, I created this solution to stop running process if timeout expires, can you guys please provide suggestions if this is the right solution or not

import uvicorn
import asyncio
from fastapi import FastAPI, Request, HTTPException

from fastapi import FastAPI, APIRouter, Response, Request, HTTPException
from fastapi.routing import APIRoute
from typing import Callable

REQUEST_TIMEOUT = 5
app = FastAPI()

class CustomAPIRoute(APIRoute):
    def get_route_handler(self) -> Callable:
        original_route_handler = super().get_route_handler()

        async def custom_route_handler(request: Request) -> Response:
            try:
                return await asyncio.wait_for(original_route_handler(request), timeout=REQUEST_TIMEOUT)
            except asyncio.TimeoutError:
                raise HTTPException(status_code=504, detail='timeout error !!!')
       
        return custom_route_handler

async def cpu_bound_task(text):
    while 1:
        print("hello")
        await asyncio.sleep(1)

    return text


router = APIRouter(route_class=CustomAPIRoute)
@router.get('/')
async def main():
    await cpu_bound_task(text='Hello world')
    print("this should not executed if timeout")

    return {'response': 111}

app.include_router(router)

if __name__ == "__main__":
    import uvicorn
    uvicorn.run(app, host="0.0.0.0", port=9999)

i get the reference from this

0 replies

TaigoFr · 2025-06-18T17:53:43Z

TaigoFr
Jun 18, 2025

We still have no answer on how to release the process after the timeout is sent, right? This is a MAJOR resource leak.

Even in pure async cases, no sync blocking calls, the request seems to keep running in the background forever even after the timeout response is returned to the user

@LMalikov wrote a great example:

Even if the router function contains async code it doesn't get interrupted/cancelled with this middleware solution. The following example keeps printing Running... endlessly even though TimeoutException is triggered and underlying Task created by asyncio.wait_for(...) gets cancelled.
app = FastAPI()

@app.get("/long_running")
async def long_running():
    try:
        while True:
            print("Running...")
            await asyncio.sleep(1)
    except asyncio.CancelledError:  # This never happens :(
        print("Cancelled.")

@app.middleware("http")
async def timeout_middleware(request: Request, call_next):
    try:
        return await asyncio.wait_for(call_next(request), timeout=3)
    except asyncio.TimeoutError:
        return JSONResponse({'detail': 'Request processing time exceeded limit'}, 504)
@tiangolo shouldn't we hit except asyncio.CancelledError in this case? 🙏

If you call /long_running multiple times, you end up with many infinite loops running in the background without stopping.

Is there a solution to this?

I understand that signal.alarm doesn't solve the problem if we expect to have concurrency in the process, as it kills requests of other users.

0 replies

TaigoFr · 2025-06-18T18:20:45Z

TaigoFr
Jun 18, 2025

Update: I believe this solves it - #13236 (comment)

i.e. the timeout is returned and the background process actually stops

Easy migration:

import asyncio
from typing import Callable

from fastapi import FastAPI, Request, Response
from fastapi.routing import APIRoute, APIRouter
from fastapi.responses import JSONResponse

class RouteWithTimeout(APIRoute):
    def get_route_handler(self) -> Callable:
        original_route_handler = super().get_route_handler()

        async def custom_route_handler(request: Request) -> Response:
            try:
                # for python >=3.11:
                # async with asyncio.timeout(1):
                #     return await original_route_handler(request)
                # for python <=3.10
                return await asyncio.wait_for(original_route_handler(request), timeout=1)
            except asyncio.TimeoutError:
                return JSONResponse(
                    status_code=408,
                    content={"detail": "Request timed out"},
                )
        return custom_route_handler

app = FastAPI()
app.router.route_class = RouteWithTimeout

@app.get("/test")
async def test() -> None:
    await asyncio.sleep(3)
    print("should not reach this")

Works for me

1 reply

Garrett-R Jul 13, 2025

This indeed seems to work for asynchronous endpoints.

But note it does not work for synchronous endpoints:

"""Run by doing: fastapi dev app.py"""
import asyncio
import time
from typing import Callable

from fastapi import FastAPI, Request, Response
from fastapi.routing import APIRoute, APIRouter
from fastapi.responses import JSONResponse

class RouteWithTimeout(APIRoute):
    def get_route_handler(self) -> Callable:
        original_route_handler = super().get_route_handler()

        async def custom_route_handler(request: Request) -> Response:
            try:
                # (for python >=3.11)
                async with asyncio.timeout(1):
                    return await original_route_handler(request)
            except asyncio.TimeoutError:
                return JSONResponse(
                    status_code=408,
                    content={"detail": "Request timed out"},
                )
        return custom_route_handler

app = FastAPI()
app.router.route_class = RouteWithTimeout


# Works!
@app.get("/test")
async def test() -> None:
    await asyncio.sleep(3)
    print("should not reach this")

# Doesn't work
@app.get("/test-sync")
def test_sync() -> None:
    time.sleep(3)
    print("should not reach this")  # <--- does reach this 😢

raceychan · 2025-06-19T08:17:38Z

raceychan
Jun 19, 2025

You might want to check my project premier that adds timeout, retry, rate limit and other functionalities in a single line of code to your FastAPI(actually works for any ASGI framework)

3 replies

TaigoFr Jun 19, 2025

Just checked it, and it's pretty cool @raceychan ! For now I'm good, but migration seems very easy and functionality good too.

I have a question I couldn't easily understand: can I use your rate limiter (I noticed it has a key_maker) to rate limit PER USER with some custom value for each user (but globally for all endpoints)

Also to confirm: the rate limiter works per process? So if I have multiple instances running, it can't keep track of the global rate limit, correct?

Very cool, great work

raceychan Jun 19, 2025

Just checked it, and it's pretty cool @raceychan ! For now I'm good, but migration seems very easy and functionality good too.

I have a question I couldn't easily understand: can I use your rate limiter (I noticed it has a key_maker) to rate limit PER USER with some custom value for each user (but globally for all endpoints)

Also to confirm: the rate limiter works per process? So if I have multiple instances running, it can't keep track of the global rate limit, correct?

Very cool, great work

It depends on how you track your USER, but most likely yes, you just need to do some extract logic on extracting the user id(the data you use to identify your user)
It can if you setup redis as cache backend(has built in support for redis, support others through interface), that case it works across processes & machines, but if you use memory cache then no.

TaigoFr Jun 19, 2025

incredible, thanks for the reply :)

thibautd · 2026-02-25T23:50:14Z

thibautd
Feb 25, 2026

For people looking for a proper solution using an ASGI middleware (don't use BaseHTTPMiddleware or @app.middleware("http"), they have side effects), here it is:

class TimeoutMiddleware:
    def __init__(self, app: ASGIApp, timeout: float) -> None:
        self.app = app
        self.timeout = timeout

    async def __call__(self, scope: Scope, receive: Receive, send: Send) -> None:
        if scope["type"] != "http":  # pragma: no cover
            await self.app(scope, receive, send)
            return

        response_started = False

        async def send_wrapper(message: Message) -> None:
            nonlocal response_started
            if message["type"] == "http.response.start":
                response_started = True
            await send(message)

        try:
            with anyio.fail_after(self.timeout):
                await self.app(scope, receive, send_wrapper)
        except TimeoutError as err:
            if not response_started:
                response = Response(status_code=504)
                await response(scope, receive, send)

This will work for async routes, and will correctly cancel any awaiting route.

0 replies

Uh oh!

How does one setup a global timeout to all requests? #7364

Uh oh!

First check

Description

Example

Environment

Additional context

Replies: 29 comments · 7 replies

Uh oh!

Uh oh!

PLNech Aug 27, 2020 Author

Uh oh!

Uh oh!

Uh oh!

PLNech Sep 17, 2020 Author

Uh oh!

PLNech Sep 17, 2020 Author

Uh oh!

tiangolo Jan 17, 2021 Maintainer

Uh oh!

github-actions[bot] bot May 16, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Replies: 29 comments 7 replies

PLNech
Aug 27, 2020
Author

PLNech
Sep 17, 2020
Author

PLNech
Sep 17, 2020
Author

tiangolo
Jan 17, 2021
Maintainer

github-actions[bot]
bot May 16, 2021