Several fixes to make `Completion.acreate(stream=True)` work #172

nfcampos · 2023-01-07T11:01:08Z

Note that as per the issue above, even after the two fixes below this test still fails, as the 2nd chunk of the stream never arrives

The api will send chunks like ``` b'data: {"id": "cmpl-6W18L0k1kFoHUoSsJOwcPq7DKBaGX", "object": "text_completion", "created": 1673088873, "choices": [{"text": "_", "index": 0, "logprobs": null, "finish_reason": null}], "model": "ada"}\n\n' ``` The default iterator will break on each `\n` character, whereas iter_chunks will just output parts as they arrive

…ssion is only closed once the response stream is finished Previously we'd exit the with statement before the response stream is consumed by the caller, therefore, unless we're using a global ClientSession, the session is closed (and thus the request) before it should be.

ddeville

Thank you so much for fixing this! I left a couple of comments but otherwise this is looking great.

ddeville · 2023-01-12T05:09:47Z

openai/api_requestor.py

+        ctx = aiohttp_session()
+        session = await ctx.__aenter__()
+        result = await self.arequest_raw(
+            method.lower(),
+            url,
+            session,
+            params=params,
+            supplied_headers=headers,
+            files=files,
+            request_id=request_id,
+            request_timeout=request_timeout,
+        )
+        resp, got_stream = await self._interpret_async_response(result, stream)
+        if got_stream:
+
+            async def wrap_resp():
+                async for r in resp:
+                    yield r
+                await ctx.__aexit__(None, None, None)
+
+            return wrap_resp(), got_stream, self.api_key
+        else:
+            await ctx.__aexit__(None, None, None)


I first I thought it'd be easier to just fetch/create a ClientSession here rather than getting the async generator and calling __aenter__ and __aexit__ manually but since we have to deal with manually closing one while being careful not to close the other, I think it's probably fine.

Yea I think the context manager is still worth it for that encapsulation

ddeville · 2023-01-12T05:12:23Z

openai/api_requestor.py

+            async def wrap_resp():
+                async for r in resp:
+                    yield r
+                await ctx.__aexit__(None, None, None)


I guess it's possible for this async generator to never complete (for example if the caller raises an exception before completing the iteration) in which case we'll never close this session, which I think raises an exception on the event loop?

Maybe we should create a session on the APIRequestor instance instead 🤔

Good catch, I think should be good now in the latest commit

ddeville · 2023-01-12T05:16:03Z

openai/tests/asyncio/test_endpoints.py

@@ -63,3 +64,26 @@ async def test_timeout_does_not_error():
        model="ada",
        request_timeout=10,
    )
+
+
+async def test_completions_stream_finishes_global_session():


Thanks for adding this test.

…ile consuming the stream

ddeville · 2023-01-12T17:44:55Z

openai/api_requestor.py

+                    async for r in resp:
+                        yield r
+                finally:
+                    await ctx.__aexit__(None, None, None)


This might still not be called if the caller never actually iterates through the response and just drops it right?

It's probably fine for now to fix this bug but I imagine we'll want to scope the session to the requestor itself in the future so that we can always ensure that it's closed.

I think that's an issue inherent to exposing an async iterator as an api here? If the caller doesn't consume it then all sorts of bad things may happen... The only solution I can think of is to add some cleanup with a timeout, but sounds a bit invasive? Another option would be to make this a bit fake, and fully consume the iterator ourselves and buffer it all in memory, but that partly defeats the purpose of asking for a stream

Yep that's fair.

ddeville

Alright let's merge this, thank you so much for fixing this bug!

) * Added a failing test case for async completion stream * Consume async generator with async for * Consume the stream in chunks as sent by API, to avoid "empty" parts The api will send chunks like ``` b'data: {"id": "cmpl-6W18L0k1kFoHUoSsJOwcPq7DKBaGX", "object": "text_completion", "created": 1673088873, "choices": [{"text": "_", "index": 0, "logprobs": null, "finish_reason": null}], "model": "ada"}\n\n' ``` The default iterator will break on each `\n` character, whereas iter_chunks will just output parts as they arrive * Add another test using global aiosession * Manually consume aiohttp_session asyncontextmanager to ensure that session is only closed once the response stream is finished Previously we'd exit the with statement before the response stream is consumed by the caller, therefore, unless we're using a global ClientSession, the session is closed (and thus the request) before it should be. * Ensure we close the session even if the caller raises an exception while consuming the stream

nfcampos added 3 commits January 7, 2023 10:50

Added a failing test case for async completion stream

c2dc889

Consume async generator with async for

4f4f3cf

nfcampos force-pushed the nc/async-stream-test branch from 5cc1809 to 2e2e20e Compare January 7, 2023 12:14

nfcampos added 2 commits January 7, 2023 12:36

Add another test using global aiosession

9817bbd

nfcampos changed the title ~~A failing test for Completion.acreate(stream=True)~~ Several fixes to make Completion.acreate(stream=True) work with both local and global aiohttp session Jan 7, 2023

nfcampos changed the title ~~Several fixes to make Completion.acreate(stream=True) work with both local and global aiohttp session~~ Several fixes to make Completion.acreate(stream=True) work Jan 7, 2023

ddeville reviewed Jan 12, 2023

View reviewed changes

Ensure we close the session even if the caller raises an exception wh…

7088352

…ile consuming the stream

ddeville reviewed Jan 12, 2023

View reviewed changes

ddeville approved these changes Jan 12, 2023

View reviewed changes

ddeville merged commit e7ee4da into openai:main Jan 12, 2023

ddeville mentioned this pull request Feb 7, 2023

Fixes #184: JSON parsing issue when async streaming #216

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Several fixes to make `Completion.acreate(stream=True)` work #172

Several fixes to make `Completion.acreate(stream=True)` work #172

Uh oh!

nfcampos commented Jan 7, 2023 •

edited

Loading

Uh oh!

ddeville left a comment

Uh oh!

ddeville Jan 12, 2023

Uh oh!

nfcampos Jan 12, 2023

Uh oh!

ddeville Jan 12, 2023 •

edited

Loading

Uh oh!

nfcampos Jan 12, 2023

Uh oh!

ddeville Jan 12, 2023

Uh oh!

ddeville Jan 12, 2023

Uh oh!

nfcampos Jan 12, 2023

Uh oh!

ddeville Jan 12, 2023

Uh oh!

ddeville left a comment

Uh oh!

Uh oh!

Several fixes to make Completion.acreate(stream=True) work #172

Several fixes to make Completion.acreate(stream=True) work #172

Uh oh!

Conversation

nfcampos commented Jan 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ddeville left a comment

Choose a reason for hiding this comment

Uh oh!

ddeville Jan 12, 2023

Choose a reason for hiding this comment

Uh oh!

nfcampos Jan 12, 2023

Choose a reason for hiding this comment

Uh oh!

ddeville Jan 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nfcampos Jan 12, 2023

Choose a reason for hiding this comment

Uh oh!

ddeville Jan 12, 2023

Choose a reason for hiding this comment

Uh oh!

ddeville Jan 12, 2023

Choose a reason for hiding this comment

Uh oh!

nfcampos Jan 12, 2023

Choose a reason for hiding this comment

Uh oh!

ddeville Jan 12, 2023

Choose a reason for hiding this comment

Uh oh!

ddeville left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Several fixes to make `Completion.acreate(stream=True)` work #172

Several fixes to make `Completion.acreate(stream=True)` work #172

nfcampos commented Jan 7, 2023 •

edited

Loading

ddeville Jan 12, 2023 •

edited

Loading