Add support to LLM Streaming #17

mi12-root · 2025-09-28T15:52:51Z

Including structured output streaming via Partial and tool loop.

Also add `Partial` associatedtype to `Generable` protocol with default Self. All implementations currently throw "not implemented" errors.

`@Generable` now emits a nested subtype for streaming use cases: ``` @generable struct T { // Fields here. } ``` now expands to: ``` struct T { // Fields here. } extension T: Generable { Partial { // partial fields } // Other generated fields. } ``` The rules for generating the partial subtypes are as follows: - `T?` => `T.Partial`, - otherwise: `T` => `T.Partial`

This should help avoid unwanted automatic defaults

Currently none of the backends support streaming, so marking the tests are disabled.

Tool use is still not supported

When a message is truncated, OpenAI returns `incomplete` event rather than `completed` event.

To avoid code duplication

- Unify generateResponse and generateResponseStream implementations - Both methods now use the same underlying streaming mechanism - Reduce code duplication and improve consistency - Maintain same API contract and error handling

Removed the 'sending' keyword from all AsyncThrowingStream return types across the streaming API. This affects: - LLM protocol methods (replyStream functions) - All backend implementations (OpenAI, Apple, MLX) - Test implementations (FakeLLM, CrashingLLM) - Documentation examples The streaming functionality remains unchanged, only the function signatures have been updated to remove the sending parameter.

mi12-root added 22 commits September 19, 2025 19:27

Add a proposal outlining the streaming API

a1668df

Add streaming API foundation to LLM protocol

79eea15

Also add `Partial` associatedtype to `Generable` protocol with default Self. All implementations currently throw "not implemented" errors.

Remove Self as default to Generable.Partial

647af5d

This should help avoid unwanted automatic defaults

Add basic text streaming test

99593cb

Currently none of the backends support streaming, so marking the tests are disabled.

Implement String streaming for SystemLLM

56247fb

Implement String streaming for OpenAI LLM

c9ba2cb

Merge branch 'main' into streaming

34374b2

Add basic String streaming in MLX LLM

a412c3e

Tool use is still not supported

Consider incomplete events when streaming

fcf61ba

When a message is truncated, OpenAI returns `incomplete` event rather than `completed` event.

Add streaming tests cases with tool use

d7198ba

Implement tool loop when streaming in the OpenAI backend

852f0f3

Add repair a util for fixing partial JSONs during LLM streaming

da350ca

mv Utils/JSONRepair.swift to JsonUtils/JsonRepair.swift

9f89621

Support streaming structured outputs in OpenAI integration

b54f4de

Rewrite OpenAI's response generation to reuse the streaming API

b1d0822

To avoid code duplication

Implement streaming structured output for SystemLLM

3fca711

Refactor SystemLLMSession to use streaming API internally

5989006

- Unify generateResponse and generateResponseStream implementations - Both methods now use the same underlying streaming mechanism - Reduce code duplication and improve consistency - Maintain same API contract and error handling

Implement tool loop in streaming mode for MLX models

f5d756b

Re-rewrite MLX generateResponse to use the streaming API

eaa6994

Update streaming proposal to reflect current codebase

ba983c2

mi12-root merged commit df0b659 into main Sep 28, 2025
2 checks passed

mi12-root deleted the streaming branch September 28, 2025 16:00

mi12-root mentioned this pull request Sep 28, 2025

Add Support for Streaming #2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support to LLM Streaming #17

Add support to LLM Streaming #17

Uh oh!

mi12-root commented Sep 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add support to LLM Streaming #17

Add support to LLM Streaming #17

Uh oh!

Conversation

mi12-root commented Sep 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants