The combination stream=True, tool_choice="auto" raises an exception right now, which means that developers are stuck with one of two unfortunate choices:
- Developing an application that streams the response but cannot use tools
- Developing an LLM application that can use tools but cannot stream the response
Relevant discussion: #1615