O1 model doesn'ts support streaming completions #430

sheldonhull · 2025-01-30T20:59:48Z

Background

With the introduction of newer reasoning models in v1.7.0, we've seen that while some tweaks have been made, streaming support for models like o1 was initially unavailable. This support varies based on the provider:

OpenAI: Recently added streaming support.
Azure OpenAI: Still lacks streaming support for o1.

Problem

The current implementation assumes that all results are streamed, which is incompatible with some o1 models. They require complete chat completion before results can be read, creating a gap in configuration and logic, especially for Azure OpenAI users.

Proposal

Add a new configuration value:
- stream: false (default to true).
Introduce a CLI flag for manual override:
- Use --stream false or --no-stream.

Seem reasonable? If so I'll see if I can knock that out.

ref: current limitations of o1 in azure openai

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

O1 model doesn'ts support streaming completions #430

O1 model doesn'ts support streaming completions #430

sheldonhull commented Jan 30, 2025 •

edited

Loading

O1 model doesn'ts support streaming completions #430

O1 model doesn'ts support streaming completions #430

Comments

sheldonhull commented Jan 30, 2025 • edited Loading

Background

Problem

Proposal

sheldonhull commented Jan 30, 2025 •

edited

Loading