changelog / post

WebSocket support for real-time model streaming

Go back

We’ve revamped the experience of using WebSockets to invoke your deployments. Here’s what’s new:

  • WebSocket-specific metrics in the UI, like average connection duration

  • Improved UX for managing and invoking WebSocket deployments

WebSockets allow your applications to have a persistent, bidirectional communication channel with your models. This is useful for:

  • Real-time applications (e.g., audio transcription)

  • Streaming responses (e.g., in chat applications)

We also support using WebSockets to invoke Chains for agents or compound AI workflows.

Read the docs to get started!