Open Vision Agents by Stream. Build Vision Agents with any model/ video provider.
github.com·1d·
Discuss: r/programming

Open Vision Agents by Stream

Build Vision Agents quickly with any model or video provider.

  • Video AI: Built for real-time video AI. Combine Yolo, Roboflow and others with gemini/openai realtime
  • Low Latency: Join quickly (500ms) and low audio/video latency (30ms)
  • Open: Built by Stream, but use any video edge network that you like
  • Native APIs: Native SDK methods from OpenAI (create response), Gemini (generate) and Claude (create message). So you can always use the latest LLM capabilities.
  • SDKs: SDKs for React, Android, iOS, Flutter, React, React Native and Unity.

Created by Stream, uses Stream’s edge network for ultra-low latency.

Examples

Sports Coaching

This example shows you how to build golf coach…

Similar Posts

Loading similar posts...