Now
In active development. Targeting the next release.
-
True LLM-token streaming for read-page
The
/v1/read-page/streamSSE wire format already exists; we ship the chunked-but-not-truly-streamed response today. Next pass: hook into the providers’ streaming APIs so the first sentence reaches the user’s ear in <500ms even on cold cache. -
Real audio capture for live captions
Today we mirror existing
<track>cues onto the overlay. Coming: Web Audio API tap → backend STT → cue stream, so any unaltered video gets captions even when the source has none.