Voice: hybrid streaming TTS with trackbar player #89
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Context
v0.7.1-dev has TTS working (Kokoro local + Groq fallback), pause/play/stop controls, and progress tracking infrastructure in voice.rs. Currently TTS generates audio AFTER the full AI response completes — user waits for entire response before hearing anything.
Goal
Hybrid streaming TTS: hear audio sentence-by-sentence while AI is still responding, then full trackbar with seek/replay after response completes.
Implementation
Phase 1: Sentence-level SSE audio streaming
event: audio(multiple events per response)Phase 2: Full trackbar after response
[⏸] [━━━━●━━━━] 0:12/0:35Phase 3: Polish
Technical notes
.!?\nwith min 20 charsRelated
Status Assessment
Phase 1: Sentence-level SSE audio streaming — DONE
event: audiowith chunk/total metadataPhase 2: Full trackbar after response — PARTIAL
Phase 3: Polish — TODO
Implementing remaining features now.
Signed-off-by: mik-tf
Complete
All 3 phases implemented:
Phase 1 (pre-existing)
Phase 2
Phase 3
Deployed as v0.7.6-dev.
Signed-off-by: mik-tf