switch transcription model from Whisper to parakeet (faster + better) #28

Closed
opened 2026-05-06 17:07:20 +00:00 by mik-tf · 1 comment
Owner

Overview

hero_voice currently uses Whisper for STT. Per Scott's component work, parakeet models are faster and more accurate. Migrate the default STT path to parakeet, keep Whisper as a fallback option.

Why

Meeting 2026-05-06 (Scott): "hero_voice component / can use parakeet models / much better than whisper, faster".

Acceptance

  • parakeet model integrated as a provider option in hero_aibroker
  • hero_voice STT defaults to parakeet
  • Whisper remains available as fallback (for offline / non-parakeet hosts)
  • Latency improvement measured and documented

Source: meeting notes 2026-05-06.

## Overview `hero_voice` currently uses Whisper for STT. Per Scott's component work, **parakeet** models are faster and more accurate. Migrate the default STT path to parakeet, keep Whisper as a fallback option. ## Why Meeting 2026-05-06 (Scott): "hero_voice component / can use parakeet models / much better than whisper, faster". ## Acceptance - [ ] parakeet model integrated as a provider option in hero_aibroker - [ ] hero_voice STT defaults to parakeet - [ ] Whisper remains available as fallback (for offline / non-parakeet hosts) - [ ] Latency improvement measured and documented ## Related - Owner: scott - META: [home#219](https://forge.ourworld.tf/lhumina_code/home/issues/219) - [hero_voice#23](https://forge.ourworld.tf/lhumina_code/hero_voice/issues/23) — Rustpotter wake-word (separate but related) Source: meeting notes 2026-05-06.
mik-tf added this to the ACTIVE project 2026-05-06 17:31:47 +00:00
Owner

Migration to Parakeet is complete. For now we use Parakeet v2, which is English only. In the future we can also support Parakeet v3 for European language support (with slightly reduced English accuracy). Our backend library Sherpa ONNX also supports a variety of models, including Whisper, if broader language support is needed in the future.

Migration to Parakeet is complete. For now we use Parakeet v2, which is English only. In the future we can also support Parakeet v3 for European language support (with slightly reduced English accuracy). Our backend library Sherpa ONNX also supports a variety of models, including Whisper, if broader language support is needed in the future.
scott closed this issue 2026-05-27 16:28:22 +00:00
Sign in to join this conversation.
No milestone
No project
No assignees
2 participants
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
lhumina_code/hero_voice#28
No description provided.