5 Commits

Author SHA1 Message Date
Ethanfel 26295e4db7 feat: auto-discover user presets from the presets folder
Drop any audio file (wav/flac/mp3/ogg/m4a) into the presets cache dir and
it will appear as "<name> (local)" in the Voice Preset dropdown on next
ComfyUI restart. Add a same-stem .txt file for the transcript.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-06 09:37:17 +02:00
Ethanfel 4c42322c6f Expand voice presets to 8 voices (3 female, 5 male)
All transcribed via whisper-medium. Sources: Chatterbox demo GCS bucket
(ResembleAI) and F5-TTS repo (SWivid).

Female: Shadowheart, American actress, Podcast host
Male: Nature, Old Hollywood, Rick Sanchez, Stewie Griffin,
      Harvey Keitel, Conan O'Brien

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-05 18:51:43 +02:00
Ethanfel c109e860a8 Add transcript for Shadowheart preset (transcribed via whisper-medium)
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-05 18:39:14 +02:00
Ethanfel 75e74075f5 Restore Shadowheart preset; user will transcribe via Whisper node
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-05 18:33:52 +02:00
Ethanfel 8de201a4c9 Add OmniVoice Voice Preset node with two female voice samples
Two built-in presets, auto-downloaded and cached to ComfyUI/models/omnivoice/presets/:
- "Nature – female, warm" (F5-TTS basic_ref_en.wav, transcript included)
- "Shadowheart – female, expressive" (Chatterbox demo, connect Whisper for transcript)

Outputs ref_audio (AUDIO) and ref_text (STRING) — wire directly into
OmniVoice Generate. Updated default workflow to use this node.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-05 18:19:29 +02:00