Files
ComfyUI-Omnivoice/workflows/voice_cloning.json
T
Ethanfel 4c42322c6f Expand voice presets to 8 voices (3 female, 5 male)
All transcribed via whisper-medium. Sources: Chatterbox demo GCS bucket
(ResembleAI) and F5-TTS repo (SWivid).

Female: Shadowheart, American actress, Podcast host
Male: Nature, Old Hollywood, Rick Sanchez, Stewie Griffin,
      Harvey Keitel, Conan O'Brien

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-05 18:51:43 +02:00

85 lines
2.4 KiB
JSON
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
{
"last_node_id": 4,
"last_link_id": 4,
"nodes": [
{
"id": 1,
"type": "OmniVoiceModelLoader",
"pos": [40, 80],
"size": {"0": 300, "1": 100},
"flags": {},
"order": 0,
"mode": 0,
"outputs": [
{"name": "model", "type": "OMNIVOICE_MODEL", "links": [1], "shape": 3, "slot_index": 0}
],
"properties": {"Node name for S&R": "OmniVoiceModelLoader"},
"widgets_values": ["cuda:0", "float16"]
},
{
"id": 2,
"type": "OmniVoiceVoicePreset",
"pos": [40, 240],
"size": {"0": 300, "1": 80},
"flags": {},
"order": 1,
"mode": 0,
"outputs": [
{"name": "ref_audio", "type": "AUDIO", "links": [2], "shape": 3, "slot_index": 0},
{"name": "ref_text", "type": "STRING", "links": [3], "shape": 3, "slot_index": 1}
],
"properties": {"Node name for S&R": "OmniVoiceVoicePreset"},
"widgets_values": ["Shadowheart female, expressive (Chatterbox)"]
},
{
"id": 3,
"type": "OmniVoiceGenerate",
"pos": [420, 80],
"size": {"0": 380, "1": 310},
"flags": {},
"order": 2,
"mode": 0,
"inputs": [
{"name": "model", "type": "OMNIVOICE_MODEL", "link": 1, "slot_index": 0},
{"name": "ref_audio", "type": "AUDIO", "link": 2, "slot_index": 1},
{"name": "ref_text", "type": "STRING", "link": 3, "slot_index": 2}
],
"outputs": [
{"name": "audio", "type": "AUDIO", "links": [4], "shape": 3, "slot_index": 0}
],
"properties": {"Node name for S&R": "OmniVoiceGenerate"},
"widgets_values": [
"Hello! This is a test of the OmniVoice text-to-speech system.",
"voice_cloning",
"",
1.0,
32
]
},
{
"id": 4,
"type": "SaveAudio",
"pos": [880, 80],
"size": {"0": 300, "1": 100},
"flags": {},
"order": 3,
"mode": 0,
"inputs": [
{"name": "audio", "type": "AUDIO", "link": 4, "slot_index": 0}
],
"properties": {"Node name for S&R": "SaveAudio"},
"widgets_values": ["omnivoice"]
}
],
"links": [
[1, 1, 0, 3, 0, "OMNIVOICE_MODEL"],
[2, 2, 0, 3, 1, "AUDIO"],
[3, 2, 1, 3, 2, "STRING"],
[4, 3, 0, 4, 0, "AUDIO"]
],
"groups": [],
"config": {},
"extra": {"ds": {"scale": 0.9, "offset": [0, 0]}},
"version": 0.4
}