ComfyUI-MisoTTS/misotts/__init__.py at f7a6f7790d5b3f8b5c22f9a631e14ff080200b2e - ComfyUI-MisoTTS - Gitea: Git with a cup of tea

Ethanfel/ComfyUI-MisoTTS

Files

T

Ethanfel f7a6f7790d Initial release: ComfyUI-MisoTTS (modernized CSM 8B)

Modernized MisoTTS integration for ComfyUI with no torchtune/moshi:
- vendored plain-torch Llama backbone (csm_llama), parity-verified Δ=0 vs torchtune
- transformers.MimiModel codec (bit-identical codes to moshi), drops moshi/bnb/sphn
- low-memory loader: streams 32GB fp32 checkpoint to GPU in bf16 (~18GB VRAM)
- nodes: Model Loader, Generate (audiobook chunking + voice anchoring), EPUB Loader
- pin-free requirements; runs on modern torch / Blackwell GPUs

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

2026-06-06 23:37:54 +02:00

4 lines

108 B

Python

Raw Blame History

	`from .inference import Generator, Segment, load_miso_8b`

	`__all__ = ["Generator", "Segment", "load_miso_8b"]`