ComfyUI-SelVA

Ethanfel/ComfyUI-SelVA

Fork 0

Commit Graph

Author	SHA1	Message	Date
Ethanfel	528d33be39	fix: trim/pad latent to seq_cfg.latent_seq_len before decoding Without this the decoder produced 7s instead of 8s due to STFT rounding. Same fix as _prepare_dataset uses for training data. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 19:22:09 +02:00
Ethanfel	8195c3114a	feat: add SelVA VAE Roundtrip node Encodes audio through the VAE then decodes straight back, bypassing the diffusion model entirely. Use this to isolate whether saturation artifacts are introduced by the codec reconstruction (VAE/DAC) or by the LoRA. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 19:15:20 +02:00

Author

SHA1

Message

Date

Ethanfel

528d33be39

fix: trim/pad latent to seq_cfg.latent_seq_len before decoding

Without this the decoder produced 7s instead of 8s due to STFT rounding.
Same fix as _prepare_dataset uses for training data.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-04-08 19:22:09 +02:00

Ethanfel

8195c3114a

feat: add SelVA VAE Roundtrip node

Encodes audio through the VAE then decodes straight back, bypassing the
diffusion model entirely. Use this to isolate whether saturation artifacts
are introduced by the codec reconstruction (VAE/DAC) or by the LoRA.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-04-08 19:15:20 +02:00

2 Commits