ComfyUI-SelVA

Author	SHA1	Message	Date
Ethanfel	8e3ab999f0	fix: load VAE state dict with strict=False vae.ckpt is a full training checkpoint containing discriminator, STFT loss modules, and EMA wrappers that are absent from the inference AudioAutoencoder. strict=False ignores these training-only keys while still loading all encoder/decoder/bottleneck weights correctly. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 19:51:51 +01:00
Ethanfel	35d0615253	feat: auto-install pip venv for feature extraction on first use PrismAudioFeatureExtractor now creates and populates a managed venv (_extract_env/) automatically when python_env is left as the default 'python'. Also adds scripts/install_extract_env.sh for manual/Docker setup without conda. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 19:27:27 +01:00
Ethanfel	618e7de64b	feat: PrismAudioTextOnly node with correct T5-Gemma encoding Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 18:09:11 +01:00
Ethanfel	3d62688e8c	feat: PrismAudioSampler node with correct metadata format and peak normalization Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 18:07:33 +01:00
Ethanfel	7c54ee8482	feat: PrismAudioFeatureExtractor node with subprocess bridge and conda env Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 18:06:10 +01:00
Ethanfel	3f35aa39f2	feat: PrismAudioFeatureLoader node for pre-computed .npz files Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 18:04:32 +01:00
Ethanfel	1043f4bacb	feat: PrismAudioModelLoader node with auto-download and adaptive VRAM Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 18:02:47 +01:00
Ethanfel	baa80de194	feat: project scaffolding with shared utils and node registration Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-27 16:59:21 +01:00

1 2 3 4 5

208 Commits