Ethanfel
1e9551152e
feat: add DITTO optimizer, upgrade BigVGAN trainer, document all nodes
...
BigVGAN trainer (selva_bigvgan_trainer.py):
- Add snake_alpha_only train mode: tunes only ~27K per-channel α params
(0.024% of 112M) — physically cannot cause harmonic smearing
- Add lambda_l2sp: L2-SP anchor regularization toward pretrained weights
- Add optional discriminator_path: frozen MPD+MRD feature matching loss
replaces mel L1 when a BigVGAN discriminator checkpoint is provided
- Inline MPD + MRD discriminator implementations (no extra dependencies)
DITTO optimizer (selva_ditto_optimizer.py):
- New node: inference-time noise optimization (arXiv:2401.12179)
- Optimizes x₀ via mel Gram matrix style loss against BJ reference clips
- All model weights frozen — zero quality degradation risk
- Truncated BPTT through last n_grad_steps of the ODE (configurable)
- Gradient checkpointing on each differentiated step
Docs:
- README: document all 20 nodes (was 3), add workflow diagrams
- STYLE_TRANSFER.md: new guide — DITTO, vocoder fine-tuning tiers,
why LoRA/TI fail, combined approach, dataset prep
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com >
2026-04-09 12:04:05 +02:00
Ethanfel
b519b042e2
docs: document mask inputs and normalize toggle in README
...
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com >
2026-04-05 10:43:42 +02:00
Ethanfel
d495939367
docs: rewrite README for SelVA
...
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com >
2026-04-04 17:12:28 +02:00
Ethanfel
807a2e51fb
docs: fix README references — PrismAudio not ThinkSound
...
Point links to huggingface.co/FunAudioLLM/PrismAudio and use public
GitHub URL for install instructions.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com >
2026-03-28 11:16:31 +01:00
Ethanfel
62a3c5d0dc
docs: rewrite README to reflect current node design
...
Update node descriptions, inputs/outputs, workflows, and environment
setup to match current implementation (managed_env dropdown, VHS
video_info, auto-duration, fps output, synchformer auto-resolve).
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com >
2026-03-28 11:10:07 +01:00
Ethanfel
807f00417f
docs: README with installation and usage instructions
...
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com >
2026-03-27 18:15:17 +01:00