ComfyUI-SelVA

Author	SHA1	Message	Date
Ethanfel	28ee3db337	feat(sampler): add ti_strength blend for TI injection TI via text conditioning produces buzz because SelVA's text path is mean-pooled into a global DiT bias — not rich per-token cross-attention like SD. The optimizer learns a constant spectral artifact rather than semantic style shift. ti_strength=1.0 (default) = full injection as before. ti_strength<1.0 = lerp between original and injected text_clip, allowing the effect to be dialled back without retraining. Applies to both text_clip and neg_text_clip symmetrically. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-09 00:07:57 +02:00
Ethanfel	b89167cfae	fix(ti-trainer): clamp token norm to CLIP manifold to prevent buzz artifacts Diagnosis: learned tokens grew to norm ~3.2 while real CLIP content tokens sit at ~1.0. Model never trained on embeddings that large — activates buzz artifact instead of semantic style shift. Fix: measure mean token norm from content positions (1–20) of dataset CLIP embeddings at startup, clamp learned_tokens per-token after every optimizer step to max 1.5× that reference (50% headroom). Token norm is now logged as current/limit for easy monitoring. ti_sweep_1.json: rebuild around norm_clamp group — n4_clamped (primary diagnostic), prefix_clamped, n8_prefix_clamped, warm_clamped. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 23:54:23 +02:00
Ethanfel	f9d092158a	fix(ti): lower default lr/batch, add lr_batch sweep group n4_baseline showed token_norm growing linearly without plateau — classic sign of lr too high relative to parameter count. With only K×1024 params, gradient signal per param is already high-magnitude; high lr causes overshoot rather than convergence. - Default lr: 1e-3 → 2e-4 (matches LoRA working regime) - Default batch_size: 16 → 4 (more diverse gradients, helps norm saturate) - ti_sweep_1.json: add lr_batch group (lr_low_b4, lr_mid_b8, lr_low_b4_prefix, lr_2e3), restructure with clearer groups, annotate n4_baseline as completed with findings Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 23:42:22 +02:00
Ethanfel	92535deab2	fix(ti-scheduler): save comparison image after each completed experiment Previously the comparison PNG was only written at the very end of the sweep, so an interrupted run produced no image at all. Now _save_comparison() is called right after _write_summary() for every successful experiment, keeping loss_comparison.png current throughout the sweep. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 23:39:30 +02:00
Ethanfel	0b24207ca5	feat(ti-trainer): generate baseline.wav once before training starts Saves baseline.wav + baseline.png in the checkpoint dir using the same seed as the TI eval samples — direct A/B comparison at every checkpoint without re-generating the baseline each time. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 23:33:28 +02:00
Ethanfel	e1a2f0ed7d	feat: add inject_mode (suffix/prefix) to TI pipeline Observation: n4_baseline loss barely moved (1.025→0.965 over 3000 steps), token_norm grew linearly without plateau — generator likely ignores last-K CLIP positions (EOS/padding zone) where suffix injects. Fix: add inject_mode parameter throughout the pipeline: - "suffix": replace last K positions (original behavior, model may ignore) - "prefix": replace positions 1:1+K right after BOS — highest attention weight in CLIP, much stronger gradient signal expected Changes: - selva_textual_inversion_trainer.py: _inject_tokens() helper centralises the torch.cat construction for both modes; used in training loop and eval; inject_mode stored in checkpoint files - selva_textual_inversion_loader.py: reads inject_mode from checkpoint, includes in TEXTUAL_INVERSION bundle - selva_sampler.py: uses _inject_tokens() via bundle's inject_mode field - selva_ti_scheduler.py: inject_mode in _PARAM_DEFAULTS, config, and _train_inner call - ti_sweep_1.json: updated with prefix_inject group (n4, n8, n4+warm); n4_baseline marked completed; suffix experiments retained for comparison Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 23:31:52 +02:00
Ethanfel	f96265da23	feat(ti-trainer): add loss curve IMAGE output Reuses _draw_loss_curve + _smooth_losses + _pil_to_tensor from the LoRA trainer — raw loss in light blue, smoothed overlay in blue, matches the LoRA trainer's visual style. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 23:20:44 +02:00
Ethanfel	c0d95ce356	feat: add ti_sweep_1 experiment file First TI sweep covering the three most impactful axes: - token_count group: n_tokens 4 / 8 / 16 (capacity vs overfitting) - learning_rate group: 5e-4 / 1e-3 / 2e-3 with n_tokens=4 - warm_init group: n4 and n8 seeded from 'mechanical impact sound design' 7 experiments total, 3000 steps each, same data_dir as LoRA sweeps. n4_baseline (lr=1e-3, random init) is the primary reference point. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 23:14:31 +02:00
Ethanfel	e37bfe1b1c	feat: add SelVA TI Scheduler for sweep-based textual inversion experiments - SelvaTiScheduler: runs a JSON-defined sweep of TI training experiments, loading the dataset once and reusing it across runs - Collects per-experiment loss history, final/min loss, stability metric (loss_std_last_quarter), and duration — written to experiment_summary.json after each completed run so partial sweeps survive interruption - Resume-aware: skips experiments already marked completed in an existing summary file - Outputs smoothed loss comparison chart (same axes, one curve per experiment) - SelvaTextualInversionTrainer._train_inner now returns a dict {embeddings_path, loss_history} so the scheduler can read results; train() extracts just the path for ComfyUI JSON format: name, description, data_dir, output_root, base config, experiments list with id + param overrides Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 23:13:04 +02:00
Ethanfel	bb07bc8169	fix(ti-trainer): guard spectral metrics, drop unused imports - Wrap _spectral_metrics + _save_spectrogram in try-except so a matplotlib or STFT error doesn't abort the checkpoint save (matches LoRA trainer) - Remove unused `import math` and `_pil_to_tensor` import - Drop dead `img` variable (_save_spectrogram returns None) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 23:10:19 +02:00
Ethanfel	e36cdd7947	fix(ti-trainer): fix gradient flow and spectral metric shapes - Replace in-place text_clip assignment with torch.cat so the computation graph correctly links text_input → learned_tokens; in-place assignment into a requires_grad=False leaf severs the graph and learned_tokens receives no gradients - _spectral_metrics(wav, sr): was passing wav.unsqueeze(0) [1,1,L] instead of wav [1,L]; stft mean(dim=1) would return wrong shape [1,T] not [n_freqs] - _save_spectrogram(wav, sr, ...): was passing wav.squeeze(0) [L] (1D) instead of wav [1,L] as the function expects Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 23:08:13 +02:00
Ethanfel	e56ece9c1c	feat: add SelVA Textual Inversion Trainer and Loader nodes Learns K CLIP token embeddings ([K, 1024]) with all model weights frozen, keeping generated latents on the decoder's natural manifold — avoids the quality degradation that affects LoRA on BJ's audio dataset. - selva_textual_inversion_trainer.py: trains learned_tokens via AdamW, injects into last K positions of 77-token CLIP embedding, checkpoints with eval audio + spectral metrics - selva_textual_inversion_loader.py: loads .pt bundle, returns TEXTUAL_INVERSION dict for sampler - selva_sampler.py: optional textual_inversion input; injects into both text_clip and neg_text_clip before preprocess_conditions - __init__.py: registers both new nodes Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 23:01:44 +02:00
Ethanfel	eed7eefeac	feat: add SelVA HF Smoother and Spectral Matcher preprocessing nodes Two ComfyUI nodes to reduce domain mismatch between custom training audio and the MMAudio VAE's expected spectral distribution: SelvaHfSmoother: blends a low-pass filtered copy (biquad) with the original at a configurable cutoff and blend ratio. Attenuates extreme HF content that BigVGANv2 handles poorly. RMS-preserving. SelvaSpectralMatcher: computes the log-mel energy profile of the clip, compares it per-band to the VAE's normalization means (DATA_MEAN_80D/128D), and applies a smooth STFT-domain gain correction to match the codec's training distribution. Configurable strength and max_gain_db clamp. RMS-preserving. Recommended workflow: SpectralMatcher → HfSmoother → feature extraction. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 20:28:16 +02:00
Ethanfel	107bb05f17	fix(vae-roundtrip): pass bigvgan path to encoder-only FeaturesUtils AutoEncoderModule unconditionally asserts vocoder_ckpt_path is not None even when need_vae_encoder=True. Pass best_netG.pt to satisfy the assert; the vocoder weights are not actually used since decode+vocode go through model["feature_utils"]. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 20:05:44 +02:00
Ethanfel	10e6095e31	fix(vae-roundtrip): use model feature_utils for decode, add normalize/unnormalize, normalize output - Load fresh FeaturesUtils only for encoding; use model["feature_utils"] for decode+vocode to mirror the exact path the sampler takes - Apply generator.normalize() → unnormalize() around the encoded latent so the decoder receives latents in the same space it expects from inference - Log both encoded and norm→unnorm latent stats to diagnose round-trip fidelity - Normalize output to -27 dBFS (matching training clip RMS) and clamp to [-1, 1] to prevent clipping artifacts in the output waveform Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 19:50:01 +02:00
Ethanfel	528d33be39	fix: trim/pad latent to seq_cfg.latent_seq_len before decoding Without this the decoder produced 7s instead of 8s due to STFT rounding. Same fix as _prepare_dataset uses for training data. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 19:22:09 +02:00
Ethanfel	8195c3114a	feat: add SelVA VAE Roundtrip node Encodes audio through the VAE then decodes straight back, bypassing the diffusion model entirely. Use this to isolate whether saturation artifacts are introduced by the codec reconstruction (VAE/DAC) or by the LoRA. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 19:15:20 +02:00
Ethanfel	c8e6b91f67	feat: add alpha_scale_sweep to fix LoRA noise contamination Previous sweep used alpha=rank (scale=1.0) which at rank 128/256 drowned base model priors — spectral flatness went from 0.013 (baseline) to 0.094. This sweep tests alpha dramatically below rank across r16/r32/r128 to find the scale where LoRA nudges rather than overwrites. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 17:55:05 +02:00
Ethanfel	fdce9cbbf1	feat: evaluate adapters on all dataset clips, not just clip_001 - _eval_sample gains clip_idx param (default 0, backward compatible) - Evaluator loops over all dataset clips per adapter, saves one WAV per clip - Reference metrics computed for all clips and averaged - Comparison chart and summary use avg_metrics across all clips - Eliminates bias from evaluating on an unrepresentative single clip Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 17:42:55 +02:00
Ethanfel	42ceb4b153	fix: preserve original audio extension when copying reference file shutil.copy2 was writing FLAC binary to reference.wav — unplayable. Now copies as reference{.flac/.wav/etc} matching the source extension. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 17:31:26 +02:00
Ethanfel	4505b89db1	feat: add reference audio to LoRA evaluator Loads the first clip's original audio (same clip used for inference), copies it to output_dir/reference.wav, runs spectral metrics and saves a spectrogram. Appears first in the comparison chart so generated samples can be judged against the target sound. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 17:30:33 +02:00
Ethanfel	dbfa7b23fe	feat: add eval_r128_candidates.json Evaluates top 5 adapters from r128_sweet_spot: baseline, lr_5e4_r128, lr_3e4_r256, lr_3e4_r128, curriculum_lr_3e4 final + step 6000 checkpoint (before regression) for spectral comparison. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 17:28:28 +02:00
Ethanfel	d2e1ea7b80	feat: add SelVA LoRA Evaluator node Generates audio samples from a list of adapters against a fixed reference clip, collects spectral metrics for each, and outputs a comparison bar chart + eval_summary.json. Useful for comparing sweep candidates before committing to a next round of training. JSON format: name, data_dir, output_dir, steps, seed, adapters[{id, path}]. Empty path = baseline (no LoRA). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 17:26:50 +02:00
Ethanfel	9a47508d2d	fix: lower RMS normalization target from -23/-20 to -27 dBFS Training clips at -23 LUFS measure -25 to -31 dBFS RMS (avg ~-27). Normalizing output to -23 dBFS was 4-8 dB too loud, causing saturation on clips with high crest factor and peaks near 0 dBFS. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 17:19:20 +02:00
Ethanfel	678c050f11	fix: make normalize(x1) assignment explicit in training loop normalize() uses in-place ops so it worked, but reading the return value makes the intent clear and guards against future refactors. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 15:43:42 +02:00
Ethanfel	1be07a80d2	feat: add cosine LR decay schedule to trainer and scheduler - Add lr_schedule param (constant\|cosine) to SelvaLoraTrainer - Cosine decays LR from initial value to ~0 after warmup, preventing the oscillation observed at steps 6000-8000 with lr=2e-4 flat - Wire lr_schedule through scheduler _PARAM_DEFAULTS and _train_inner call - Add g5_r128_lr_2e4_cosine and g5_r128_lr_3e4_cosine to r128_sweet_spot sweep Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 13:25:01 +02:00
Ethanfel	58e1985af2	feat: SelVA Skip Experiment node + save partial scalars on skip - New node: SelVA Skip Experiment — writes skip_current.flag from UI, queue in a second workflow tab while scheduler is running - SkipExperiment now attaches partial loss/grad/spectral data to the exception so the scheduler saves all collected scalars in the summary Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 13:10:43 +02:00
Ethanfel	264dc49d42	feat: skip_current.flag to cancel experiment and move to next Create the flag file in the sweep output_root to skip the running experiment at the next log interval (every 50 steps): touch /path/to/experiment/skip_current.flag Scheduler marks it as 'skipped' in the summary and continues. Skipped experiments are NOT resumed on restart (unlike failed ones). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 13:09:01 +02:00
Ethanfel	fec5c86f09	feat: add spectral_flatness and temporal_variance to eval metrics spectral_flatness (Wiener entropy) — 0=tonal, 1=white noise. Rising value across steps directly flags noise contamination. temporal_variance — RMS std/mean per frame. Low = lifeless/compressed. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 12:45:40 +02:00
Ethanfel	2861327016	feat: spectral metrics per eval sample in experiment summary Computes hf_energy_ratio (>4kHz), spectral_centroid_hz, spectral_rolloff_hz at each save_every checkpoint. Logged to console and stored in experiment_summary.json under results.spectral_metrics[step]. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 12:44:43 +02:00
Ethanfel	c4687521ef	feat: save spectrogram PNG alongside each eval sample Log-frequency dB spectrogram (inferno colormap, 100Hz–16kHz) saved as step_XXXXX.png next to step_XXXXX.wav in samples/ subfolder. Makes high-frequency rolloff (low bitrate signature) immediately visible. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 12:42:34 +02:00
Ethanfel	8717af2728	fix: prevent saturation from RMS normalization clipping peaks RMS normalize to target then scale back if peaks exceed 1.0, preserving dynamics instead of hard-clipping transients. Eval sample target updated to -23 dBFS to match training data. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 12:29:29 +02:00
Ethanfel	78e9838a83	fix: replace peak normalization with RMS normalization at -20 dBFS Peak norm was slamming output to full scale regardless of content level, making generated audio several times louder than training clips. RMS norm to -20 dBFS matches typical processed audio level. Sampler exposes target_lufs (-40 to -6, default -20) for user control. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 12:06:48 +02:00
Ethanfel	94610b8943	feat: r128_sweet_spot sweep — noise-free LR search + rank 256 9 experiments targeting loss 0.25-0.35 without LoRA+ noise. Tests higher base LR (2e-4/3e-4/5e-4), curriculum combos, conservative LoRA+ ratio=4, and rank 256 baseline + lr=3e-4. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 10:46:08 +02:00
Ethanfel	f5f7f2ae68	fix: eval sample seed 0 -> 42 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 10:32:43 +02:00
Ethanfel	1663b39833	fix: bump eval sample to 25 ODE steps (was 8) Inference is fast on RTX PRO 6000 — 8 steps was washing out quality differences between experiments. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 10:32:27 +02:00
Ethanfel	a7923d5fb7	feat: r64_overnight sweep — focused rank-64 ablation at 8000 steps 15 experiments across rank (64/128), alpha, regularisation, LR, target layers, and combined stacks. Based on tier1_thorough early results confirming rank 64 sounds best perceptually. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 01:32:23 +02:00
Ethanfel	786a57c424	feat: sweep resume + 5 additional experiments (LR, target, extended) Scheduler: on re-run, reads existing experiment_summary.json and skips already-completed experiments — safe to stop and restart mid-sweep. tier1_thorough: adds g5 (lr 3e-5/3e-4), g6 (full target attn.qkv+linear1 at r16 and r64), and g4_full_r64_6k (6000-step extended run) — 17 total. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 00:59:16 +02:00
Ethanfel	f15e02b0b8	fix: eval samples use fixed clip/seed, save to samples/ subfolder - Always sample dataset[0] with fixed noise seed so checkpoints are directly comparable (hear the model improve step by step) - Save to output_dir/samples/step_XXXXX.wav instead of alongside checkpoints Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 00:54:37 +02:00
Ethanfel	0682a536cb	fix: point data_dir to features/ subdir where .npz and audio live Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 00:45:32 +02:00
Ethanfel	0000878e76	feat: thorough overnight sweep + dataset browser updates - Dataset browser: audio/features now resolve through features/ subdir - tier1_sweep.json: update data_dir to BJ dataset path - tier1_thorough.json: 12-experiment overnight sweep across 4 groups (rank 16/32/64, alpha scaling, LoRA+/dropout/curriculum isolation, full Tier 1 stack at r16 and r64) — output to BJ/experiment/ Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 00:38:19 +02:00
Ethanfel	675644189d	feat: add SelVA Dataset Browser node Companion node for inspecting dataset.json entries by integer index. Outputs video (.mp4), audio (.wav/.flac), features (.npz), frames dir, mask dir, label, and max_index for constraining the index widget range. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-07 14:55:27 +02:00
Ethanfel	82fb7a0009	docs: note AudioX shows no perceptual quality gain on V2A vs SelVA Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-07 09:12:00 +02:00
Ethanfel	af4777d2d7	docs: add AudioX vs SelVA evaluation Architecture comparison, capability matrix, integration cost estimate, LoRA training difficulty analysis, and license implications. Verdict: SelVA remains preferred for V2A + LoRA fine-tuning; AudioX adds value for music generation, inpainting, and text-to-audio tasks. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-07 09:11:09 +02:00
Ethanfel	ed8abf7a5b	docs: add video format recommendations to dataset preparation section New section 1.1 covers aspect ratio (16:9 landscape preferred), resolution (≥480p), frame rate (any, use VHS_VIDEOINFO), and portrait handling (center-crop to square). Based on CLIP 384px and Synchformer 224px internals. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 13:44:14 +02:00
Ethanfel	21ed93d3ee	docs: add audio dataset pipeline reference doc Full research notes on cleaning, augmentation, and quality metrics for generative model training. Covers LUFS normalization, AudioSep, waveform augmentation (pitch shift, RIR, EQ), latent mixup, DNSMOS gating, tool install commands, and key paper references. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 13:37:48 +02:00
Ethanfel	f1e2bbd55b	feat: add first experiment sweep file for Tier 1 ablation 6 experiments: baseline, LoRA+ (ratio=16), dropout 0.05, dropout 0.1, curriculum sampling, and all three combined. bf16 batch 16, 2000 steps, seed 42. data_dir placeholder needs to be updated before running. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 13:15:06 +02:00
Ethanfel	3d9221c248	fix: three bugs in scheduler and trainer - trainer: raise ValueError early when remaining steps < log_interval (50) instead of UnboundLocalError on smoothed_img/final_path at return - trainer: use None in grad_norm_history instead of silent 0.0 when grad_accum > log_interval and no optimizer step fired in the interval - trainer: include start_step in _train_inner return dict - scheduler: use start_step from result dict for min_loss_step and loss_at_steps (fixes wrong step labels on resumed experiments) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 13:11:25 +02:00
Ethanfel	2d200395af	feat: add grad norm logging and richer experiment summary output trainer: - Track gradient norm before clipping at each optimizer step - Log avg grad_norm per log_interval alongside loss in console output - Include grad_norm_history in _train_inner return dict scheduler: - Add system block to summary (GPU name, VRAM, torch/CUDA version) - Include full loss_history and grad_norm_history arrays in each experiment result (50-step resolution, not just save_every checkpoints) - Add loss_std_last_quarter stability metric (std dev of raw loss over last 25% of steps — high value indicates unstable training) - Add log_interval field so consumers know the x-axis resolution Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 13:06:39 +02:00
Ethanfel	3ec380a27e	feat: add SelVA LoRA Scheduler node for automated experiment sweeps - Extract _prepare_dataset() from SelvaLoraTrainer.train() as a module-level function so the dataset can be encoded once and reused across experiments - Change _train_inner() return value from tuple to dict (adds loss_history, meta, completed; train() unpacks for ComfyUI — no change to node outputs) - New SelvaLoraScheduler node: reads a JSON sweep file, runs N experiments sequentially, writes experiment_summary.json (updated after each run) and loss_comparison.png with all smoothed curves overlaid on the same axes - Register SelvaLoraScheduler in nodes/__init__.py Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 13:03:21 +02:00

1 2 3 4 5

234 Commits