ComfyUI-SelVA

Files

T

Ethanfel f9d092158a fix(ti): lower default lr/batch, add lr_batch sweep group

n4_baseline showed token_norm growing linearly without plateau — classic
sign of lr too high relative to parameter count. With only K×1024 params,
gradient signal per param is already high-magnitude; high lr causes
overshoot rather than convergence.

- Default lr: 1e-3 → 2e-4 (matches LoRA working regime)
- Default batch_size: 16 → 4 (more diverse gradients, helps norm saturate)
- ti_sweep_1.json: add lr_batch group (lr_low_b4, lr_mid_b8,
  lr_low_b4_prefix, lr_2e3), restructure with clearer groups,
  annotate n4_baseline as completed with findings

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-04-08 23:42:22 +02:00

alpha_scale_sweep.json

feat: add alpha_scale_sweep to fix LoRA noise contamination

2026-04-08 17:55:05 +02:00

eval_r128_candidates.json

feat: add eval_r128_candidates.json

2026-04-08 17:28:28 +02:00

r64_overnight.json

feat: r64_overnight sweep — focused rank-64 ablation at 8000 steps

2026-04-08 01:32:23 +02:00

r128_sweet_spot.json

feat: add cosine LR decay schedule to trainer and scheduler