Comfyui-STAR

Author	SHA1	Message	Date
Ethanfel	741c02b88c	Add diagnostic debug logging to pipeline stages Prints tensor stats (shape, dtype, min, max, mean, std) at each stage to help diagnose quality issues. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-15 02:25:10 +01:00
Ethanfel	e2025c6ca0	Move VAE encode outside autocast to match original STAR pipeline The original STAR code runs vae_encode() before the amp.autocast() block. Our code had it inside, which changes how the encoder processes tensors and can produce different latent representations. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-15 02:14:25 +01:00
Ethanfel	0537d9d8a5	Expose denoise parameter (0.1–1.0) in node UI Maps directly to total_noise_levels (denoise * 1000). Default 0.9 matches the original STAR inference script. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-15 02:06:57 +01:00
Ethanfel	8a440761d1	Fix noise level (900 not 1000) and prompt concatenation to match original STAR The original STAR inference uses total_noise_levels=900, preserving input structure during SDEdit. We had 1000 which starts from near-pure noise, destroying the input. Also always append the quality prompt to user text instead of using it only as a fallback. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-15 02:03:34 +01:00
Ethanfel	f7021e95f4	Add segment-based processing for long videos to reduce RAM usage Process videos in overlapping segments (25% overlap with linear crossfade blending) so peak memory is bounded by one segment rather than the full video. New segment_size parameter on the Super-Resolution node (default 0 = all at once, recommended 16-32 for long videos). Also update README clone URL to GitHub mirror. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-14 23:28:01 +01:00
Ethanfel	5f9287cfac	Initial release: ComfyUI nodes for STAR video super-resolution Two-node package wrapping the STAR (ICCV 2025) diffusion-based video upscaling pipeline: - STAR Model Loader: loads UNet+ControlNet, OpenCLIP text encoder, and temporal VAE with auto-download from HuggingFace - STAR Video Super-Resolution: runs the full diffusion pipeline with configurable upscale factor, guidance, solver mode, chunking, and color correction Includes three VRAM offload modes (disabled/model/aggressive) to support GPUs from 12GB to 40GB+. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-14 23:20:27 +01:00

6 Commits