Commit Graph

5 Commits

Author SHA1 Message Date
Ethanfel fd5922b1cd feat(video): add path-string loader variant
UniverSR Load Video Audio (Path) mirrors FoleyTuneVideoLoader: takes an
absolute video_path (for files outside input/) and outputs the same
(UNIVERSR_VIDEO, AUDIO). Shared load body factored into _load_video_audio;
registered for the inline preview (post-run) in the web extension.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-01 13:29:16 +02:00
Ethanfel 8972fed805 refactor(video): single Foley-style upload loader with inline preview
Replace the path+dropdown loader (and its non-rendering ui.gifs) with one
node mirroring FoleyTuneVideoLoaderUpload: a `video` upload widget with
drag-drop and an inline video preview, shipped via web/js/UniverSRVideo.js
(adapted from FoleyTuneVideo.js) + WEB_DIRECTORY.

The loader now outputs (UNIVERSR_VIDEO, AUDIO) so you can super-resolve the
audio and remux it. Updated the example workflow output order and README.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-01 13:25:06 +02:00
Ethanfel 5acaffab92 feat: video loader + combiner nodes
Adapted from HunyuanVideo-FoleyTune for the audio-SR workflow:

- UniverSR Load Video Audio: extract a video's audio track via ffmpeg
  (WAV pipe + soundfile, no torchcodec) and carry a UNIVERSR_VIDEO
  reference forward, with an inline video preview.
- UniverSR Video Combiner: mux the enhanced audio back onto the source
  video without re-encoding video (-c:v copy), trim-aware, with output
  auto-increment and preview.

Both registered alongside the SR nodes; ffmpeg + soundfile required only
for these. Adds README docs and an example video workflow.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-01 13:08:28 +02:00
Ethanfel 12cbc415cf docs: full node documentation in README
Comprehensive README: features, install, model auto-download, a
parameter reference for both nodes, an input_sr guide (SR vs BWE),
recommended settings, chunking, how-it-works, and troubleshooting.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-01 13:02:10 +02:00
Ethanfel 5f29b225b7 Initial release: ComfyUI-UniverSR
ComfyUI nodes for UniverSR (ICASSP 2026) — vocoder-free audio
super-resolution (8/12/16/24 kHz → 48 kHz) via flow matching.

- UniverSR Model Loader: presets auto-download to models/universr,
  plus local dir / raw .pth (from_local) loading, with caching.
- UniverSR Super-Resolution: chunked overlap-add for long audio,
  per-channel stereo, seed control with global-RNG isolation,
  wet/dry blend, and an optional before/after spectrogram.
- Vendors the universr inference package under vendor/ (prefers an
  installed copy); only extra dep beyond ComfyUI's stack is torchdiffeq.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-01 12:59:42 +02:00