ComfyUI-SelVA

Files

T

Ethanfel f4a7292cde feat: add optional MASK input to SelVA Feature Extractor

Allows per-frame or static segmentation masks to be applied before CLIP
and sync encoding, zeroing background pixels. Useful when multiple objects
compete for the same sound and text prompting alone is insufficient.

- _apply_mask(): resizes mask spatially (nearest-exact), samples temporally
  to match sampled frame count, multiplies into frames
- _hash_inputs(): includes mask bytes in cache key (begin/mid/end sampling)
- INPUT_TYPES: mask added to optional inputs with tooltip
- extract_features(): mask=None parameter, applied after _resize_frames for
  both CLIP (384px) and sync (224px) paths, before normalization
- Log line notes when masking is active

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-04-05 08:34:13 +02:00

__init__.py

fix: bug sweep and improvements

2026-04-04 18:04:35 +02:00

selva_feature_extractor.py

feat: add optional MASK input to SelVA Feature Extractor

2026-04-05 08:34:13 +02:00

selva_model_loader.py

feat: comprehensive node improvements

2026-04-04 18:16:03 +02:00

selva_sampler.py

feat: comprehensive node improvements

2026-04-04 18:16:03 +02:00

utils.py

chore: remove all PrismAudio code from main branch

2026-04-04 17:58:31 +02:00