Saves frames as uint8 .npy instead of H.264 MP4, eliminating the
lossy codec roundtrip. extract_features.py loads .npy directly and
skips decord when given a numpy file. Passes --source_fps for
correct temporal sampling.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Each step now prints elapsed seconds on completion.
Total time printed at the end to identify bottlenecks.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
T5-Gemma outputs BFloat16 which numpy does not support.
Cast all feature tensors with .float() before .numpy().
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Creates data_utils/v2a_utils/feature_utils_288.py with FeaturesUtils:
- T5-Gemma text encoding via transformers
- VideoPrism video encoding via JAX videoprism package
- Synchformer visual encoder loading from checkpoint
Also fixes extract_features.py to add plugin root to sys.path so
data_utils is importable in the subprocess venv.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>