8-cut

Author	SHA1	Message	Date
Ethanfel	edc5784ba6	feat: hard negative source_model tracking, training toggle Add source_model column to hard_negatives table with migration. New get_hard_negatives() returns full rows, delete_hard_negatives_by_ids() for bulk deletion. get_training_data() gains use_hard_negatives param. TrainDialog has "Use hard negatives" checkbox. Scan panel passes current model name when marking negatives. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-19 15:27:11 +02:00
Ethanfel	4fb2ae144f	feat: scan result history — keep N versions per (file, model) Add scan_timestamp column to scan_results. save_scan_results now inserts with a timestamp and prunes versions beyond max_versions (default 5). get_scan_results returns only the latest version by default, with optional scan_timestamp parameter for loading specific versions. New get_scan_versions method returns available versions for a (file, profile, model) tuple. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-19 15:18:28 +02:00
Ethanfel	2614a765d5	fix: get_export_folders respects scan_export filter Ghost folders (scan-export-only) no longer appear in training dropdowns. Also filters out 0-clip folders from get_training_stats. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-19 15:16:49 +02:00
Ethanfel	8fb8581816	feat: add EAT (Efficient Audio Transformer) embedding model Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-19 14:00:09 +02:00
Ethanfel	5b25e85e98	feat: add AST (Audio Spectrogram Transformer) embedding model Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-19 13:55:29 +02:00
Ethanfel	e3f133ef84	feat: multi-layer extraction for HuBERT/Wav2Vec2 models Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-19 13:53:55 +02:00
Ethanfel	e1789d4e71	fix: bug audit — broken test imports, training data overlap, cleanup - Fix test_utils.py importing build_annotation_json_path from main instead of core.annotations (all 59 tests pass now) - Fix get_training_data double-counting clips at same start_time in both positive and soft sets — subtract positive from soft - Add cancel_flag to train_classifier so training can be interrupted between videos (TrainWorker passes self as cancel_flag) - Remove orphaned core/export.py (was for deleted server API) - Remove stale Dockerfile and docker-compose.yml (referenced server) - Clean up leftover server/__pycache__ and client/ build artifacts - Add torch to requirements.txt (was only mentioned in comments) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-18 12:55:58 +02:00
Ethanfel	12ed183f1b	feat: integrate training UI, BEATs model, and clean up legacy code - Remove legacy distance-mode scanning (build_profile, _similarity, etc.) and hand-crafted intensity features — pipeline is now embedding-only - Integrate Microsoft BEATs as embedding option alongside wav2vec2/HuBERT - Add TrainDialog with positive class selector, model picker, video dir fallback, and live training stats - Add TrainWorker QThread with cancel support and proper lifecycle cleanup - Add source_path column to DB for robust source video tracking - Add get_export_folders/get_training_data/get_training_stats to DB - Wire source_path in all export DB writes (_on_clip_done, _on_auto_clip_done) - Cancel scan/train workers in closeEvent to prevent use-after-free crashes - Add setup_env.sh supporting both conda and python venv (CUDA 12.8) - Update requirements.txt with all actual dependencies - Update 8cut_train.py with --positive flag for new DB-driven training Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-18 11:52:27 +02:00
Ethanfel	f2c38aee79	feat: rewrite audio scan with MFCC+delta+spectral contrast pipeline Root cause of poor discrimination: MFCC[0] (energy) dominated the feature vector, making cosine similarity see all audio as similar. Changes: - Skip MFCC[0], use 12 coefficients instead of 20 - Add delta MFCCs for temporal dynamics - Add 7-band spectral contrast for tonal vs noise quality - Switch from cosine similarity to euclidean-distance-based score - Pre-compute STFT once for whole file (10-20x faster) - Vectorized sliding window via cumulative sums (no Python loop) - Lower sample rate 22050→16000 Hz (faster, no quality loss) - 62-dim feature vector (was 40-dim mean+std of raw MFCCs) - Default threshold 0.05 (new similarity scale) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-17 15:28:44 +02:00
Ethanfel	8ab5bdba77	fix: use mean+std MFCC vectors (40-dim) for better discrimination Mean-only vectors were too similar across different audio segments, causing everything to match even at threshold 0.99. Adding std captures temporal dynamics and makes the similarity scores much more spread out. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-17 09:27:11 +02:00
Ethanfel	fd42791c9f	feat: add get_all_export_paths to ProcessedDB Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-17 08:55:39 +02:00
Ethanfel	9cf9e3233f	feat: add scan_video with average and nearest modes Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-17 08:50:47 +02:00
Ethanfel	e17d8f67aa	feat: add audio_scan module with build_profile Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-17 08:48:18 +02:00
Ethanfel	279aee14cb	feat: add apply_keyframes_to_jobs helper Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-14 15:57:27 +02:00
Ethanfel	8e8c8b9774	feat: add resolve_keyframe helper to extract sorted-keyframe lookup Adds a pure function that returns the latest keyframe at or before a given time (with tolerance), replacing the inline lookup pattern that appears multiple times in main.py. Includes 6 tests covering empty list, before-first, exact match, between, after-last, and tolerance. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-14 15:48:00 +02:00
Ethanfel	d4357f0da4	feat: cursor lock with crop keyframing, remove fuzzy filename matching - Lock button (G key) freezes export cursor, timeline scrubs playback only - In lock mode, clicking crop bar sets a keyframe at current playback time - Orange diamonds on timeline show keyframe positions - Export resolves per-clip crop center from nearest preceding keyframe - Crop bar/overlay updates while scrubbing to preview effective crop - Unlocking clears all keyframes - Replace fuzzy filename matching with exact match to prevent marker bleed Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-13 16:34:49 +02:00
Ethanfel	f8b148f77d	feat: profile support for independent marker sets Each profile has its own set of timeline markers, so the same video can be cut with different settings (e.g. landscape vs portrait) without markers interfering. Profile selector in the top bar, persisted in QSettings, stored per-row in the DB. - Add `profile` column to ProcessedDB schema (migrates existing rows to 'default') - Scope find_similar, get_markers, _get_markers_for by profile - Add get_profiles() for populating the combo dropdown - Thread profile through _DBWorker, _after_load, _refresh_markers, _on_clip_done, dropEvent, _on_open_files - Editable profile QComboBox in top bar, refreshed after each export - 5 new tests for profile isolation and backward compatibility (54 total) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-13 11:08:50 +02:00
Ethanfel	bcdda9c783	fix: UI audit — dark theme styling, group delete/overwrite, layout cleanup - Style QComboBox/QSpinBox/QDoubleSpinBox/QCheckBox in dark theme - Delete and overwrite now operate on the full clip group, not just one sub-clip - Add get_group/delete_group to ProcessedDB with tests - Restructure control rows: transport+actions / annotation+path / encoding - Add "Open Files" button to queue panel (replaces drag-drop-only) - Playlist right-click to remove items - Compact time display (1:23.4 / 5:00.0), window title shows filename - Short side: QLineEdit → QSpinBox with validation - Tooltips with keyboard shortcuts on all interactive widgets - Fix arrow hint direction, remove stale mask comment Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-13 02:56:26 +02:00
Ethanfel	e2b4f9bf8d	remove: mask generation, venv setup, and settings dialog Dead code — masking is handled externally via ComfyUI. Removes SetupWorker, MaskWorker, SettingsDialog, build_mask_output_dir, the mask UI row, Settings button, and associated test cases. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-12 15:53:31 +02:00
Ethanfel	f11b3e298e	feat: group clips into subfolders (clip_001/, clip_002/, etc.) Each batch export creates a subfolder named after the group (e.g. clip_001/) containing all sub-clips and their audio files. Keeps the top-level export folder clean. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-12 01:05:10 +02:00
Ethanfel	93cee40b06	feat: export 3 overlapping 8s clips per press with configurable spread Each export generates clip_NNN_0, clip_NNN_1, clip_NNN_2 offset by the spread value (2–8s, default 3s). Preview plays the full span covered by all three clips. Marker click still overwrites a single clip. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-11 23:22:41 +02:00
Ethanfel	8bc42cabd9	fix: store absolute paths in dataset.json Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-07 14:10:06 +02:00
Ethanfel	8148971aae	refactor: remove fps from annotation — path + label only Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-07 14:09:08 +02:00
Ethanfel	c3c480acc7	feat: replace dataset.tsv with dataset.json annotation file Each exported clip writes an entry to <folder>/dataset.json containing its relative path, sound label, and fps. Re-exporting to the same path updates the existing entry (upsert). Empty labels are skipped. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-07 14:01:50 +02:00
Ethanfel	191d1cf299	fix: move SELVA_CATEGORIES to module level, harden append_to_tsv, fix tests Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 18:48:46 +02:00
Ethanfel	c76eb9ec84	feat: add label/category annotation fields and TSV export for SELVA Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 18:45:29 +02:00
Ethanfel	9652b249ba	feat: extract audio alongside WebP image sequence Adds build_audio_extract_command and runs it in ExportWorker after the frame sequence completes. Audio written to <sequence_dir>.wav (lossless pcm_s16le). Extraction failure (no audio stream) is silently ignored. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 17:20:40 +02:00
Ethanfel	1ab18e6ab5	fix: use -c:v and explicit -an in image_sequence ffmpeg command	2026-04-06 15:58:33 +02:00
Ethanfel	93028d9ac7	feat: build_sequence_dir and image_sequence flag for build_ffmpeg_command Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 15:56:16 +02:00
Ethanfel	9bc65a2b25	feat: add build_mask_output_dir utility	2026-04-06 15:34:08 +02:00
Ethanfel	1a43c4725d	feat: portrait crop filter in build_ffmpeg_command Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 13:51:02 +02:00
Ethanfel	798c5ff4f4	feat: add short_side resize to build_ffmpeg_command	2026-04-06 13:31:58 +02:00
Ethanfel	63d48a968c	feat: DB schema v2 — store start_time and output_path, add get_markers Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 13:08:49 +02:00
Ethanfel	558fa23da4	feat: ProcessedDB and _normalize_filename with tests Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 12:42:03 +02:00
Ethanfel	8a97c3c7c2	fix: ffmpeg command type hint, -ss comment, FileNotFoundError handler	2026-04-06 12:08:42 +02:00
Ethanfel	f6832f58a6	feat: ExportWorker with ffmpeg command builder Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 12:06:32 +02:00
Ethanfel	68f9a01d2b	fix: format_time rollover, move sys.path to conftest Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 11:16:52 +02:00
Ethanfel	5819ea2970	feat: add utility functions with tests Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 11:14:01 +02:00
Ethanfel	7c46ec3ae7	feat: project skeleton	2026-04-06 11:11:25 +02:00

39 Commits