From 93bcb23fa74292307a106e8a8b365a22accf5e78 Mon Sep 17 00:00:00 2001
From: Ethanfel <ethan.fel@ts-pc.fr>
Date: Sat, 18 Apr 2026 21:26:14 +0200
Subject: [PATCH] docs: document embedding cache and fast rescan loop

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---
 README.md | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/README.md b/README.md
index 32dbc83..0ba9676 100644
--- a/README.md
+++ b/README.md
@@ -312,6 +312,8 @@ The classifier trains a `HistGradientBoostingClassifier` on audio embeddings and
 
 Select a trained model from the dropdown and click **Scan**. Adjust the threshold slider to control sensitivity. Detected regions appear as colored bands on the timeline and as rows in the results panel.
 
+Audio embeddings are computed once per video and cached to disk (`cache/w2v/`). Subsequent scans with the same embedding model skip the GPU entirely and only re-run the classifier, which takes milliseconds. This makes the retrain → rescan loop nearly free after the first pass.
+
 ### 4. Review and refine
 
 - Toggle **Review** mode for a clean timeline focused on scan results
@@ -326,7 +328,7 @@ Click **Export Scan Results** to batch export all enabled regions. The button sh
 
 ### 6. Retrain with feedback
 
-Train again — hard negatives are automatically included. Each training run saves with a timestamp. Right-click the model dropdown to restore a previous version if results degrade.
+Train again — hard negatives are automatically included. Each training run saves with a timestamp. Click the **⏲** button next to the model dropdown to restore a previous version if results degrade — restoring automatically rescans with the selected version.
 
 ## Database