Add atlas refine cue seed workflow

2026-07-01 14:10:23 +02:00
parent 83dfecc55b
commit 5f602db06b
34 changed files with 12162 additions and 18 deletions
@@ -0,0 +1,183 @@
+# Blowjob Top-View 1024 Reference Pool Cue Expansion
+
+Date: 2026-07-01
+
+Canonical atlas folder:
+
+```text
+/media/unraid/davinci/Qwen_edit_lora/POV/dataset_v2/blowjob_top_view
+```
+
+Supplemental raw cue-expansion folder:
+
+```text
+/media/unraid/davinci/Qwen_edit_lora/POV/dataset_v2/1.original/blowjob_top_view_1024
+```
+
+Folder state:
+
+- canonical folder: 17 PNG images
+- supplemental raw folder: 27 PNG images
+- all images are `1024x1024`
+- raw-pool contact sheet created for review at `/tmp/blowjob_top_view_1024_contact.jpg`
+
+Reference-pool report command:
+
+```bash
+python tools/krea2_atlas_refine_manifest.py --print-reference-pool-report --variant-key pov_blowjob_top_down_vertical_shaft --reference-pool-folder 1.original/blowjob_top_view_1024
+```
+
+Cue-review sheet command:
+
+```bash
+python tools/krea2_atlas_refine_manifest.py --print-reference-cue-review-sheet --variant-key pov_blowjob_top_down_vertical_shaft --reference-pool-folder 1.original/blowjob_top_view_1024
+```
+
+The cue-review sheet currently creates 27 blank review items: 17 canonical
+curated references, all matched to raw counterparts by image id, plus 10
+raw-only supplemental extras. Canonical rows include a
+`reference_images_template`; raw-only rows leave that template empty until a
+human review decides whether the extra frame is only cue-mining evidence or
+deserves promotion into the curated catalog.
+
+Filled cue-review sheet to sidecar-candidate draft command:
+
+```bash
+python tools/krea2_atlas_refine_manifest.py --print-reference-cue-candidate-draft --reference-cue-review-sheet-json /tmp/sxcp-reference-cue-review-filled.json
+```
+
+The candidate draft emits copyable `prompt_variant` objects only for reviewed
+canonical rows with positive cues and a filled prompt-variant id. It skips
+raw-only extras, noisy option/meta/negative cue wording, blank rows, and
+duplicate ids so the raw pool cannot silently become promoted generator truth.
+
+Candidate draft to same-stem sidecar authoring draft:
+
+```bash
+python tools/krea2_atlas_refine_manifest.py --folder /media/unraid/comfyui/output/CodexMCP-Atlas-Refine --subject-id atlas_refine_same_woman_001 --print-reference-cue-sidecar-author-draft --reference-cue-candidate-draft-json /tmp/sxcp-reference-cue-candidate-draft.json --variant-key pov_blowjob_top_down_vertical_shaft
+```
+
+Validate and apply only after confirming the target baseline deck is the one to
+test:
+
+```bash
+python tools/krea2_atlas_refine_manifest.py --validate-reference-cue-sidecar-author-draft --reference-cue-sidecar-author-draft-json /tmp/sxcp-reference-cue-sidecar-author-draft.json
+python tools/krea2_atlas_refine_manifest.py --apply-reference-cue-sidecar-author-draft --reference-cue-sidecar-author-draft-json /tmp/sxcp-reference-cue-sidecar-author-draft.json --folder /media/unraid/comfyui/output/CodexMCP-Atlas-Refine
+```
+
+Apply writes unscored sidecar prompt variants and checks the source prompt hash.
+The next required action is fixed-seed MCP rendering and scoring, not catalog
+promotion.
+
+## Why This Pool Matters
+
+The canonical folder is the preferred source for curated atlas references. The
+supplemental raw folder shows the same pose family with more images, so it can
+drive cue expansion when repeated visual axes are visible across references. It
+should not be treated as proof that our current generated prompt can preserve
+the same subject, coworking workspace, clothing ownership, or anatomy behavior.
+
+Use this pool upstream of sidecar authoring:
+
+1. Pick the nearest atlas cluster for the target variant, preferring the
+   canonical `blowjob_top_view/` path when the frame exists there.
+2. Extract positive cue axes from repeated visual evidence.
+3. Write small `append_cues` against the current generated baseline and store
+   the nearest atlas targets in `reference_images`.
+4. Test with fixed sampler seeds through the MCP batch path.
+5. Score generated results against the nearest atlas cluster and the current
+   same-subject/workspace baseline before promotion.
+6. Keep raw-only extras in the cue-review sheet until repeated generated
+   evidence proves the cue belongs in a sidecar or catalog entry.
+7. Convert only reviewed canonical rows to candidate sidecar snippets; then
+   render and score them before seed selection.
+8. Use the sidecar authoring draft to attach reviewed candidates to a baseline
+   deck with prompt-hash drift protection.
+
+## Observed Cue Axes
+
+- `camera_pitch`: straight-down vertical, high oblique top-down, and tilted
+  top-down variants all appear in the pool.
+- `support_plane`: white lounge/chair surfaces, pale floor, carpet/rug, bed or
+  blanket surfaces, wood floor, tile floor, and outdoor ground appear as visible
+  plane anchors.
+- `viewer_foreground_amount`: some refs use only a lower torso edge, while
+  others show more thighs, feet, waistband, or abdomen mass.
+- `partner_upper_body_stack`: the strongest top-view refs place face, eyes or
+  eyelids, hair crown, shoulders, upper chest or neckline, and hand contact as
+  the main partner stack.
+- `hand_placement`: one hand at the base, both hands supporting the base, hands
+  on floor/support, and hands on the viewer/body edge appear as alternate frame
+  cues.
+- `eye_direction`: direct eye contact, eyelids lowered, and open-mouth/gaze-up
+  variants appear as expression axes.
+- `clothing_anchor`: fitted tops, straps, shirts, or visible necklines can help
+  anchor upper-body geometry when the crop supports them.
+- `floor_furniture_evidence`: cropped support edges, floor seams, rugs, chair or
+  furniture edges, and outdoor objects can carry scene identity without forcing
+  deep room perspective.
+
+## Current Top-View Rule From This Pool
+
+For coworking prompt tests, translate workspace context into floor/support-plane
+evidence and keep the shaft/contact line as the first pose anchor. The
+user-highlighted atlas-22-style direction is:
+
+```text
+Straight-down male POV oral close-up. The centered shaft and mouth contact form
+the main vertical axis from the lower foreground to the woman's face. The
+woman's face, eyelids, hair crown, shoulders, upper chest, neckline, and one
+hand stack around the shaft-contact axis. Viewer thighs and feet frame the
+lower side edges. Tucked knees remain small side shapes on the floor. The
+background reads as a flat pale floor and one cropped white lounge chair
+surface, with shallow top-down room depth.
+```
+
+Manual same-seed calibration on 2026-07-01 confirms the anchor order. The
+sidecars for `/media/unraid/comfyui/output/sxcp_accumulator/bwave_2/img_00135_.png`
+through `/media/unraid/comfyui/output/sxcp_accumulator/bwave_2/img_00139_.png`
+performed best when the prompt started with straight-down POV, then the shaft,
+then the partner stack directly below the shaft, then mouth contact, and ended
+with a sparse flat-floor background. Treat abdomen and room-depth cues as
+secondary evidence; they should follow the shaft-contact axis instead of
+leading the composition.
+
+Because this finding depends on word order and removes the deep coworking-room
+tail, test it as exact replacement text instead of appended cues. The validated
+dry-run batch is:
+
+```text
+/tmp/sxcp_top_view_oral_shaft_anchor_exact_batch.json
+```
+
+It keeps the original baseline probe plus two exact-text candidates:
+
+- `atlas22_shaft_contact_upper_stack_floor_plane`
+- `atlas27_shaft_axis_between_feet_floor_anchors`
+
+For clothed variants, keep clothing subject-owned:
+
+```text
+The woman wears a fitted white ribbed tank top; the tank-top neckline and
+shoulders remain visible from above.
+```
+
+## Promotion Guard
+
+Do not copy all 27 supplemental references into the live catalog variant. Keep
+the catalog reference list curated, then use the raw pool to choose cue axes and
+nearest visual targets. A generated candidate still needs fixed-seed image
+evidence and visual scores for pose ownership, workspace continuity, clothing
+visibility, subject identity, expression/eye control, anatomy/proportion, and
+prompt noise before it can become seedable. Sidecar cue variants should carry
+nearest visual targets like:
+
+```json
+{
+  "id": "shaft_contact_upper_stack_floor_support",
+  "text": "A same-subject straight-down male POV oral close-up. The centered shaft and mouth contact form the main vertical axis from the lower foreground to the woman's face. The woman's face, eyelids, hair crown, shoulders, upper chest, neckline, and one hand stack around the shaft-contact axis. The background reads as a flat pale floor plane with shallow overhead room depth.",
+  "reference_images": [
+    "blowjob_top_view/22_blowjob_top_view.png"
+  ]
+}
+```