Add describe (first-pass) mode to the judge node

New mode on QwenVLImageJudge: 'describe' looks at the reference alone and returns
a prompt-ready caption + per-axis target spec to seed the very first prompt (the
generator has nothing to reproduce yet). 'compare' is the existing ref-vs-gen
scoring. generated_image is now optional (required only for compare); shared
generation refactored into _generate_from_messages; third output renamed
diff_analysis -> analysis (mode-agnostic). agent_bridge gains --mode (describe
needs no receptor/prompt); added workflow_describe_api.json. Docs updated with the
first-pass bootstrap step. Fixed error-return arity to 5-tuple.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
This commit is contained in:
2026-06-26 23:04:09 +02:00
parent 959ec70065
commit c7ef756a71
6 changed files with 211 additions and 47 deletions
+26
View File
@@ -0,0 +1,26 @@
{
"11": {
"class_type": "LoadImage",
"inputs": { "image": "reference.png" },
"_meta": { "title": "Reference Image (put in ComfyUI/input/)" }
},
"12": {
"class_type": "QwenVLImageJudge",
"inputs": {
"reference_image": ["11", 0],
"mode": "describe",
"model_path": "/media/p5/qwen3vl_4b_abliterated_comfy_convert/hf_bf16",
"precision": "bf16",
"axes": "subject_count, gender_mix, body_type, distinctive_features, age_appearance, ethnicity_skin, hair, clothing_state, sexual_act, position, penetration, explicitness, body_contact, pose, facial_expression, gaze, framing, camera_angle, scene, lighting_color, art_style",
"max_new_tokens": 1024,
"temperature": 0.0,
"swap_eval": false,
"keep_loaded": true,
"auto_download": true,
"report_dir": "/media/p5/Comfyui/output/calibrator",
"run_tag": "seed",
"prompt_used": ""
},
"_meta": { "title": "Qwen3-VL Describe (first pass)" }
}
}