Files
ComfyUI-Prompt-Calibrator/workflow/workflow_describe_api.json
T
Ethanfel 53f1f9b9b4 Switch compare to discrete verdicts + granular pose axes + per-axis definitions
The 4B's 0-1 scores were unreliable (identical ref/gen scored ~0.6), so the
judge now returns verdict match/partial/mismatch per axis; overall_score and a
new mismatch_count are computed from verdicts on our side (reliable, monotonic).
Expanded the action/pose cluster into position_name, body_orientation,
limb_arrangement, penetration, contact_points, genital_visibility (+ breast_size)
so explicit poses carry detail. Each axis now ships a one-line definition in the
prompt so gender_mix/subject_count stop absorbing positional text. 24 axes total.
Example workflows use the node default (axes=''). Docs realigned; stop condition
is now mismatch_count==0.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-26 23:15:51 +02:00

27 lines
727 B
JSON

{
"11": {
"class_type": "LoadImage",
"inputs": { "image": "reference.png" },
"_meta": { "title": "Reference Image (put in ComfyUI/input/)" }
},
"12": {
"class_type": "QwenVLImageJudge",
"inputs": {
"reference_image": ["11", 0],
"mode": "describe",
"model_path": "/media/p5/qwen3vl_4b_abliterated_comfy_convert/hf_bf16",
"precision": "bf16",
"axes": "",
"max_new_tokens": 1024,
"temperature": 0.0,
"swap_eval": false,
"keep_loaded": true,
"auto_download": true,
"report_dir": "/media/p5/Comfyui/output/calibrator",
"run_tag": "seed",
"prompt_used": ""
},
"_meta": { "title": "Qwen3-VL Describe (first pass)" }
}
}