ComfyUI-Prompt-Calibrator

Author	SHA1	Message	Date
Ethanfel	8b567cb531	chat mode: json_output toggle to return clean extracted JSON For JSON-producing system prompts (e.g. LTX prompt-relay), json_output=true pulls the JSON object out of the reply (strips reasoning/prose/code-fences via _parse_json, which handles nested schemas and reasoning-then-JSON) and returns it re-serialized; falls back to raw text if none parses. agent_bridge gains --json-output. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-07-02 02:09:36 +02:00
Ethanfel	d389d6daff	Trim dead inputs: drop fp16 precision and prompt_used fp16 offers nothing over bf16 for these models (removed from the quant dropdown; loader still tolerant if passed). prompt_used was metadata-only — removed from the node inputs, report payload/markdown, the bridge, and the example workflows. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-27 10:03:06 +02:00
Ethanfel	271aa8ae42	Add chat mode: use the node as a general VLM, not just a judge New mode='chat' with system_prompt + user_prompt inputs runs your own prompt over the image(s) and returns raw text in 'analysis' — reusing the same model dropdown, quant, auto-download and backend. Makes it a one-node abliterated VLM for captioning, tagging, Q&A, prompt-from-image, etc. agent_bridge gains --mode chat / --system-prompt / --user-prompt (no receptor needed). Writes a chat report (latest.json) for the agent. Docs updated. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-27 09:55:36 +02:00
Ethanfel	34adb946a4	Add model dropdown (presets w/ VRAM) + manual override; list HauhauCS GGUF models New model_select dropdown with suggested VRAM in each label: huihui Qwen3-VL 4B(local)/8B/30B-A3B (transformers, auto-download) and HauhauCS Qwen3.5-9B / Qwen3.6-35B-A3B Uncensored Aggressive (GGUF). model_path is now the manual override (empty = use dropdown). agent_bridge gains --model-select/--model-path. The HauhauCS models are GGUF-only (no safetensors) so the transformers backend can't load them yet — selecting one returns a clear 'GGUF backend pending' message until the backend is added. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-27 09:24:28 +02:00
Ethanfel	887dfc0bbb	Add analysis profiles with distance/proximity-aware axes A discrete verdict collapses magnitude and a generic axis can hide what you're calibrating (a blowjob where the head is 20cm away still reads sexual_act=oral -> MATCH). New 'profile' input selects an act-specialized axis set (general / oral / penetration / handjob / solo) whose act-critical axes capture distance explicitly (mouth_genital_distance: touching/<5cm/10-20cm/>20cm, oral_depth, insertion_depth, stroke_position, ...). axes now overrides the profile when set. agent_bridge gains --profile; workflows + docs updated. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-27 00:48:46 +02:00
Ethanfel	69c1d6deb4	describe emits one canonical reference; compare can anchor on it Describe mode now produces a single coherent, internally-consistent canonical scene description (paragraph + per-axis spec, written to canonical_reference in the report). Compare gains an optional reference_description input: when set, it anchors on that fixed text and shows only the generated image (no swap) — so the reference side never drifts or self-contradicts across iterations; only the generated image is re-described each turn. agent_bridge gains --ref-desc / --ref-desc-file (reads the describe report's canonical_reference). Docs + example workflow updated. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-26 23:22:57 +02:00
Ethanfel	c7ef756a71	Add describe (first-pass) mode to the judge node New mode on QwenVLImageJudge: 'describe' looks at the reference alone and returns a prompt-ready caption + per-axis target spec to seed the very first prompt (the generator has nothing to reproduce yet). 'compare' is the existing ref-vs-gen scoring. generated_image is now optional (required only for compare); shared generation refactored into _generate_from_messages; third output renamed diff_analysis -> analysis (mode-agnostic). agent_bridge gains --mode (describe needs no receptor/prompt); added workflow_describe_api.json. Docs updated with the first-pass bootstrap step. Fixed error-return arity to 5-tuple. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-26 23:04:09 +02:00
Ethanfel	959ec70065	Redesign judge output for calibration: per-axis {score, ref, gen}, drop local fix suggestions The local VLM now only observes and scores; correction is left to the stronger external agent. Each axis reports the target value (ref), the current value (gen) and the closeness (score) — the target/current/distance an agent needs to calibrate. Expanded to ~20 granular axes (identity/body/wardrobe/action/affect/ camera/render) so the action cluster stays discriminative for explicit content. swap_eval now inverts ref/gen of the swapped pass; diff summary sorts worst-first; default max_new_tokens 1024. Docs aligned. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-26 22:52:40 +02:00
Ethanfel	95198a15b5	Initial commit: VLM-as-judge prompt calibration loop Qwen3-VL image-similarity judge node, external-prompt receptor node, agent_bridge CLI, example SDXL workflow, and methodology/agent-loop/ calibration-policy docs. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-26 22:15:56 +02:00

9 Commits