5cff883914
No GGUF needed: huihui ships Qwen3.5-9B, Qwen3.6-27B, Qwen3.6-35B-A3B as multimodal SAFETENSORS (abliterated), loadable via transformers AutoModelForMultimodalLM. Added them to the model dropdown. _resolve_vl_classes now tries AutoModelForMultimodalLM (3.5/3.6) and AutoModelForImageTextToText (Qwen3-VL) in name-based order, with load fallback across candidates. model_select is now the model NAME only; precision is the separate quant dropdown applied to it (repo_by_precision routes e.g. the local fp8 dir). Aliases 3.5-9b/3.6-27b/3.6-35b. VRAM-by-quant table in README. Needs a recent transformers for 3.5/3.6. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
13 lines
491 B
Plaintext
13 lines
491 B
Plaintext
# Qwen3-VL needs transformers >= 4.57. The newer natively-multimodal Qwen3.5/3.6
|
|
# abliterated models need a recent transformers exposing AutoModelForMultimodalLM
|
|
# (upgrade transformers if a Qwen3.5/3.6 model fails to load).
|
|
transformers>=4.57.0
|
|
huggingface_hub # auto-download of models by repo id / alias
|
|
torch
|
|
pillow
|
|
numpy
|
|
# for precision=nf4 (4-bit) — needed to run the 30B-A3B abliterated judge on 32 GB:
|
|
bitsandbytes
|
|
# optional, for faster attention on the RTX 5090:
|
|
# flash-attn
|