Files
Ethanfel 5cff883914 Add Qwen3.5/3.6 abliterated (safetensors) + split model/quant selectors
No GGUF needed: huihui ships Qwen3.5-9B, Qwen3.6-27B, Qwen3.6-35B-A3B as multimodal
SAFETENSORS (abliterated), loadable via transformers AutoModelForMultimodalLM. Added
them to the model dropdown. _resolve_vl_classes now tries AutoModelForMultimodalLM
(3.5/3.6) and AutoModelForImageTextToText (Qwen3-VL) in name-based order, with
load fallback across candidates. model_select is now the model NAME only; precision
is the separate quant dropdown applied to it (repo_by_precision routes e.g. the local
fp8 dir). Aliases 3.5-9b/3.6-27b/3.6-35b. VRAM-by-quant table in README. Needs a
recent transformers for 3.5/3.6.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-27 09:50:13 +02:00

13 lines
491 B
Plaintext

# Qwen3-VL needs transformers >= 4.57. The newer natively-multimodal Qwen3.5/3.6
# abliterated models need a recent transformers exposing AutoModelForMultimodalLM
# (upgrade transformers if a Qwen3.5/3.6 model fails to load).
transformers>=4.57.0
huggingface_hub # auto-download of models by repo id / alias
torch
pillow
numpy
# for precision=nf4 (4-bit) — needed to run the 30B-A3B abliterated judge on 32 GB:
bitsandbytes
# optional, for faster attention on the RTX 5090:
# flash-attn