How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="haffner/Maestro1-9B-Heretic")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)
# Load model directly
from transformers import AutoProcessor, AutoModelForMultimodalLM

processor = AutoProcessor.from_pretrained("haffner/Maestro1-9B-Heretic")
model = AutoModelForMultimodalLM.from_pretrained("haffner/Maestro1-9B-Heretic")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
inputs = processor.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
Quick Links

About

Heretic'ed safetensors and GGUF quants of https://huggingface.co/vectionlabs/Maestro1-9B

Multimodal Projector:

Refusals: 3/100, KL divergence: 0.0741

  • Parameters:
    • direction_index = 18.77
    • attn.o_proj.max_weight = 1.37
    • attn.o_proj.max_weight_position = 27.86
    • attn.o_proj.min_weight = 1.33
    • attn.o_proj.min_weight_distance = 18.73
    • mlp.down_proj.max_weight = 1.34
    • mlp.down_proj.max_weight_position = 22.02
    • mlp.down_proj.min_weight = 1.24
    • mlp.down_proj.min_weight_distance = 14.79
Downloads last month
2,456
GGUF
Model size
8B params
Architecture
qwen3vl
Hardware compatibility
Log In to add your hardware

1-bit

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for haffner/Maestro1-9B-Heretic

Quantized
(3)
this model