Instructions to use LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8 with MLX:
# Make sure mlx-vlm is installed # pip install --upgrade mlx-vlm from mlx_vlm import load, generate from mlx_vlm.prompt_utils import apply_chat_template from mlx_vlm.utils import load_config # Load the model model, processor = load("LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8") config = load_config("LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8") # Prepare input image = ["http://images.cocodataset.org/val2017/000000039769.jpg"] prompt = "Describe this image." # Apply chat template formatted_prompt = apply_chat_template( processor, config, prompt, num_images=1 ) # Generate output output = generate(model, processor, formatted_prompt, image) print(output) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
- Unsloth Studio
How to use LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8 with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8 to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8 to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8 to start chatting
Load model with FastModel
pip install unsloth from unsloth import FastModel model, tokenizer = FastModel.from_pretrained( model_name="LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8", max_seq_length=2048, ) - Pi
How to use LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8 with Pi:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8"
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "mlx-lm": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8 with Hermes Agent:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8"
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8
Run Hermes
hermes
Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8
Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8 is an MLX / VMLX vision-language checkpoint derived from huihui-ai/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated, packaged for local multimodal experimentation on Apple Silicon.
Tested inference path
Inference for this checkpoint has been tested with
LibraxisAI/mlx-batch-server.
This is the recommended tested path for operator-controlled local multimodal mlx-lm inference on Apple Silicon.
| Aspect | Status |
|---|---|
| Tested runtime | LibraxisAI/mlx-batch-server |
| Target hardware | Apple Silicon |
| Inference mode | Local / self-hosted |
| Hugging Face Hosted Inference | Disabled for this repository (inference: false) |
This does not claim compatibility with every possible serving stack. It documents the path that has been exercised for this published checkpoint.
Intended use
- Local image-and-text reasoning on Apple Silicon
- Multimodal prompting experiments
- Screenshot, document, chart, and visual question-answering workflows
- Operator-controlled local inference where hosted inference is not desired
Out of scope
- Safety-critical decisions without domain expert review
- Claims of benchmark superiority not backed by published evaluation data
- Non-MLX / non-VMLX runtime guarantees
- High-stakes visual interpretation without human validation
Training and conversion metadata
| Parameter | Value |
|---|---|
| Repository | LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8 |
| Base model | huihui-ai/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated |
| Task | image-text-to-text |
| Library | mlx |
| Format | MLX / VMLX checkpoint |
| Quantization | MXFP8 |
| Target platform | Apple Silicon |
This card reports metadata present in the Hugging Face repository, existing frontmatter, or public config files. Missing benchmark, dataset, or training-run details are left explicit rather than reconstructed.
Usage
Use the library instructions above, or run this checkpoint through the tested local serving path: LibraxisAI/mlx-batch-server
Validation
End-to-end pipeline test 2026-04-22 on M3 Ultra (load → text → vision → unload), served via mlx-batch-server:
| Probe | TTFT | Output chars | Notes |
|---|---|---|---|
| Cold load | — | — | 39 s from cold to ready |
| Text — simple greeting (PL) | 0.75 s | 438 | Clean output, abliterated behaviour |
| Text — canonical (PL, literary) | 0.37 s | 690 | Concise reasoning trace |
| Vision — JPEG (Monument Valley) | 13.14 s | 1149 | Detailed scene description |
3/3 probes passed. has_reasoning=True on all probes — this model emits reasoning traces via <think> markers.
Limitations
- Validate outputs on your own domain data before relying on this checkpoint.
- Memory use and speed depend heavily on Apple Silicon generation, unified-memory size, prompt length, and runtime configuration.
- Validation data above reflects M3 Ultra; expect different timings on other hardware.
License
apache-2.0. Check the upstream/base model license as well when a base model is declared.
𝚅𝚒𝚋𝚎𝚌𝚛𝚊𝚏𝚝𝚎𝚍. with AI Agents by VetCoders (c)2024-2026 LibraxisAI
- Downloads last month
- 2,934
8-bit
Model tree for LibraxisAI/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-vmlx-mxfp8
Base model
Qwen/Qwen3.6-35B-A3B