Qwable-v1-mlx-2Bit / README.md
usermma's picture
Upload folder using huggingface_hub
f77a36c verified
|
Raw
History Blame Contribute Delete
1.08 kB
---
license: agpl-3.0
language:
- en
library_name: transformers
tags:
- qwen
- qwen3
- qwen3.6
- moe
- distillation
- chain-of-thought
- agentic
- claude-fable-5
- claude-opus-4.7
- tool-use
- chained-distill
- mlx
- mlx-my-repo
pipeline_tag: text-generation
base_model: lordx64/Qwable-v1
datasets:
- lordx64/agentic-distill-fable-5-sft
---
# usermma/Qwable-v1-mlx-2Bit
The Model [usermma/Qwable-v1-mlx-2Bit](https://huggingface.co/usermma/Qwable-v1-mlx-2Bit) was converted to MLX format from [lordx64/Qwable-v1](https://huggingface.co/lordx64/Qwable-v1) using mlx-lm version **0.31.2**.
## Use with mlx
```bash
pip install mlx-lm
```
```python
from mlx_lm import load, generate
model, tokenizer = load("usermma/Qwable-v1-mlx-2Bit")
prompt="hello"
if hasattr(tokenizer, "apply_chat_template") and tokenizer.chat_template is not None:
messages = [{"role": "user", "content": prompt}]
prompt = tokenizer.apply_chat_template(
messages, tokenize=False, add_generation_prompt=True
)
response = generate(model, tokenizer, prompt=prompt, verbose=True)
```