MLX
Safetensors
English
llama
Llama-3
instruct
finetune
chatml
gpt4
synthetic data
distillation
function calling
json mode
axolotl
roleplaying
chat
Instructions to use mlx-community/Hermes-3-Llama-3.1-8B-bf16 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use mlx-community/Hermes-3-Llama-3.1-8B-bf16 with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir Hermes-3-Llama-3.1-8B-bf16 mlx-community/Hermes-3-Llama-3.1-8B-bf16
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
mlx-community/Hermes-3-Llama-3.1-8B-bf16
The Model mlx-community/Hermes-3-Llama-3.1-8B-bf16 was converted to MLX format from NousResearch/Hermes-3-Llama-3.1-8B using mlx-lm version 0.16.1.
Use with mlx
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("mlx-community/Hermes-3-Llama-3.1-8B-bf16")
response = generate(model, tokenizer, prompt="hello", verbose=True)
- Downloads last month
- 131
Model size
8B params
Tensor type
BF16
·
Hardware compatibility
Log In to add your hardware
Quantized
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for mlx-community/Hermes-3-Llama-3.1-8B-bf16
Base model
meta-llama/Llama-3.1-8B