Instructions to use unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF", filename="Devstral-Small-2-24B-Instruct-2512-BF16.gguf", )
llm.create_chat_completion( messages = "No input example has been defined for this model task." )
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF with llama.cpp:
Install (macOS, Linux)
curl -LsSf https://llama.app/install.sh | sh # Start a local OpenAI-compatible server with a web UI: llama serve -hf unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL # Run inference directly in the terminal: llama cli -hf unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama serve -hf unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL # Run inference directly in the terminal: llama cli -hf unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL # Run inference directly in the terminal: ./llama-cli -hf unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL # Run inference directly in the terminal: ./build/bin/llama-cli -hf unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL
Use Docker
docker model run hf.co/unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL
- LM Studio
- Jan
- Ollama
How to use unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF with Ollama:
ollama run hf.co/unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL
- Unsloth Studio
How to use unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF to start chatting
- Pi
How to use unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF with Pi:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama serve -hf unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama serve -hf unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL
Run Hermes
hermes
- Atomic Chat new
- Docker Model Runner
How to use unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF with Docker Model Runner:
docker model run hf.co/unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL
- Lemonade
How to use unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL
Run and chat with the model
lemonade run user.Devstral-Small-2-24B-Instruct-2512-GGUF-UD-Q4_K_XL
List all available models
lemonade list
Add files using upload-large-folder tool
Browse files- .gitattributes +1 -0
- Devstral-Small-2-24B-Instruct-2512-BF16.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-IQ4_NL.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-IQ4_XS.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-Q2_K.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-Q2_K_L.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-Q3_K_M.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-Q3_K_S.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-Q4_0.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-Q4_1.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-Q4_K_M.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-Q4_K_S.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-Q5_K_M.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-Q5_K_S.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-Q6_K.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-Q8_0.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-UD-IQ1_M.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-UD-IQ1_S.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-UD-IQ2_M.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-UD-IQ2_XXS.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-UD-IQ3_XXS.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-UD-Q2_K_XL.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-UD-Q3_K_XL.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-UD-Q4_K_XL.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-UD-Q5_K_XL.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-UD-Q6_K_XL.gguf +2 -2
- Devstral-Small-2-24B-Instruct-2512-UD-Q8_K_XL.gguf +2 -2
- config.json +74 -0
- imatrix_unsloth.gguf +3 -0
- mmproj-BF16.gguf +2 -2
- mmproj-F16.gguf +2 -2
- mmproj-F32.gguf +2 -2
.gitattributes
CHANGED
|
@@ -63,3 +63,4 @@ Devstral-Small-2-24B-Instruct-2512-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
|
| 63 |
Devstral-Small-2-24B-Instruct-2512-UD-Q8_K_XL.gguf filter=lfs diff=lfs merge=lfs -text
|
| 64 |
imatrix_unsloth.gguf_file filter=lfs diff=lfs merge=lfs -text
|
| 65 |
Devstral-Small-2-24B-Instruct-2512-BF16.gguf filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 63 |
Devstral-Small-2-24B-Instruct-2512-UD-Q8_K_XL.gguf filter=lfs diff=lfs merge=lfs -text
|
| 64 |
imatrix_unsloth.gguf_file filter=lfs diff=lfs merge=lfs -text
|
| 65 |
Devstral-Small-2-24B-Instruct-2512-BF16.gguf filter=lfs diff=lfs merge=lfs -text
|
| 66 |
+
imatrix_unsloth.gguf filter=lfs diff=lfs merge=lfs -text
|
Devstral-Small-2-24B-Instruct-2512-BF16.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6a86365cc26ec2e5ba1434aa85da15a7de28eb92015447623a780f1f86ab1d1b
|
| 3 |
+
size 47154056032
|
Devstral-Small-2-24B-Instruct-2512-IQ4_NL.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:455cc22c4d85e93acbb1df2760e4df5626dc86ca37b0bee39e479e560f09508f
|
| 3 |
+
size 13468552352
|
Devstral-Small-2-24B-Instruct-2512-IQ4_XS.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6b8270a839e7a1263f34a799c18fb9eb0ca6f1d039cdbfa4a11f9ac9552a118a
|
| 3 |
+
size 12780424352
|
Devstral-Small-2-24B-Instruct-2512-Q2_K.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:408a61e7dcc0eb7fda7d70979395f974fe17310b4f38f977c95d9b98095fe1aa
|
| 3 |
+
size 8890862752
|
Devstral-Small-2-24B-Instruct-2512-Q2_K_L.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e32066271c0bbd03ce8a7648ebabf8335fbf16cbf16023389e987d7fc70111f7
|
| 3 |
+
size 9048149152
|
Devstral-Small-2-24B-Instruct-2512-Q3_K_M.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:53466aa3e486e6f0068c2593d12aeb15fb079c1dfe443ab922c468c39761bd16
|
| 3 |
+
size 11474619552
|
Devstral-Small-2-24B-Instruct-2512-Q3_K_S.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2647a4944f3783e6744ab94a85d9d604ac6ea0f1bcbb78b90afbf823543bb501
|
| 3 |
+
size 10400812192
|
Devstral-Small-2-24B-Instruct-2512-Q4_0.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:af9a7e072a101d0ee2befef444de1b78cdb4dce7f4515bcba7751ce78eecded5
|
| 3 |
+
size 13494766752
|
Devstral-Small-2-24B-Instruct-2512-Q4_1.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0d075ca8b3614e9ef9597630c7d7b92816d31787f119ed81d306518d0e72d094
|
| 3 |
+
size 14873644192
|
Devstral-Small-2-24B-Instruct-2512-Q4_K_M.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d14ba9edee1bb4c4996a726deb81e49ae81800a3216f0774634238c380aee496
|
| 3 |
+
size 14334446752
|
Devstral-Small-2-24B-Instruct-2512-Q4_K_S.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3a23a8cdce2b76b5f1338e5090be2068b3a9a1b62b8ad5277975f3e345220901
|
| 3 |
+
size 13549816992
|
Devstral-Small-2-24B-Instruct-2512-Q5_K_M.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2da6ca6c4ae387aa7f3f2f4a67bb3e1ca570ce0c69c21e26b8695e75172443b0
|
| 3 |
+
size 16764521632
|
Devstral-Small-2-24B-Instruct-2512-Q5_K_S.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5acecb4247289f9b3d00307d94531936fe60a0664c8b841cf8f5b454c66ee757
|
| 3 |
+
size 16304950432
|
Devstral-Small-2-24B-Instruct-2512-Q6_K.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b04b3e19730d7a1e19530f40b947b69c028b090cb7c58c1515cf1fc2ece5f821
|
| 3 |
+
size 19346476192
|
Devstral-Small-2-24B-Instruct-2512-Q8_0.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0760502e9228234f6cfa843f8870b8fc91c46a13664cf766c639229cccc80866
|
| 3 |
+
size 25055317152
|
Devstral-Small-2-24B-Instruct-2512-UD-IQ1_M.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e1a27d27597a32037e6796d3f94df7c17ff8454341d1661c1e99b7f6e54e0fba
|
| 3 |
+
size 6017846432
|
Devstral-Small-2-24B-Instruct-2512-UD-IQ1_S.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a80ab11ca4af5af7fe42fd32c1a552e8eac4e4882be5646eac3273017a0e2a37
|
| 3 |
+
size 5559094432
|
Devstral-Small-2-24B-Instruct-2512-UD-IQ2_M.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:669920de0afd6b7b5cf97da7779e7c63f8b57c9930b7669e9f23b1c46c2304a6
|
| 3 |
+
size 8243858592
|
Devstral-Small-2-24B-Instruct-2512-UD-IQ2_XXS.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:40e93f2c700057860a4917552589d27119127070405cde3e7ed1c7f9ba4fc6db
|
| 3 |
+
size 6751440032
|
Devstral-Small-2-24B-Instruct-2512-UD-IQ3_XXS.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6ab6264a46f5c530cee3a999a57e828ed4609770d01172e8124c1876e80e6415
|
| 3 |
+
size 9409744032
|
Devstral-Small-2-24B-Instruct-2512-UD-Q2_K_XL.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d6e7b4d124338d00d698ce22432b43586941da5218d992952fb04bd8b8ae71f6
|
| 3 |
+
size 9292680352
|
Devstral-Small-2-24B-Instruct-2512-UD-Q3_K_XL.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:613c44b12179f8380eb14f413784ad0412ee527e86d762a8d8816ac7ce0d42ca
|
| 3 |
+
size 11851410592
|
Devstral-Small-2-24B-Instruct-2512-UD-Q4_K_XL.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b44e34b78180fc3ab1abbe1edad9f1f3926fdca10eed3bfae168b065e683f6cd
|
| 3 |
+
size 14506151072
|
Devstral-Small-2-24B-Instruct-2512-UD-Q5_K_XL.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1bfce0c4ead4a46178bf42bb5b82ca7f6fb3b3126271d9f4c223c69fbc2c3ffe
|
| 3 |
+
size 16766364832
|
Devstral-Small-2-24B-Instruct-2512-UD-Q6_K_XL.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:10fa6b4b4b19f4dfc3c436dd28c1098aede6598b56e04a00c46c2eac98c16804
|
| 3 |
+
size 20788923552
|
Devstral-Small-2-24B-Instruct-2512-UD-Q8_K_XL.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4a50696a851571a53603b7496c444b30b6f5efa9b85c3d9d3e716db59985a06c
|
| 3 |
+
size 28992392352
|
config.json
ADDED
|
@@ -0,0 +1,74 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"architectures": [
|
| 3 |
+
"Mistral3ForConditionalGeneration"
|
| 4 |
+
],
|
| 5 |
+
"torch_dtype": "bfloat16",
|
| 6 |
+
"image_token_index": 10,
|
| 7 |
+
"model_type": "mistral3",
|
| 8 |
+
"multimodal_projector_bias": false,
|
| 9 |
+
"pad_token_id": 11,
|
| 10 |
+
"projector_hidden_act": "gelu",
|
| 11 |
+
"quantization_config": {
|
| 12 |
+
"activation_scheme": "static",
|
| 13 |
+
"dequantize": false,
|
| 14 |
+
"modules_to_not_convert": [
|
| 15 |
+
"model.vision_tower",
|
| 16 |
+
"model.multi_modal_projector",
|
| 17 |
+
"lm_head"
|
| 18 |
+
],
|
| 19 |
+
"quant_method": "fp8",
|
| 20 |
+
"weight_block_size": null
|
| 21 |
+
},
|
| 22 |
+
"spatial_merge_size": 2,
|
| 23 |
+
"text_config": {
|
| 24 |
+
"attention_dropout": 0.0,
|
| 25 |
+
"head_dim": 128,
|
| 26 |
+
"hidden_act": "silu",
|
| 27 |
+
"hidden_size": 5120,
|
| 28 |
+
"initializer_range": 0.02,
|
| 29 |
+
"intermediate_size": 32768,
|
| 30 |
+
"max_position_embeddings": 393216,
|
| 31 |
+
"model_type": "ministral3",
|
| 32 |
+
"num_attention_heads": 32,
|
| 33 |
+
"num_hidden_layers": 40,
|
| 34 |
+
"num_key_value_heads": 8,
|
| 35 |
+
"rms_norm_eps": 1e-05,
|
| 36 |
+
"rope_parameters": {
|
| 37 |
+
"beta_fast": 32.0,
|
| 38 |
+
"beta_slow": 1.0,
|
| 39 |
+
"factor": 48.0,
|
| 40 |
+
"llama_4_scaling_beta": 0.1,
|
| 41 |
+
"mscale": 1.0,
|
| 42 |
+
"mscale_all_dim": 1.0,
|
| 43 |
+
"original_max_position_embeddings": 8192,
|
| 44 |
+
"rope_theta": 100000000.0,
|
| 45 |
+
"rope_type": "yarn",
|
| 46 |
+
"type": "yarn"
|
| 47 |
+
},
|
| 48 |
+
"sliding_window": null,
|
| 49 |
+
"use_cache": true,
|
| 50 |
+
"vocab_size": 131072
|
| 51 |
+
},
|
| 52 |
+
"tie_word_embeddings": false,
|
| 53 |
+
"transformers_version": "5.0.0rc1",
|
| 54 |
+
"unsloth_fixed": true,
|
| 55 |
+
"vision_config": {
|
| 56 |
+
"attention_dropout": 0.0,
|
| 57 |
+
"head_dim": 64,
|
| 58 |
+
"hidden_act": "silu",
|
| 59 |
+
"hidden_size": 1024,
|
| 60 |
+
"image_size": 1540,
|
| 61 |
+
"initializer_range": 0.02,
|
| 62 |
+
"intermediate_size": 4096,
|
| 63 |
+
"model_type": "pixtral",
|
| 64 |
+
"num_attention_heads": 16,
|
| 65 |
+
"num_channels": 3,
|
| 66 |
+
"num_hidden_layers": 24,
|
| 67 |
+
"patch_size": 14,
|
| 68 |
+
"rope_parameters": {
|
| 69 |
+
"rope_theta": 10000.0,
|
| 70 |
+
"rope_type": "default"
|
| 71 |
+
}
|
| 72 |
+
},
|
| 73 |
+
"vision_feature_layer": -1
|
| 74 |
+
}
|
imatrix_unsloth.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:57a041eab0edf63c5e00b86df544e4c213300ac0828e27858e4eaa433ba7e5f1
|
| 3 |
+
size 10037344
|
mmproj-BF16.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7fbd63a14ed6642370f7a8c276fdb3c1061f3349468dc7bb3a0a9e803e70926b
|
| 3 |
+
size 879258272
|
mmproj-F16.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:236bb4f0500620164917f26749cd402485b7d749034ea817c6a2d65253b126b9
|
| 3 |
+
size 878054048
|
mmproj-F32.gguf
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f140d901ebc3d3e79dac0d6549592049d9cf411ef41e049e49afaaa72b5255ef
|
| 3 |
+
size 1755867808
|