jamesdumay commited on
Commit
68f97c1
·
verified ·
1 Parent(s): b7f6dae

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +128 -0
README.md ADDED
@@ -0,0 +1,128 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: mesh-llm
3
+ base_model:
4
+ - "unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF"
5
+ pipeline_tag: text-generation
6
+ tags:
7
+ - gguf
8
+ - mesh-llm
9
+ - layer-package
10
+ - skippy
11
+ - distributed-inference
12
+ - local-inference
13
+ - openai-compatible
14
+ ---
15
+
16
+ <div align="center">
17
+ <a href="https://www.meshllm.cloud">
18
+ <img src="https://github.com/Mesh-LLM/mesh-llm/raw/main/docs/mesh-llm-logo.svg" alt="Mesh LLM" width="220">
19
+ </a>
20
+
21
+ <h1>Devstral-Small-2-24B-Instruct-2512-UD-Q4_K_XL</h1>
22
+
23
+ <p>
24
+ <strong>Distributed GGUF inference package for Mesh LLM</strong>
25
+ </p>
26
+
27
+ <p>
28
+ <a href="https://www.meshllm.cloud"><img alt="Website" src="https://img.shields.io/badge/Website-meshllm.cloud-111111?style=for-the-badge"></a>
29
+ <a href="https://github.com/Mesh-LLM/mesh-llm"><img alt="GitHub" src="https://img.shields.io/badge/GitHub-Mesh--LLM-24292f?style=for-the-badge&logo=github"></a>
30
+ <a href="https://discord.gg/rs6fmc63eN"><img alt="Discord" src="https://img.shields.io/badge/Discord-Join-5865F2?style=for-the-badge&logo=discord&logoColor=white"></a>
31
+ </p>
32
+ </div>
33
+
34
+ GGUF layer package for running **Devstral-Small-2-24B-Instruct-2512-UD-Q4_K_XL** across a local Mesh LLM cluster.
35
+
36
+ This package is derived from [unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF](https://huggingface.co/unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF) and keeps the original GGUF distribution split into per-layer artifacts for distributed inference.
37
+
38
+ ## Highlights
39
+
40
+ | Run locally | Pool multiple machines | OpenAI-compatible | Package variant |
41
+ |---|---|---|---|
42
+ | Private inference on your hardware | Split layers across peers | Serve `/v1/chat/completions` locally | `UD-Q4_K_XL` layer package |
43
+
44
+ ## Model Overview
45
+
46
+ | Property | Value |
47
+ |---|---|
48
+ | **Source model** | [unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF](https://huggingface.co/unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF) |
49
+ | **Model id** | `unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL` |
50
+ | **Family** | Devstral |
51
+ | **Parameter scale** | 24B |
52
+ | **Quantization** | `UD-Q4_K_XL` |
53
+ | **Layer count** | 40 |
54
+ | **Activation width** | 5120 |
55
+ | **Package size** | 13.8 GB |
56
+ | **Source file** | `Devstral-Small-2-24B-Instruct-2512-UD-Q4_K_XL.gguf` |
57
+ | **Package repo** | [meshllm/Devstral-Small-2-24B-Instruct-2512-UD-Q4_K_XL-layers](https://huggingface.co/meshllm/Devstral-Small-2-24B-Instruct-2512-UD-Q4_K_XL-layers) |
58
+
59
+ ## Recommended Use
60
+
61
+ - Local and private inference with Mesh LLM.
62
+ - Multi-machine serving when the full GGUF is too large for one host.
63
+ - OpenAI-compatible chat/completions workflows through Mesh LLM's local API.
64
+
65
+ For upstream architecture details, chat template guidance, sampling recommendations, license terms, and benchmark notes, see the source model card: [unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF](https://huggingface.co/unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF).
66
+
67
+ ## Quickstart
68
+
69
+ ```bash
70
+ # Run this on each machine that should contribute memory/compute.
71
+ mesh-llm serve --model "meshllm/Devstral-Small-2-24B-Instruct-2512-UD-Q4_K_XL-layers" --split
72
+ ```
73
+
74
+ ```bash
75
+ # Check the mesh and discover the OpenAI-compatible model name.
76
+ curl -s http://localhost:3131/api/status
77
+ curl -s http://localhost:3131/v1/models
78
+ ```
79
+
80
+ ```bash
81
+ # Send an OpenAI-compatible chat request.
82
+ curl -s http://localhost:3131/v1/chat/completions \
83
+ -H "Content-Type: application/json" \
84
+ -d '{
85
+ "model": "unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:UD-Q4_K_XL",
86
+ "messages": [{"role": "user", "content": "Write a tiny hello-world function in Rust."}],
87
+ "max_tokens": 128
88
+ }'
89
+ ```
90
+
91
+ ## Package Variant
92
+
93
+ | Property | Value |
94
+ |---|---|
95
+ | **Format** | `layer-package` |
96
+ | **Canonical source ref** | `unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF@main/Devstral-Small-2-24B-Instruct-2512-UD-Q4_K_XL.gguf` |
97
+ | **Source revision** | `main` |
98
+ | **Source SHA-256** | `b44e34b78180fc3ab1abbe1edad9f1f3926fdca10eed3bfae168b065e683f6cd` |
99
+ | **Skippy ABI** | `0.1.25` |
100
+ | **Package manifest SHA-256** | `1527887657160605200ea0d37353c695f1fe7d74b8d310b1de2a6e371102b67f` |
101
+
102
+ ## What Is Included
103
+
104
+ | Artifact | Path | Contents | SHA-256 |
105
+ |---|---|---|---|
106
+ | Manifest | `model-package.json` | Package schema, source identity, checksums | `1527887657160605200ea0d37353c695f1fe7d74b8d310b1de2a6e371102b67f` |
107
+ | Metadata | `shared/metadata.gguf` | 0 tensors, 8.0 MB | `86fec00c5793bcf438dd2d1c5f75f782c79ea84cf6aaa4450bcc2bd146341e68` |
108
+ | Embeddings | `shared/embeddings.gguf` | 1 tensors, 368.0 MB | `5cdea2e2b47b8d751d31151a3f63196b8723fe8588621228b94c126cc9977ea7` |
109
+ | Output head | `shared/output.gguf` | 2 tensors, 533.0 MB | `3c7be3f6156a76320542e283b933580a1988415c2696a3b8b37ad58f5a68be0b` |
110
+ | Transformer layers | `layers/layer-*.gguf` | 40 layer artifacts, 360 tensors, 13.0 GB | `see model-package.json` |
111
+
112
+ ## Validation
113
+
114
+ Generated by the Mesh LLM HF Jobs splitter from `mesh-llm` ref `main`.
115
+ Each artifact is checksummed as it is written, uploaded to this repository, and removed from the job workspace before the next artifact is produced.
116
+
117
+ ```bash
118
+ skippy-model-package write-package "/source/Devstral-Small-2-24B-Instruct-2512-UD-Q4_K_XL.gguf" --out-dir "/tmp/meshllm-layer-job-meshllm_Devstral-Small-2-24B-Instruct-2512-UD-Q4_K_XL-layers-199/package"
119
+ ```
120
+
121
+ ## Links
122
+
123
+ - Source model: [unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF](https://huggingface.co/unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF)
124
+ - Mesh LLM website: [meshllm.cloud](https://www.meshllm.cloud)
125
+ - Mesh LLM: [github.com/Mesh-LLM/mesh-llm](https://github.com/Mesh-LLM/mesh-llm)
126
+ - Discord: [discord.gg/rs6fmc63eN](https://discord.gg/rs6fmc63eN)
127
+ - Package catalog: [meshllm/catalog](https://huggingface.co/datasets/meshllm/catalog)
128
+ - Package format: [layer-package-repos.md](https://github.com/Mesh-LLM/mesh-llm/blob/main/docs/specs/layer-package-repos.md)