Colby commited on
Commit
d796840
·
verified ·
1 Parent(s): 1c3bdc4

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +80 -0
README.md ADDED
@@ -0,0 +1,80 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: swiss-ai/Apertus-8B-2509
3
+ tags:
4
+ - gguf
5
+ - llama.cpp
6
+ - quantized
7
+ - trl
8
+ - sft
9
+ ---
10
+
11
+ # Apertus-8B-tulu-xlam-gguf
12
+
13
+ This is a GGUF conversion of [Colby/Apertus-8B-tulu-xlam-sft](https://huggingface.co/Colby/Apertus-8B-tulu-xlam-sft), which is a LoRA fine-tuned version of [swiss-ai/Apertus-8B-2509](https://huggingface.co/swiss-ai/Apertus-8B-2509).
14
+
15
+ ## Model Details
16
+
17
+ - **Base Model:** swiss-ai/Apertus-8B-2509
18
+ - **Fine-tuned Model:** Colby/Apertus-8B-tulu-xlam-sft
19
+ - **Training:** Supervised Fine-Tuning (SFT) with TRL
20
+ - **Format:** GGUF (for llama.cpp, Ollama, LM Studio, etc.)
21
+
22
+ ## Available Quantizations
23
+
24
+ | File | Quant | Size | Description | Use Case |
25
+ |------|-------|------|-------------|----------|
26
+ | Apertus-8B-tulu-xlam-sft-f16.gguf | F16 | ~1GB | Full precision | Best quality, slower |
27
+ | Apertus-8B-tulu-xlam-sft-q8_0.gguf | Q8_0 | ~500MB | 8-bit | High quality |
28
+ | Apertus-8B-tulu-xlam-sft-q5_k_m.gguf | Q5_K_M | ~350MB | 5-bit medium | Good quality, smaller |
29
+ | Apertus-8B-tulu-xlam-sft-q4_k_m.gguf | Q4_K_M | ~300MB | 4-bit medium | Recommended - good balance |
30
+
31
+ ## Usage
32
+
33
+ ### With llama.cpp
34
+
35
+ ```bash
36
+ # Download model
37
+ huggingface-cli download Colby/Apertus-8B-tulu-xlam-gguf Apertus-8B-tulu-xlam-sft-q4_k_m.gguf
38
+
39
+ # Run with llama.cpp
40
+ ./llama-cli -m Apertus-8B-tulu-xlam-sft-q4_k_m.gguf -p "Your prompt here"
41
+ ```
42
+
43
+ ### With Ollama
44
+
45
+ 1. Create a `Modelfile`:
46
+ ```
47
+ FROM ./Apertus-8B-tulu-xlam-sft-q4_k_m.gguf
48
+ ```
49
+
50
+ 2. Create the model:
51
+ ```bash
52
+ ollama create my-model -f Modelfile
53
+ ollama run my-model
54
+ ```
55
+
56
+ ### With LM Studio
57
+
58
+ 1. Download the `.gguf` file
59
+ 2. Import into LM Studio
60
+ 3. Start chatting!
61
+
62
+ ## License
63
+
64
+ Inherits the license from the base model: swiss-ai/Apertus-8B-2509
65
+
66
+ ## Citation
67
+
68
+ ```bibtex
69
+ @misc{Apertus_8B_tulu_xlam_gguf,
70
+ author = {Colby},
71
+ title = {Apertus-8B-tulu-xlam-gguf},
72
+ year = {2025},
73
+ publisher = {Hugging Face},
74
+ url = {https://huggingface.co/Colby/Apertus-8B-tulu-xlam-gguf}
75
+ }
76
+ ```
77
+
78
+ ---
79
+
80
+ *Converted to GGUF format using llama.cpp*