techwithsergiu commited on
Commit
7ff1e3b
·
verified ·
1 Parent(s): 54c9614

upd Model family in readme

Browse files
.gitattributes CHANGED
@@ -35,3 +35,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  tokenizer.json filter=lfs diff=lfs merge=lfs -text
37
  diagrams/diagram_02.png filter=lfs diff=lfs merge=lfs -text
 
 
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  tokenizer.json filter=lfs diff=lfs merge=lfs -text
37
  diagrams/diagram_02.png filter=lfs diff=lfs merge=lfs -text
38
+ diagrams/diagram_01.png filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -19,7 +19,7 @@ Retains the full visual tower — this is a **VLM-capable** model (image + text
19
  Primary use-case: Unsloth LoRA fine-tuning when you need image understanding in the
20
  fine-tuned result.
21
 
22
- > If you only need text fine-tuning, use the smaller and cleaner
23
  > [techwithsergiu/Qwen3.5-text-2B-bnb-4bit](https://huggingface.co/techwithsergiu/Qwen3.5-text-2B-bnb-4bit)
24
  > instead — same backbone, visual tower removed, lighter VRAM footprint.
25
 
@@ -33,16 +33,16 @@ fine-tuned result.
33
 
34
  ![](diagrams/diagram_01.png)
35
 
36
- | Model | Format | Visual |
37
  |---|---|---|
38
- | [Qwen/Qwen3.5-2B](https://huggingface.co/Qwen/Qwen3.5-2B) | f16 | |
39
- | **techwithsergiu/Qwen3.5-2B-bnb-4bit** | BNB NF4 | |
40
- | [techwithsergiu/Qwen3.5-text-2B](https://huggingface.co/techwithsergiu/Qwen3.5-text-2B) | bf16 | |
41
- | [techwithsergiu/Qwen3.5-text-2B-bnb-4bit](https://huggingface.co/techwithsergiu/Qwen3.5-text-2B-bnb-4bit) | BNB NF4 | |
42
-
43
- The BNB-quantized models are roughly 40% of the original f16 size. The visual tower is
44
- kept at bf16 and is a fixed ~0.65 GB overhead shared across all model sizes — it accounts
45
- for a larger fraction of the total in smaller models (0.8B) than in larger ones (9B).
46
 
47
  ## Fine-tuning
48
 
 
19
  Primary use-case: Unsloth LoRA fine-tuning when you need image understanding in the
20
  fine-tuned result.
21
 
22
+ > If you only need text fine-tuning, use
23
  > [techwithsergiu/Qwen3.5-text-2B-bnb-4bit](https://huggingface.co/techwithsergiu/Qwen3.5-text-2B-bnb-4bit)
24
  > instead — same backbone, visual tower removed, lighter VRAM footprint.
25
 
 
33
 
34
  ![](diagrams/diagram_01.png)
35
 
36
+ | Model | Type | Base model |
37
  |---|---|---|
38
+ | [Qwen/Qwen3.5-2B](https://huggingface.co/Qwen/Qwen3.5-2B) | f16 · VLM · source | |
39
+ | **[techwithsergiu/Qwen3.5-2B-bnb-4bit](https://huggingface.co/techwithsergiu/Qwen3.5-2B-bnb-4bit)** | BNB NF4 · VLM | Qwen/Qwen3.5-2B |
40
+ | [techwithsergiu/Qwen3.5-text-2B](https://huggingface.co/techwithsergiu/Qwen3.5-text-2B) | bf16 · text-only | Qwen/Qwen3.5-2B |
41
+ | [techwithsergiu/Qwen3.5-text-2B-bnb-4bit](https://huggingface.co/techwithsergiu/Qwen3.5-text-2B-bnb-4bit) | BNB NF4 · text-only | Qwen3.5-text-2B |
42
+ | [techwithsergiu/Qwen3.5-text-2B-GGUF](https://huggingface.co/techwithsergiu/Qwen3.5-text-2B-GGUF) | GGUF quants | Qwen3.5-text-2B |
43
+
44
+ The visual tower is a fixed ~0.65 GB bf16 overhead shared across all model sizes.
45
+ BNB-quantized models are roughly 40% of the original f16 size (exact ratio varies by size).
46
 
47
  ## Fine-tuning
48
 
diagrams/diagram_01.png CHANGED

Git LFS Details

  • SHA256: 5ca69d979ead26a6cbc558a7a884a2511ffe130c074e0de19c7a9c0925e56598
  • Pointer size: 131 Bytes
  • Size of remote file: 126 kB
diagrams/diagram_02.png CHANGED

Git LFS Details

  • SHA256: 6a5b3bf2522f461adeae89d9f66d065330a9e6868d0f472061e6941595f229cd
  • Pointer size: 131 Bytes
  • Size of remote file: 333 kB

Git LFS Details

  • SHA256: c83120e74d873a26508dd455b254fbdc4f0ba9a41a5b1377692cd7141c4d2f7f
  • Pointer size: 131 Bytes
  • Size of remote file: 338 kB