JupiterJil's picture
Qwen 2.5 7B LoRA adapter β€” v2 run, final loss 1.48, template bleed fixed
98388ae verified
|
Raw
History Blame Contribute Delete
1.35 kB
---
library_name: peft
base_model: Qwen/Qwen2.5-7B-Instruct
tags:
- lora
- peft
- unsloth
- qwen
- qwen-2.5
- cyberspace
- presales
---
# Cyberspace Presales Co-Pilot β€” Qwen 2.5 7B LoRA Adapter
LoRA adapter for Cyberspace Limited's Vertical AI Presales Co-Pilot.
## Training Summary
- **Base model:** `Qwen/Qwen2.5-7B-Instruct`
- **Method:** LoRA (QLoRA 4-bit) via Unsloth
- **Dataset:** 132 Cyberspace proposals (ChatML format, zero template bleed)
- **Final loss:** 1.4795 (target: 1.2–1.6 βœ…)
- **Training time:** ~4.9 min on A100 80GB
## LoRA Config
- Rank: 16
- Alpha: 16
- Dropout: 0.05
- Target modules: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
- Epochs: 3
- LR: 2e-4 (cosine scheduler)
- `train_on_responses_only`: True (ChatML markers)
## Usage
```python
from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer
base = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-7B-Instruct")
model = PeftModel.from_pretrained(base, "JupiterJil/cyberspace-qwen25-7b-lora")
```
## Recommended Inference Settings
- Temperature: 0.4
- Top-p: 0.9
- Repeat penalty: 1.1
- Use detailed category-specific system prompts
## Production Artifact
For inference, use the GGUF Q4_K_M version: [JupiterJil/cyberspace-qwen25-7b-gguf](https://huggingface.co/JupiterJil/cyberspace-qwen25-7b-gguf)