choonok's picture
Upload README.md with huggingface_hub
e62f46f verified
|
Raw
History Blame
3.72 kB
---
license: other
license_name: vetjarvis-model-license-1.0-nc
license_link: LICENSE
language:
- ko
- en
base_model:
- choonok/VetJarvis-1.1-4B-Instruct
base_model_relation: quantized
pipeline_tag: text-generation
library_name: gguf
tags:
- veterinary
- companion-animal
- canine
- feline
- medical
- domain-specific
- qwen3.5
- gguf
- llama.cpp
- not-a-medical-device
---
# VetJarvis 1.1-4B-Instruct (GGUF)
[choonok/VetJarvis-1.1-4B-Instruct](https://huggingface.co/choonok/VetJarvis-1.1-4B-Instruct)λ₯Ό GGUF 포맷으둜 λ³€ν™˜ν•œ λ²„μ „μž…λ‹ˆλ‹€.
llama.cpp, Ollama, LM Studio λ“± 둜컬 μΆ”λ‘  λ„κ΅¬μ—μ„œ μ‚¬μš©ν•  수 μžˆμŠ΅λ‹ˆλ‹€.
This is a GGUF-converted version of [choonok/VetJarvis-1.1-4B-Instruct](https://huggingface.co/choonok/VetJarvis-1.1-4B-Instruct), suitable for local inference with llama.cpp, Ollama, LM Studio, etc.
## 제곡 파일 / Files
| 파일 | μ–‘μžν™” | 크기 | ꢌμž₯ μš©λ„ |
|------|--------|------|----------|
| `VetJarvis-1.1-4B-Instruct-bf16.gguf` | BF16 | ~7.9 GB | 정확도 μš°μ„ , μ„œλ²„, GPU 16GB+ |
| `VetJarvis-1.1-4B-Instruct-q8_0.gguf` | Q8_0 | ~4.2 GB | 거의 무손싀, 일반 μ‚¬μš© ꢌμž₯ |
## μΆ”μ²œ μΆ”λ‘  νŒŒλΌλ―Έν„° / Recommended Inference Parameters
| νŒŒλΌλ―Έν„° | κ°’ |
|---------|---|
| Temperature | **0.8** |
| Top-p | **0.9** |
| Max Tokens | **32,768** |
| Context Length | ≀ 262,144 |
| enable_thinking | **True** (ꢌμž₯) |
## μ‚¬μš©λ²• / Usage
### llama.cpp
```bash
./build/bin/llama-cli \
-m VetJarvis-1.1-4B-Instruct-q8_0.gguf \
--jinja \
-ngl 99 \
-sys "당신은 ν•œκ΅­ μˆ˜μ˜μ‚¬λ₯Ό λ³΄μ‘°ν•˜λŠ” AI μ–΄μ‹œμŠ€ν„΄νŠΈμž…λ‹ˆλ‹€. λ°˜λ“œμ‹œ ν•œκ΅­μ–΄λ‘œ λ‹΅λ³€ν•˜μ„Έμš”." \
-p "고양이 λ§Œμ„± μ‹ λΆ€μ „μ˜ 초기 증상은?" \
-n 32768 \
--temp 0.8 \
--top-p 0.9
```
### Ollama
```
FROM ./VetJarvis-1.1-4B-Instruct-q8_0.gguf
PARAMETER temperature 0.8
PARAMETER top_p 0.9
PARAMETER num_ctx 32768
PARAMETER stop "<|im_end|>"
```
```bash
ollama create vetjarvis-1.1-4b-instruct -f Modelfile
ollama run vetjarvis-1.1-4b-instruct
```
### LM Studio
μΆ”ν›„ μ‚¬μš©λ²• κ°€μ΄λ“œλ₯Ό μΆ”κ°€ν•  μ˜ˆμ •μž…λ‹ˆλ‹€. _Detailed LM Studio guide will be added later._
## λ³€ν™˜ 정보 / Conversion Details
- λ³€ν™˜ 도ꡬ: [llama.cpp](https://github.com/ggerganov/llama.cpp) `convert_hf_to_gguf.py`
- 원본 정밀도: BF16 (Qwen3.5-4BλŠ” BF16으둜 ν•™μŠ΅λ¨)
- λ³€ν™˜ μ‹œ BF16 β†’ BF16 직접 λ³€ν™˜ (정밀도 손싀 μ—†μŒ)
- Q8_0은 μ›λ³Έμ—μ„œ 직접 μ–‘μžν™” 생성
## λͺ¨λΈ μ•„ν‚€ν…μ²˜ / Architecture Note
이 λͺ¨λΈμ€ Qwen3.5의 **Transformer + SSM ν•˜μ΄λΈŒλ¦¬λ“œ μ•„ν‚€ν…μ²˜**μž…λ‹ˆλ‹€. 256K ν† ν°μ˜ κΈ΄ μ»¨ν…μŠ€νŠΈλ₯Ό μ§€μ›ν•˜λ©°, llama.cppμ—μ„œ 정상 λ™μž‘μ΄ ν™•μΈλ˜μ—ˆμŠ΅λ‹ˆλ‹€.
q4_K_M 같은 μ €λΉ„νŠΈ μ–‘μžν™”λŠ” SSM λ ˆμ΄μ–΄ 손싀이 일반 Transformer λͺ¨λΈλ³΄λ‹€ 클 수 μžˆμœΌλ―€λ‘œ, **BF16 λ˜λŠ” Q8_0 μ‚¬μš©μ„ ꢌμž₯**ν•©λ‹ˆλ‹€.
## λΌμ΄μ„ μŠ€ / License
원본 λͺ¨λΈμ˜ λΌμ΄μ„ μŠ€(`vetjarvis-model-license-1.0-nc`)λ₯Ό κ·ΈλŒ€λ‘œ λ”°λ¦…λ‹ˆλ‹€. **비상업적 μš©λ„**둜만 μ‚¬μš© κ°€λŠ₯ν•©λ‹ˆλ‹€. μžμ„Έν•œ λ‚΄μš©μ€ λ™λ΄‰λœ [LICENSE](LICENSE) νŒŒμΌμ„ μ°Έκ³ ν•˜μ„Έμš”.
This GGUF version inherits the original `vetjarvis-model-license-1.0-nc` license. **Non-commercial use only.** See the included [LICENSE](LICENSE) file for details.
## ⚠️ 의료기기 μ•„λ‹˜ / Not a Medical Device
λ³Έ λͺ¨λΈμ€ **μž„μƒ μ˜μ‚¬κ²°μ •μ„ λ³΄μ‘°ν•˜λŠ” μ°Έκ³  도ꡬ**이며, 진단/μ²˜λ°©μ„ λŒ€μ²΄ν•˜μ§€ μ•ŠμŠ΅λ‹ˆλ‹€. λͺ¨λ“  μž„μƒ νŒλ‹¨μ€ μžκ²©μ„ κ°–μΆ˜ μˆ˜μ˜μ‚¬κ°€ μˆ˜ν–‰ν•΄μ•Ό ν•©λ‹ˆλ‹€.
This model is a reference tool to support clinical decision-making. It is **not a medical device** and does not replace diagnosis or prescription by a qualified veterinarian.