choonok's picture
Upload README.md with huggingface_hub
e62f46f verified
|
Raw
History Blame
3.72 kB
metadata
license: other
license_name: vetjarvis-model-license-1.0-nc
license_link: LICENSE
language:
  - ko
  - en
base_model:
  - choonok/VetJarvis-1.1-4B-Instruct
base_model_relation: quantized
pipeline_tag: text-generation
library_name: gguf
tags:
  - veterinary
  - companion-animal
  - canine
  - feline
  - medical
  - domain-specific
  - qwen3.5
  - gguf
  - llama.cpp
  - not-a-medical-device

VetJarvis 1.1-4B-Instruct (GGUF)

choonok/VetJarvis-1.1-4B-Instructλ₯Ό GGUF 포맷으둜 λ³€ν™˜ν•œ λ²„μ „μž…λ‹ˆλ‹€.

llama.cpp, Ollama, LM Studio λ“± 둜컬 μΆ”λ‘  λ„κ΅¬μ—μ„œ μ‚¬μš©ν•  수 μžˆμŠ΅λ‹ˆλ‹€.

This is a GGUF-converted version of choonok/VetJarvis-1.1-4B-Instruct, suitable for local inference with llama.cpp, Ollama, LM Studio, etc.

제곡 파일 / Files

파일 μ–‘μžν™” 크기 ꢌμž₯ μš©λ„
VetJarvis-1.1-4B-Instruct-bf16.gguf BF16 ~7.9 GB 정확도 μš°μ„ , μ„œλ²„, GPU 16GB+
VetJarvis-1.1-4B-Instruct-q8_0.gguf Q8_0 ~4.2 GB 거의 무손싀, 일반 μ‚¬μš© ꢌμž₯

μΆ”μ²œ μΆ”λ‘  νŒŒλΌλ―Έν„° / Recommended Inference Parameters

νŒŒλΌλ―Έν„° κ°’
Temperature 0.8
Top-p 0.9
Max Tokens 32,768
Context Length ≀ 262,144
enable_thinking True (ꢌμž₯)

μ‚¬μš©λ²• / Usage

llama.cpp

./build/bin/llama-cli \
    -m VetJarvis-1.1-4B-Instruct-q8_0.gguf \
    --jinja \
    -ngl 99 \
    -sys "당신은 ν•œκ΅­ μˆ˜μ˜μ‚¬λ₯Ό λ³΄μ‘°ν•˜λŠ” AI μ–΄μ‹œμŠ€ν„΄νŠΈμž…λ‹ˆλ‹€. λ°˜λ“œμ‹œ ν•œκ΅­μ–΄λ‘œ λ‹΅λ³€ν•˜μ„Έμš”." \
    -p "고양이 λ§Œμ„± μ‹ λΆ€μ „μ˜ 초기 증상은?" \
    -n 32768 \
    --temp 0.8 \
    --top-p 0.9

Ollama

FROM ./VetJarvis-1.1-4B-Instruct-q8_0.gguf
PARAMETER temperature 0.8
PARAMETER top_p 0.9
PARAMETER num_ctx 32768
PARAMETER stop "<|im_end|>"
ollama create vetjarvis-1.1-4b-instruct -f Modelfile
ollama run vetjarvis-1.1-4b-instruct

LM Studio

μΆ”ν›„ μ‚¬μš©λ²• κ°€μ΄λ“œλ₯Ό μΆ”κ°€ν•  μ˜ˆμ •μž…λ‹ˆλ‹€. Detailed LM Studio guide will be added later.

λ³€ν™˜ 정보 / Conversion Details

  • λ³€ν™˜ 도ꡬ: llama.cpp convert_hf_to_gguf.py
  • 원본 정밀도: BF16 (Qwen3.5-4BλŠ” BF16으둜 ν•™μŠ΅λ¨)
  • λ³€ν™˜ μ‹œ BF16 β†’ BF16 직접 λ³€ν™˜ (정밀도 손싀 μ—†μŒ)
  • Q8_0은 μ›λ³Έμ—μ„œ 직접 μ–‘μžν™” 생성

λͺ¨λΈ μ•„ν‚€ν…μ²˜ / Architecture Note

이 λͺ¨λΈμ€ Qwen3.5의 Transformer + SSM ν•˜μ΄λΈŒλ¦¬λ“œ μ•„ν‚€ν…μ²˜μž…λ‹ˆλ‹€. 256K ν† ν°μ˜ κΈ΄ μ»¨ν…μŠ€νŠΈλ₯Ό μ§€μ›ν•˜λ©°, llama.cppμ—μ„œ 정상 λ™μž‘μ΄ ν™•μΈλ˜μ—ˆμŠ΅λ‹ˆλ‹€.

q4_K_M 같은 μ €λΉ„νŠΈ μ–‘μžν™”λŠ” SSM λ ˆμ΄μ–΄ 손싀이 일반 Transformer λͺ¨λΈλ³΄λ‹€ 클 수 μžˆμœΌλ―€λ‘œ, BF16 λ˜λŠ” Q8_0 μ‚¬μš©μ„ ꢌμž₯ν•©λ‹ˆλ‹€.

λΌμ΄μ„ μŠ€ / License

원본 λͺ¨λΈμ˜ λΌμ΄μ„ μŠ€(vetjarvis-model-license-1.0-nc)λ₯Ό κ·ΈλŒ€λ‘œ λ”°λ¦…λ‹ˆλ‹€. 비상업적 μš©λ„λ‘œλ§Œ μ‚¬μš© κ°€λŠ₯ν•©λ‹ˆλ‹€. μžμ„Έν•œ λ‚΄μš©μ€ λ™λ΄‰λœ LICENSE νŒŒμΌμ„ μ°Έκ³ ν•˜μ„Έμš”.

This GGUF version inherits the original vetjarvis-model-license-1.0-nc license. Non-commercial use only. See the included LICENSE file for details.

⚠️ 의료기기 μ•„λ‹˜ / Not a Medical Device

λ³Έ λͺ¨λΈμ€ μž„μƒ μ˜μ‚¬κ²°μ •μ„ λ³΄μ‘°ν•˜λŠ” μ°Έκ³  도ꡬ이며, 진단/μ²˜λ°©μ„ λŒ€μ²΄ν•˜μ§€ μ•ŠμŠ΅λ‹ˆλ‹€. λͺ¨λ“  μž„μƒ νŒλ‹¨μ€ μžκ²©μ„ κ°–μΆ˜ μˆ˜μ˜μ‚¬κ°€ μˆ˜ν–‰ν•΄μ•Ό ν•©λ‹ˆλ‹€.

This model is a reference tool to support clinical decision-making. It is not a medical device and does not replace diagnosis or prescription by a qualified veterinarian.