How to use from
Hermes Agent
Start the llama.cpp server
# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama serve -hf UdayGattu23/qwen2.5-7b-finetuned-argus:
Configure Hermes
# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default UdayGattu23/qwen2.5-7b-finetuned-argus:
Run Hermes
hermes
Quick Links

Qwen2.5-7B Finetuned on Argus Dataset

This model is a finetuned version of Qwen2.5-7B using LoRA with rank 128.

Training Details

  • Base Model: Qwen/Qwen2.5-7B
  • Training Method: LoRA (rank=128, alpha=256)
  • Dataset: 27,997 text samples
  • Epochs: 2 (best checkpoint from epoch 1)
  • Batch Size: 16 (effective)
  • Learning Rate: 5e-5
  • Hardware: A100 GPU

Training Results

  • Epoch 1: Training Loss: 1.301, Validation Loss: 1.589 (best)
  • Epoch 2: Training Loss: 1.699, Validation Loss: 1.826

Available Formats

  • PyTorch: Original model weights
  • GGUF: Multiple quantization levels available
    • Q8_0: Highest quality (7.5GB)
    • Q6_K: Very high quality (5.5GB)
    • Q5_K_M: High quality (4.8GB)
    • Q4_K_M: Good quality (3.8GB)
    • Q4_0: Acceptable quality (3.5GB)

Usage

With Transformers

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("UdayGattu23/qwen2.5-7b-finetuned-argus")
tokenizer = AutoTokenizer.from_pretrained("UdayGattu23/qwen2.5-7b-finetuned-argus")

With llama.cpp (GGUF)

./main -m qwen2.5-7b-finetuned-Q4_K_M.gguf -p "Your prompt here"

License

Apache 2.0

Downloads last month
26
Safetensors
Model size
8B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for UdayGattu23/qwen2.5-7b-finetuned-argus

Base model

Qwen/Qwen2.5-7B
Quantized
(83)
this model