nakotsuko13/distilled_dataset_v2_clean
Viewer • Updated • 1.67k • 5
How to use nakotsuko13/qwen3-4b-nako13-agentbench-lora with PEFT:
Task type is invalid.
This repository provides a LoRA adapter fine-tuned from Qwen/Qwen3-4B-Instruct-2507 using LoRA + Unsloth.
This adapter is trained to improve multi-turn agent task performance on:
from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
import torch
base = "Qwen/Qwen3-4B-Instruct-2507"
model = AutoModelForCausalLM.from_pretrained(base, torch_dtype=torch.float16, device_map="auto")
model = PeftModel.from_pretrained(model, "your_id/your-repo")
Base model
Qwen/Qwen3-4B-Instruct-2507