Text Generation
PEFT
Safetensors
English
medical
clinical-reasoning
diagnosis
grpo
rlhf
lora
qwen
conversational
Instructions to use chrisvoncsefalvay/dx-reasoning-qwen2.5-grpo with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use chrisvoncsefalvay/dx-reasoning-qwen2.5-grpo with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-7B-Instruct") model = PeftModel.from_pretrained(base_model, "chrisvoncsefalvay/dx-reasoning-qwen2.5-grpo") - Notebooks
- Google Colab
- Kaggle
Ctrl+K