Text Classification
PEFT
Safetensors
process-reward-model
reasoning
reward-model
lora
test-time-compute
ai-efficiency
Instructions to use vanthienha199/thinktank-prm-qwen2.5-0.5b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use vanthienha199/thinktank-prm-qwen2.5-0.5b with PEFT:
from peft import PeftModel from transformers import AutoModelForSequenceClassification base_model = AutoModelForSequenceClassification.from_pretrained("Qwen/Qwen2.5-0.5B") model = PeftModel.from_pretrained(base_model, "vanthienha199/thinktank-prm-qwen2.5-0.5b") - Notebooks
- Google Colab
- Kaggle
File size: 133 Bytes
a949a73 | 1 2 3 4 | version https://git-lfs.github.com/spec/v1
oid sha256:3fd169731d2cbde95e10bf356d66d5997fd885dd8dbb6fb4684da3f23b2585d8
size 11421892
|