yami2333
/

qwen2.5-1.5b-grpo-infer

Model card Files Files and versions

[wandb][https://wandb.ai/fin-llm/huggingface/reports/Untitled-Report--VmlldzoxMjkzNzY0Mw/edit?draftId=VmlldzoxMjkzNzY0Mw==&firstReport=&runsetFilter]:

result:

Downloads last month: 2

Safetensors

Model size

2B params

Tensor type

BF16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for yami2333/qwen2.5-1.5b-grpo-infer

Base model

Qwen/Qwen2.5-1.5B

Finetuned

(358)

this model

Dataset used to train yami2333/qwen2.5-1.5b-grpo-infer