Open to Collab

huaqin zhao PRO

zhaohq

AI & ML interests

None yet

Recent Activity

published a model 22 days ago

zhaohq/PureRL-7B-v7-stage1-reasoning-qa-instruct-v4

updated a model 23 days ago

zhaohq/PureRL-7B-v7-stage1-reasoning-qa-instruct-v3

published a model 23 days ago

zhaohq/PureRL-7B-v7-stage1-reasoning-qa-instruct-v3

View all activity

Organizations

None yet

published a model 22 days ago

zhaohq/PureRL-7B-v7-stage1-reasoning-qa-instruct-v4

Updated 22 days ago

updated a model 23 days ago

zhaohq/PureRL-7B-v7-stage1-reasoning-qa-instruct-v3

Text Generation • 8B • Updated 23 days ago • 23

published 2 models 23 days ago

zhaohq/PureRL-7B-v7-stage1-reasoning-qa-instruct-v3

Text Generation • 8B • Updated 23 days ago • 23

zhaohq/PureRL-7B-v7-s2-l2-maskon-qa-instruct

Updated 23 days ago

published a model 24 days ago

zhaohq/PureRL-7B-v7-step-long

Updated 24 days ago

updated a model 24 days ago

zhaohq/PureRL-7B-v7-stage1-conf-tag-instruct

Text Generation • 8B • Updated 24 days ago • 445 • 1

published 2 models 25 days ago

zhaohq/RLCR-hotpot

Updated 25 days ago

zhaohq/PureRL-7B-v7-stage1-conf-tag-instruct

Text Generation • 8B • Updated 24 days ago • 445 • 1

updated a model 25 days ago

zhaohq/PureRL-7B-v7-stage1-reasoning-qa-instruct-v2

8B • Updated 25 days ago • 71 • 1

published a model 25 days ago

zhaohq/PureRL-7B-v7-stage1-reasoning-qa-instruct-v2

8B • Updated 25 days ago • 71 • 1

updated a model 25 days ago

zhaohq/PureRL-7B-v7-stage1-reasoning-qa-instruct

Text Generation • 8B • Updated 25 days ago • 166

published a model 25 days ago

zhaohq/PureRL-7B-v7-stage1-reasoning-qa-instruct

Text Generation • 8B • Updated 25 days ago • 166

updated 4 models 25 days ago

published 4 models 26 days ago

zhaohq/PureRL-1.5B-v7-s2-l2-kl-w1-b1

Text Generation • 2B • Updated 25 days ago • 159

zhaohq/PureRL-1.5B-v7-s2-l2-kl-w0-b1

Text Generation • 2B • Updated 25 days ago • 164

zhaohq/PureRL-1.5B-v7-s2-l2-kl-w3-b1

Text Generation • 2B • Updated 25 days ago • 168

zhaohq/PureRL-1.5B-v7-s2-l2-kl-w2-b1

Text Generation • 2B • Updated 25 days ago • 172

huaqin zhao PRO

AI & ML interests

Recent Activity

Organizations

zhaohq's activity