Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
BounharAbdelaziz 's Collections
RL for LLMs
LLM Post-training
AI Agents
Moroccan Darija Datasets
SFT for LLMs
Reward models
SFT Mix
General RL
RL Vision
RL Maths
RL Code
RL Agents
SFT Math
SFT Informal Maths
SFT Vision
SFT Vision Thinking
SFT VLM
Code Agents
Web Agents
Frugal-AI
RLHF/RLVR
Moroccan Darija LLMs
Moroccan Darija Embeddings Models & Datasets
Moroccan Speech Models & Datasets
Translation Models & Datasets
Arabic (MSA) Language Models & Datasets
Arabic (MSA) Summarization Models & Datasets

RL for LLMs

updated 8 days ago
Upvote
-

  • General RL

    Collection
    4 items • Updated 8 days ago

  • RL Code

    Collection
    7 items • Updated 8 days ago

    Note A collection of datasets for Code RL training of LLMs.


  • RL Maths

    Collection
    17 items • Updated 8 days ago

    Note A collection of datasets for Maths RL training of LLMs.


  • RL Vision

    Collection
    2 items • Updated 8 days ago

    Note A collection of datasets for vision RL training of VLMs.


  • Reward models

    Collection
    1 item • Updated 8 days ago

    Note A collection of RMs for RL training of LLMs.


  • RL Agents

    Collection
    2 items • Updated 8 days ago

    Note A collection of agentic data for RL training of LLMs.

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs