RL Maths - a BounharAbdelaziz Collection

BounharAbdelaziz 's Collections

LLM Post-training

Moroccan Darija Datasets

SFT Informal Maths

SFT Vision Thinking

Moroccan Darija LLMs

Moroccan Darija Embeddings Models & Datasets

Moroccan Speech Models & Datasets

Translation Models & Datasets

Arabic (MSA) Language Models & Datasets

Arabic (MSA) Summarization Models & Datasets

RL Maths

updated 8 days ago

hkgc/math3to5_olympiads_aime

Viewer • Updated Jul 6, 2025 • 18.3k • 27 • 3

Note RL dataset for olympiads Math. Contains hints that are used for difficult samples. Details in https://arxiv.org/pdf/2507.10628.
SynthLabsAI/Big-Math-RL-Verified

Viewer • Updated Mar 25, 2025 • 251k • 39.2k • 231
zwhe99/DeepMath-103K

Viewer • Updated May 29, 2025 • 103k • 7.32k • 364

Note answers were provided by R1 then extracted.
virtuoussy/Math-RLVR

Viewer • Updated Apr 16, 2025 • 782k • 152 • 9

Note needs a judge
a-m-team/AM-Math-Difficulty-RL

Viewer • Updated Apr 2, 2025 • 235k • 261 • 16

Note A collection of 235K samples for math curr learning.
deepmind/aqua_rat

Viewer • Updated Jan 9, 2024 • 196k • 8.15k • 72

Note A large-scale dataset consisting of approximately 100,000 algebraic word problems. MCQ with single correct answer. Can be verified via string matching with the final answer.
di-zhang-fdu/DeepMind_Mathematics_QA

Viewer • Updated Sep 13, 2024 • 1k • 56 • 2
Intelligent-Internet/II-Thought-RL-v0-Math-50K

Viewer • Updated Mar 24, 2025 • 53.3k • 18 • 3
PRIME-RL/Eurus-2-RL-Data

Viewer • Updated Feb 19, 2025 • 483k • 1.64k • 57

Note contains math and code
GAIR/LIMO

Viewer • Updated Feb 10, 2025 • 817 • 6.7k • 177
GAIR/LIMO-v2

Viewer • Updated Jul 30, 2025 • 800 • 897 • 11
Skywork/Skywork-OR1-RL-Data

Viewer • Updated May 29, 2025 • 119k • 7.3k • 67

Note Math split contains 105k samples
a-m-team/AM-Thinking-v1-RL-Dataset

Viewer • Updated May 21, 2025 • 54.8k • 193 • 18

Note 34K samples of math
m-gopichand/deepmind_math_dataset_processed

Viewer • Updated Feb 16, 2025 • 227M • 663 • 1

Note sample some from the medium and hard sets. (hard is 1% which is already 2.3M samples...)
PrimeIntellect/INTELLECT-2-RL-Dataset

Viewer • Updated May 13, 2025 • 285k • 210 • 66

Note 270k math problems
BytedTsinghua-SIA/DAPO-Math-17k

Viewer • Updated Apr 18, 2025 • 1.79M • 9.96k • 177
nvidia/AceReason-Math

Viewer • Updated Jun 18, 2025 • 49.6k • 34.9k • 55

Note high quality, verfiable, challenging and diverse math dataset for training math reasoning model using reinforcement leraning. This dataset contains 49K math problems and answer sourced from NuminaMath and DeepScaler-Preview applying filtering rules to exclude unsuitable data (e.g., multiple sub-questions, multiple-choice, true/false, long and complex answers, proof, figure) this dataset was used to train AceReason-Nemotron models, which achieve strong results on math benchmark such as AIME24 an