RL Maths
Viewer • Updated • 18.3k • 27 • 3Note RL dataset for olympiads Math. Contains hints that are used for difficult samples. Details in https://arxiv.org/pdf/2507.10628.
-
SynthLabsAI/Big-Math-RL-Verified
Viewer • Updated • 251k • 39.2k • 231
zwhe99/DeepMath-103K
Viewer • Updated • 103k • 7.32k • 364Note answers were provided by R1 then extracted.
virtuoussy/Math-RLVR
Viewer • Updated • 782k • 152 • 9Note needs a judge
a-m-team/AM-Math-Difficulty-RL
Viewer • Updated • 235k • 261 • 16Note A collection of 235K samples for math curr learning.
deepmind/aqua_rat
Viewer • Updated • 196k • 8.15k • 72Note A large-scale dataset consisting of approximately 100,000 algebraic word problems. MCQ with single correct answer. Can be verified via string matching with the final answer.
-
di-zhang-fdu/DeepMind_Mathematics_QA
Viewer • Updated • 1k • 56 • 2 -
Intelligent-Internet/II-Thought-RL-v0-Math-50K
Viewer • Updated • 53.3k • 18 • 3
PRIME-RL/Eurus-2-RL-Data
Viewer • Updated • 483k • 1.64k • 57Note contains math and code
-
GAIR/LIMO
Viewer • Updated • 817 • 6.7k • 177 -
GAIR/LIMO-v2
Viewer • Updated • 800 • 897 • 11
Skywork/Skywork-OR1-RL-Data
Viewer • Updated • 119k • 7.3k • 67Note Math split contains 105k samples
a-m-team/AM-Thinking-v1-RL-Dataset
Viewer • Updated • 54.8k • 193 • 18Note 34K samples of math
m-gopichand/deepmind_math_dataset_processed
Viewer • Updated • 227M • 663 • 1Note sample some from the medium and hard sets. (hard is 1% which is already 2.3M samples...)
PrimeIntellect/INTELLECT-2-RL-Dataset
Viewer • Updated • 285k • 210 • 66Note 270k math problems
-
BytedTsinghua-SIA/DAPO-Math-17k
Viewer • Updated • 1.79M • 9.96k • 177
nvidia/AceReason-Math
Viewer • Updated • 49.6k • 34.9k • 55Note high quality, verfiable, challenging and diverse math dataset for training math reasoning model using reinforcement leraning. This dataset contains 49K math problems and answer sourced from NuminaMath and DeepScaler-Preview applying filtering rules to exclude unsuitable data (e.g., multiple sub-questions, multiple-choice, true/false, long and complex answers, proof, figure) this dataset was used to train AceReason-Nemotron models, which achieve strong results on math benchmark such as AIME24 an