GRM-2.6-Plus / .eval_results /swe_bench_pro.yaml
DedeProGames's picture
Create .eval_results/swe_bench_pro.yaml
f945315 verified
raw
history blame contribute delete
226 Bytes
- dataset:
id: ScaleAI/SWE-bench_Pro
task_id: SWE_Bench_Pro
value: 54.0
date: '2026-04-23'
source:
url: https://huggingface.co/OrionLLM/GRM-2.6-Plus
name: Official GRM-2.6 Benchmark
user: DedeProGames