Add ResearchClawBench evaluation result

#186
by CoCoOne - opened
.eval_results/researchclawbench.yaml ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ - dataset:
2
+ id: InternScience/ResearchClawBench
3
+ task_id: overall
4
+ value: 17.12
5
+ date: "2026-05-15"
6
+ notes: "ResearchHarness: https://huggingface.co/spaces/InternScience/ResearchHarness; ResearchClawBench: https://huggingface.co/datasets/InternScience/ResearchClawBench; tools enabled; code execution; file-system workspace; completed 39/40 tasks"
7
+ source:
8
+ url: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro
9
+ name: Model Card