YAML Metadata Error:Invalid content in Eval Result file .eval_results/cerebellum_benchmarks.yaml

Check out the documentation for more information.

Show details
Dataset "ai2_arc" does not exist

YAML Metadata Error:Invalid content in Eval Result file .eval_results/cerebellum_benchmarks.yaml

Check out the documentation for more information.

Show details
Dataset "hellaswag" does not exist

YAML Metadata Error:Invalid content in Eval Result file .eval_results/cerebellum_benchmarks.yaml

Check out the documentation for more information.

Show details
Task ID "default" does not match any task in dataset "openai/openai_humaneval". Available: none
deucebucket's picture
results: eval badge files (self-reported, audited locally)
88e9e86 verified
Raw
History Blame Contribute Delete
1.03 kB
- dataset:
id: ai2_arc
task_id: arc_challenge
value: 0.9548
date: '2026-06-11'
notes: 25-shot, llama.cpp, lm-eval-harness, RTX 3090, audited
source:
url: https://huggingface.co/deucebucket/Qwen3.6-35B-A3B-Heretic-Cerebellum-GGUF/tree/main/benchmark_results
name: Cerebellum ablation-informed mixed-precision results
- dataset:
id: hellaswag
task_id: default
value: 0.9178
date: '2026-06-11'
notes: 10-shot, llama.cpp, lm-eval-harness, RTX 3090, audited
source:
url: https://huggingface.co/deucebucket/Qwen3.6-35B-A3B-Heretic-Cerebellum-GGUF/tree/main/benchmark_results
name: Cerebellum ablation-informed mixed-precision results
- dataset:
id: openai/openai_humaneval
task_id: default
value: 0.6463
date: '2026-06-11'
notes: pass@1, evalplus HumanEval+, llama.cpp, RTX 3090, audited
source:
url: https://huggingface.co/deucebucket/Qwen3.6-35B-A3B-Heretic-Cerebellum-GGUF/tree/main/benchmark_results
name: Cerebellum ablation-informed mixed-precision results