jkim96 commited on
Commit
89f96ac
·
verified ·
1 Parent(s): 8a34f7c

Update evaluation results

Browse files
Files changed (1) hide show
  1. README.md +12 -10
README.md CHANGED
@@ -50,14 +50,16 @@ model, tokenizer = load_quantized(
50
  | Metric | Value |
51
  | --- | ---: |
52
  | `wikitext2_ppl` | 7.1423 |
53
- | `zero-shot accuracy avg` | 69.9820 |
54
- | `arc_challenge` | 60.2389 |
55
- | `arc_easy` | 81.2710 |
56
- | `commonsense_qa` | 84.4390 |
57
- | `hellaswag` | 80.4720 |
58
- | `lambada_openai` | 69.3771 |
59
- | `openbookqa` | 43.2000 |
60
- | `piqa` | 82.2633 |
61
- | `truthfulqa_mc2` | 55.1749 |
62
- | `winogrande` | 73.4017 |
 
 
63
 
 
50
  | Metric | Value |
51
  | --- | ---: |
52
  | `wikitext2_ppl` | 7.1423 |
53
+ | `zero-shot accuracy avg` | 70.2603 |
54
+ | `arc_challenge` | 61.3481 |
55
+ | `arc_easy` | 81.6498 |
56
+ | `commonsense_qa` | 84.1114 |
57
+ | `gsm8k_cot` | 82.8658 |
58
+ | `hellaswag` | 80.3625 |
59
+ | `lambada_openai` | 69.8040 |
60
+ | `mmlu` | 77.9590 |
61
+ | `openbookqa` | 44.0000 |
62
+ | `piqa` | 82.2089 |
63
+ | `truthfulqa_mc2` | 55.1401 |
64
+ | `winogrande` | 73.7174 |
65