Text Generation
Transformers
Safetensors
Portuguese
qwen3
text-generation-inference
conversational
Eval Results (legacy)
nicholasKluge commited on
Commit
9b0ed55
·
verified ·
1 Parent(s): f66ea75

Upload evals.yaml with huggingface_hub

Browse files
Files changed (1) hide show
  1. evals.yaml +3 -0
evals.yaml CHANGED
@@ -117,6 +117,9 @@ evaluations:
117
  hellaswag_poly_pt_acc_norm_stderr: 0.005200030264123482
118
  hellaswag_poly_pt_acc_stderr: 0.005056141839024339
119
  hellaswag_poly_pt_alias: hellaswag_poly_pt
 
 
 
120
  ifeval_pt_alias: ifeval_pt
121
  ifeval_pt_inst_level_loose_acc: 0.4186046511627907
122
  ifeval_pt_inst_level_loose_acc_stderr: N/A
 
117
  hellaswag_poly_pt_acc_norm_stderr: 0.005200030264123482
118
  hellaswag_poly_pt_acc_stderr: 0.005056141839024339
119
  hellaswag_poly_pt_alias: hellaswag_poly_pt
120
+ humaneval_instruct_alias: humaneval_instruct
121
+ humaneval_instruct_pass@1,create_test: 0.10365853658536585
122
+ humaneval_instruct_pass@1_stderr,create_test: 0.023875115311878508
123
  ifeval_pt_alias: ifeval_pt
124
  ifeval_pt_inst_level_loose_acc: 0.4186046511627907
125
  ifeval_pt_inst_level_loose_acc_stderr: N/A