tkwiecinski/amr-fma-OLMo-2-1124-7B-Instruct-lora_sft-math-p1_sft_math_tooluse-s42

amr-fma training run.

  • Method: lora_sft
  • Base model: allenai/OLMo-2-1124-7B-Instruct
  • Dataset: DigitalLearningGmbH/MATH-lighteval (slug: math)
  • Seed: 42
  • Git commit: 8b979a30de6dfbf3b5a1052e42d8c0453b214d3f
  • Exp name: p1_sft_math_tooluse
  • WandB run: buury04a

Tags

  • phase:P1
  • domain:math

Checkpoints (branches)

  • step 1 β†’ revision step-00001
  • step 3 β†’ revision step-00003
  • step 6 β†’ revision step-00006
  • step 12 β†’ revision step-00012
  • step 24 β†’ revision step-00024
  • step 45 β†’ revision step-00045
  • step 86 β†’ revision step-00086
  • step 87 β†’ revision step-00087

Pin a specific checkpoint with revision=... in AutoModelForCausalLM.from_pretrained / PeftModel.from_pretrained.

Hyperparameter sections

checkpointing, dataset, evaluation, final_adapter_path, lora, model, optimization, prompt_style, runtime, sdpo, sequence, total_steps

Downloads last month
9
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for tkwiecinski/amr-fma-OLMo-2-1124-7B-Instruct-lora_sft-math-p1_sft_math_tooluse-s42