tkwiecinski/amr-fma-OLMo-2-1124-7B-Instruct-lora_sdpo-arc_challenge-p1_sdpo_multimodel_trial-s42

amr-fma training run.

  • Method: lora_sdpo
  • Base model: allenai/OLMo-2-1124-7B-Instruct
  • Dataset: allenai/ai2_arc (slug: arc_challenge)
  • Seed: 42
  • Git commit: 0a703a3b9fa4a2fe6be6ab5621e40883fd67118c
  • Exp name: p1_sdpo_multimodel_trial
  • WandB run: 5m88ybj7

Tags

  • phase:P1
  • domain:science

Checkpoints (branches)

  • step 1 β†’ revision step-00001
  • step 3 β†’ revision step-00003
  • step 5 β†’ revision step-00005
  • step 10 β†’ revision step-00010
  • step 19 β†’ revision step-00019
  • step 35 β†’ revision step-00035
  • step 63 β†’ revision step-00063
  • step 64 β†’ revision step-00064

Pin a specific checkpoint with revision=... in AutoModelForCausalLM.from_pretrained / PeftModel.from_pretrained.

Hyperparameter sections

checkpointing, dataset, evaluation, final_adapter_path, lora, model, optimization, prompt_style, runtime, sdpo, sequence, total_steps

Downloads last month
9
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for tkwiecinski/amr-fma-OLMo-2-1124-7B-Instruct-lora_sdpo-arc_challenge-p1_sdpo_multimodel_trial-s42