tkwiecinski/amr-fma-OLMo-2-1124-7B-Instruct-lora_sft-math-p1_sft_math_tooluse-s42

amr-fma training run.

Method: lora_sft
Base model: allenai/OLMo-2-1124-7B-Instruct
Dataset: DigitalLearningGmbH/MATH-lighteval (slug: math)
Seed: 42
Git commit: 8b979a30de6dfbf3b5a1052e42d8c0453b214d3f
Exp name: p1_sft_math_tooluse
WandB run: buury04a

Checkpoints (branches)

step 1 → revision step-00001
step 3 → revision step-00003
step 6 → revision step-00006
step 12 → revision step-00012
step 24 → revision step-00024
step 45 → revision step-00045
step 86 → revision step-00086
step 87 → revision step-00087

Pin a specific checkpoint with revision=... in AutoModelForCausalLM.from_pretrained / PeftModel.from_pretrained.

Hyperparameter sections

checkpointing, dataset, evaluation, final_adapter_path, lora, model, optimization, prompt_style, runtime, sdpo, sequence, total_steps

Downloads last month: 9

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for tkwiecinski/amr-fma-OLMo-2-1124-7B-Instruct-lora_sft-math-p1_sft_math_tooluse-s42

Base model

allenai/OLMo-2-1124-7B

Finetuned

allenai/OLMo-2-1124-7B-SFT

Finetuned

allenai/OLMo-2-1124-7B-DPO

Finetuned

allenai/OLMo-2-1124-7B-Instruct

Adapter

(16)

this model

tkwiecinski
/

amr-fma-OLMo-2-1124-7B-Instruct-lora_sft-math-p1_sft_math_tooluse-s42

tkwiecinski/amr-fma-OLMo-2-1124-7B-Instruct-lora_sft-math-p1_sft_math_tooluse-s42

Tags

Checkpoints (branches)

Hyperparameter sections

Model tree for tkwiecinski/amr-fma-OLMo-2-1124-7B-Instruct-lora_sft-math-p1_sft_math_tooluse-s42