Instructions to use tkwiecinski/amr-fma-OLMo-2-1124-7B-Instruct-lora_sft-math-p1_sft_math_tooluse-s42 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use tkwiecinski/amr-fma-OLMo-2-1124-7B-Instruct-lora_sft-math-p1_sft_math_tooluse-s42 with PEFT:
Task type is invalid.
- Notebooks
- Google Colab
- Kaggle
tkwiecinski/amr-fma-OLMo-2-1124-7B-Instruct-lora_sft-math-p1_sft_math_tooluse-s42
amr-fma training run.
- Method:
lora_sft - Base model:
allenai/OLMo-2-1124-7B-Instruct - Dataset:
DigitalLearningGmbH/MATH-lighteval(slug:math) - Seed:
42 - Git commit:
8b979a30de6dfbf3b5a1052e42d8c0453b214d3f - Exp name:
p1_sft_math_tooluse - WandB run:
buury04a
Tags
- phase:P1
- domain:math
Checkpoints (branches)
- step 1 β revision
step-00001 - step 3 β revision
step-00003 - step 6 β revision
step-00006 - step 12 β revision
step-00012 - step 24 β revision
step-00024 - step 45 β revision
step-00045 - step 86 β revision
step-00086 - step 87 β revision
step-00087
Pin a specific checkpoint with revision=... in
AutoModelForCausalLM.from_pretrained / PeftModel.from_pretrained.
Hyperparameter sections
checkpointing, dataset, evaluation, final_adapter_path, lora, model, optimization, prompt_style, runtime, sdpo, sequence, total_steps
- Downloads last month
- 9
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support
Model tree for tkwiecinski/amr-fma-OLMo-2-1124-7B-Instruct-lora_sft-math-p1_sft_math_tooluse-s42
Base model
allenai/OLMo-2-1124-7B Finetuned
allenai/OLMo-2-1124-7B-SFT Finetuned
allenai/OLMo-2-1124-7B-DPO Finetuned
allenai/OLMo-2-1124-7B-Instruct