ASSN2 SimPO+SFT Model

This repository contains the SimPO+SFT model checkpoint for Assignment 2.

Pipeline

  • Base model: meta-llama/Llama-3.2-1B-Instruct
  • SFT merged checkpoint: /workspace_data/checkpoints/assn2-sft-merged-hf
  • SimPO initialization: SFT merged checkpoint
  • SimPO checkpoint: /workspace_data/checkpoints/assn2-simpo-sft

Evaluation

  • SimPO+SFT accuracy: 0.62

Notes

This model was produced by applying the reference-free SimPO objective starting from the SFT-merged checkpoint.

Downloads last month
20
Safetensors
Model size
1B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support