assn2-simpo-qwen2.5-1.5b-lora

SimPO LoRA adapter for Qwen/Qwen2.5-1.5B-Instruct.

This repository contains a LoRA adapter trained for CAS4133 Assignment 2.

Base model

Qwen/Qwen2.5-1.5B-Instruct

The full merged model was not uploaded due to network instability. This adapter can be loaded with the base model.

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

Adapter

this model