Flawed Fictions Qwen3-4B LiteReason SFT (positive)

Projector SFT checkpoint trained on the positive-only Flawed Fictions implicit-thought dataset. This upload preserves the full base LM weights plus LiteReason reasoning_projector weights in the safetensors shards.

Training Details

Base model: Qwen/Qwen3-4B-Instruct-2507
Dataset variant: positive (gold-label Yes subset only)
Dataset/prep label in local artifacts: twenty_five_percent_masked
W&B run: gm8xrkg2
Source checkpoint directory: /mnt/disk/litereason_anon/litereason/experiments/flawed_fictions/outputs/sft_qwen3_4b_positive

What Is Saved

Full Hugging Face model weights
LiteReason reasoning_projector.* weights embedded in the uploaded safetensors
config.json with is_litereason_model=true, litereason_num_reasoning_layers=3, and litereason_max_reasoning_steps=5
Tokenizer / chat template files needed for reload

Usage

from litereason.causal_lm_with_reasoning import AutoModelForCausalLMWithReasoning

model = AutoModelForCausalLMWithReasoning.from_pretrained(
    "agurung/flawed-fictions-qwen3-4b-litereason-sft-positive",
    device_map="auto",
    torch_dtype="auto",
)

Downloads last month: 2

Safetensors

Model size

4B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for agurung/flawed-fictions-qwen3-4b-litereason-sft-positive

Base model

Qwen/Qwen3-4B-Instruct-2507

Finetuned

(1741)

this model