Introspective Diffusion Language Models (I-DLM)
Collection
Model checkpoints for I-DLM. Paper: https://arxiv.org/abs/2604.11035 • 3 items • Updated • 11
How to use yifanyu/I-DLM-8B-lora-r128 with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("/data/cxu/dllm-distillation/training/model/Qwen3-8B-b3-allmasked-causal")
model = PeftModel.from_pretrained(base_model, "yifanyu/I-DLM-8B-lora-r128")LoRA adapter (rank=128) for I-DLM-8B, enabling lossless Relaxed Introspective Strided Decoding (R-ISD).
This model was presented in the paper Introspective Diffusion Language Models.
This LoRA adapter is used in the gated LoRA setup described in the I-DLM paper:
Note: This LoRA adapter is used with the I-DLM-8B base model via our SGLang-based serving pipeline. Direct loading via
transformersis not currently supported for reproducing paper results. Please use the SGLang-based ISD pipeline for inference.
`