--- license: bsd-2-clause base_model: - LanguageBind/Open-Sora-Plan-v1.2.0 library_name: diffusers tags: - OutDreamer - video-outpainting - diffusion-transformer - DiT --- # OutDreamer checkpoint for video outpainting This repository provides the OutDreamer checkpoint for **OutDreamer: Video Outpainting with a Diffusion Transformer**. OutDreamer is a DiT-based video outpainting framework designed to extend video content beyond the original frame boundaries while maintaining spatial and temporal consistency. The model introduces an efficient video control branch, a conditional outpainting branch, mask-driven self-attention, latent alignment loss, and a cross-video-clip refiner for long video outpainting. The method and its results are detailed in the arXiv paper: [OutDreamer: Video Outpainting with a Diffusion Transformer](https://arxiv.org/abs/2506.22298). ## How to Use **Important:** This checkpoint is intended to be used with the OutDreamer codebase and is not a standalone Hugging Face pipeline. For project details, please refer to the OutDreamer GitHub repository: [zhongzero/OutDreamer](https://github.com/zhongzero/OutDreamer) For setup and inference scripts compatible with this checkpoint, please refer to the reproduction repository: [zhongzero/OutDreamer-unofficial](https://github.com/zhongzero/OutDreamer-unofficial) ## Citation If you find this work helpful for your research, please cite: ```BibTeX @article{zhong2026outdreamer, title={Outdreamer: Video outpainting with a diffusion transformer}, author={Zhong, Linhao and Li, Fan and Huang, Yi and Liu, Jianzhuang and Pei, Renjing and Song, Fenglong}, journal={IEEE Transactions on Image Processing}, year={2026}, publisher={IEEE} } ```