Buckets:

linoyts's picture
|
download
raw
2.88 kB
# LTX-2 Trainer
This package provides tools and scripts for training and fine-tuning
Lightricks' **LTX-2** audio-video generation model. It supports LoRA training, full
fine-tuning, and a flexible conditioning framework covering text-to-video, text-to-audio, image-to-video,
video extension, audio extension, video inpainting, audio inpainting, video outpainting, IC-LoRA for video, audio, and joint
audio-video references, audio-to-video, and video-to-audio.
---
## ๐Ÿ“– Documentation
All detailed guides and technical documentation are in the [docs](./docs/) directory:
- [โšก Quick Start Guide](docs/quick-start.md)
- [๐ŸŽฌ Dataset Preparation](docs/dataset-preparation.md)
- [๐Ÿ› ๏ธ Training Modes](docs/training-modes.md)
- [โš™๏ธ Configuration Reference](docs/configuration-reference.md)
- [๐Ÿš€ Training Guide](docs/training-guide.md)
- [๐Ÿงช Inference Guide](../ltx-pipelines/README.md)
- [๐Ÿ”ง Utility Scripts](docs/utility-scripts.md)
- [๐Ÿงฉ Custom Training Strategies](docs/custom-training-strategies.md)
- [๐Ÿ“š LTX-Core Documentation](../ltx-core/README.md)
- [๐Ÿ›ก๏ธ Troubleshooting Guide](docs/troubleshooting.md)
### ๐Ÿค– Agent-Assisted Training
Use the [`train-model`](../../.claude/skills/train-model/SKILL.md) repository skill for an end-to-end guided run:
it probes your data and hardware, chooses the matching training mode, prepares/preprocesses the dataset, launches
training, and monitors the job while using the docs above as the source of truth.
---
## ๐Ÿ”ง Requirements
- **LTX-2 Model Checkpoint** - Local `.safetensors` file
- **Gemma Text Encoder** - Local Gemma model directory (required for LTX-2)
- **Linux with CUDA** - CUDA 13+ recommended for optimal performance
- **Nvidia GPU with 80GB+ VRAM** - Recommended for the standard config. For GPUs with 32GB VRAM (e.g., RTX 5090),
use the [low VRAM config](configs/t2v_lora_low_vram.yaml) which enables INT8 quantization and other
memory optimizations
---
## ๐Ÿค Contributing
We welcome contributions from the community! Here's how you can help:
- **Share Your Work**: If you've trained interesting LoRAs or achieved cool results, please share them with the
community.
- **Report Issues**: Found a bug or have a suggestion? Open an issue on GitHub.
- **Submit PRs**: Help improve the codebase with bug fixes or general improvements.
- **Feature Requests**: Have ideas for new features? Let us know through GitHub issues.
---
## ๐Ÿ’ฌ Join the Community
Have questions, want to share your results, or need real-time help?
Join our [community Discord server](https://discord.gg/ltxplatform) to connect with other users and the development
team!
- Get troubleshooting help
- Share your training results and workflows
- Collaborate on new ideas and features
- Stay up to date with announcements and updates
We look forward to seeing you there!
---
Happy training! ๐ŸŽ‰

Xet Storage Details

Size:
2.88 kB
ยท
Xet hash:
0f1d0c68aac2311cb6eaf0956ab1db495afdca935c07be7310bb1b998eae2683

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.