--- license: apache-2.0 library_name: transformers pipeline_tag: text-to-audio language: - en tags: - feature-extraction - audio - music - text2music - custom_code - ace-step - acestep - lora - loha - music-generation - xl-base - pop - electro base_model: ACE-Step/acestep-v15-xl-base --- # ACE-Step v1.5 xl-base: Pop/Electro LoHA (1.3k tracks) LoHA adapter trained on 1.3k pop/electro tracks against the [ACE-Step v1.5 xl-base](https://huggingface.co/ACE-Step/acestep-v15-xl-base) checkpoint. Trained with [Side-Step](https://github.com/koda-dernet/Side-Step) following the maintainer's [recommended config](https://github.com/koda-dernet/Side-Step/issues/57).
|  | **Final**: epoch 100, MA5 **0.7955** (best of run) **Dataset**: 1.3k pop/electro tracks (Same as [Nekochu/stable-audio-open-1.0-Music](https://huggingface.co/Nekochu/stable-audio-open-1.0-Music)) **~1/3 common caption tags**: Chill & Relax, Feel-Good Vibes, Pop Dance, Slow Down & Relax, EDM, Pop, Workout Beats, Chill Vibes, Alt Z, Electro House **Adapter**: LoHA dim=128, alpha=256, target-mlp **Trainable**: 652M params (11.6%) **Hardware**: RTX 5090, ~24-30 min/epoch, ~48 hrs total |
AceStepHandler.add_lora(): Python inference script