lukasz-staniszewski's picture
Fix card: AUSteerSteeringController quickstart
2754cc6 verified
|
raw
history blame
1.29 kB
metadata
library_name: audio-interv
tags:
  - activation-steering
  - audio
  - austeer
  - diffusion
  - genre
  - interpretability
  - music
  - steering
  - tf6tf7

AUSteer — genre (ACE-Step)

Per-(step, layer) sparse activation-momentum scores for the genre concept on ACE-Step. At inference, AUSteerSteeringController adds alpha along the top-k most concept-discriminative bins.

Quickstart

from src.steering import SteerableACEModel, AUSteerSteeringController

model = SteerableACEModel(device="cuda")
model.pipeline.load()
ctrl = AUSteerSteeringController.from_pretrained(
    "lukasz-staniszewski/ace-step-austeer-electronic-music-tf6tf7", alpha=15.0, k=256, mode="additive",
)

with model.steer(ctrl):
    audio = model.generate(
        prompt="instrumental music", lyrics="[inst]",
        audio_duration=10.0, infer_step=30, manual_seed=0,
    )

Generation config

{
  "method": "austeer",
  "concept": "genre",
  "lyrics": "[inst]",
  "layers": "tf6tf7",
  "layers_collected": [
    "tf6",
    "tf7"
  ],
  "num_inference_steps": 30,
  "audio_duration": 30.0,
  "seed": 10,
  "guidance_scale": 5.0,
  "guidance_scale_text": 0.0,
  "guidance_scale_lyric": 0.0,
  "guidance_interval": 1.0,
  "guidance_interval_decay": 0.0
}