Fix card: AUSteerSteeringController quickstart

c0a4f34 verified 2 days ago

1.75 kB

library_name: audio-interv
tags:
  - activation-steering
  - all
  - audio
  - austeer
  - diffusion
  - guitar-electronic
  - interpretability
  - music
  - steering

AUSteer — `guitar_electronic` (ACE-Step)

Per-(step, layer) sparse activation-momentum scores for the guitar_electronic concept on ACE-Step. At inference, AUSteerSteeringController adds alpha along the top-k most concept-discriminative bins.

Paper

TADA! Tuning Audio Diffusion Models through Activation Steering — https://huggingface.co/papers/2602.11910

Quickstart

from src.steering import SteerableACEModel, AUSteerSteeringController

model = SteerableACEModel(device="cuda")
model.pipeline.load()
ctrl = AUSteerSteeringController.from_pretrained(
    "lukasz-staniszewski/ace-step-austeer-guitar-electronic-all", alpha=15.0, k=256, mode="additive",
)

with model.steer(ctrl):
    audio = model.generate(
        prompt="instrumental music", lyrics="[inst]",
        audio_duration=10.0, infer_step=30, manual_seed=0,
    )

Generation config

{
  "method": "austeer",
  "concept": "guitar_electronic",
  "lyrics": "[inst]",
  "layers": "all",
  "layers_collected": [
    "tf0",
    "tf1",
    "tf2",
    "tf3",
    "tf4",
    "tf5",
    "tf6",
    "tf7",
    "tf8",
    "tf9",
    "tf10",
    "tf11",
    "tf12",
    "tf13",
    "tf14",
    "tf15",
    "tf16",
    "tf17",
    "tf18",
    "tf19",
    "tf20",
    "tf21",
    "tf22",
    "tf23"
  ],
  "num_inference_steps": 30,
  "audio_duration": 30.0,
  "seed": 10,
  "guidance_scale": 5.0,
  "guidance_scale_text": 0.0,
  "guidance_scale_lyric": 0.0,
  "guidance_interval": 1.0,
  "guidance_interval_decay": 0.0
}

AUSteer — guitar_electronic (ACE-Step)

Paper

Quickstart

Generation config

AUSteer — `guitar_electronic` (ACE-Step)