lukasz-staniszewski
/

ace-step-austeer-tempo-all

activation-steering

interpretability

Model card Files Files and versions

lukasz-staniszewski commited on 10 days ago

Commit

f320676

·

verified ·

1 Parent(s): a4c9e53

Upload CAA steering vectors

Files changed (3) hide show

README.md +75 -0
austeer.pkl +3 -0
config.json +40 -0

README.md ADDED Viewed

	@@ -0,0 +1,75 @@

+---
+library_name: audio-interv
+tags:
+  - ace-step
+  - activation-steering
+  - audio
+  - austeer
+  - diffusion
+  - interpretability
+  - music
+  - steering
+  - tempo
+---
+# Steering Vectors — `tempo` (ACE-Step)
+Steering vectors for the **tempo** concept on ACE-Step, computed via contrastive activation addition (CAA).
+## Quickstart
+```python
+from src.steering import SteerableACEModel
+from steering.caa import CAAController
+model = SteerableACEModel(device="cuda")
+ctrl = CAAController.from_pretrained("lukasz-staniszewski/ace-step-austeer-tempo-all", alpha=20)
+with model.steer(ctrl):
+    audio = model.generate(prompt="instrumental music", seed=0)
+```
+## Generation config
+```json
+{
+  "method": "austeer",
+  "concept": "tempo",
+  "lyrics": "[inst]",
+  "layers": "all",
+  "layers_collected": [
+    "tf0",
+    "tf1",
+    "tf2",
+    "tf3",
+    "tf4",
+    "tf5",
+    "tf6",
+    "tf7",
+    "tf8",
+    "tf9",
+    "tf10",
+    "tf11",
+    "tf12",
+    "tf13",
+    "tf14",
+    "tf15",
+    "tf16",
+    "tf17",
+    "tf18",
+    "tf19",
+    "tf20",
+    "tf21",
+    "tf22",
+    "tf23"
+  ],
+  "num_inference_steps": 30,
+  "audio_duration": 30.0,
+  "seed": 10,
+  "guidance_scale": 5.0,
+  "guidance_scale_text": 0.0,
+  "guidance_scale_lyric": 0.0,
+  "guidance_interval": 1.0,
+  "guidance_interval_decay": 0.0
+}
+```

austeer.pkl ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:feb0212c469bf15eb853bad5c1234775d9902818151f64aa26dfdc281106f5c9
+size 29551625

config.json ADDED Viewed

	@@ -0,0 +1,40 @@

+{
+  "method": "austeer",
+  "concept": "tempo",
+  "lyrics": "[inst]",
+  "layers": "all",
+  "layers_collected": [
+    "tf0",
+    "tf1",
+    "tf2",
+    "tf3",
+    "tf4",
+    "tf5",
+    "tf6",
+    "tf7",
+    "tf8",
+    "tf9",
+    "tf10",
+    "tf11",
+    "tf12",
+    "tf13",
+    "tf14",
+    "tf15",
+    "tf16",
+    "tf17",
+    "tf18",
+    "tf19",
+    "tf20",
+    "tf21",
+    "tf22",
+    "tf23"
+  ],
+  "num_inference_steps": 30,
+  "audio_duration": 30.0,
+  "seed": 10,
+  "guidance_scale": 5.0,
+  "guidance_scale_text": 0.0,
+  "guidance_scale_lyric": 0.0,
+  "guidance_interval": 1.0,
+  "guidance_interval_decay": 0.0
+}