Spaces:

ltx-community
/

ltx-2.3-beard-removal

Running on Zero

App Files Files Community

linoyts HF Staff commited on 8 days ago

Commit

dc2750b

verified ·

1 Parent(s): 4f0b478

Use precompiled AOTI transformer blocks (ZeroGPU speedup, STG-capable)

Browse files

Fuses the LoRA + loads the precompiled AOTI transformer blocks at the **root module level** (per ZeroGPU docs), using the **STG-capable** Group C repo (`ltx-community/LTX-2.3-Transformer-GroupC-STG-sm120-cu130-rb3`): the perturbation path is compiled as always-on tensor math, and a small per-block wrapper feeds a no-op ones mask when STG is off so one graph serves both. Keeps STG on (default stg_scale). Public graph-only repo → no token. Validated end-to-end on a day-to-night duplicate (coherent output, STG passes run clean).

Files changed (1) hide show

app.py +15 -1

app.py CHANGED Viewed

@@ -42,7 +42,21 @@ pipe.to("cuda")
 pipe.vae.enable_tiling()
 _lora_path = hf_hub_download(LORA_REPO, LORA_FILE, token=HF_TOKEN)
 pipe.load_lora_weights(load_file(_lora_path), adapter_name="shave")
-pipe.set_adapters("shave", LORA_SCALE)
 def _src_fps(path, default=FPS):

 pipe.vae.enable_tiling()
 _lora_path = hf_hub_download(LORA_REPO, LORA_FILE, token=HF_TOKEN)
 pipe.load_lora_weights(load_file(_lora_path), adapter_name="shave")
+pipe.fuse_lora(lora_scale=LORA_SCALE)
+pipe.unload_lora_weights()
+# AOTI (Group C / STG): load precompiled blocks at ROOT level. The graph always runs the
+# perturbation lerp; the wrapper feeds a no-op ones mask when None (non-STG blocks / main
+# pass) and forces all_perturbed=False. STG (block 28, perturbed pass) still gets the real mask.
+spaces.aoti_load(module=pipe.transformer, repo_id="ltx-community/LTX-2.3-Transformer-GroupC-STG-sm120-cu130-rb3")
+for _blk in pipe.transformer.transformer_blocks:
+    _compiled = _blk.forward
+    def _fwd(*a, _c=_compiled, **kw):
+        if kw.get("perturbation_mask", None) is None:
+            _hs = kw["hidden_states"]
+            kw["perturbation_mask"] = torch.ones((_hs.shape[0], 1, 1), device=_hs.device, dtype=_hs.dtype)
+        kw["all_perturbed"] = False
+        return _c(*a, **kw)
+    _blk.forward = _fwd
 def _src_fps(path, default=FPS):