Text Generation
Transformers
Safetensors
PyTorch
nvidia
two-tower
diffusion
mamba

Commit History

Add cached generate_ar (ST-AR baseline) + single-step AR/mock-AR context extend (stock parity)
c739325

fitsumreda Claude Opus 4.8 commited on

faster inferneece
b348e21

fitsumreda commited on

Two-tower mask diffusion: fix denoiser (adaLN norm order, bidirectional in-block attention, block-wise chunk-scan Mamba) + fp64 router; refresh README
a203471
verified

fitsumreda commited on

Upload folder using huggingface_hub
947a10f
verified

fitsumreda commited on