Text Generation
Transformers
Safetensors
PyTorch
nvidia
two-tower
diffusion
mamba
Nemotron-Labs-TwoTower-30B-A3B-Base-BF16 / modeling_nemotron_twotower.py

Commit History

Update model card (README) and tidy inference scaffolding
0ea6f1b

fitsumreda Claude Opus 4.8 commited on

Fix NaN corruption in long-context diffusion (fp32 denoiser SSM scan) + multi-request inference
67bf233

fitsumreda Claude Opus 4.8 commited on

Add cached generate_ar (ST-AR baseline) + single-step AR/mock-AR context extend (stock parity)
c739325

fitsumreda Claude Opus 4.8 commited on

faster inferneece
b348e21

fitsumreda commited on

Two-tower mask diffusion: fix denoiser (adaLN norm order, bidirectional in-block attention, block-wise chunk-scan Mamba) + fp64 router; refresh README
a203471
verified

fitsumreda commited on

Upload folder using huggingface_hub
947a10f
verified

fitsumreda commited on