Text Generation
Transformers
Safetensors
PyTorch
nvidia
two-tower
diffusion
mamba
Nemotron-Labs-TwoTower-30B-A3B-Base-BF16 / modeling_nemotron_twotower.py

Commit History

faster inferneece
b348e21

fitsumreda commited on

Two-tower mask diffusion: fix denoiser (adaLN norm order, bidirectional in-block attention, block-wise chunk-scan Mamba) + fp64 router; refresh README
a203471
verified

fitsumreda commited on

Upload folder using huggingface_hub
947a10f
verified

fitsumreda commited on