Corpse

Dessicated

11 244

AI & ML interests

None yet

Recent Activity

upvoted a collection 4 days ago

DFlash

liked a model 6 days ago

nvidia/GLM-5.2-NVFP4

upvoted an article 7 days ago

Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel

View all activity

Organizations

upvoted a collection 4 days ago

DFlash

Collection

Block Diffusion for Flash Speculative Decoding • 23 items • Updated 4 days ago • 142

upvoted an article 7 days ago

Article

Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel

nvidia

•

8 days ago

• 34

upvoted an article 15 days ago

Article

GLM-5.2: Built for Long-Horizon Tasks

zai-org

•

15 days ago

• 117

upvoted a collection 2 months ago

MiMo-V2.5

Collection

4 items • Updated Apr 27 • 90

upvoted an article 7 months ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

itazap, ariG23498, ArthurZ, sergiopaniego, merve, pcuenq

•

Dec 18, 2025

• 125

upvoted a collection 7 months ago

Nemotron-Cascade

Collection

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 14 items • Updated 21 days ago • 55

upvoted 2 articles 7 months ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

nvidia

•

Dec 15, 2025

• 113

Article

Transformers v5: Simple model definitions powering the AI ecosystem

lysandre, ArthurZ, cyrilvallez, reach-vb

•

Dec 1, 2025

• 312

upvoted an article 11 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

reach-vb, pcuenq, lewtun, clem, Rocketknight1, clefourrier, celinah, Wauplin, marcsun13, pagezyhf, ahadnagy, joaogante

•

Aug 5, 2025

• 513

upvoted an article about 1 year ago

Article

All LLMs Will Be Sparse BitNet Hybrids

codys12

•

May 14, 2025

• 16

upvoted a collection about 1 year ago

Gemma 3 QAT

Collection

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Mar 12 • 219

Corpse

AI & ML interests

Recent Activity

Organizations

Dessicated's activity

Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel

GLM-5.2: Built for Long-Horizon Tasks

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Transformers v5: Simple model definitions powering the AI ecosystem

Welcome GPT OSS, the new open-source model family from OpenAI!

All LLMs Will Be Sparse BitNet Hybrids