GSQ Collection GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling, https://huggingface.co/papers/2604.18556 • 9 items • Updated 28 days ago • 9
meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • 109B • Updated May 22, 2025 • 713k • • 1.31k
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 Image-Text-to-Text • 402B • Updated May 22, 2025 • 91.7k • • 171