view article Article MTEB Leaderboard: From a slow demo to feature-rich leaderboard Samoed • 1 day ago • 18
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses Paper • 2606.02373 • Published 13 days ago • 52
view article Article Designing the hf CLI as an agent-optimized way to work with the Hub celinah, Wauplin • 10 days ago • 56
Less Is More? When Dataset Context Hurts LLM-Generated Dataset Descriptions Paper • 2606.02334 • Published 13 days ago • 1
view article Article Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL +6 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego • 18 days ago • 41
view article Article Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic ibm-research • 12 days ago • 85
view article Article MONET: Lowering the Barrier to World Class Image Generation Research jasperai • 16 days ago • 10
MONET - Massive Open Non-redundant, Enriched, Text-to-image Collection A curated, deduped & recaptioned open image–text dataset of 104.9M samples released under the Apache2.0 licence. https://huggingface.co/blog/jasperai/ • 4 items • Updated 16 days ago • 11
MiniCPM5 Collection A SOTA 1B on-device LLM, small yet powerful. • 11 items • Updated 19 days ago • 26
view article Article Why Open Models Are the Only Sustainable Way to Teach AI penelopegittos • 22 days ago • 8
Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation Paper • 2605.19833 • Published 26 days ago • 134
Scaling Properties of Continuous Diffusion Spoken Language Models Paper • 2604.24416 • Published Apr 27 • 1