Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation Paper • 2602.02007 • Published Feb 2 • 19
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation Paper • 2603.22117 • Published Mar 23 • 29
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Mar 12 • 219
view article Article 🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows Kseniase • Feb 2, 2025 • 26
Overcoming Data Scarcity in Multi-Dialectal Arabic ASR via Whisper Fine-Tuning Paper • 2506.02627 • Published Jun 3, 2025 • 3
finetune-ar-dialects Collection Models for the thesis titled: "The Effects of Fine-Tuning on the ASR Performance of Dialectal Arabic". • 17 items • Updated May 20, 2024 • 3
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 tomaarsen, Xenova, alvarobartt, ariG23498, pcuenq, sergiopaniego • Sep 4, 2025 • 275
LightRAG: Simple and Fast Retrieval-Augmented Generation Paper • 2410.05779 • Published Oct 8, 2024 • 40
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 11 items • Updated Mar 2 • 86