Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention Paper • 2606.20945 • Published 6 days ago • 55
Exploring the Design Space of Reward Backpropagation for Flow Matching Paper • 2606.11075 • Published 15 days ago • 9
FrontiersMind/Nandi-Mini-V1.1-600M-Early-Checkpoint-250GT Text Generation • 0.6B • Updated 28 days ago • 514 • 11
Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems Paper • 2604.04936 • Published Jan 8 • 26
Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems Paper • 2604.04936 • Published Jan 8 • 26
Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems Paper • 2604.04936 • Published Jan 8 • 26
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • 22.7M • Updated 23 days ago • 249M • • 4.99k
The Instruction Gap: LLMs get lost in Following Instruction Paper • 2601.03269 • Published Dec 19, 2025 • 8