Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2601.07372

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models.

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published May 7 • 80
Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 231
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 156
Pretraining Language Models to Ponder in Continuous Space

Paper • 2505.20674 • Published May 27, 2025 • 3

May Papers -- Best of Best

The papers I've found so far in May that I consider best of the best, whether that be from the perspective of providing a new perspective on a probl..

MegaFlow: Large-Scale Distributed Orchestration System for the Agentic Era

Paper • 2601.07526 • Published Jan 12 • 23
Intelligent AI Delegation

Paper • 2602.11865 • Published Feb 12 • 16
ENGRAM: Effective, Lightweight Memory Orchestration for Conversational Agents

Paper • 2511.12960 • Published Nov 17, 2025 • 1
CityRAG: Stepping Into a City via Spatially-Grounded Video Generation

Paper • 2604.19741 • Published Apr 21 • 17

DVPS Scientific Watch

Collection of external scientific material relevant to the project

HuggingFaceFW/finetranslations

Viewer • Updated Jan 9 • 3.33B • 10.8k • 294
LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators

Paper • 2411.00136 • Published Oct 31, 2024
The Illusion of Readiness in Health AI

Paper • 2509.18234 • Published Sep 22, 2025 • 1
The Roots of Performance Disparity in Multilingual Language Models: Intrinsic Modeling Difficulty or Design Choices?

Paper • 2601.07220 • Published Jan 12

Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team

Paper • 2506.14234 • Published Jun 17, 2025 • 41
MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models

Paper • 2506.14435 • Published Jun 17, 2025 • 7
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

Paper • 2504.19413 • Published Apr 28, 2025 • 60
MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4, 2025 • 167

Deepseek Papers

Deepseek papers collection

DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Paper • 2310.16818 • Published Oct 25, 2023 • 33
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 56
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11, 2024 • 62
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Paper • 2401.14196 • Published Jan 25, 2024 • 73

Ex-Omni: Enabling 3D Facial Animation Generation for Omni-modal Large Language Models

Paper • 2602.07106 • Published Feb 6 • 11
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Paper • 2601.07372 • Published Jan 12 • 50

Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models

Paper • 2508.10751 • Published Aug 14, 2025 • 29
Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265
MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Paper • 2508.14704 • Published Aug 20, 2025 • 43
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22, 2025 • 162

Good research papers

Good research papers collection

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29, 2025 • 72
SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 208
Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10, 2025 • 109
Small Language Models are the Future of Agentic AI

Paper • 2506.02153 • Published Jun 2, 2025 • 25

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models.

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published May 7 • 80
Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 231
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 156
Pretraining Language Models to Ponder in Continuous Space

Paper • 2505.20674 • Published May 27, 2025 • 3

Deepseek Papers

Deepseek papers collection

DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Paper • 2310.16818 • Published Oct 25, 2023 • 33
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 56
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11, 2024 • 62
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Paper • 2401.14196 • Published Jan 25, 2024 • 73

May Papers -- Best of Best

The papers I've found so far in May that I consider best of the best, whether that be from the perspective of providing a new perspective on a probl..

MegaFlow: Large-Scale Distributed Orchestration System for the Agentic Era

Paper • 2601.07526 • Published Jan 12 • 23
Intelligent AI Delegation

Paper • 2602.11865 • Published Feb 12 • 16
ENGRAM: Effective, Lightweight Memory Orchestration for Conversational Agents

Paper • 2511.12960 • Published Nov 17, 2025 • 1
CityRAG: Stepping Into a City via Spatially-Grounded Video Generation

Paper • 2604.19741 • Published Apr 21 • 17

Ex-Omni: Enabling 3D Facial Animation Generation for Omni-modal Large Language Models

Paper • 2602.07106 • Published Feb 6 • 11
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Paper • 2601.07372 • Published Jan 12 • 50

DVPS Scientific Watch

Collection of external scientific material relevant to the project

HuggingFaceFW/finetranslations

Viewer • Updated Jan 9 • 3.33B • 10.8k • 294
LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators

Paper • 2411.00136 • Published Oct 31, 2024
The Illusion of Readiness in Health AI

Paper • 2509.18234 • Published Sep 22, 2025 • 1
The Roots of Performance Disparity in Multilingual Language Models: Intrinsic Modeling Difficulty or Design Choices?

Paper • 2601.07220 • Published Jan 12

Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models

Paper • 2508.10751 • Published Aug 14, 2025 • 29
Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265
MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Paper • 2508.14704 • Published Aug 20, 2025 • 43
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22, 2025 • 162

Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team

Paper • 2506.14234 • Published Jun 17, 2025 • 41
MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models

Paper • 2506.14435 • Published Jun 17, 2025 • 7
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

Paper • 2504.19413 • Published Apr 28, 2025 • 60
MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4, 2025 • 167

Good research papers

Good research papers collection

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29, 2025 • 72
SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 208
Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10, 2025 • 109
Small Language Models are the Future of Agentic AI

Paper • 2506.02153 • Published Jun 2, 2025 • 25

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs