LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents Paper • 2606.06087 • Published 14 days ago • 63
PhotoBench Family Collection Current and future releases for paper PhotoBench: Beyond Visual Matching Towards Personalized Intent-Driven Photo Retrieval • 4 items • Updated Apr 23 • 1
Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization Paper • 2604.09574 • Published Feb 24 • 30
StepORLM: A Self-Evolving Framework With Generative Process Supervision For Operations Research Language Models Paper • 2509.22558 • Published Sep 26, 2025 • 4
Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering Paper • 2604.08224 • Published Apr 9 • 52
PhotoBench: Beyond Visual Matching Towards Personalized Intent-Driven Photo Retrieval Paper • 2603.01493 • Published Mar 2 • 21