CIBench: Evaluating Your LLMs with a Code Interpreter Plugin Paper β’ 2407.10499 β’ Published Jul 15, 2024
MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space Paper β’ 2504.13835 β’ Published Apr 18, 2025 β’ 38
DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning Paper β’ 2602.11089 β’ Published Feb 11 β’ 18
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper β’ 2603.25040 β’ Published Mar 26 β’ 133
TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration Paper β’ 2604.14116 β’ Published Apr 15 β’ 13
TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration Paper β’ 2604.14116 β’ Published Apr 15 β’ 13
Running on CPU Upgrade Agents 1.02k Open VLM Leaderboard π 1.02k VLMEvalKit Evaluation Results Collection