Running 3.9k The Ultra-Scale Playbook π 3.9k The ultimate guide to training LLM on large GPU Clusters
Running Agents 44 Open LMM Reasoning Leaderboard π₯ 44 A Leaderboard that demonstrates LMM reasoning capabilities
Running Agents 432 Reward Bench Leaderboard π 432 Explore and compare model scores on RewardBench benchmarks
Running on CPU Upgrade 14k Open LLM Leaderboard π 14k Track, rank and evaluate open LLMs and chatbots
Running Agents Featured 135 Open VLM Video Leaderboard π 135 VLMEvalKit Eval Results in video understanding benchmark
Running on CPU Upgrade Agents 1.02k Open VLM Leaderboard π 1.02k VLMEvalKit Evaluation Results Collection