Open Agent Leaderboard

community

https://www.exgentic.ai

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Elron updated a dataset 4 days ago

open-agent-leaderboard/traces

Elron updated a Space 4 days ago

open-agent-leaderboard/README

Elron updated a Space 4 days ago

open-agent-leaderboard/blog

View all activity

Organization Card

Community About org cards

Open Agent Leaderboard

An open benchmark for comparing full AI agent systems across diverse real-world tasks. Reports both quality and cost.

Unlike model-only benchmarks, we evaluate the complete agent — the model, the tools, the planning strategy, the error recovery — as a single system. The same model can produce very different results depending on the agent wrapped around it.

Website: exgentic.ai
Results: open-agent-leaderboard/results
Leaderboard: open-agent-leaderboard/leaderboard
Blog: open-agent-leaderboard/blog
Framework: Exgentic
Paper: arXiv:2602.22953

Submit results

Run evaluations using Exgentic and open a PR on the results dataset.

Collections 1

spaces 3

The Open Agent Leaderboard

📊

Compare AI agents' performance and cost across benchmarks

Open Agent Leaderboard

🤖

Explore AI agents' performance leaderboard and efficiency chart

models 5

datasets 3

open-agent-leaderboard/traces

Traces • Updated 4 days ago • 10.3k • 95

open-agent-leaderboard/results

Benchmark • Updated 4 days ago • 150 • 216 • 5

open-agent-leaderboard/agent-cards

Updated Mar 30 • 36

AI & ML interests

Recent Activity

Team members 1

Open Agent Leaderboard

Submit results

Collections 1

Open Agent Leaderboard

The Open Agent Leaderboard

Open Agent Leaderboard

The Open Agent Leaderboard

spaces 3 Sort: Recently updated

The Open Agent Leaderboard

Open Agent Leaderboard

models 5 Sort: Recently updated

datasets 3 Sort: Recently updated

spaces 3

models 5

datasets 3