--- title: Evals & LLM-as-judge emoji: ⚖️ colorFrom: yellow colorTo: red sdk: gradio sdk_version: 4.44.0 python_version: "3.11" app_file: app.py pinned: false --- # Evals & LLM-as-judge Score model answers against references with an LLM judge. Bring your own [OpenRouter](https://openrouter.ai/keys) key. Source & write-up: https://github.com/shenmali/agentic-ai-first/tree/main/demos/04-evals-llm-as-judge