Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
UII-AI
/
MedVidBench-Leaderboard
like
6
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
18339c0
MedVidBench-Leaderboard
301 kB
Ctrl+K
Ctrl+K
4 contributors
History:
62 commits
MedGRPO Team
Fix evaluate_all_pai to pass --skip-llm-judge to task main() functions
18339c0
4 months ago
evaluation
Fix evaluate_all_pai to pass --skip-llm-judge to task main() functions
4 months ago
.gitattributes
1.63 kB
Upload ground_truth.json
5 months ago
.gitignore
163 Bytes
update name
4 months ago
.pre-commit-config.yaml
Safe
1.53 kB
Duplicate from gradio-templates/leaderboard
5 months ago
CODE_VERIFICATION_REPORT.md
11.4 kB
Add explicit flush debug
4 months ago
LEADERBOARD_FORMATS.md
5.95 kB
add merged format
4 months ago
README.md
6.18 kB
update name
4 months ago
app.py
47.6 kB
Add semantic similarity matching for Next Action evaluation
4 months ago
requirements.txt
313 Bytes
Add matplotlib to requirements
4 months ago
test_llm_judge.py
2.66 kB
Add --skip-llm-judge flag for faster evaluation
4 months ago