video/image - a dbest111 Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

dbest111 's Collections

video/image

updated Jul 24, 2025

google/vit-base-patch16-224

Image Classification • 86.6M • Updated Sep 5, 2023 • 5.53M • • 979
OpenGVLab/internimage_g_jointto22k_384

Image Classification • 3B • Updated Mar 25, 2025 • 14 • 1
chancharikm/qwen2.5-vl-72b-cam-motion

Video-Text-to-Text • 73B • Updated Sep 19, 2025 • 103 • 1
lmms-lab/Aero-1-Audio

Text Generation • 2B • Updated Jun 7, 2025 • 680 • 91
mipal/AVATAR

Updated Nov 3, 2025 • 53 • 1
FAVOR-Bench/FAVOR

Viewer • Updated May 11 • 27.1k • 1.19k • 3
lmms-lab/VideoMMMU

Viewer • Updated May 5, 2025 • 900 • 1.71k • 14
moonshotai/Kimi-VL-A3B-Thinking-2506

Image-Text-to-Text • 16B • Updated Jan 30 • 6.64k • 362
lmms-lab/llava-critic-113k

Viewer • Updated Oct 5, 2024 • 113k • 867 • 28
lmms-lab/M4-Instruct-Data

Updated Jul 21, 2024 • 701 • 79
lmms-lab/llava-next-interleave-qwen-7b

Text Generation • 8B • Updated Jul 24, 2024 • 146 • 27
lmms-lab/LLaVA-OneVision-Data

Viewer • Updated May 24, 2025 • 3.94M • 13.4k • 237
avalab/syndicom

Viewer • Updated May 10, 2024 • 19.2k • 4
avalab/iTBLS

Viewer • Updated Jan 17, 2025 • 12.5k • 18
Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification

Paper • 2312.14378 • Published Dec 22, 2023
avalab/cTBLS_knowledge_retriever

Updated Jan 12, 2024
avalab/cTBLS_encoder

Updated Apr 27, 2023
CraftJarvis/minecraft-vla-sft

Viewer • Updated Mar 21, 2025 • 3.78M • 966 • 10

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs