LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 26 days ago • 143
Running on Zero MCP 1.77k Wan2.2 14B Fast Preview 🐌 1.77k generate a video from an image with a text prompt
Running on Zero MCP Featured 217 Wan2.2 14B Fast Preview 🐌 217 generate a video from an image with a text prompt
Running on CPU Upgrade Featured 402 ML Intern 🤖 402 Explore machine learning tasks via an interactive web app
Kronos: A Foundation Model for the Language of Financial Markets Paper • 2508.02739 • Published Aug 2, 2025 • 43
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 36 items • Updated 5 days ago • 221
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated Apr 6 • 147k • • 2.89k
Running Featured 134 Voxtral Realtime WebGPU 💬 134 Real-time speech transcription, entirely in your browser.