zhaohanlin's picture

7

zhaohanlin

ultrazhl

·

AI & ML interests

None yet

Recent Activity

upvoted an article 13 days ago

Is using a validation set useful for end-to-end learning in robotics?

upvoted a paper 26 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

upvoted a paper 10 months ago

UI-Venus Technical Report: Building High-performance UI Agents with RFT

View all activity

Organizations

None yet

upvoted an article 13 days ago

Article

Is using a validation set useful for end-to-end learning in robotics?

m1b

•

Dec 1, 2024

• 16

upvoted a paper 26 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published 28 days ago • 146

upvoted a paper 10 months ago

UI-Venus Technical Report: Building High-performance UI Agents with RFT

Paper • 2508.10833 • Published Aug 14, 2025 • 46

upvoted 2 collections over 1 year ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Dec 23, 2025 • 310

UI Agent

a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robotics • 497 items • Updated 2 days ago • 69

upvoted a paper over 1 year ago

MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning

Paper • 2409.20566 • Published Sep 30, 2024 • 54

upvoted a paper almost 2 years ago

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Paper • 2408.06327 • Published Aug 12, 2024 • 17

authored a paper almost 2 years ago

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Paper • 2408.06327 • Published Aug 12, 2024 • 17

authored a paper about 2 years ago

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Paper • 2406.12793 • Published Jun 18, 2024 • 35