Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
rayruiyang
's Collections
VST
Haplo-VL
VST
updated
9 days ago
A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities.
Upvote
6
Sort: Collection
rayruiyang/VST-3B-RL
Image-Text-to-Text
•
4B
•
Updated
Nov 11, 2025
•
301
•
3
rayruiyang/VST-3B-SFT
Image-Text-to-Text
•
4B
•
Updated
Nov 11, 2025
•
90
rayruiyang/VST-7B-SFT
Image-Text-to-Text
•
8B
•
Updated
Nov 11, 2025
•
84
rayruiyang/VST-7B-RL
Image-Text-to-Text
•
8B
•
Updated
Nov 11, 2025
•
91
Visual Spatial Tuning
Paper
•
2511.05491
•
Published
Nov 7, 2025
•
53
rayruiyang/vst_3d_grounding_benchmark
Preview
•
Updated
Feb 1
•
24
rayruiyang/vst_500k
Viewer
•
Updated
Mar 13
•
563k
•
4.22k
•
4
Upvote
6
+2
Sort: Collection
Share collection
View history
Collection guide
Browse collections