minhnguyent546/ViCLIP-OT
Feature Extraction • 0.2B • Updated • 10 • 2
ViCLIP-OT: The First Foundation Vision-Language Model for Vietnamese Image–Text Retrieval with Optimal Transport
Note Model trained with CLIP + SIGROT objective
Note Model trained with SigLIP + SIGROT objective
Note Checkpoints and other artifacts for ViCLIP-OT