zhixiangwei/VLM-1B
Updated • 487 • 4
How to use zhixiangwei/vlm1b-hqclip-xlarge-vitl14-clipa with OpenCLIP:
import open_clip
model, preprocess_train, preprocess_val = open_clip.create_model_and_transforms('hf-hub:zhixiangwei/vlm1b-hqclip-xlarge-vitl14-clipa')
tokenizer = open_clip.get_tokenizer('hf-hub:zhixiangwei/vlm1b-hqclip-xlarge-vitl14-clipa')Pretrain CLIP-L-14 on VLM-1B using CLIPA.
| Dataset | Performance |
|---|---|
| ImageNet 1k | 0.78628 |
| ImageNet V2 | 0.7132 |
| ImageNet-A | 0.662 |
| ImageNet-O | 0.4085 |
| ImageNet-R | 0.900967 |
| ImageNet Sketch | 0.673643 |
| ObjectNet | 0.716324 |
| IN-shifts | 0.679106 |
| VTAB | 0.604791 |
| MSCOCO | 0.580741 |
| Flickr30k | 0.841 |
| WinoGAViL | 0.509853 |
| Retrieval | 0.643865 |
| Avg. | 0.637904 |