Image-to-Text
Transformers
Safetensors
English
blip-2
visual-question-answering
vision
image-captioning
Instructions to use anonymoussubmission2024/vlrm-blip2-opt-2.7b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use anonymoussubmission2024/vlrm-blip2-opt-2.7b with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "image-to-text" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("image-to-text", model="anonymoussubmission2024/vlrm-blip2-opt-2.7b")# Load model directly from transformers import AutoProcessor, AutoModelForMultimodalLM processor = AutoProcessor.from_pretrained("anonymoussubmission2024/vlrm-blip2-opt-2.7b") model = AutoModelForMultimodalLM.from_pretrained("anonymoussubmission2024/vlrm-blip2-opt-2.7b") - Notebooks
- Google Colab
- Kaggle
Upload train_videos_ids.txt
Browse files- .gitattributes +1 -0
- train_videos_ids.txt +3 -0
.gitattributes
CHANGED
|
@@ -34,3 +34,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
runs/vlrm filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
runs/vlrm filter=lfs diff=lfs merge=lfs -text
|
| 37 |
+
train_videos_ids.txt filter=lfs diff=lfs merge=lfs -text
|
train_videos_ids.txt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b10e1d2e05a5ef90d5a292e5a4c6c440b726ff10d1eaa49123b5fbb558b7f1df
|
| 3 |
+
size 123148792
|