blip2-lora-finetune

This model is a fine-tuned version of Salesforce/blip2-opt-2.7b on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 2e-05
train_batch_size: 2
eval_batch_size: 2
seed: 42
gradient_accumulation_steps: 8
total_train_batch_size: 16
optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 0.05
num_epochs: 15

Training Loss	Epoch	Step	Validation Loss
83.2389	0.5979	50	10.0021
18.5524	1.1913	100	1.9922
9.6493	1.7892	150	1.0606
8.5570	2.3827	200	0.9584
7.3530	2.9806	250	0.9102
8.6532	3.5740	300	0.8919
6.9255	4.1674	350	0.8629
7.2563	4.7653	400	0.8558
7.6426	5.3587	450	0.8349
7.3618	5.9567	500	0.8488
7.1585	6.5501	550	0.8302
8.3336	7.1435	600	0.8298
6.9589	7.7414	650	0.8193
7.0150	8.3348	700	0.8237
7.3567	8.9327	750	0.8238
8.1129	9.5262	800	0.8183
6.8732	10.1196	850	0.8128
6.4406	10.7175	900	0.8155
6.8474	11.3109	950	0.8236
6.5659	11.9088	1000	0.8157
7.1021	12.5022	1050	0.8156
7.2930	13.0957	1100	0.8174
6.9147	13.6936	1150	0.8145
7.0703	14.2870	1200	0.8173
6.6065	14.8849	1250	0.8155

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Adapter

(40)

this model