ccdv/arxiv-summarization
Viewer • Updated • 432k • 8.59k • 124
How to use farleyknight-org-username/arxiv-summarization-t5-small with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModelForMultimodalLM
tokenizer = AutoTokenizer.from_pretrained("farleyknight-org-username/arxiv-summarization-t5-small")
model = AutoModelForMultimodalLM.from_pretrained("farleyknight-org-username/arxiv-summarization-t5-small")This model is a fine-tuned version of t5-small on the ccdv/arxiv-summarization dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|---|---|---|---|---|---|---|---|---|
| 2.5925 | 0.39 | 10000 | 2.4566 | 17.8432 | 6.6779 | 14.2303 | 16.1952 | 19.0 |
| 2.518 | 0.79 | 20000 | 2.3868 | 18.0354 | 6.8565 | 14.3552 | 16.3664 | 19.0 |
| 2.4587 | 1.18 | 30000 | 2.3600 | 18.2076 | 6.9618 | 14.5349 | 16.5626 | 19.0 |
| 2.4365 | 1.58 | 40000 | 2.3295 | 18.3579 | 7.0312 | 14.6145 | 16.6845 | 19.0 |
| 2.4306 | 1.97 | 50000 | 2.3190 | 18.4551 | 7.0861 | 14.6879 | 16.7627 | 19.0 |
| 2.4005 | 2.37 | 60000 | 2.3056 | 18.3521 | 7.0496 | 14.6413 | 16.6832 | 19.0 |
| 2.396 | 2.76 | 70000 | 2.3012 | 18.348 | 7.0439 | 14.6509 | 16.6994 | 19.0 |