File size: 612 Bytes
47813bd 791e376 47813bd | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 | ---
datasets:
- spanish-ir/messirve
language:
- es
base_model:
- intfloat/multilingual-e5-large
pipeline_tag: feature-extraction
---
multilingual-e5-large model fine-tuned on the [MessIRve](https://huggingface.co/datasets/spanish-ir/messirve) Spanish IR full training set, retrieving hard negatives with BM25 and following the same approach as [Wang et al. (2024)](https://arxiv.org/abs/2402.05672).
Refer to https://github.com/ftvalentini/MessirveSpanishIR for more details on the training.
Paper: [MessIRve: A Large-Scale Spanish Information Retrieval Dataset](https://arxiv.org/abs/2409.05994) (EMNLP 2025) |