--- datasets: - spanish-ir/messirve language: - es base_model: - intfloat/multilingual-e5-large pipeline_tag: feature-extraction --- multilingual-e5-large model fine-tuned on the [MessIRve](https://huggingface.co/datasets/spanish-ir/messirve) Spanish IR full training set, retrieving hard negatives with BM25 and following the same approach as [Wang et al. (2024)](https://arxiv.org/abs/2402.05672). Refer to https://github.com/ftvalentini/MessirveSpanishIR for more details on the training. Paper: [MessIRve: A Large-Scale Spanish Information Retrieval Dataset](https://arxiv.org/abs/2409.05994) (EMNLP 2025)