Feature Extraction
Safetensors
Spanish
xlm-roberta
File size: 612 Bytes
47813bd
 
 
 
 
 
 
791e376
47813bd
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
---
datasets:
- spanish-ir/messirve
language:
- es
base_model:
- intfloat/multilingual-e5-large
pipeline_tag: feature-extraction
---
multilingual-e5-large model fine-tuned on the [MessIRve](https://huggingface.co/datasets/spanish-ir/messirve) Spanish IR full training set, retrieving hard negatives with BM25 and following the same approach as [Wang et al. (2024)](https://arxiv.org/abs/2402.05672). 

Refer to https://github.com/ftvalentini/MessirveSpanishIR for more details on the training.

Paper: [MessIRve: A Large-Scale Spanish Information Retrieval Dataset](https://arxiv.org/abs/2409.05994) (EMNLP 2025)