Feature Extraction
Safetensors
Spanish
xlm-roberta
ftvalentini commited on
Commit
47813bd
·
verified ·
1 Parent(s): 633da9e

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -0
README.md ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - spanish-ir/messirve
4
+ language:
5
+ - es
6
+ base_model:
7
+ - intfloat/multilingual-e5-large
8
+ ---
9
+ multilingual-e5-large model fine-tuned on the [MessIRve](https://huggingface.co/datasets/spanish-ir/messirve) Spanish IR full training set, retrieving hard negatives with BM25 and following the same approach as [Wang et al. (2024)](https://arxiv.org/abs/2402.05672).
10
+
11
+ Refer to https://github.com/ftvalentini/MessirveSpanishIR for more details on the training.
12
+
13
+ Paper: [MessIRve: A Large-Scale Spanish Information Retrieval Dataset](https://arxiv.org/abs/2409.05994) (EMNLP 2025)