Non-English Embeddings and Models
updated
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Paper
• 2211.05100
• Published • 39
Contrastive Language-Image Pre-training for the Italian Language
Paper
• 2108.08688
• Published • 2
IT5: Large-scale Text-to-text Pretraining for Italian Language
Understanding and Generation
Paper
• 2203.03759
• Published • 5
Spanish Pre-trained BERT Model and Evaluation Data
Paper
• 2308.02976
• Published • 3
German FinBERT: A German Pre-trained Language Model
Paper
• 2311.08793
• Published • 3
German Text Embedding Clustering Benchmark
Paper
• 2401.02709
• Published • 6
AfroDigits: A Community-Driven Spoken Digit Dataset for African
Languages
Paper
• 2303.12582
• Published • 23
Text Generation
• 7B • Updated • 7.95k
• 70
Updated • 86
• 24
Aya Model: An Instruction Finetuned Open-Access Multilingual Language
Model
Paper
• 2402.07827
• Published • 48
Viewer
• Updated • 206k • 8.95k
• 352
CohereLabs/c4ai-command-r-v01
Text Generation
• 35B • Updated • 29.1k
• 1.11k