linagora/linto-dataset-text-ar-tn
Viewer • Updated • 1.98M • 410 • 1
How to use messalti/MagharibiBERT-v1 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("fill-mask", model="messalti/MagharibiBERT-v1") # Load model directly
from transformers import AutoTokenizer, AutoModelForMaskedLM
tokenizer = AutoTokenizer.from_pretrained("messalti/MagharibiBERT-v1")
model = AutoModelForMaskedLM.from_pretrained("messalti/MagharibiBERT-v1")
MagharibiBERT-v1 is a Transformer-based language model pre-trained for Maghrebi Arabic dialects (Algerian, Moroccan, Tunisian).
It is initialized from MARBERT and further adapted using Masked Language Modeling (MLM) on large-scale social media text.
The model is designed for downstream NLP tasks such as hate speech detection, sentiment analysis, and dialect classification.
MagharibiBERT-v1 is described in the following manuscript, currently submitted for peer review:
Ahmed Zoubir Messalti and Sadik Bessou,
MagharibiBERT: A Regionally Adapted Language Model for Maghrebi Arabic Social Media,
submitted to the International Journal of Computers and Their Applications (IJCA), 2026.
from transformers import pipeline
pipe = pipeline("fill-mask", model="messalti/MagharibiBERT-v1")
Base model
UBC-NLP/MARBERT