AfriHuBERT: A self-supervised speech representation model for African languages

Model description

This is a compact multilingual self-supervised speech encoder based on facebook/hubert-large-ll60k. We performed continued pretraining through multilingual adaptive finetuning (MAFT) on over 10,000 hours of African language data aggregated from various sources.

Pretraining data

Dataset: AfriHuBERT was trained on data from 11 major sources, including BibleTTS, Kallaama, MMS Ulab v2, NaijaVoices, and NCHLT. All sources and their licenses are shown in the table below. Please refer to the paper for more information.

Language Coverage

AfriHuBERT-large covers 1,230 languages in total including 1,226 indigenous African languages

BibTeX entry and citation info.

If you use this model in your research paper kindly cite the follows papers:

@doctoralThesis{alabi2026phd,
  author = {Alabi, Jesujoba Oluwadara},
  title = {Advancing African NLP: Adaptation, Analysis, and Evaluation of Large Language Models},
  school = {Saarland University},
  year = {2026},
  note = {Unpublished doctoral dissertation}
}


@misc{alabi2024afrihubertselfsupervisedspeechrepresentation,
      title={AfriHuBERT: A self-supervised speech representation model for African languages}, 
      author={Jesujoba O. Alabi and Xuechen Liu and Dietrich Klakow and Junichi Yamagishi},
      year={2024},
      eprint={2409.20201},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2409.20201}, 
}

Downloads last month: -

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ajesujoba/AfriHuBERT-large

Base model

facebook/hubert-large-ll60k

Finetuned

(22)

this model

Datasets used to train ajesujoba/AfriHuBERT-large

Paper for ajesujoba/AfriHuBERT-large

AfriHuBERT: A self-supervised speech representation model for African languages

Paper • 2409.20201 • Published Sep 30, 2024