Prabhāsa-b_s 0.2 — BabyLM-2026 Strict (100M)

Pāṇinian-Structured pretraining for Small Language Models. ELC encoder (14L/768d/12h, RoPE, GeGLU/RMSNorm, N-hot morpheme embeddings + Paribhāṣā structure-aware masking), pure-MLM, AdamW lr 5e-4, 10 epochs.

Official BabyLM-2026 scorer (summed pseudo-log-likelihood, no length-norm):

BLiMP BLiMP-supplement EWoK COMPS
72.63 65.90 53.23 54.72
  • +5.07 pp over v0.1 (qbz506/prabhasa-b_s, 67.56); −1.9 pp from the GPT-2 baseline (74.53).
  • BabyLM-compliant: ≤10 epochs over the 100M-word Strict budget. Reproduced across two independent runs.
  • Honest framing: targets sample efficiency + interpretability, not frontier parity. See findings F1–F10 in the repo.
Downloads last month
21
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support