PY-AI-Dev commited on
Commit
6ebac9e
·
verified ·
1 Parent(s): dfe4362

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +17 -0
README.md CHANGED
@@ -9,8 +9,12 @@ tags:
9
  - gqa
10
  - swiglu
11
  - rope
 
 
 
12
  metrics:
13
  - perplexity
 
14
  pipeline_tag: text-generation
15
  ---
16
 
@@ -18,6 +22,19 @@ pipeline_tag: text-generation
18
 
19
  A 9.97M parameter causal language model trained from scratch, targeting the [Open SLM Leaderboard](https://huggingface.co/spaces/AxiomicLabs/Open_SLM_Leaderboard) `<10M` tier.
20
 
 
 
 
 
 
 
 
 
 
 
 
 
 
21
  ## Model Details
22
 
23
  | Property | Value |
 
9
  - gqa
10
  - swiglu
11
  - rope
12
+ - multiple-choice
13
+ - text-ranking
14
+ - nlp-research
15
  metrics:
16
  - perplexity
17
+ - accuracy
18
  pipeline_tag: text-generation
19
  ---
20
 
 
22
 
23
  A 9.97M parameter causal language model trained from scratch, targeting the [Open SLM Leaderboard](https://huggingface.co/spaces/AxiomicLabs/Open_SLM_Leaderboard) `<10M` tier.
24
 
25
+ ## Intended Use
26
+
27
+ This is a **research model** optimised for NLU benchmarking tasks, not open-ended generation. It is best suited for:
28
+
29
+ | Task | Examples |
30
+ |------|---------|
31
+ | **Multiple-choice QA** | ARC, HellaSwag, PIQA, ArithMark — score each candidate and pick the highest |
32
+ | **Log-likelihood ranking** | Rank candidate continuations or document relevance by perplexity |
33
+ | **SLM research** | Ablations, architecture studies, efficiency benchmarks at the <10M scale |
34
+ | **Perplexity evaluation** | Measuring language model fit on held-out text corpora |
35
+
36
+ It is **not suited** for open-ended text generation, chat, or instruction following — at 10M parameters the vocabulary (8,192 tokens) and capacity are too limited for fluent free-form output.
37
+
38
  ## Model Details
39
 
40
  | Property | Value |