PlainSpeak โ€” Dense-to-Plain-English Translator

I taught a tiny AI to speak human.

Give it Shakespeare. Give it a legal contract. Give it anything written to impress instead of communicate. It gives you back what it actually means.

Try it

โ†’ Live demo on Hugging Face Spaces

pip install mlx-lm
mlx_lm.generate \
  --model Brandi-Kinard/plainspeak-smollm2-1.7b \
  --prompt "### Original:
YOUR TEXT HERE

### Plain English:" \
  --max-tokens 200

Examples

Shakespeare โ†’ Plain English

Original: "Wherefore art thou Romeo? Deny thy father and refuse thy name."

PlainSpeak: "Why are you Romeo? Don't deny your father and refuse your name."

Adam Smith โ†’ Plain English

Original: "The invisible hand of the market, whereby individuals pursuing their own self-interest are led, as if by an invisible hand, to promote ends which were no part of their original intention."

PlainSpeak: "When people try to make money for themselves, they often end up helping society without meaning to."

KJV Bible โ†’ Plain English

Original: "The LORD is my shepherd; I shall not want."

PlainSpeak: "The LORD leads me. I don't need anything else."

Model Details

Property Value
Base model SmolLM2-1.7B-Instruct
Fine-tuning method LoRA (8 layers)
Training iterations 500
Training examples 1,200
Validation examples 150
Data source Project Gutenberg + AI-generated synthetic pairs
Hardware Apple M1, 16GB unified memory
Peak training memory 10.09 GB
Final val loss 1.771
Inference memory ~3.6 GB
Build time 1 evening

What it's good at

  • 19th century prose (Dickens, James, Eliot, Hardy)
  • Shakespeare and Elizabethan English
  • King James Bible passages
  • Economic and political theory (Smith, Burke, Locke)
  • Academic abstracts
  • Legal boilerplate

Known limitations

  • Trained on 200-word chunks โ€” short fragments may produce inconsistent results
  • Occasional errors on numerical content (dates, quantities)
  • Not optimized for highly technical scientific notation
  • May struggle with extremely abstract or experimental writing (e.g. stream-of-consciousness)

How it was built

1. Stream 1,500 prose passages from Project Gutenberg
2. Generate plain English versions using a frontier model as teacher
3. Format as (original โ†’ plain) training pairs
4. Fine-tune SmolLM2-1.7B with LoRA on Apple MLX
5. Fuse adapter into final weights

The key insight: a small model trained on 1,500 high-quality examples outperforms a large model trained on millions of noisy ones โ€” at this specific task.

Use in Python

from mlx_lm import load, generate

model, tokenizer = load("Brandi-Kinard/plainspeak-smollm2-1.7b")

prompt = """### Original:
It is a truth universally acknowledged, that a single man in possession of a good fortune, must be in want of a wife.

### Plain English:"""

response = generate(model, tokenizer, prompt=prompt, max_tokens=200)
print(response)

Links

Downloads last month
21
Safetensors
Model size
2B params
Tensor type
BF16
ยท
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Brandi-Kinard/plainspeak-smollm2-1.7b

Adapter
(36)
this model

Space using Brandi-Kinard/plainspeak-smollm2-1.7b 1