cstr commited on
Commit
aad5d37
·
verified ·
1 Parent(s): e8fd833

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - en
5
+ tags:
6
+ - text-recognition
7
+ - ocr
8
+ - trocr
9
+ - handwriting
10
+ - gguf
11
+ - crispembed
12
+ base_model: microsoft/trocr-small-handwritten
13
+ pipeline_tag: image-to-text
14
+ ---
15
+
16
+ # TrOCR-small Handwritten Text — GGUF
17
+
18
+ Handwritten text recognition model for [CrispEmbed](https://github.com/CrispStrobe/CrispEmbed).
19
+ Recognizes handwritten text from cropped text-line images. Outputs **mixed-case** text
20
+ (unlike trocr-small-printed which uppercases).
21
+
22
+ **Architecture**: DeiT-small encoder (12L, 384d, 6 heads) + TrOCR decoder
23
+ (6L, 256d, 8 heads). XLM-R vocabulary (64,044 tokens). 61M parameters.
24
+
25
+ **Source**: [microsoft/trocr-small-handwritten](https://huggingface.co/microsoft/trocr-small-handwritten) (MIT).
26
+
27
+ ## Model Variants
28
+
29
+ | Variant | Size | Recognition quality |
30
+ |---------|------|-------------------|
31
+ | F32 | 235 MB | exact token match vs HuggingFace (greedy) |
32
+ | F16 | 118 MB | identical to F32 |
33
+ | **Q8_0** | **63 MB** | **identical to F32** |
34
+
35
+ Q4_K not provided — 256d decoder bottleneck is too narrow for 4-bit quantization.
36
+
37
+ ## Usage
38
+
39
+ ```bash
40
+ # Single crop recognition
41
+ crispembed -m trocr-small-handwritten-q8_0.gguf --ocr crop.png
42
+
43
+ # Full pipeline with DBNet detection
44
+ crispembed --det dbnet-ic15-q4_k.gguf -m trocr-small-handwritten-q8_0.gguf --ocr scan.png
45
+ ```
46
+
47
+ ## License
48
+
49
+ MIT (same as [microsoft/trocr-small-handwritten](https://huggingface.co/microsoft/trocr-small-handwritten)).