metythorn's picture
Upload README.md with huggingface_hub
ee55908 verified
|
Raw
History Blame Contribute Delete
576 Bytes
metadata
language:
  - km
license: apache-2.0
tags:
  - ocr
  - transformer
  - vision
pipeline_tag: image-to-text

Khmer OCR CNN + Transformer

This repository contains a ResNet + Transformer decoder checkpoint for Khmer OCR, I don’t have a public paper for this model — everything comes from thousands of experiments across different model architectures and datasets.

Installation

pip install mer

Usage

from mer import Mer

model = Mer(markdown=True, device='cuda')
result = model.predict("sample_image.png")
print("Predicted text:", result)