Skip to content
Tmob edited this page Jan 28, 2026 · 2 revisions

Welcome to Kiri OCR

Kiri OCR is a lightweight, high-performance Optical Character Recognition (OCR) library designed specifically for English and Khmer documents. It combines modern Deep Learning techniques to provide accurate text detection and recognition in a compact package.

Kiri OCR Banner

✨ Key Features

  • Hybrid Transformer Architecture: Utilizes a CNN backbone with a Transformer encoder and a dual CTC + Attention decoder for high accuracy.
  • Bi-lingual Support: Native support for Khmer and English (including mixed text), handling complex Khmer scripts correctly.
  • Full Pipeline: Includes both text detection (finding text in images) and text recognition (reading the text).
  • Lightweight & Fast: Optimized for efficient inference, making it suitable for various deployment scenarios from cloud to edge.
  • Easy-to-Use: Simple Python API and a powerful Command Line Interface (CLI).
  • Customizable: Train your own models or fine-tune existing ones on your data.
  • Hugging Face Integration: Seamlessly downloads models and datasets from Hugging Face.

🚀 Getting Started

If you are new to Kiri OCR, we recommend starting with the following pages:

  1. Installation: Learn how to install Kiri OCR on your system (Linux, Mac, Windows).
  2. Quick Start: Run your first OCR prediction in minutes.
  3. Python API: Integrate Kiri OCR into your Python applications.
  4. CLI Reference: Explore the command-line tools.

📚 Advanced Topics

For those looking to train their own models or dive deeper:

🔗 Resources

🤝 Contributing

We welcome contributions! Please see our GitHub repository for guidelines on how to contribute code, report issues, or suggest features.

Clone this wiki locally