<p align="center">
<a href="https://github.com/junhoyeo/BetterOCR">
<img src="https://github.com/junhoyeo/BetterOCR/raw/main/.github/images/logo.png" width="256px" />
</a>
</p>
<h1 align="center">BetterOCR</h1>
> 🔍 Better text detection by combining multiple OCR engines with 🧠 LLM.
OCR _still_ sucks! ... Especially when you're from the _other side_ of the world (and face a significant lack of training data in your language) — or just not thrilled with noisy results.
**BetterOCR** combines results from multiple OCR engines with an LLM to correct & reconstruct the output.
- **🔍 OCR Engines**: Currently supports [EasyOCR](https://github.com/JaidedAI/EasyOCR) and [Tesseract](https://github.com/tesseract-ocr/tesseract).
- **🧠 LLM**: Supports [Chat models](https://github.com/openai/openai-python#chat-completions) from OpenAI.
- **📒 Custom Context**: Allows users to provide an optional context to use specific keywords such as proper nouns and product names. This assists in spelling correction and noise identification, ensuring accuracy even with rare or unconventional words.
Head over to [💯 Examples](https://github.com/junhoyeo/BetterOCR#-examples) to view performace by languages (🇺🇸, 🇰🇷, 🇮🇳).
Coming Soon: improved interface, async support, box detection, and more.