Reverse Dependencies of tesserocr
The following projects have a declared dependency on tesserocr:
- axa-fr-ocr — AXA France OCR library
- cleanX — Python library for cleaning data in large datasets of Xrays
- decaptcha — A GUI automation Python module for solving Google reCAPTCHA v2
- docling — SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
- docling-google-ocr — SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
- docowling — SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
- extended-docling — SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications, now with Google OCR support.
- form-tools — no summary
- hvf-extraction-script — Python extraction script for HVF report images
- liteocr — Light-weight OCR engine.
- memorious — A minimalistic, recursive web crawling library for Python.
- metalex — MetaLex is tool for lexicographic and metalexicographic activities
- ocrd-tesserocr — wrap Tesseract preprocessing, segmentation and recognition
- OCyara — A Yara rule engine that scans images for matches using Optical Character Recognition (OCR). See the Github page for more information about the Cython, Tesseract, and Leptonica prerequsites.
- pgsocr — A command line utility for converting Blu-ray subs to SRT or ASS using AI Language Models.
- picslate — Picture ocr & translation
- sheatless — A python library for extracting parts from sheetmusic pdfs
- video-ocr — Package to run OCR on videos
1