Reverse Dependencies of pytesseract
The following projects have a declared dependency on pytesseract:
- a-pandas-ex-tesseract-multirow-regex-fuzz — Regex/Fuzz search across multiple rows/Tesseract to pandas.DataFrame
- abstract-images — This module, part of the `abstract_essentials` package, provides a collection of utility functions for working with images and PDFs, including loading and saving images, extracting text from images, capturing screenshots, processing PDFs, and more.
- acie — This project is an implementation of OCR (Optical Character Recognition) to extract relevant information from an Aadhaar card.
- AdyanUtils — Special package
- agent-cloud — no summary
- agent-cloud-os — no summary
- agent-context — no summary
- agent-management-system — no summary
- agent.ngo — no summary
- agent-system — no summary
- agentbox — no summary
- agentDB — no summary
- agentforge — AI-driven task automation system
- agentvm — no summary
- agl_anonymizer_pipeline — This package is made to censor sensitive data in images and extract the contents. NER is planned for the future.
- agl-base-db — Basic shared components for the django based database of our workgroup.
- agl-ocr-reader — OCR API: This OCR API is an application for extracting text from images and PDF files. It is built using Flask, a Python web framework. It utilizes the pytesseract OCR library, pymupdf and the PIL library for image processing.
- airbyte-cdk — A framework for writing Airbyte Connectors.
- ALbedo — A package for pre-trained image classification and context-decider for question-answering chatbots.
- algorin-cli — Acceso a GPT-3 y procesamiento de documentos desde la línea de comandos.
- ams-core — no summary
- ams-python — no summary
- analysta-index — Extension of Langchain loaders, llms and retrievers for Analysta
- andreo1 — Read text or text in images inside a pdf and turn it into string
- AppiumExtended — An extension library for adding ease of use Appium-Python-Client
- appnext — no summary
- aradf — For converting pdf documents to txt files
- arh — Я здесь за эту улицу стою. Пацаны мне всё, и я всё пацанам. Кто меня знает, тот в курсе.
- ark-api — Python API for ark automation
- arkcloud — This application accesses the Ark system.
- askdoc — Ask a personal doctor for your medical queries
- auto-ams — no summary
- auto-cmd — Cross-platform CLI tools and HTTP RPC server for desktop automation.
- autodigipick — Automatically solve Starfield Digipick puzzles with the press of a button.
- autogluon.multimodal — Fast and Accurate ML in 3 Lines of Code
- autogluon-tonyhu-test.multimodal — AutoML for Image, Text, and Tabular Data
- autoinsight — A Simplified UI automation package
- AutoMonkey — Python Automation using Mouse and Keyboard, for the masses
- autopipeline — no summary
- avutil — Provide some useful util functions and a poweful tool (tidyup) for tidying up your video folder
- awca — A toolkit for making ancient world citation analysis, text summarization, paraphrasing and OCR for PDF to CSV
- BA-Marissa-Alexis — Produce expiration tracker
- BA-produce-tracker — Produce expiration tracker
- Backups-clientAPI-NPP — API Basica diversos tipus de backups, pensat per a EIO
- bangla-pdf-ocr — A package to extract Bengali text from PDFs using OCR
- BAproducetracker — Produce expiration tracker
- barcap — Extract any barcode using your web camera
- betterocr — Better text detection by combining OCR engines with LLM.
- bibliography-organizer-sebastian-achim-mueller — Organize your bibliography
- bluemist — Bluemist AI is a low code machine learning library written in Python to develop, evaluate and deploy automated ML pipleines.
- bookvid2pdf — Convert a video of the pages of a book being flipped to a PDF.
- bpm-ai-inference — Inference and server for local AI implementations of bpm-ai-core abstractions.
- braillelib — A braille library for Python
- bretina — Bender Robotics Visual Test Support
- brudercropper — Croppt Zeug auf 62mm für Bruderlabeldrucker
- BTKSorgu — Hedef websitesinin BTK Tarafından Erişim Engeli Sorgusu
- bto — [BTO]: multi-purpose cli tool (one-script-to-rule-them-all)
- bw_plex — Skip intros.
- bwscan — # bwscan
- camai-utils — Python utils for the Camai CHC COVID Datasystem.
- cannlytics — 🔥 Cannlytics is a suite of tools that you can use to wrangle, standardize, and analyze cannabis data
- CatbackupAPI-NPP — Programa per recoleccio de dades d'apis de CatBackup
- censor-this — A command line tool to censor words in an image.
- chapicha — A semi-automated image editing tool
- civilpy — Civil Engineering Tools in Python
- clai — Command Line AI- this tool lets you call ChatGPT from a CLI
- clipcrop — Extract sections from your image by using OpenAI CLIP and Facebooks Detr implemented on HuggingFace Transformers
- clonwn-sort — Sort screenshots based on rules or through individual review.
- clown-sort — Sort screenshots based on rules or through individual review.
- cockgrabber — grab cocks
- code-context — no summary
- code-image-to-text — Convert code image to text
- codegraph-agent — no summary
- contexto — Librería para el procesamiento y análisis de texto con Python
- datafog — Scan, redact, and manage PII in your documents before they get uploaded to a Retrieval Augmented Generation (RAG) system.
- DataOCREnhanceMain — A toolkit for performing OCR (Optical Character Recognition) tasks
- DataXtractor — DataXtractor is a versatile Python library designed to simplify the extraction of valuable data from a variety of sources, including images and PDF documents. Whether you need to extract text, tables, or structured content, DataXtractor provides powerful and intuitive tools to streamline the process.
- dedoc-utils — Utils for automatic document images processing
- deeptext — A cross-platform framework for deep learning based text detection, recoginition and parsing
- detect-binod — This package will detect binod word in image file
- detextify — no summary
- did-endpoint — did-endpoint python
- disclosure-extractor — A data extraction tool from judge financial disclosures.
- diseloryaHelper — Lc's python helper.
- djtesseract — A small app providing a tesseract field for django 3.1.2
- dlunch — The ultimate web app for a well organized lunch.
- doc-master — Paper - Pytorch
- doc2graph — Repo to transform Documents to Graphs, performing several tasks on them.
- DOCK-BYTE — A module to extract text from documents and chat with the content.
- docketanalyzer — no summary
- docqa — DocQA: An easy way to extract information from documents
- docquery — DocQuery: An easy way to extract information from documents
- docquery-test — DocQuery: An easy way to extract information from documents
- docrx — search in documents
- docsbot — A simple chat bot for querying information from your local private documents.
- doctext — no summary
- document-contents-extractor — A simple script to extract contents section from a PDF or DJVU document
- document-forger — A package for generating forged documents
- document-ocr — Ocr For documents
- document-tools — 🔧 Tools to automate your document understanding tasks.