Reverse Dependencies of pdf2image
The following projects have a declared dependency on pdf2image:
- abstract-images — This module, part of the `abstract_essentials` package, provides a collection of utility functions for working with images and PDFs, including loading and saving images, extracting text from images, capturing screenshots, processing PDFs, and more.
- afipcaeqrdecode — Package to decode and extract invoice metadata from an AFIP CAE qr code link
- agixt — An Artificial Intelligence Automation Platform. AI Instruction management from various providers, has an adaptive memory, and a versatile plugin system with many commands including web browsing. Supports many AI providers and models and growing support every day.
- agl-ocr-reader — OCR API: This OCR API is an application for extracting text from images and PDF files. It is built using Flask, a Python web framework. It utilizes the pytesseract OCR library, pymupdf and the PIL library for image processing.
- ai-object-detection — AI Object Detection
- aideml — Autonomous AI for Data Science and Machine Learning
- aipdf — A tool to extract PDF files to markdown, or any other format using AI
- airbyte-cdk — A framework for writing Airbyte Connectors.
- alertwise — Wagtail based weather warnings composing and dissemination tool
- alita-sdk — SDK for building langchain agents using resouces from Alita
- alita-tools — Default set of tools and toolkits available within ELITEA Agents.
- amazon-textract-textractor — A package to use AWS Textract services.
- analysis-engine — Analysis for the UK Department for Transport's major projects portfolio
- analysta-index — Extension of Langchain loaders, llms and retrievers for Analysta
- ansys-geometry-core — A python wrapper for Ansys Geometry service
- ansys-sphinx-theme — A theme devised by ANSYS, Inc. for Sphinx documentation.
- ansys-units — Pythonic interface for units, unit systems, and unit conversions.
- anthropic-cli — A command-line tool for interacting with the Anthropic API
- appjsonify — An academic paper PDF to JSON conversion toolkit.
- apple-vision-utils — Fast and accurate OCR on images and PDFs using Apple Vision framework directly from command line.
- arh — Я здесь за эту улицу стою. Пацаны мне всё, и я всё пацанам. Кто меня знает, тот в курсе.
- aryn-sdk — The client library for Aryn services.
- askdoc — Ask a personal doctor for your medical queries
- astra-multivector — Multivector Tables using the DataAPI from AstraDB
- auto-coder — AutoCoder: AutoCoder
- autogluon.multimodal — Fast and Accurate ML in 3 Lines of Code
- AutoRAG — Automatically Evaluate RAG pipelines with your own data. Find optimal structure for new RAG product.
- autoscab — apply for many of the same job
- awca — A toolkit for making ancient world citation analysis, text summarization, paraphrasing and OCR for PDF to CSV
- babot — Framework para crear agentes inteligentes personalizados
- bangla-pdf-ocr — A package to extract Bengali text from PDFs using OCR
- betty — Betty helps you visualize and publish your family history by building interactive genealogy websites out of your Gramps and GEDCOM family trees
- big-pdf-into-images — A tool to convert PDF files into images, page by page.
- biochatter — Backend library for conversational AI in biomedicine
- biocwl-dash — Viewer for Mount Sinai IIDSGT Precision Oncology reports.
- bisheng-unstructured — ETLs fro LLMs
- bluedot-rest-framework — no summary
- bocr — A Python package for OCR using Vision LLMs
- bpm-ai-core — Core AI abstractions and helpers.
- bsrag-unstructured — A Python package with a built-in web application
- butler-sdk — Butler Python SDK
- bwscan — # bwscan
- Byaldi — Use late-interaction multi-modal models such as ColPali in just a few lines of code.
- camai-utils — Python utils for the Camai CHC COVID Datasystem.
- capybara-docsaid — OpenCV with ONNX Runtime Inference Toolkit.
- cli-pdf-viewer — PDF Viewer
- cliriculum — A python cli tool to rapidly create an html or PDF resume
- cloudinteractive-ai-insights — Collection of AI tools designed to assist with your assignments and projects.
- common-utility-pepe — korean pepe lover
- comod — Compartmental modelling Python package
- contexto — Librería para el procesamiento y análisis de texto con Python
- cornellGrading — Routines for interacting with Cornell installations of Canvas and Qualtrics
- cyyrus — Transform Unstructured Data into Usable Datasets
- data-science-document-ai — "Document AI repo for data science"
- data-snapshot — no summary
- DataXtractor — DataXtractor is a versatile Python library designed to simplify the extraction of valuable data from a variety of sources, including images and PDF documents. Whether you need to extract text, tables, or structured content, DataXtractor provides powerful and intuitive tools to streamline the process.
- decimer-segmentation — DECIMER Segmentation - Extraction of chemical structure depictions from scientific literature
- deepsearch-latest — Deep search tool for document analysis
- demogpt — Autonomous AI Agent for Gen-AI App Generation
- dfelf — Data File Elf
- disclosure-extractor — A data extraction tool from judge financial disclosures.
- django-doma — Simple Document Management for Django
- doc-curation — A package for curating doc file collections, with ability to sync with youtube and archive.org doc items.
- doc-loader — Given werkzeug.FileStorage, fastapi.UploadFile or str file path as input it converts any image files(.pdf, .jpg, .png, .tiff) into list of PIL or numpy objects
- doc-ocr — Text extractor from document
- doc-ocr-yakul — Text extractor from document
- docai-dev — DocAI Python module
- docai-py — Butler Doc AI
- docailite — DocAI Lite Python module
- docbarcodes — Docbarcodes extracts 1D and 2D barcodes from scanned PDF documents or images.
- docile-benchmark — Tools to work with the DocILE dataset and benchmark
- docint — Extracting information from DOCuments INTelligently.
- DOCK-BYTE — A module to extract text from documents and chat with the content.
- docqa — DocQA: An easy way to extract information from documents
- docquery — DocQuery: An easy way to extract information from documents
- docquery-test — DocQuery: An easy way to extract information from documents
- documentdataextraction — Change the yml template format in GenerateExtractTemplate class
- DocumentsReader — A package to read and process documents
- docupie — An advanced document processing tool that leverages AI to extract structured data from PDFs
- domsdatabasen — Scraper and PDF text processor for domsdatabasen.dk
- dp-PDF-Crawler — A custom Flask package with PDF processing tools
- dsparse — Multi-modal file parsing and chunking
- dsrag — State-of-the-art RAG pipeline from D-Star AI
- dynamiq — Dynamiq is an orchestration framework for agentic AI and LLM applications
- easylatex2image — another latex converter 2 pictures
- easyocr-unstructured — Parse unstructured text from PDFs
- ebs-iot-linuxnode — no summary
- ecom-data-helpers-lib — A library of reusable utilities for AWS Lambda functions in ECOM Data Projects
- eDOCr — OCR for Engineering Mechanical Drawings
- edubotics-core — Core modules for edubotics-based LLM AI chatbots
- edupsyadmin — edupsyadmin provides tools to help school psychologists with their documentation
- efficient-ocr — Efficient OCR
- EmberFactory — Software to (re)produce burning ember diagrams of the style used in IPCC reports.
- embermaker — Software library to (re)produce burning ember and related diagrams of the style used in IPCC reports
- emo-market-base — Marketlerden ürünleri kazıma işlemleri için temel pakettir
- enb — Experiment NoteBook (enb): efficient and reproducible science.
- esch — esch (v.) : to turn matricies into high quality svg (animations)
- Extractable — Extract tables from PDFs
- extralit — Open-source tool for accurate & fast scientific literature data extraction with LLM and human-in-the-loop.
- faker-file — Generate files with fake data.