Reverse Dependencies of pypdfium2
The following projects have a declared dependency on pypdfium2:
- aijson-core — Low-code config language for AI pipelines
- aijson-pdf — AI JSON PDF Actions
- amazon-textract-textractor — A package to use AWS Textract services.
- asyncflows — Declarative AI Pipelines
- axisvm — A Python package for AxisVM.
- bisheng-unstructured — ETLs fro LLMs
- bpm-ai-core — Core AI abstractions and helpers.
- deepdoctection — Repository for Document AI
- delete-your-pdf — Crop, Rotate, and extract text from your PDFs so you can delete them
- docling — SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
- docprompt — Documents and large language models.
- edspdf — Smart text extraction from PDF documents
- expert-doc — Document parser for 'expert'
- extract-thinker — Library to extract data from files and documents agnositicaly using LLMs
- extralit — Open-source tool for accurate & fast scientific literature data extraction with LLM and human-in-the-loop.
- friday-agent — An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.
- gmft — Lightweight, performant, deep table extraction
- god-ocr — OCR King
- h2ogpt — no summary
- img2speech — Integrated Python package for converting Image to speech
- img2table — img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing
- langchain_1111_Dev_cerebrum — Building applications with LLMs through composability
- langchain-by-johnsnowlabs — Building applications with LLMs through composability
- langchain-xfyun — 在LangChain中流畅地使用讯飞星火大模型
- langchaincoexpert — Building applications with LLMs through composability
- langchainmsai — Building applications with LLMs through composability
- langchainn — Building applications with LLMs through composability
- langplus — Building applications with LLMs through composability
- langrila — useful tool to use API-based LLM
- llmvm-cli — Command Line LLM with client-side tools support.
- look-like-scanned — Python script to make documents look like they were scanned.
- mimir-ai — no summary
- mindee — Mindee API helper library for Python
- modm-data — Embedded Hardware Description Processor
- nanugo — Turn your handwritten pdf sheets to Anki deck.
- nougat-ocr — Nougat: Neural Optical Understanding for Academic Documents
- Ocrversion1 — no summary
- Ocrversion2 — no summary
- onnxtr — Onnx Text Recognition (OnnxTR): docTR Onnx-Wrapper for high-performance OCR on documents.
- oplangchain — langchain for OpenPlugin
- os-copilot — An self-improving embodied conversational agents seamlessly integrated into the operating system to automate our daily tasks.
- pdf-bank-statement-parser — Command-line tool for converting PDF bank statements into CSV
- pdf-gpt4-json — Use GTP4-Vision as a better than OCR data extractor
- pdf2index — no summary
- pdfbrain — Parsing PDF files with pdfium
- pdfplumber — Plumb a PDF for detailed information about each char, rectangle, and line.
- pdfplumber.aemc — Plumb a PDF for detailed information about each char, rectangle, and line.
- pdftext — Extract structured text from pdfs quickly
- PrivacySherlock — A Python package for PII detection and classification
- pybudgetbook — Organize and sort your receipts locally, use pandas power to analyze your spendings!
- pyrpasuite — RPA using python
- python-doctr — Document Text Recognition (docTR): deep Learning for high-performance OCR on documents.
- pythonwatermark — Easily add watermarks to PDF, JPG & PNG files with no restrictive licensing
- semantra — A semantic search CLI tool
- slowblood — Tools for ML/LLM
- sparclur — Tools for analyzing PDF files and comparing PDF parsers
- stepcutis — A document analysis program
- supersullytools — This is a Python package that brings together a suite of utilities and helpers across several domains of software development.
- surya-ocr — OCR, layout, reading order, and table recognition in 90+ languages
- surya-ocr-vlite — OCR, layout analysis, and line detection in 90+ languages
- tabled-pdf — Detect and recognize tables in PDFs and images.
- testzeus-hercules — Hercules: The World's First Open-Source AI Agent for End-to-End Testing
- texify — OCR for latex images
- unimernet — UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
- vectorcraft — A custom library extending LangChain functionality.
- wdoc — A perfect AI powered RAG for document query and summary. Supports ~all LLM and ~all filetypes (url, pdf, epub, youtube (incl playlist), audio, anki, md, docx, pptx, oe any combination!)
- zebrafy — Python library for converting PDF and images to Zebra Programming Language (ZPL)
1