Reverse Dependencies of PyMuPDF
The following projects have a declared dependency on PyMuPDF:
- 3m — 3m
- 42towels — an essential item for any intergalactic hitchhiker
- academic-claim-analyzer — A tool for analyzing academic claims
- AdyanUtils — Special package
- aerospace-chatbot — Aerospace engineering chatbot and AI tools.
- afipcaeqrdecode — Package to decode and extract invoice metadata from an AFIP CAE qr code link
- Agentx — AgentX: Seamlessly integrate intelligent agents into your projects. Empower your applications with advanced AI capabilities.
- agl-ocr-reader — OCR API: This OCR API is an application for extracting text from images and PDF files. It is built using Flask, a Python web framework. It utilizes the pytesseract OCR library, pymupdf and the PIL library for image processing.
- ai4data — no summary
- aicastle — AI Castle Package
- aigrok — A Python package for document processing and analysis with LLM integration and OCR capabilities
- airclick — airclick 相关python包
- airless-pdf — Airless package to manipulate pdf
- alacorder — Alacorder retrieves case detail PDFs from Alacourt.com and processes them into data tables suitable for research purposes.
- algorin-cli — Acceso a GPT-3 y procesamiento de documentos desde la línea de comandos.
- alita-tools — Default set of tools and toolkits available within ELITEA Agents.
- alldata — This is a Package in which you can Extract Images,Text and Tables from 1 package
- amberpdf — Librería que procesa un PDF mixto (texto e imágenes/tablas) y extrae el contenido en orden
- anbani — Georgian alphabet and language utilities for Natural Language Processing, script conversion and more.
- api2openai — Create a Python package.
- aradf — For converting pdf documents to txt files
- aranea — Aranea is an automated architecture analysis tool for parsing a car architecture from a PDF file.
- arcan — An AI web3 tooling platform for the decentralized customization and enhancement of AI agents
- archive-hocr-tools — hOCR (streaming) parsers and writers
- archive-pdf-tools — Internet Archive PDF compression tools
- arxiv-dl — Command-line Papers Downloader. Citation extraction and PDF naming automation.
- arxiv-summarizer — A happy toolkit for arxiv paper summarization and understanding.
- ascript — airclick 相关python包
- atelier-facture — no summary
- aus-council-scrapers — no summary
- ausbildungsnachweise-utils — Utilities to generate Ausbildungsnachweise PDFs from human readable input formats.
- Auto-Research — Geberate scientific survey with just a query
- autopipeline — no summary
- AutoRAG — Automatically Evaluate RAG pipelines with your own data. Find optimal structure for new RAG product.
- avahiplatform — An avahiai library which makes your Gen-AI tasks effortless
- aws-textract-pipeline — Package short description.
- axa-fr-splitter — This package splits PDF and TIFF files into separate PNGs and extracts text from input files.
- b2cloud — ヤマト運輸株式会社が提供する送り状発行システムB2クラウドをpythonで利用するパッケージ
- basiclingua — A Python library based on various LLMs to perform basic and advanced natural language processing (NLP) tasks
- bbrc-pyxnat — XNAT in Python
- bechdelai — Automating the Bechdel test and its variants for feminine representation in movies with AI
- Bimbmaan — bimbmaan: A Shaktiman to process high quality images for research publication
- Bio-Epidemiology-NER — Recognize bio-medical entities from a text corpus
- biochatter — Backend library for conversational AI in biomedicine
- bisheng-langchain — bisheng langchain modules
- bisheng-unstructured — ETLs fro LLMs
- BLA2 — This is a Package in which you can Extract Images,Text and Tables from 1 package
- bluewave — Python script to analyze the similarity of two PDFs
- bma-client-lib — BornHack Media Archive Client Library
- bnw-tools — Tools developed in the BorgNetzWerk project for the extraction, analysis and publication of knowledge.
- BookerPdfTool — iBooker/ApacheCN 知识库抓取工具
- botrun-ask-folder — no summary
- brainannex — Full-Stack Data/Knowledge Management with Neo4j
- browsr — TUI File Browser App
- Bs-Extractor — This repository contains a Python program designed to execute Optical Character Recognition (OCR) and Facial Recognition on images.
- bsg-ide — Beamer Slide Generator IDE
- BsSalary-Extractor — This repository contains a Python program designed to execute Optical Character Recognition (OCR) and Facial Recognition on images.
- BuoyanText — Normalizing English and Chinese Text
- burdoc — Advanced PDF parsing for python
- camel-ai — Communicative Agents for AI Society Study
- CanD — Create complex layouts for scientific figures in matplotlib
- cardimpose — Impose multiple copies of a pdf onto a larger document.
- casparser — (Karvy/Kfintech/CAMS) Consolidated Account Statement (CAS) PDF parser
- cbz — CBZ simplifies creating, managing, and viewing comic book files in CBZ format, offering seamless packaging, metadata handling, and built-in viewing capabilities
- cdt.ai — Cognitive Data Transformer for text, image, audio, and video processing
- celi-framework — Controller-Embedded Language Interactions - facilitates the entire lifecycle of document processing, from pre-processing and embedding to post-monitoring and quality assessment.
- chat-research — Use ChatGPT to accelerator your research.
- chatiq — A versatile Slack bot using GPT & Weaviate-powered long-term memory to accomplish various tasks.
- ChatLLM — Create a Python package.
- ChatSQL — Create a Python package.
- chemllmhack — A SDK for computational chemistry LLM hackthon
- chemrel — A project which focuses on automating and transferring chemical data extraction using span categorization and relation extraction models.
- chichitk — Python UI library built upon Tkinter
- Chicoasen — Librería para cargar archivos csv & json y conectar a una base de datos
- chinese-pdf-divider — divide chinese pdf file into blocks within 512
- Chocolate-App — no summary
- chunkifyr — Your ultimate toolkit for text chunking.
- chunknorris — A package for chunking documents from various formats
- chutoro — no summary
- circlemind — no summary
- cista — Dropbox-like file server with modern web interface
- civilpy — Civil Engineering Tools in Python
- clearedge — no summary
- closeai — Create a Python package.
- clown-sort — Sort screenshots based on rules or through individual review.
- cnmv-data — Extracción desde PDF de la cartera de inversión reportada por Fondos de Inversión a la CNMV
- codeforge — no summary
- Codexes2Gemini — Humans and AIs making books richer, more diverse, and more surprising.
- colibrie — Colibrie is a blazing fast tool to extract tables from PDFs
- colorblind_pdf — A package to process PDFs for testing colorblind accessibility.
- colrev — CoLRev: An open-source environment for collaborative reviews
- colrev-scidb — TODO
- comicbox-pdffile — A ZipFile like API for PyMuPDF
- comicpy — Tool to create CBR or CBZ files, supports PDF, ZIP, RAR files.
- commit0 — A development and evaluation framework for using language models to generate libraries.
- compare-pdf — A simple package to visually compare PDF files
- compiloor — no summary
- concall-tools — Tools to extract information from concall transcripts
- ContextQA — Chat with your data by leveraging the power of LLMs and vector databases
- convert-all — A CLI tool to convert files between formats