Reverse Dependencies of pdfplumber
The following projects have a declared dependency on pdfplumber:
- aclpubcheck — no summary
- aeiva — aeiva is a general AI agent framework
- agent-llm — An Artificial Intelligence Automation Platform. AI Instruction management from various providers, has an adaptive memory, and a versatile plugin system with many commands including web browsing. Supports many AI providers and models and growing support every day.
- AGIpdf2json — This package can help user parse PDF files into text file and JSON file. Additionally, it can help user parse question-answer pairs into a JSONL document in prompt-completion format, that is supported by OpenAI
- agixt — An Artificial Intelligence Automation Platform. AI Instruction management from various providers, has an adaptive memory, and a versatile plugin system with many commands including web browsing. Supports many AI providers and models and growing support every day.
- agl-report-reader — AGL Report Reader app
- aideate-blade — web content aideate_scraper
- ankigengpt — no summary
- appjsonify — An academic paper PDF to JSON conversion toolkit.
- aqua-parser — An amazing aquaparser-parser.
- arfindata — Python package for the aquisition and pre-treatment of China's listed companies' annual reports
- Artemisa — Sistema de extración de información de documentos
- arxiv-astro-summarizer — Scrapes arXiv astro-ph paper, summarizes the abstract, and returns relavant papers according to a user input.
- autoanki — Automatically make Anki Decks for Chinese text
- AutoRAG — Automatically Evaluate RAG pipelines with your own data. Find optimal structure for new RAG product.
- azure-genai-utils — Utility functions for Azure GenAI
- bank-statement-reader-altara — no summary
- bankruptcy — A bankruptcy document parser.
- banner-of-light-research-assistant — This application prepares data from the 19th century newspaper Banner of Light to be analyzed by a AI research assistant powered by OpenAI
- bany — A collection of scripts for personal finance
- beancount-cmb-importer — A beancount importer for CMB.
- beancount-reds-importers — Importers for various institutions for Beancount
- bisheng-unstructured — ETLs fro LLMs
- botrun-ask-folder — no summary
- bsrag-unstructured — A Python package with a built-in web application
- cannlytics — 🔥 Cannlytics is a suite of tools that you can use to wrangle, standardize, and analyze cannabis data
- cdt.ai — Cognitive Data Transformer for text, image, audio, and video processing
- ChinesePatentParser — A Python script that can parse a Chinese patent of invention type to extract fields, sections, and subsections in it.
- cobralib — A utilities module that contains classes and functions that simplify interfaces with files and databases.
- comwares — This project provides middlewares for a startup company.
- conc_test_report — Generate a concise and brief summary of all concrete test result PDFs, to aid in fast and efficient review
- contract-analyzer — A RAG system for contract analysis
- credit-pdfextract — a pdf extract library
- crewai — Cutting-edge framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
- cropioai — Cutting-edge framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CropioAI empowers agents to work together seamlessly, tackling complex tasks.
- cv-xtractor — A Python package for extracting information from CVs (resumes).
- data-modori — LMOps Tool for Korean
- dbgpt-ext — Add your description here
- deepdoctection — Repository for Document AI
- deepsearcher — None
- depdf — PDF table & paragraph extractor
- desktop-env — The package provides a desktop environment for setting and evaluating desktop automation tasks.
- digital-nondigital-pdf-extraction — This module will return whether PDF is Digital, Non-Digital or Mixed.
- disclosure-extractor — A data extraction tool from judge financial disclosures.
- doc-extractor — no summary
- doc-summarizer — Text summarizer implemented via langchain.
- doc2html — Convert documents between formats. For example docx to html or pdf to html
- docint — Extracting information from DOCuments INTelligently.
- docparser-feb — Document parsing tool for LLM training and Rag
- docqa — DocQA: An easy way to extract information from documents
- docquery — DocQuery: An easy way to extract information from documents
- docquery-test — DocQuery: An easy way to extract information from documents
- documentdataextraction — Change the yml template format in GenerateExtractTemplate class
- DocumentInsightsGenerator — A package to generate comprehensive insights from documents using NLP techniques.
- docusearch — Query documents quickly and efficiently
- dost — DOST is a Python based Utility platform as an Open Source project. We strive to liberate humans from mundane, repetitive tasks, giving them more time to use their intellect and creativity to solve higher-order business challenges and perform knowledge work.
- dp-PDF-Crawler — A custom Flask package with PDF processing tools
- ebank — no summary
- ebanktool — no summary
- eegunity — An open source Python pacakge for large-scale EEG datasets processing
- efficient-ocr — Efficient OCR
- ezlocalai — ezlocalai is an easy to set up local multimodal artificial intelligence server with OpenAI Style Endpoints.
- fgts-pdf-dados — Extrai dados de PDFs do FGTS e grava tudo em arquivo CSV pronto para usar com o Inverstorzilla.
- fileseek — FileSeek – AI-Powered Local Document Archive&Search
- finance-analytics — extract and analyze Bank statements
- fincept-terminal — A Terminal for Financial Market Analysis and Fetching all kinds of Data.
- find-in-pdf — A simple tool to search for strings or list PDF files within a directory.
- find-keyword-xtvu — A package to find keywords in .pdf, .docx, .odt, and .rtf files, with support for multiple languages and the ability to run on multiple CPU cores
- flexidata — FlexiData is an open-source Python package designed for processing unstructured data.
- friday-agent — An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.
- getpaper — getpaper - papers download made easy!
- global-parser-lib — A library for parsing various file types.
- gpt-pdf-organizer — no summary
- grag — A simple package for implementing RAG
- graphtomation — An AI utility package to build and serve Crew and LangGraph workflows as FastAPI routes, packed with reusable components for AI engineers.
- intelligence-toolkit — Interactive workflows for generating AI intelligence reports from real-world data sources using GPT models
- invoice-parser — Tools for parsing and extracting information from invoices.
- invoice2data — Python parser to extract data from pdf invoice
- irspdf — A simple information retrieval system for pdf documents
- k-parse-tool — parse and extract data from HTML
- leitor-pdf — no summary
- lemon-rag — no summary
- lexoid — no summary
- llm-agent-toolkit — LLM Agent Toolkit provides minimal, modular interfaces for core components in LLM-based applications.
- llmvm-cli — Command Line LLM with client-side tools support.
- local-deep-research — AI-powered research assistant with deep, iterative analysis using LLMs and web searches
- lpw — Using Local Packet Whisperer (LPW, Chat with PCAP/PCAPNG files locally, privately!
- LWD-utils — rename-version of PaperCrawlerUtil
- makerbean — A small educational purpose package
- mctinctools — Common tools for our organization.
- mdfileconvert — Convierte archivos a formato Markdown extrayendo texto, tablas e imágenes.
- megaparse — no summary
- metabook — rename and organize your pdf book collection
- mimir-ai — no summary
- miro-to-mermaid — Miro to Mermaid diagram converter
- ml-access-key-extractor — Biblioteca para extrair chave de acesso na nota fiscal em PDF
- mmda — MMDA - multimodal document analysis
- moonai — Cutting-edge framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, moonai empowers agents to work together seamlessly, tackling complex missions.
- MordinezNLP — Powerfull python tool for modern NLP processing
- msds-tdm — MSDS Package