parsethisio

View on PyPIReverse Dependencies (0)

0.1.1 parsethisio-0.1.1-py3-none-any.whl

Wheel Details

Project: parsethisio
Version: 0.1.1
Filename: parsethisio-0.1.1-py3-none-any.whl
Download: [link]
Size: 23888
MD5: 6216abc88496740915e939dfc7977792
SHA256: c76da99b23395cbad1fe6fc099a6ea9332c45f3bd48caa8252d409c8b86f285f
Uploaded: 2025-03-02 21:18:08 +0000

dist-info

METADATA

Metadata-Version: 2.2
Name: parsethisio
Version: 0.1.1
Summary: A Python library to extract text from various sources for LLM preprocessing.
Author-Email: Jörn Depenbrock <joern[at]jdde.de>
License: GNU Affero General Public License v3.0
Keywords: text extraction,LLM,PDF,web,preprocessing
Classifier: Programming Language :: Python :: 3
Classifier: Operating System :: OS Independent
Requires-Python: <4.0,>=3.10
Requires-Dist: requests
Requires-Dist: PyPDF2
Requires-Dist: coverage
Requires-Dist: openai
Requires-Dist: filetype
Requires-Dist: scrapegraphai
Requires-Dist: youtube_transcript_api
Requires-Dist: loguru
Requires-Dist: gitingest
Requires-Dist: aiohttp
Requires-Dist: markitdown
Requires-Dist: beautifulsoup4
Requires-Dist: coverage; extra == "dev"
Requires-Dist: coverage-badge; extra == "dev"
Requires-Dist: pytest; extra == "dev"
Provides-Extra: dev
Description-Content-Type: text/markdown
License-File: LICENSE
[Description omitted; length: 5856 characters]

WHEEL

Wheel-Version: 1.0
Generator: setuptools (75.8.2)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
content_parser/__init__.py sha256=kUR5RAFc7HCeiqdlX36dZOHkUI5wI6V_43RpEcD8b-0 22
content_parser/archive_parser.py sha256=9c6UnqFnatcwNbIgNb7mtIC7XVKMbWTos83i6SwdHdA 1109
content_parser/audio_parser.py sha256=_4r7_2PGfe_hwuNk9zv-UXporrWzStOckzRufmGJLY0 1069
content_parser/base_parser.py sha256=ayN8G-YgBkRLXr_ZaUFd8BOxITyq9LX5hpjnhqHNT-E 392
content_parser/data_parser.py sha256=yuC9UfjzHarFBVpBWtTEMszhfglPcfC1ZPDR3O4moXo 1570
content_parser/image_parser.py sha256=oIawJymR4x-0b2IAqMe_ZMV_fL9bqMu0JsLZDyLAOO4 1934
content_parser/office_parser.py sha256=Hzt4J0ZKxGQuMq8uJE011hueRM1UVXWex12AByPwltE 1889
content_parser/pdf_parser.py sha256=RRA2N0kBeSB7TzS5wB3X-CF8IQ6ThZCR4lUlvLDI-SU 1346
content_parser/text_parser.py sha256=TuQCRgw6y2Ek5FN7WRm5AHlZ4ptNigdVP89mlzBxviw 3546
content_parser/helpers/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
content_parser/helpers/youtube_transcript_helper.py sha256=0VOeoy7VINREWQakUIJAGglHvC8oqSRR8bbPl6F_1rs 5059
parsethisio-0.1.1.dist-info/LICENSE sha256=z5aMVBrc-9kk0pxaOdOAkI62S5YhYMiF6ADsMtZmOZg 34353
parsethisio-0.1.1.dist-info/METADATA sha256=kDKGZ3blc5ThSJq4wfbKjMeO3Nd78ikhmrQoEH3fvuM 6812
parsethisio-0.1.1.dist-info/WHEEL sha256=jB7zZ3N9hIM9adW7qlTAyycLYW9npaWKLRzaoVcLKcM 91
parsethisio-0.1.1.dist-info/top_level.txt sha256=lec9JwxVuv25dWwyqHikAANgbiPRp7BOu3piB27XYFk 15
parsethisio-0.1.1.dist-info/RECORD

top_level.txt

content_parser