pdftokenizer

View on PyPIReverse Dependencies (0)

0.0.4 pdftokenizer-0.0.4-py3-none-any.whl

Wheel Details

Project: pdftokenizer
Version: 0.0.4
Filename: pdftokenizer-0.0.4-py3-none-any.whl
Download: [link]
Size: 8542
MD5: 4b23ccd08fd6d9a8c4897afe15264a49
SHA256: 22dae8bde95c1d5d44d3b5193abb1e4b344496b18edc500362eb8a71f082e4a5
Uploaded: 2025-01-20 00:21:20 +0000

dist-info

METADATA

Metadata-Version: 2.4
Name: pdftokenizer
Version: 0.0.4
Summary: Tool to extract PAWLs tokens from PDFs
Author-Email: JSv4 <scrudato[at]umich.edu>
Project-Url: Documentation, https://github.com/JSv4/pdftokenizer#readme
Project-Url: Issues, https://github.com/JSv4/pdftokenizer/issues
Project-Url: Source, https://github.com/JSv4/pdftokenizer
Classifier: Development Status :: 4 - Beta
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: Implementation :: CPython
Classifier: Programming Language :: Python :: Implementation :: PyPy
Requires-Python: >=3.10
Requires-Dist: pdf2image (>=1.17.0)
Requires-Dist: pdfplumber (>=0.11.0)
Requires-Dist: plasmapdf (==0.1.2)
Requires-Dist: pypdf
Requires-Dist: pytesseract (>=0.3.0)
Description-Content-Type: text/markdown
License-Expression: MIT
License-File: LICENSE.txt
[Description omitted; length: 1998 characters]

WHEEL

Wheel-Version: 1.0
Generator: hatchling 1.27.0
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
pdftokenizer/__about__.py sha256=aBGuHwpyTF5WpHVTiuWJCaHebyx3m3kqFadf6mLP9wI 124
pdftokenizer/__init__.py sha256=qRDFaJOd3XDXFu9BFWpV3Pkw0R_nbbZL4ZwOkZImDPc 1406
pdftokenizer/types.py sha256=zWaZgQcbROmtilUkUzFzNt1D0B7OEYhoCheYN6-4WUs 259
pdftokenizer/utils.py sha256=ehErXcUFlG9NHXN7hSE1lLna1rOWmwoGtjP75IXPyXc 3395
pdftokenizer/extractors/__init__.py sha256=FDLlIp_Z76RVGt9BxcP3vIkC4hXETRwYawzCyzZ1XrY 205
pdftokenizer/extractors/base.py sha256=XqM49ZMs5YdyE35GQKNIqS6Tu_h735Zt2SorJqUHAlk 415
pdftokenizer/extractors/pdfplumber.py sha256=Yh6LsppD2lOIVc28AsA5WjLPCbtaHLT3lboEweBvKf4 1752
pdftokenizer/extractors/tesseract.py sha256=8oeSAyybacgybpil03O4Sz45oZvWLWh9795TpMispPM 3126
pdftokenizer-0.0.4.dist-info/METADATA sha256=JcnAX73VA4QQw1I2e7fuUQfT2YlWi-lf1nHiLSIdBWA 2995
pdftokenizer-0.0.4.dist-info/WHEEL sha256=qtCwoSJWgHk21S1Kb4ihdzI2rlJ1ZKaIurTj_ngOhyQ 87
pdftokenizer-0.0.4.dist-info/licenses/LICENSE.txt sha256=wvIPgHGvCpGc3uXMr3bDMY0y90-tewONiM3wYNznULc 1099
pdftokenizer-0.0.4.dist-info/RECORD