pyspark-pdf

View on PyPIReverse Dependencies (0)

0.1.0rc9 pyspark_pdf-0.1.0rc9-py3-none-any.whl

Wheel Details

Project: pyspark-pdf
Version: 0.1.0rc9
Filename: pyspark_pdf-0.1.0rc9-py3-none-any.whl
Download: [link]
Size: 189421
MD5: 28552447f9d3eb57f2b06a14fabc3b41
SHA256: 3bb3f5ae8150397fa0ecda6281ac5f6d289aadb71212ae4272504d9f7927784b
Uploaded: 2024-11-07 17:51:54 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: pyspark-pdf
Version: 0.1.0rc9
Summary: Spark-Pdf is a library for processing documents using Apache Spark
Author: Mykola Melnyk
Author-Email: mykola[at]stabrise.com
Home-Page: https://github.com/StabRise/spark-pdf
Project-Url: Repository, https://github.com/StabRise/spark-pdf
License: AGPL-3.0
Classifier: License :: OSI Approved :: GNU Affero General Public License v3
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Requires-Python: >=3.10,<4.0
Requires-Dist: PyMuPDF (==1.24.11)
Requires-Dist: imagesize (==1.4.1)
Requires-Dist: numpy (<2.0.0,>=1.26.4)
Requires-Dist: pandas (<3.0.0,>=2.2.2)
Requires-Dist: pillow (<11.0.0,>=10.4.0)
Requires-Dist: pyarrow (==17.0.0)
Requires-Dist: pyspark (==3.5.3)
Requires-Dist: pytesseract (==0.3.13)
Requires-Dist: pytest (<8.0.0,>=7.4.4)
Requires-Dist: torch (<3.0.0,>=2.3.0); extra == "ml"
Requires-Dist: transformers (<5.0.0,>=4.42.0); extra == "ml"
Provides-Extra: ml
Description-Content-Type: text/markdown
[Description omitted; length: 1613 characters]

WHEEL

Wheel-Version: 1.0
Generator: poetry-core 1.9.1
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
sparkpdf/VERSION sha256=ea9gcXDig6JEX33BeVUg0QmjNpCvTatcbbyr2LhwjE0 8
sparkpdf/__init__.py sha256=3_EXRH8XBYkRsQ3DYAHvTAr2lwPdxX0IRb7Sm7-AvUU 4173
sparkpdf/enums.py sha256=7RHBhZ0YrB__KFiSwTzKL58uX6GVzefj3-ugWZLZVLg 722
sparkpdf/image/DataToImage.py sha256=kMro-D_V5TPcPW8Xx7_QlAEn8BfiyokPmzLwW3eqmUQ 1619
sparkpdf/image/ImageDrawBoxes.py sha256=bCymMbOZJXrIMCSys7kEPpB4dzDoUszQ8zei-tpSYjw 6686
sparkpdf/image/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
sparkpdf/models/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
sparkpdf/models/ner/BaseNer.py sha256=sPTimmbi9gR6h-fI3WRvgJr4ZS_EN6E2Y3wZJizw88M 3712
sparkpdf/models/ner/Ner.py sha256=uB-_LFcRNiMd4HBxzC7h4d_2yr_ce7evSrv6Mz_a_uE 6268
sparkpdf/models/ner/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
sparkpdf/models/recognizers/TesseractOcr.py sha256=I8dRfIl_1v51T75Zj7cMReinPmwJLsy8JvSen20zjG8 11232
sparkpdf/models/recognizers/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
sparkpdf/params.py sha256=ppLPMmT4HJ0moenqbv92JaMbcaZXGmUVXEbnOIf8oY0 10845
sparkpdf/pdf/PdfDataToImage.py sha256=iChfxqeNAJ9DLqNKeFi5alCqUEQQT7bsC8nl4q7Ig9k 3119
sparkpdf/pdf/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
sparkpdf/pipeline/PandasPipeline.py sha256=FBzacOZXOIUGRN7bKPAZGLmxVIqo4saDI6fK0AKMI1A 2035
sparkpdf/pipeline/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
sparkpdf/resources/images/SparkPdfLogo.png sha256=Ilp6_UmE4unGzebPE5qCWAsGwJ-Fc8j1jqq9CK_X-vc 21867
sparkpdf/resources/pdfs/SparkPdf.pdf sha256=Uc3JiCBQ-OGq0maL7rwDi8AAh64oobFC5xqsVmkzpco 84947
sparkpdf/resources/pdfs/example.pdf sha256=k9ivhDZUlDMsoQXVNqf33vX7qp-kAx-58qWt5209-Nc 48025
sparkpdf/schemas/Box.py sha256=wexNWlo4Ai9gDBYvbCKRTeLbFyrrcKGz7Lfttu-Q38c 1036
sparkpdf/schemas/Document.py sha256=QH4KOWAaNNLSxysWRbDBwfZ1vsLWYJHpV6hD-pClr48 442
sparkpdf/schemas/Entity.py sha256=ggysMq8jPb3p-GpDY6VGnuYlwBjqVaQIh0k9iNgXJM4 463
sparkpdf/schemas/Image.py sha256=lts2uCQYb6gAZEokRX6A7nsnzLQr1usJ1cgSXqxq5BY 3134
sparkpdf/schemas/NerOutput.py sha256=0i4XsmLGPKgf2R8553wOHIm-pz-fqZenEQtnchLe4SE 415
sparkpdf/schemas/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
sparkpdf/text/TextToDocument.py sha256=Al26J1LuLog9HKdI-JIe_MHRVvvufnKsBykAKSFBUc0 1024
sparkpdf/text/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
sparkpdf/utils/__init__.py sha256=iVx5PRvgPlDk2-K3V0aFCaDTWdSuShmyiCxyGvO7sWs 765
sparkpdf/utils/auxiliary.py sha256=NzJwqX9LuFqHsMEX3Dl8wyUb2xDTkPlncOXxs_7SEiM 1483
sparkpdf/utils/dataclass.py sha256=dVc0-VFexl8GpHzo-MUHqJVnLKQAqSmov8CRbHyKMPU 5922
sparkpdf/utils/show_utils.py sha256=_cX9j3XgfKbFw9v_55xbNqeGS7Xx9JfqER996o6AwKg 5955
sparkpdf/utils/templates/base.html sha256=49Z5UBHWUDmeHEGORKqLK_lSzZLy3e2_3nUeIBTNZZ4 721
sparkpdf/utils/templates/image.html sha256=lSlqTlK0a2L8FxIp-9KPPpEvfgIJhzArJ1CfoxWxPrk 144
sparkpdf/utils/templates/ner.html sha256=1NlxKXVyZBqDJ4HTR9PRyIGE9IEGHb1YjJXQ6qgsukg 174
sparkpdf/utils/templates/text.html sha256=gxY5VT5aBz-zwrVn8cWHT7-eNdrLw1S1jjkFc-RtpHU 224
pyspark_pdf-0.1.0rc9.dist-info/LICENSE sha256=ILBn-G3jdarm2w8oOrLmXeJNU3czuJvVhDLBASWdhM8 34522
pyspark_pdf-0.1.0rc9.dist-info/METADATA sha256=v8yyH7jjNXO5M9u7-QwXBu58D2G_ha2guAje27QH1yo 2799
pyspark_pdf-0.1.0rc9.dist-info/WHEEL sha256=Nq82e9rUAnEjt98J6MlVmMCZb-t9cYE2Ir1kpBmnWfs 88
pyspark_pdf-0.1.0rc9.dist-info/RECORD