docowling

View on PyPIReverse Dependencies (0)

1.0.17 docowling-1.0.17-py3-none-any.whl

Wheel Details

Project: docowling
Version: 1.0.17
Filename: docowling-1.0.17-py3-none-any.whl
Download: [link]
Size: 116154
MD5: e60915e7f68da15ba6e5d52a428de6cd
SHA256: 4a146c051ecd2b12068fce2a163c2b606a9026ec55348be65d9416ff5117a177
Uploaded: 2025-01-11 17:29:20 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: docowling
Version: 1.0.17
Summary: SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
Author: Christoph Auer
Author-Email: cau[at]zurich.ibm.com
Home-Page: https://github.com/mouraworks/docowling
Project-Url: Repository, https://github.com/mouraworks/docowling
License: MIT
Keywords: docowling,convert,document,pdf,docx,html,markdown,layout model,segmentation,table structure,table former
Classifier: Development Status :: 5 - Production/Stable
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: MacOS :: MacOS X
Classifier: Operating System :: POSIX :: Linux
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Requires-Python: >=3.9,<4.0
Requires-Dist: beautifulsoup4 (<5.0.0,>=4.12.3)
Requires-Dist: certifi (>=2024.7.4)
Requires-Dist: chardet (<6.0.0,>=5.2.0)
Requires-Dist: deepsearch-glm (<2.0.0,>=1.0.0)
Requires-Dist: docling-core[chunking] (<3.0.0,>=2.12.1)
Requires-Dist: docling-ibm-models (<4.0.0,>=3.1.0)
Requires-Dist: docling-parse (<4.0.0,>=3.0.0)
Requires-Dist: easyocr (<2.0,>=1.7)
Requires-Dist: filetype (<2.0.0,>=1.2.0)
Requires-Dist: huggingface_hub (<1,>=0.23)
Requires-Dist: lxml (<6.0.0,>=4.0.0)
Requires-Dist: marko (<3.0.0,>=2.1.2)
Requires-Dist: ocrmac (<2.0.0,>=1.0.0); sys_platform == "darwin" and extra == "ocrmac"
Requires-Dist: onnxruntime (<1.20.0,>=1.7.0); python_version < "3.10" and extra == "rapidocr"
Requires-Dist: onnxruntime (<2.0.0,>=1.7.0); python_version >= "3.10" and extra == "rapidocr"
Requires-Dist: openpyxl (<4.0.0,>=3.1.5)
Requires-Dist: pandas (<3.0.0,>=2.1.4)
Requires-Dist: pydantic (<3.0.0,>=2.0.0)
Requires-Dist: pydantic-settings (<3.0.0,>=2.3.0)
Requires-Dist: pypdfium2 (<5.0.0,>=4.30.0)
Requires-Dist: python-docx (<2.0.0,>=1.1.2)
Requires-Dist: python-pptx (<2.0.0,>=1.0.2)
Requires-Dist: rapidocr-onnxruntime (<2.0.0,>=1.4.0); python_version < "3.13" and extra == "rapidocr"
Requires-Dist: requests (<3.0.0,>=2.32.3)
Requires-Dist: rtree (<2.0.0,>=1.3.0)
Requires-Dist: scipy (<2.0.0,>=1.6.0)
Requires-Dist: tesserocr (<3.0.0,>=2.7.1); extra == "tesserocr"
Requires-Dist: typer (<0.13.0,>=0.12.5)
Provides-Extra: ocrmac
Provides-Extra: rapidocr
Provides-Extra: tesserocr
Description-Content-Type: text/markdown
[Description omitted; length: 3297 characters]

WHEEL

Wheel-Version: 1.0
Generator: poetry-core 1.9.1
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
docowling/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
docowling/backend/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
docowling/backend/abstract_backend.py sha256=Z9tLgs7bCeAtG0fElpcBt2k1joSbNF3vBsEw_GxnIDQ 1746
docowling/backend/asciidoc_backend.py sha256=S0UIaXXasmWP3k6MhOo7N7lzeLbaEGwHJRu8SkoHdPU 14488
docowling/backend/csv_backend.py sha256=wQMX4RR05INjsFSh_pMRPXZk80C2WspXMSZx2PGoGeY 7907
docowling/backend/docling_parse_backend.py sha256=hoFDUiIhW9-fpd-X9I-ViNaNpSUa4yJJktD2x0wdLt8 7864
docowling/backend/docling_parse_v2_backend.py sha256=im1TKt1RzqxVXXUya2zwhXe8OKO_oC3Rxc4vsaNeYuk 8893
docowling/backend/html_backend.py sha256=2ayy5GI2NtUKIWpZRfpJLx7OMYSbLqXIcAmpoVXVl5g 16040
docowling/backend/md_backend.py sha256=Ig1t_BSaMZ5aB90PoPB1YvzN6v_E8hUiMverxP8bW0U 14384
docowling/backend/msexcel_backend.py sha256=l7rGpG411ASkUYIHUiQgbqSIXgQgpcQ498Ltl4qrwfY 12420
docowling/backend/mspowerpoint_backend.py sha256=lEBDEQuWekMI9t1RlbyQnPWzQKvgooj9ET7ZzQlQWb4 16124
docowling/backend/msword_backend.py sha256=8wY9ltu9hP04jUtgvwwSSCTZYsvGbSL5MZQILDVX-ok 19780
docowling/backend/pdf_backend.py sha256=daCmFsmcxu6WVo-2lYX7PNrtZ7s5ibVhLJgcyT2KDFk 2134
docowling/backend/pypdfium2_backend.py sha256=bOC-AnGSxcHY08ZomtAH6iL-9VqFbPRd7s60jOW8Acs 9262
docowling/backend/xml/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
docowling/backend/xml/pubmed_backend.py sha256=Igm53qyIGhZP4X4BLcg1JcVjl5NSzZBcf4bMez1363s 21045
docowling/backend/xml/uspto_backend.py sha256=rQZm6rjFIQajeHte1mwb_t4XyB9thQ1u3Cu_pXfTNF4 72868
docowling/chunking/__init__.py sha256=PdbI8zEvy6ofTjqKustJtIIiYw5UR_K6w7wgaHxTneE 358
docowling/cli/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
docowling/cli/main.py sha256=roVKW7K6sPN1aqq9akPJcq0M89j4SapROFbQsnisJAE 15479
docowling/datamodel/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
docowling/datamodel/base_models.py sha256=NftI98wAiy_snU9QGFVZ2H_knG4qhhqB5Cde2G_kNoA 6546
docowling/datamodel/document.py sha256=I6DH0Uul2lKsYFrO0frgf5uIIOHpK4mz95sZbpnaa6Y 13495
docowling/datamodel/pipeline_options.py sha256=4DOwzoAQKxffdhtSwNUBPwyx4hTVuH-M0zdkYExvhaQ 7983
docowling/datamodel/settings.py sha256=JjT4sP-YYLQl1IPqij4uv8PUNG-e0bGho10vsZv5FpA 1352
docowling/document_converter.py sha256=ZrCLKYSjsWTP6Pf_Yr4iNDOCjYd-0BzZc6o-aHozpVQ 12999
docowling/exceptions.py sha256=025dSOsdGHQq-tiQuhngtoGvTaBf0cApq5OIqKBkuBo 91
docowling/models/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
docowling/models/base_model.py sha256=8L_uPA6UQBFb8GxVzartoZsgQbEnSIRSpHWXYDNMCis 733
docowling/models/base_ocr_model.py sha256=FZeGasZ6xQ5JWJkIF371x_mR1peZZx1DU_pIY7d1Nrc 6611
docowling/models/ds_glm_model.py sha256=A1Fb7ZmJKkCws9RpsuQm_4uJH2O-ZwS0f75vxL7WHt4 13368
docowling/models/easyocr_model.py sha256=PfeIDq7hbhy3jQyqkZr1VvRIwabQ8mWCfAgZSID0e9Q 5136
docowling/models/layout_model.py sha256=P-_dO32SdMLlxv-BXLM20xoudi_RrIDdljnQKWJoJVs 10121
docowling/models/ocr_mac_model.py sha256=_PAku060KjncNXIx1ptcbFw9aDJg60H5C00nrC1lEZU 4632
docowling/models/page_assemble_model.py sha256=HSvNrbixaoRT7JVkimT6I-0Qa08h1H3366duDskkm0w 7802
docowling/models/page_preprocessing_model.py sha256=Oqsp1-KzOwi9ihP4KxYpIkzxd7uRfQBd0MXvBGEgMR0 2852
docowling/models/rapid_ocr_model.py sha256=o84tda-tldFO0fAhHuvIsyxPxzLn9v2ZQ8ojHXeZVdA 5121
docowling/models/table_structure_model.py sha256=nkZDtzmNxOsH1PQD1rAIJVa-Vb0FPvFrtWfK-FehlUI 9128
docowling/models/tesseract_ocr_cli_model.py sha256=PosmLgTsF3jVk5tPM6MjdwiT6q-C6M00U0LM8BrNWZU 6819
docowling/models/tesseract_ocr_model.py sha256=Md6VHsEb_b7iGppKzaM-ZCGyXSJXvo4wzc8oZiWo9Nk 6257
docowling/pipeline/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
docowling/pipeline/base_pipeline.py sha256=9aWm_oA1qcux1v7mn2pQYJtJ8NZvnAmIlg8l8Clpsxk 8172
docowling/pipeline/simple_pipeline.py sha256=ktPrbu120vvKKWt5ZEyCE_vT8f4bJcnnzgCaPoOW6Uc 2322
docowling/pipeline/standard_pdf_pipeline.py sha256=LcKEivQkPjWO4IGTqQso7C7g_UEK2ZmJsGsX523PGuM 9436
docowling/py.typed sha256=frcCV1k9oG9oKj3dpUqdJg1PxRT2RSN_XKdLCPjaYaY 2
docowling/utils/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
docowling/utils/accelerator_utils.py sha256=oYxGtrcf7d8sLGNjqeC2dY0oi9hQ16XfiIcc2ga-oNk 1411
docowling/utils/export.py sha256=pqg3WghwxsF0ExTgnNf19LWN8blQ9J_Q2HMwk1tM0FE 4844
docowling/utils/glm_utils.py sha256=C31U-8NqPrpO8MUhP7FdOwnpCmn5Kjz1aQ4py5Qp13I 11895
docowling/utils/layout_postprocessor.py sha256=ZdoJYZ-P2EfNsExlpZtwJ4el3ZXPjdML8clnRWHqG1s 24926
docowling/utils/profiling.py sha256=Ar6P1UdF56zNz26mK6y3T811dT-e8cTNKS_REsEnw-M 1819
docowling/utils/utils.py sha256=1I6B7lMlKCiwCPQPElygak3hCkVi1btmpRpBnA3AAIE 1249
docowling-1.0.17.dist-info/entry_points.txt sha256=oueefj0seZjrYqy8xEFhKGkNvDv_FeUa6khX72X2Ghw 52
docowling-1.0.17.dist-info/LICENSE sha256=MfxOLNJ_9a6qsSbCeaH70rm5_AQOclBM8rRdCrP22No 1141
docowling-1.0.17.dist-info/METADATA sha256=RwHuvVA9AOnz-x1Naskq-zTV1wnAlxEwN0tMGou5Ibo 6087
docowling-1.0.17.dist-info/WHEEL sha256=Nq82e9rUAnEjt98J6MlVmMCZb-t9cYE2Ir1kpBmnWfs 88
docowling-1.0.17.dist-info/RECORD

entry_points.txt

docowling = docowling.cli.main:app