unstructured

View on PyPIReverse Dependencies (162)

0.16.5 unstructured-0.16.5-py3-none-any.whl

Wheel Details

Project: unstructured
Version: 0.16.5
Filename: unstructured-0.16.5-py3-none-any.whl
Download: [link]
Size: 1741817
MD5: 4835886f100116930f7568014d8bf54d
SHA256: d867e6d5c002c159997bb44df82c43531570c32fa87a010a0aae8a7a0e22ec49
Uploaded: 2024-11-07 20:34:04 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: unstructured
Version: 0.16.5
Summary: A library that prepares raw documents for downstream ML tasks.
Author: Unstructured Technologies
Author-Email: devops[at]unstructuredai.io
Home-Page: https://github.com/Unstructured-IO/unstructured
License: Apache-2.0
Keywords: NLP PDF HTML CV XML parsing preprocessing
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Education
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Requires-Python: >=3.9.0,<3.13
Requires-Dist: chardet
Requires-Dist: filetype
Requires-Dist: python-magic
Requires-Dist: lxml
Requires-Dist: nltk
Requires-Dist: requests
Requires-Dist: beautifulsoup4
Requires-Dist: emoji
Requires-Dist: dataclasses-json
Requires-Dist: python-iso639
Requires-Dist: langdetect
Requires-Dist: numpy (<2)
Requires-Dist: rapidfuzz
Requires-Dist: backoff
Requires-Dist: typing-extensions
Requires-Dist: unstructured-client
Requires-Dist: wrapt
Requires-Dist: tqdm
Requires-Dist: psutil
Requires-Dist: python-oxmsg
Requires-Dist: html5lib
Requires-Dist: pdfminer.six; extra == "all-docs"
Requires-Dist: markdown; extra == "all-docs"
Requires-Dist: xlrd; extra == "all-docs"
Requires-Dist: pdf2image; extra == "all-docs"
Requires-Dist: onnx; extra == "all-docs"
Requires-Dist: pi-heif; extra == "all-docs"
Requires-Dist: unstructured-inference (==0.8.1); extra == "all-docs"
Requires-Dist: networkx; extra == "all-docs"
Requires-Dist: effdet; extra == "all-docs"
Requires-Dist: openpyxl; extra == "all-docs"
Requires-Dist: python-docx (>=1.1.2); extra == "all-docs"
Requires-Dist: pikepdf; extra == "all-docs"
Requires-Dist: pypdf; extra == "all-docs"
Requires-Dist: google-cloud-vision; extra == "all-docs"
Requires-Dist: python-pptx (>=1.0.1); extra == "all-docs"
Requires-Dist: pypandoc; extra == "all-docs"
Requires-Dist: pandas; extra == "all-docs"
Requires-Dist: unstructured.pytesseract (>=0.3.12); extra == "all-docs"
Requires-Dist: pandas; extra == "csv"
Requires-Dist: python-docx (>=1.1.2); extra == "doc"
Requires-Dist: python-docx (>=1.1.2); extra == "docx"
Requires-Dist: pypandoc; extra == "epub"
Requires-Dist: langdetect; extra == "huggingface"
Requires-Dist: sacremoses; extra == "huggingface"
Requires-Dist: sentencepiece; extra == "huggingface"
Requires-Dist: torch; extra == "huggingface"
Requires-Dist: transformers; extra == "huggingface"
Requires-Dist: onnx; extra == "image"
Requires-Dist: pdf2image; extra == "image"
Requires-Dist: pdfminer.six; extra == "image"
Requires-Dist: pikepdf; extra == "image"
Requires-Dist: pi-heif; extra == "image"
Requires-Dist: pypdf; extra == "image"
Requires-Dist: google-cloud-vision; extra == "image"
Requires-Dist: effdet; extra == "image"
Requires-Dist: unstructured-inference (==0.8.1); extra == "image"
Requires-Dist: unstructured.pytesseract (>=0.3.12); extra == "image"
Requires-Dist: pdfminer.six; extra == "local-inference"
Requires-Dist: markdown; extra == "local-inference"
Requires-Dist: xlrd; extra == "local-inference"
Requires-Dist: pdf2image; extra == "local-inference"
Requires-Dist: onnx; extra == "local-inference"
Requires-Dist: pi-heif; extra == "local-inference"
Requires-Dist: unstructured-inference (==0.8.1); extra == "local-inference"
Requires-Dist: networkx; extra == "local-inference"
Requires-Dist: effdet; extra == "local-inference"
Requires-Dist: openpyxl; extra == "local-inference"
Requires-Dist: python-docx (>=1.1.2); extra == "local-inference"
Requires-Dist: pikepdf; extra == "local-inference"
Requires-Dist: pypdf; extra == "local-inference"
Requires-Dist: google-cloud-vision; extra == "local-inference"
Requires-Dist: python-pptx (>=1.0.1); extra == "local-inference"
Requires-Dist: pypandoc; extra == "local-inference"
Requires-Dist: pandas; extra == "local-inference"
Requires-Dist: unstructured.pytesseract (>=0.3.12); extra == "local-inference"
Requires-Dist: markdown; extra == "md"
Requires-Dist: python-docx (>=1.1.2); extra == "odt"
Requires-Dist: pypandoc; extra == "odt"
Requires-Dist: pypandoc; extra == "org"
Requires-Dist: paddlepaddle (==3.0.0b1); extra == "paddleocr"
Requires-Dist: unstructured.paddleocr (==2.8.1.0); extra == "paddleocr"
Requires-Dist: onnx; extra == "pdf"
Requires-Dist: pdf2image; extra == "pdf"
Requires-Dist: pdfminer.six; extra == "pdf"
Requires-Dist: pikepdf; extra == "pdf"
Requires-Dist: pi-heif; extra == "pdf"
Requires-Dist: pypdf; extra == "pdf"
Requires-Dist: google-cloud-vision; extra == "pdf"
Requires-Dist: effdet; extra == "pdf"
Requires-Dist: unstructured-inference (==0.8.1); extra == "pdf"
Requires-Dist: unstructured.pytesseract (>=0.3.12); extra == "pdf"
Requires-Dist: python-pptx (>=1.0.1); extra == "ppt"
Requires-Dist: python-pptx (>=1.0.1); extra == "pptx"
Requires-Dist: pypandoc; extra == "rst"
Requires-Dist: pypandoc; extra == "rtf"
Requires-Dist: pandas; extra == "tsv"
Requires-Dist: openpyxl; extra == "xlsx"
Requires-Dist: pandas; extra == "xlsx"
Requires-Dist: xlrd; extra == "xlsx"
Requires-Dist: networkx; extra == "xlsx"
Provides-Extra: all-docs
Provides-Extra: csv
Provides-Extra: doc
Provides-Extra: docx
Provides-Extra: epub
Provides-Extra: huggingface
Provides-Extra: image
Provides-Extra: local-inference
Provides-Extra: md
Provides-Extra: odt
Provides-Extra: org
Provides-Extra: paddleocr
Provides-Extra: pdf
Provides-Extra: ppt
Provides-Extra: pptx
Provides-Extra: rst
Provides-Extra: rtf
Provides-Extra: tsv
Provides-Extra: xlsx
Description-Content-Type: text/markdown
[Description omitted; length: 18526 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.40.0)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
test_unstructured/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
test_unstructured/test_utils.py sha256=v7ytk5UJccQ7OvaIOq8E6ioGSStcfG4HrPQD27xAQIU 10997
test_unstructured/unit_utils.py sha256=C4P89P666inGxQl_mUJpVZRctxLE0v6-izNsPAjIt-Y 8108
test_unstructured/chunking/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
test_unstructured/chunking/test_base.py sha256=zsJ3yS-8bmOsorPjc7AuQ9JH_yfHKz8Y3Y6iKXO1Qfs 76516
test_unstructured/chunking/test_basic.py sha256=t666cn-KFystsEN7VlCDUoCqy82cu89P6H-MhmnLJOg 8332
test_unstructured/chunking/test_dispatch.py sha256=xHD5BTim8aTLmi7PH65mKvXmrJslAf6xYd-sKgd1fSo 3255
test_unstructured/chunking/test_title.py sha256=CfDXRt_Ggx-lwG27GlBuFHuI4_wq4lR-VP9X12hBsP0 17989
test_unstructured/cleaners/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
test_unstructured/cleaners/test_core.py sha256=3FVidZQD-qK9lCsiKftkARghqCAYcelHbBD0BnLve8k 10357
test_unstructured/cleaners/test_extract.py sha256=VJCJ0kE212qeEg4eNEhaaVLFAhApWxn8Q5IAe45qWmI 4711
test_unstructured/cleaners/test_translate.py sha256=B6GItLUlhxAQMpbvI43HA-JOUa5M1eoCmWtukXpxraI 1673
test_unstructured/common/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
test_unstructured/common/test_html_table.py sha256=fUcK9yFK_ArNBeDrfxKft8GAfsU9bqp3WEEdxdKTLPM 7579
test_unstructured/documents/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
test_unstructured/documents/test_coordinates.py sha256=8kIj45xu8SSf6vt3LRxoraMpY2yjiVcPsXYFPkpz2hU 2795
test_unstructured/documents/test_elements.py sha256=4fkmdTjaLrAtzuEKFQwrV53dAcfKrASJjbS_yoQYDMQ 28660
test_unstructured/documents/test_ontology_to_unstructured_parsing.py sha256=NlDIs5wHiln2DzjjBmTGIESqXXDtdwOv0CzjbYsApp8 11046
test_unstructured/embed/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
test_unstructured/embed/test_mixedbreadai.py sha256=3XzkygDeKXUAWiVEXKSdBi5ue8ilXqxZs9WxSgx9kek 1357
test_unstructured/embed/test_octoai.py sha256=ok4ZO_80zuQpI16mASLTlregGCiFwpJOWnsafNn4U80 861
test_unstructured/embed/test_openai.py sha256=1vsK1DuJu1krdes1uOBpzdFlnrHc5WbQ8CzzsJFh6H0 861
test_unstructured/embed/test_vertexai.py sha256=uZ5aCGZgJjlx_SD1jKozukt7dWXm_hNXyiUuK2gPku0 876
test_unstructured/embed/test_voyageai.py sha256=DX-sDjRn7PZiwZFjOxnrT8iAzdHbszTwm6xXuHVp7Qc 914
test_unstructured/file_utils/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
test_unstructured/file_utils/test_file_conversion.py sha256=dtZ0Q4uMk46rbO25ofVQrDKwJCi8N8QmMH6jJidXSL4 1942
test_unstructured/file_utils/test_filetype.py sha256=5m4uMBCxSQM20Cj-4Vrd-ndUTTholb4oInlZiEX-77M 48569
test_unstructured/file_utils/test_model.py sha256=X0hZxNXej7vhYPs_6yOUotJtftAlL8u8l8_tyCgTJlo 8293
test_unstructured/metrics/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
test_unstructured/metrics/test_element_type.py sha256=wJawqBX2US7POe4iHo7SrJt5qMSAz7bJMgZKTwEZnOc 3360
test_unstructured/metrics/test_evaluate.py sha256=zNohfGaorwcJRfvoeiBfK8r5aQqJbT3Oafgc4Mts3LM 17386
test_unstructured/metrics/test_table_alignment.py sha256=li4P_NLr5OaWCDR2adGadi_iycc_uzi0U5W5SbwVCAA 554
test_unstructured/metrics/test_table_detection_metrics.py sha256=j4F9UdrRuSqp144P-Bxt03sANjQ3ovlalEIc8QYzopc 1555
test_unstructured/metrics/test_table_formats.py sha256=esS-Ri8FQ9_nRCs0HVHrR2bkYevT3H_zML0p_BbmLn8 1357
test_unstructured/metrics/test_table_structure.py sha256=9iMPU-HJ3ZwTZ2e7MljVkI0HAHqLKCFFNmkef1E4VKA 19704
test_unstructured/metrics/test_text_extraction.py sha256=LeYjTq18OxNlgROhKcpHXbydUTa3dl5FRaIdrQhd_3M 24036
test_unstructured/metrics/test_utils.py sha256=PxMmFRjoHbJ2W-8YDBjJamLQb5a9n8JtQM6nsSmWdgU 925
test_unstructured/nlp/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
test_unstructured/nlp/mock_nltk.py sha256=PsoZesQcrTP4Gxkx6_1CAI8TuYgVrLF2bDPP-i_nR6A 566
test_unstructured/nlp/test_partition.py sha256=qz883Zaw3nFKv2fDVMng2TsY9FxY7ujvkYw3_RDTsUM 15
test_unstructured/nlp/test_tokenize.py sha256=3h21ilc1llyM_DnoYFa3uzmShdMxE3Otb-lNLmlRTLM 2681
test_unstructured/partition/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
test_unstructured/partition/test_api.py sha256=84qSQVpamNLEyMsNpvaPVRpDLx18po6X02l7xipoems 23942
test_unstructured/partition/test_auto.py sha256=-eFsdOeyZ_yEmKGJsH_CbUTSikqEjV2dGPkHx-ZjbWs 48585
test_unstructured/partition/test_constants.py sha256=3T8UDe-gwJb2yEtyD43bVuOFpby8uqpPdXyBozOg0OU 3515
test_unstructured/partition/test_csv.py sha256=1FvujpyyJoN118olOdJNcUu7-DBoqXp-DMPS3efT8w0 12319
test_unstructured/partition/test_doc.py sha256=NHtaMtdHwEnbwI68fcxWN9LK6R81lEIx19N_zKKHz-0 9935
test_unstructured/partition/test_docx.py sha256=wlXE1h8CjTHCvQUs8MDztdmpygMp79YEkQvNIgrxkp0 48624
test_unstructured/partition/test_email.py sha256=9mPH7oJKMsLj2hMgNyMXGAgzc4fgUrs6iA2AlCis9JI 24639
test_unstructured/partition/test_epub.py sha256=oVm1n5bqoJU6xen9tLK-AWV_JtzMv-BcvjfBBdFunQE 6505
test_unstructured/partition/test_json.py sha256=GM2hyN4ZAWKr-yedL14Yk7yk5_OqgvbOno0eNTur6lQ 10995
test_unstructured/partition/test_md.py sha256=_pvZWB2B86x4WwxhGnwlI-CYH4yHNz99D9LZ2BfRV8c 8500
test_unstructured/partition/test_msg.py sha256=nSyv-kt6wa3ZMSaf583-xt5NROadqYiUO6cTwCuzsd4 16999
test_unstructured/partition/test_odt.py sha256=zPiTBSWx42NmI-lBE0JJbG59GqP6jC7sRErn4E3_Llw 7764
test_unstructured/partition/test_org.py sha256=WsFMdTbPk0lUrjTkMRcR4Q6mvp9_46b27rZEKM0c3vs 5053
test_unstructured/partition/test_ppt.py sha256=AszuiZi3HESwZaE1PTGYgEQd5DUi4VI_KbHJqvV3Ig4 6917
test_unstructured/partition/test_pptx.py sha256=TPxAcEIpq4tGgIFHMVuOCH59Iwx46fcfg_5wXsl1Z50 30163
test_unstructured/partition/test_rst.py sha256=koXSDk-RkrQZ8tiKVrp_KpRcK6DBVb6lVGBZxrbPlGY 4410
test_unstructured/partition/test_rtf.py sha256=f9Nw6Tj7lnzF0DB3H7E6l0njOgrO9uGjpCO3zYmsy9Y 4591
test_unstructured/partition/test_strategies.py sha256=uNmDKQwOWgXGqKh0kX63XI_444CuC7nowAWOiv0j7Sw 4344
test_unstructured/partition/test_text.py sha256=f_VODf0KXD_KcPzfOdzEFyLVBzMm61YpU0YqCmq4gbY 14834
test_unstructured/partition/test_text_type.py sha256=Q_cdaDlzlEgw2FZ0dm9LSGQOJohFPR7_U4DD9K3Xv5A 12603
test_unstructured/partition/test_tsv.py sha256=h_3WZ9IpqsAPMRPGKQCaPnL7q_XxdklyUNkCV6JnueY 5995
test_unstructured/partition/test_xlsx.py sha256=UtfwdypW4yD7No8KXCzfNikPvUi6S4kQmVvq7cl88Kc 23856
test_unstructured/partition/test_xml.py sha256=Udvqw3Zn_tpzTZ_B3ImHeckkhqbXPLB04IS4fw0HbB8 8745
test_unstructured/partition/common/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
test_unstructured/partition/common/test_common.py sha256=6rUEBW5kafEn4_MF_Odx0-chDapv-EomTSHi6FpnH4k 13653
test_unstructured/partition/common/test_lang.py sha256=Mw_GPafrlQRo-64vkZSj7t1l7fi5dhBeV0zx43WjZxY 8981
test_unstructured/partition/common/test_metadata.py sha256=uCykd6sldq7xrEd0jtqj7tNjEu6I8s3EohvzqkbHJto 20449
test_unstructured/partition/html/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
test_unstructured/partition/html/test_html_to_ontology_parsing.py sha256=FGLJ-9rq1kwhz65ww0Xr9jJ32_OdkuuL1smex2H_wew 17688
test_unstructured/partition/html/test_html_to_unstructured_and_back_parsing.py sha256=r0LtcDMvHD7DX-gFBtUmYjE7RZmPkcf_6uzHy2EI-iQ 17851
test_unstructured/partition/html/test_html_utils.py sha256=ltlllMtcDlY4QFhhttue_eZgOafFYuIayYJ8gc_0nac 1069
test_unstructured/partition/html/test_parser.py sha256=hb3DxvmWutsSTJy2M-keJFAAmLkskXuKLvV7nkjmNE0 56668
test_unstructured/partition/html/test_partition.py sha256=saVVMNMlRpEKkedaCOJuNeZHxCMb3mSd4hTxGwrpF0E 49653
test_unstructured/partition/html/test_unstructured_elements_to_ontology_parsing.py sha256=URnlE8eoXYXxu3rogl4oj6u-Np4qKaJHhvUc8Pco8zc 1103
test_unstructured/partition/pdf_image/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
test_unstructured/partition/pdf_image/conftest.py sha256=ejimRq_95Bb3Hfm6Yjyzu5tudx7ieidQ1gFXSUEpSMI 2293
test_unstructured/partition/pdf_image/test_analysis.py sha256=T-NZKcfZLYM-pTZfUeWlYHfXTKjnv7b3_v5Fnslg3V0 5147
test_unstructured/partition/pdf_image/test_image.py sha256=PGFK2mKzLB8AhUiCgLxbqBKJ3bG7k9tf1KHfVoNtFeo 22979
test_unstructured/partition/pdf_image/test_inference_utils.py sha256=0yTjM2GDLLHk_6o6KCJFA_NeZSVgnjEXXZa60cMmPuo 1329
test_unstructured/partition/pdf_image/test_ocr.py sha256=8WTx8epuUVyPv2MDV7tn6Qv-lIsfj6N_NktLEZfZ8DM 15182
test_unstructured/partition/pdf_image/test_pdf.py sha256=vg6Hkk5Y635EedFnj3WebKQHYGb25EGRjQKte0_n_oA 52268
test_unstructured/partition/pdf_image/test_pdf_image_utils.py sha256=1vBaWZMQoQ8AOB9ycCOooOdQonBlsriI69T1Oqt2WJo 12674
test_unstructured/partition/pdf_image/test_pdfminer_processing.py sha256=GqnQoerAojoPybYxCh4bf3Zg8qF6E49l2m_i6-_rlhs 7518
test_unstructured/partition/pdf_image/test_pdfminer_utils.py sha256=vkedoo8pYVaEJrj_TKJHxDbyfUVkkFPfoiR4HjQJnt8 1104
test_unstructured/partition/utils/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
test_unstructured/partition/utils/test_config.py sha256=-L866HOHHfWN6rF85FxkETfbfpGOEcg23DbmcAX2Mes 2001
test_unstructured/partition/utils/test_sorting.py sha256=ZcnJgusQT81Xsz8p8GxB3s2EdNQUXVeb7Zk_TTOSEns 4190
test_unstructured/partition/utils/test_xycut.py sha256=I-VaPlTnxuPp8G0FC8iqUYGm0Z9UxtQ9ddtZh7q5HYo 6211
test_unstructured/staging/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
test_unstructured/staging/test_base.py sha256=i8Na8NKFFKiEMjn-sIYAOlyzbUWD9oa8saX50BaQjQY 19976
test_unstructured/staging/test_baseplate.py sha256=ACJ_OtLK_64e3xn3hS1jYZYApxaenETclphCzGyAFTc 2800
test_unstructured/staging/test_datasaur.py sha256=jxn8jopADs1J7jRL-teWWtL0fD17wma0PAaAqoeDCkU 2176
test_unstructured/staging/test_huggingface.py sha256=00MvpucyTEBFJebAVPxGlfOX4T3zl1esphH4W6JL-s8 2356
test_unstructured/staging/test_label_box.py sha256=Pjbe5cPBWX2tlFk1YX1U29ujX-8slDYehQLd_1VC7K4 4335
test_unstructured/staging/test_label_studio.py sha256=Hc52vCxpSQDrYxeIEV1EoTKRumSCir-BO4kgjJbfntc 12671
test_unstructured/staging/test_prodigy.py sha256=MTFItzYNSnLNZ-ekMoNlGsfT4v9Hh8rFNFNwhtMvU-k 4020
test_unstructured/staging/test_weaviate.py sha256=Mb0t_GrIzPjuTzYD3xdWMCu_PMCX0AfdRwn36hI6sdw 2214
unstructured/__init__.py sha256=SvwSYurR6AKi7Zp-JY0ZnR9D1QkIqtHM4FEdCgdAolM 77
unstructured/__version__.py sha256=2zDO4C_9bZ_R0QcydAySB9B8tL_9aRvNjuysuIQnStE 43
unstructured/errors.py sha256=os377OEQPhV5uyO0TpFH918R-lfGgbPlL_7T3Ubj0xM 503
unstructured/logger.py sha256=aD9qsYFQBbyPSiuTfosXphv1k5EGcRnX7dAGB6sgb-g 686
unstructured/py.typed sha256=z3PGyU9Bs9Gq1-s8CjEJ8Y4Aev2MwVgsaVDwglLkTZw 118
unstructured/utils.py sha256=nmfIQtCindvbRfpqgCWWwwHfpJAnQwjovzqrPCdoNGs 27866
unstructured/chunking/__init__.py sha256=jvlh7MH_R3-v_5-ynDXcksd68w3ZejZcBbv5iJhLpOg 590
unstructured/chunking/base.py sha256=PAv5zK0_A-RtimFRJzrkdlc_-YdY_NRItYN3DPck4ac 56186
unstructured/chunking/basic.py sha256=nIl41mrfV8pAxOEehwI20bKc-dwwNpBfRf8VWpKlw08 4249
unstructured/chunking/dispatch.py sha256=hvZyn_K1vWFB2qIe6ndelDTr6YE7ZTUQSjidwzQmaew 5189
unstructured/chunking/title.py sha256=dkI0jzGUTxOafH0LnvvTxqDuK3K8KaM-9ttQ94B4wTY 7713
unstructured/cleaners/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
unstructured/cleaners/core.py sha256=eHu58csaDtS2Xuhm163Mzt29lROpRTWzHsmmLFLGrI0 14646
unstructured/cleaners/extract.py sha256=BbBYANbWz1BSYWaip2kAQ6GN86nVVr6HiyO5FRqjEHc 4339
unstructured/cleaners/translate.py sha256=omtxO1D2d4b5cr6aOhYHJH66oZ2BPFp_Gnp2lae0BdY 3255
unstructured/common/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
unstructured/common/html_table.py sha256=ZfJyXiE2Q2GqhDO7KQtp3UmLwOYx8uSm1ST_6eryD3s 5652
unstructured/documents/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
unstructured/documents/coordinates.py sha256=LoHrK13Py3TkQL__Vobug0KmF1MYwMAXP81AFinuPJw 3937
unstructured/documents/elements.py sha256=vXy9wBSyMYPgM18p1dMCLMzUvL1dXRzQxJb7LIYPB-s 38249
unstructured/documents/mappings.py sha256=c_wJW4DNXYXEPp9IYPfpOW57L_PpvR18VxXOsshjE38 5615
unstructured/documents/ontology.py sha256=Lc8HrgPe62hG6IYQEuLbQIqW-DlPSNzbJ7NPo7KJvGA 23316
unstructured/embed/__init__.py sha256=PH-3X-7qj6NPLXND_yFCfqGoLVfQ2tYr9umZtnlSEjw 1056
unstructured/embed/bedrock.py sha256=p8Pgm8PEYq_Z2c1f-D-LidtAyzxJ7uFzsWY52WcO-gM 2602
unstructured/embed/huggingface.py sha256=GYTuTUwaZbfu6WFc5Nu3xLDXZxDdGXeZuPEfUIxe0C8 2465
unstructured/embed/interfaces.py sha256=I5TDdbVC3x_yXXfBqSp1VOtVRBrLXwKbf4SgfTVhl2w 979
unstructured/embed/mixedbreadai.py sha256=T2vibcjwIZExe64ko_zbY1t1izwuV6XvxkybLyHxbCU 5471
unstructured/embed/octoai.py sha256=aFlgLhrTw-nrP_2gINO3AilJDDNiqCXObjtufjl0Yyw 2376
unstructured/embed/openai.py sha256=cx_wBCENyBuQVJMG43xSTSXLSXm6_YH4YljeEWNSEI0 2284
unstructured/embed/vertexai.py sha256=0PSdFuW_-5bR4UXrvzYBER7awTBqZtW8BVIBmaTXlW0 2821
unstructured/embed/voyageai.py sha256=fQnwVbB4WNN3C3MJPR6jNGlQAZYhxRslsyqpA-lEdrs 2447
unstructured/file_utils/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
unstructured/file_utils/encoding.py sha256=A1hRtcp-bZR6NpkCTAeHZE7b9o9NSTPyGCiwXasarH0 4420
unstructured/file_utils/file_conversion.py sha256=CzlCWiKjeq3NYDJMcWdE8HR7ixq2nwG6zVZy__2LqV0 2649
unstructured/file_utils/filetype.py sha256=J6mS171xlLIvW1-TCmc7LS7WKYwV-54iXa2V9bkl5Fs 28602
unstructured/file_utils/google_filetype.py sha256=YVspEkiiBrRUSGVeVbsavvLvTmizdy2e6TsjigXTSRU 468
unstructured/file_utils/model.py sha256=OMBdh1-Y0EpRxKXFdnWWCUVh335yCWdbQ-vzZvr8wpc 14478
unstructured/metrics/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
unstructured/metrics/element_type.py sha256=OQuuB5Z6Xej2acJFowhTKSEJN0glyx69OCkYKocy8yg 3666
unstructured/metrics/evaluate.py sha256=dTJEJIEnoy2xZxExEXEsEqaZ-X47b8u8c3Lg7_nA8Os 32034
unstructured/metrics/object_detection.py sha256=t988hG16em75Rh0OXqo-3pLoEOPjmnVVt-3yYbOtUAE 31067
unstructured/metrics/table_structure.py sha256=uyWihfHyESx7aTr2A0YYqr7e5JkqEDVfrsSyI_pHcp8 1859
unstructured/metrics/text_extraction.py sha256=5YSE1br4G4efnC8uBnaF1sMqphTIq9xXGpv4Xlz5iPE 6064
unstructured/metrics/utils.py sha256=TF_o-kZQ4NhZXn1JpC0l9a_ijE2SdzrpfTLn62UxW5A 8117
unstructured/metrics/table/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
unstructured/metrics/table/table_alignment.py sha256=tyUG9XedxVppQydISoYfzlWlBsLHTB8yU38yko6nkts 7734
unstructured/metrics/table/table_eval.py sha256=VFr7rYR2rls348n99EkLRUOCtceKtdBMFI8f0v7KG3I 12252
unstructured/metrics/table/table_extraction.py sha256=gH-vSBt2NxjB17TEA78LIRo_E0JMmIODE2MfQzjgbcY 9721
unstructured/metrics/table/table_formats.py sha256=Jqrb-26zRtUVvwOyXw-CWLyV6vKrgFHVpZKD7TFzFjk 1383
unstructured/models/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
unstructured/nlp/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
unstructured/nlp/english-words.txt sha256=8fpk2f3iMm87qMppZMFAt1eWJcqOMALtt1YHE-fm7bY 4472047
unstructured/nlp/english_words.py sha256=Ng2ozKrwF0Pw-qblYtBxxFOW9hT0eVL5uLqEgf0BHsw 701
unstructured/nlp/partition.py sha256=8bTfn7O4Plk6FJ3-TmuTnxgjdZDnvExSGQpRYrddNJE 210
unstructured/nlp/patterns.py sha256=FCrRY2XXxXR0ijg05ZPuFuvJfBy2AjXds0TEbrU8KAg 5613
unstructured/nlp/tokenize.py sha256=356-VoJk8Hu_en2O0K7qgmghjqe__l9dIw15RoyuPcY 5809
unstructured/partition/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
unstructured/partition/api.py sha256=2tQEDwCjtu51-nY_IFKnm490ApLxs7w6Nn-35VsYJqw 13767
unstructured/partition/auto.py sha256=sHMFJmvg6Mw_2AwXXLZazRsXiDYDCGUmwIy6CZ9PUng 22772
unstructured/partition/csv.py sha256=5stmawxSocVkVVTuo7hedebNO8u6JgaoW4v7cRy470k 6029
unstructured/partition/doc.py sha256=ltrI9GISr4JNxktgVRpfMpyH55r1XK801P2g52EgRBA 4584
unstructured/partition/docx.py sha256=gpjmBNpqxtvJYyAGmzrpvBsHSnSGDxHHwFahKgQkxDI 44852
unstructured/partition/email.py sha256=y7rfaM9naMBtZ3uRpfkz9PIODNfLri0uQrhfy0MpQzQ 16881
unstructured/partition/epub.py sha256=puxuPQaN7pArGDzXQsJfoi-AmvgqmGFdGNm183VNmcg 2242
unstructured/partition/image.py sha256=qZ7TDpIgAjjzdZlrf35kzS0Y0SqXG2iQPGoEJVI_Ffw 5539
unstructured/partition/json.py sha256=DUuYBrfDp5YkKtevNHunoITeC6M1of_G2La12dxuuI8 2805
unstructured/partition/md.py sha256=ZcgF_jJ5wsrH91seNoBdj-EsiUgZJ7PytsA0Qwi_e5Q 2555
unstructured/partition/model_init.py sha256=HdbUAn2jyMuJQG_s_vJT7polwh3epUx-H6SILETEhhA 586
unstructured/partition/msg.py sha256=jVzxqj0G5LPKaUpnsQIHFlLcnhd1dLT1eEb8QxTkFlM 11475
unstructured/partition/odt.py sha256=lT_w26bC0Cru7_l6Tzl_fxfzi_dm4nIN7bFA1gKffHY 4571
unstructured/partition/org.py sha256=_jLGn186c960cEMugwHD3eeBsOn3QBZNK_-o6uhZKVI 1643
unstructured/partition/pdf.py sha256=duYSmAAXyu8npWXrNWBeWUfFJmDI4p0_tyC8rQG9gBM 45580
unstructured/partition/ppt.py sha256=YFj60k9OukrUlezMFzRHzmF7q9XYPRb8z2HcLAwqg-0 2659
unstructured/partition/pptx.py sha256=pJZGMjaZf2TriA7V7yYytt4s9r8iFDfTslCA_EKlcQw 21616
unstructured/partition/rst.py sha256=vLdbmHEGKUy5fs7U9LuWIDsEEAYm7rZ9bhHZ20N1t8c 1665
unstructured/partition/rtf.py sha256=AQnHkOftBu5mj3wQSXTNJpR3tz4d_SCXBR7MF728teI 1665
unstructured/partition/strategies.py sha256=rvSaAxzJqFxnmAkQPwMQI42d4vv0Iz7tqoDD2wz9y5s 4303
unstructured/partition/text.py sha256=m1-BsxShkuy3W9BSz8v4_xzfASWUbovCMD_R5MSGqso 6867
unstructured/partition/text_type.py sha256=iXOjumjIs754bPa6b9ddbNxUQUtinp6-LhGk3PpFsoE 11584
unstructured/partition/tsv.py sha256=Ye6OYCW-IrvlEwiEtlf-q_Yf_1H7_jOlhxT4DHg6uwk 2050
unstructured/partition/xlsx.py sha256=bG_J7uaprhUX3nowf0s2R42AAKXbuCKWhImyEoiYAa8 17333
unstructured/partition/xml.py sha256=32BIGMfU3cFtFkEv8s-2MCsKSGC8JvBUUg2RrbBBDmI 4416
unstructured/partition/common/__init__.py sha256=s6_gBQedBRh0pGtrLb8qdlPdONTrq08GBWxFaGY4qpo 276
unstructured/partition/common/common.py sha256=1vcSJ0tVede3x48ZEZIkQX7TD02e5vgxBqfYW5kRnzI 15452
unstructured/partition/common/lang.py sha256=mdWHZUIddiWbqHsTpwzlHdQAYG8grNcZQMX0qXtFLMM 16687
unstructured/partition/common/metadata.py sha256=WEWbeO7B066soq6Jhai_bygLfCmmCzO8LcfoIa6Abn4 11497
unstructured/partition/html/__init__.py sha256=uFCKUengmT1m-Q_GkkZogEhv3MF7_rA9ahM0EF0S5dY 95
unstructured/partition/html/html_utils.py sha256=AZm8KaPu5DCzuXZFqy3IVeTIk0lZ0-LJL8fSaT0DFRU 1064
unstructured/partition/html/parser.py sha256=bixiPXyS9NzgPBav8hGEnbD3qYdxDsjJmsa4SSofcvw 41146
unstructured/partition/html/partition.py sha256=7g-yRem_0AK9TfQavelvAiQAZ7L5l0YRMWzXlFC8Tkk 9069
unstructured/partition/html/transformations.py sha256=QTpDvhFDxDCn8b7nrDJL90jI9ZPG-hcRRc3g8dY2IOQ 16573
unstructured/partition/pdf_image/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
unstructured/partition/pdf_image/form_extraction.py sha256=8yDrbMZEZbt6AaKuZvLl3YiRzp2rgiu3QN2a55CJlIo 369
unstructured/partition/pdf_image/inference_utils.py sha256=hulZkKcXVTgf7VFWwzzIAVyzYHKowPBFz7mDqFRk-_I 3067
unstructured/partition/pdf_image/ocr.py sha256=XyN6DsLvvul2amdHn_-nxGwfpR1mDAuuzWfzTgkfqBo 18044
unstructured/partition/pdf_image/pdf_image_utils.py sha256=vx78vKRFHHFCXu4bvnZi1ltoQwaxBQR8eU80lJ_occk 15488
unstructured/partition/pdf_image/pdfminer_processing.py sha256=yxECIam6j18dPCJUqkwEsV0zJ5teR0kDF0E-EnDDBMs 25783
unstructured/partition/pdf_image/pdfminer_utils.py sha256=qEkdnhxYLFQSJx9vdNfeaCNLPuFW0a2DoBRzal-Dcqc 4461
unstructured/partition/pdf_image/pypdf_utils.py sha256=tE14XrOLRRNbhFwfdUXD9kLrD9BbSX3ipr7ggiB3AeU 409
unstructured/partition/pdf_image/analysis/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
unstructured/partition/pdf_image/analysis/bbox_visualisation.py sha256=S16RfvY04tNVkRTnb4P0PZNcoE0qghUrY6LtV4i2088 24648
unstructured/partition/pdf_image/analysis/layout_dump.py sha256=ugMFTmJ_HSZXVYWG3LZpPJHd3tuowcE6OJktVCigzZ4 6778
unstructured/partition/pdf_image/analysis/processor.py sha256=iQErLaNaLMqGggzPoRQxO9w2YVOnGsCGziRr3qsCKO4 434
unstructured/partition/pdf_image/analysis/tools.py sha256=2Ue4pDdsiz7CXi1A-AQzxkpPQImqJE8Ag0YBR4WWmO4 7236
unstructured/partition/utils/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
unstructured/partition/utils/config.py sha256=9NTP5edhyoedbbpXe5ikYtyD6ug5fsW4hiEuy1QwHCQ 8679
unstructured/partition/utils/constants.py sha256=L-B0ZSMLyBtaj-AI5clyr8yOybtT2fHV2EQifnp_0e8 5666
unstructured/partition/utils/sorting.py sha256=JvLF_CIpAzGlcxCCiDKBNCvMiuN2HjeoM6O7fgbXTUI 8688
unstructured/partition/utils/xycut.py sha256=K_4PaKNc7Vs3iL-PQ9FaP-r3gU2WahulOpaRYpP3rq0 10202
unstructured/partition/utils/ocr_models/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
unstructured/partition/utils/ocr_models/google_vision_ocr.py sha256=rWr3zfu3xhro8beCAXIDH4QLOx58LY-7FU5JA4bkxL4 4785
unstructured/partition/utils/ocr_models/ocr_interface.py sha256=bJ3FvrO7IH811XYMQ6-SBpZ6rVgDtOLSiL9DOoqDGHk 3474
unstructured/partition/utils/ocr_models/paddle_ocr.py sha256=S-grqfUQdtqaFpd2o6A_WUBcz6TRRPxzhOBU3UyQcLM 5599
unstructured/partition/utils/ocr_models/tesseract_ocr.py sha256=9gz-2j3yNVy2qjpO4Qsq5-VmEMQnEnnTdOoM9BDajMk 7077
unstructured/patches/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
unstructured/patches/pdfminer.py sha256=yHoCQnnbtoWzymnT25PgXKDsJI9EpSzpCo9sZtJpJrQ 743
unstructured/staging/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
unstructured/staging/argilla.py sha256=DcG9QNYLTP8nN124OUcBB1lisFxQ-ruD4e_zJXEY97M 2292
unstructured/staging/base.py sha256=MYv89PiQQ8FMfp_W1gj1R8i-5zCAqx0EuKI95VJCrI4 19052
unstructured/staging/baseplate.py sha256=sTQ7umr6PlzjxrRwB2XDlFJGVUY1rv4VZ4z1UQ7q59g 1755
unstructured/staging/datasaur.py sha256=7kG_XjY5YA0w1aPxR-PUSAllXNXKZdMop02Uu41wHc4 1417
unstructured/staging/huggingface.py sha256=Nsej3wBydQDVvGNGPmsRZbeYOaMea8kRF949EY-kNxA 3838
unstructured/staging/label_box.py sha256=uOaPT-3FtP_TkOSIjM2pUCZBxK8YgiDEwPblc1I8sSA 3855
unstructured/staging/label_studio.py sha256=1w8wbuuuFOZ6MHydmv8gaNFRn9cODBvt0nWE_KAl5YY 4910
unstructured/staging/prodigy.py sha256=wPMwatJ2lWr2_0qvlkv3MV55mkovHPz-ItUY0WcKxqw 3130
unstructured/staging/weaviate.py sha256=hsl9OQ8Nwsx5GNrPI_-PQpNUX7lDCKBD0xImTMzKuHs 2607
unstructured-0.16.5.dist-info/LICENSE.md sha256=SxkKP_62uIAKb9mb1eH7FH4Kn2aYT09fgjKpJt5PyTk 11360
unstructured-0.16.5.dist-info/METADATA sha256=E0BXXvKkVCuJkelKPy6-yAbJtH8Bq8KN273x3UeCsig 24561
unstructured-0.16.5.dist-info/WHEEL sha256=pkctZYzUS4AYVn6dJ-7367OJZivF2e8RA9b_ZBjif18 92
unstructured-0.16.5.dist-info/top_level.txt sha256=IVbYkzQJXExO4_PhBGUf5dc7OZZ75t9XYrjKn3KvodA 31
unstructured-0.16.5.dist-info/RECORD

top_level.txt

test_unstructured
unstructured