document-classification

View on PyPIReverse Dependencies (0)

0.0.2a0 document_classification-0.0.2a0-py3-none-any.whl

Wheel Details

Project: document-classification
Version: 0.0.2a0
Filename: document_classification-0.0.2a0-py3-none-any.whl
Download: [link]
Size: 65888
MD5: ce2e1634e0067c13ecc0e6ba769862f8
SHA256: 1bfdac8901cd5ca0fabc2a53444d87874512a8ac79c6f9fc7375b7ba87c6554c
Uploaded: 2024-12-07 17:23:47 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: document-classification
Version: 0.0.2a0
Summary: Awesome document classifcation - Implementation of major techniques
Author: Amit Timalsina
Author-Email: amittimalsina14[at]gmail.com
License: MIT
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Requires-Python: >=3.10,<4.0
Requires-Dist: albumentations (<2.0.0,>=1.4.18)
Requires-Dist: fasttext (<0.10.0,>=0.9.3)
Requires-Dist: google (<4.0.0,>=3.0.0)
Requires-Dist: google-cloud-vision (<4.0.0,>=3.7.4)
Requires-Dist: instructor (<2.0.0,>=1.6.3)
Requires-Dist: langsmith (<0.2.0,>=0.1.139)
Requires-Dist: loguru (<0.8.0,>=0.7.2)
Requires-Dist: paddleocr (<3.0.0,>=2.9.0)
Requires-Dist: paddlepaddle (<3.0.0,>=2.6.2)
Requires-Dist: pandas (<3.0.0,>=2.2.3)
Requires-Dist: pydantic (<3.0.0,>=2.7.0)
Requires-Dist: torch (<3.0.0,>=2.5.1)
Description-Content-Type: text/markdown
[Description omitted; length: 1795 characters]

WHEEL

Wheel-Version: 1.0
Generator: poetry-core 1.9.1
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
document_classification/__init__.py sha256=dqbOMxVDpjBMW90PJioChN1qauHXgzJw6ZPU85nMJ28 70
document_classification/common/__init__.py sha256=GiKo-uY3kTvPHJvZO2zN-iHy7ilBqZ64TkUMtspcGPE 48
document_classification/common/exceptions/__init__.py sha256=02Hzoew45EAdwweIUOXTtLFDB10hKkUpSRtO_ry3YlI 1323
document_classification/common/exceptions/base_error.py sha256=hEBjGhTHM_G6P2nxCgF9IoRsO1mKk4HuH2jLBbHlvsQ 961
document_classification/common/exceptions/bounding_box_error.py sha256=AooA5uvck9JXMsowfGwfdrwL717koNvYQqiifPAk9VA 213
document_classification/common/parsers/__init__.py sha256=nCq4nYxxG27PnNhVV8WswLgNja7gxImZFPJbDoQ1ANc 64
document_classification/common/parsers/config.py sha256=xK7h8JDbEhqmaz4qgf6CEJUJ0x3de9KkO2g-8e4z6tQ 254
document_classification/common/parsers/default_parser.py sha256=vFr7sgT1SgL1b8Q6stqrebLJAuTEhPVH01wi_HpMi90 2169
document_classification/common/parsers/interfaces.py sha256=tLwS7PN0gSLWwiXpCL_86B2Xe5zyXPjh9Onr6Ga9ewE 2172
document_classification/common/parsers/layout_preserving_formatter.py sha256=V9jFnUMKSXh0l8kHMUlfw7RUjZj4LlUkrxk6jGeHj40 1764
document_classification/common/schemas/__init__.py sha256=NHnnaWSTS6QUTF1L3k_1qWaAYyNqtchoxNtphnMq4ic 240
document_classification/common/schemas/bounding_box.py sha256=ck-0BqUVf3t1Bc-FlqBwXvvu7RovNW_LcaQNP4bRKh4 1628
document_classification/common/schemas/document.py sha256=mzU9zhz8HBqxZY0q7MD855I1lu_-ElkGRkvHSGjl7dU 386
document_classification/common/schemas/line.py sha256=zvuaH5i3LFyT08kFWMQQ4fX_J5bj9JMwBWnPxBHoW2o 379
document_classification/common/schemas/word.py sha256=5LBBg23w5oxQN3K8uoT2HxkB68kH2_IsbtHcHRh0Sy0 627
document_classification/common/tests/__init__.py sha256=LbNQwpC6RnSuyaVpVUG3vHNi_L-pv2Xte_ydtUlbdgY 36
document_classification/common/tests/conftest.py sha256=5YdbqBgHS3eY6JQgF3jWVufMOcfc4mUzlpGnV2antQU 191
document_classification/common/tests/fixtures/__init__.py sha256=5HVHutpWsLlLM8O-d6SrnArKbEKx6Ry-caLntypmAxo 209
document_classification/common/tests/fixtures/lines.py sha256=NTGJ6WWZ-r2Lq3doII4ufdMPt_H0BO3Z1uV1nRpbsWg 363
document_classification/common/tests/fixtures/ocr_df.py sha256=Q4PjqldTBbGEgthaV7Yh6yxnchP37LRK8W8yYNYilFs 818
document_classification/common/tests/fixtures/words.py sha256=6gEYuBZWM5o-Pt5KEAcDxpkfx_c9GJdZuL8cPBIfTss 388
document_classification/common/tests/parsers/__init__.py sha256=iMKnIUZgtR2EQti18Fd5RLwTqCTFR3quTUioglCtec4 47
document_classification/common/tests/parsers/test_default_parser.py sha256=XHAe09eWka-q3QY9vYd1cAGPAhECvRw1FzZ28AFVa0c 3055
document_classification/common/tests/parsers/test_layout_preserving_formatter.py sha256=hTfuWw9Rr4_B0LtHVgO6An7rZp8XIEprpPLa5HidFsA 2252
document_classification/common/tests/schemas/__init__.py sha256=24spHxvT0SrpZPGiqmfYpkLiCMLDmqipeKPQxWahS2c 51
document_classification/common/tests/schemas/test_bounding_box.py sha256=HY4rZhqPo8ZX5vAcp29UYi3BLIVcgShgdfUpGTo3zAE 1434
document_classification/common/tests/schemas/test_document.py sha256=fhOML_1NngFme8nhhhYEIcYGWH21QRBOmmgDh1NPM9A 659
document_classification/common/tests/schemas/test_line.py sha256=qy2kAvdL5sd8MJ5jxFfRMQeTXFl2fkqUOKKsRqAnujo 620
document_classification/common/tests/schemas/test_word.py sha256=Cx_-OnOSfNktjDMLFZtd_TVMU4CnRLE0LSZk_uAYe7w 1412
document_classification/common/tests/utils/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
document_classification/common/tests/utils/test_dataframe_to_text.py sha256=Q32vqokJTvHZsOxyFrYUicUt0iYoRQqd_7eVjNPrjGo 1364
document_classification/common/utils/__init__.py sha256=JNsO_j_ZzJkvQ-SAgOuswcdqOFHiKEWdfXkh7HNdX50 205
document_classification/common/utils/create_enum.py sha256=Bk2P1WI9uQx0onKRZDtszP5URgjLaEzh6jtxZJjR48k 1017
document_classification/common/utils/dataframe_to_text.py sha256=lrpllJVypVyVpcEF9_R6MrXAe7lxbQJAQdhllQII2Rk 627
document_classification/common/utils/file_utils.py sha256=K7uSmjaZ2G28x5PLIPx5vu17EgIyaLoWs_fYRvB-eac 274
document_classification/common/utils/get_device.py sha256=ND6kztEp29TF89OSRwO3mRUJO8fFWgewyfutVqlsqWo 260
document_classification/common/utils/json_loader.py sha256=Th426TBpnqVhf8hFpBTfyu1fNQ0iZlM_qCpFMfsjSJ4 1150
document_classification/common/utils/json_to_ocr_text.py sha256=8-jHdHVoXlUAi3q6yAjHzmKgVi85BECaQq4Z5CLZFo0 824
document_classification/common/utils/parse_and_format.py sha256=cEcpgndJnLAbeeIFZq2Jb-jMwgQZNVhnqE5dLYoj1vw 658
document_classification/common/utils/pdf_images.py sha256=w6iga0setbynZDFoguDixwfLnnsEUlgQ2zWoOsSyNKU 2350
document_classification/common/utils/rotate.py sha256=eUKI3cEljCqHrZpnNkL-qkqEDbRnL6z8Aciq2OZhheM 1907
document_classification/fasttext_model/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
document_classification/fasttext_model/evaluate/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
document_classification/fasttext_model/evaluate/evaluate.py sha256=jE8EHfrVP0W02q-Vwy8oyXDdl8QhdsslwiO8WvnxjVY 2292
document_classification/fasttext_model/predictor/__init__.py sha256=-YvQA1QBRCR2x6kC7oUbDq1lZN7HBi6VsbWdchcI78U 75
document_classification/fasttext_model/predictor/predictor.py sha256=Au9OcLpD1wDRvERZmD3FOCVrydJap82WSKC3iUisNCU 1271
document_classification/fasttext_model/tests/__init__.py sha256=VvrcE_8xzdfqBQIH4DkJizR4qsZLMljYRi7Lc7Oqz9k 38
document_classification/fasttext_model/text_preprocessor.py sha256=meg7i2E62KXo7uELIQYNJIdArgefLKvDSd7jxSj9fpM 680
document_classification/fasttext_model/train/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
document_classification/fasttext_model/train/create_dataset.py sha256=zJZ1fXNhuM3R0BapYB-ZhakagvlPHnWQIEMqXsZBe2Q 5435
document_classification/fasttext_model/train/train.py sha256=OO6NmCpG5XBR1jtrIyiKm1IaUXeLd_BVrRbwfpZ-MNw 792
document_classification/language_model/__init__.py sha256=BpwG5o-sVJJbBF6LgJigoVHURDpJUKlYgdeqeD3DMLk 68
document_classification/language_model/config.py sha256=qJrxbiBKD5_SeiekT8M-G1tvYV0xPL1Pk-u-BTy6UbI 196
document_classification/language_model/dataset_preparer/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
document_classification/language_model/dataset_preparer/base.py sha256=JjUdU4oF5YVSH9STSOzob5B7NOvPjJ1NJULSY32Pogc 2693
document_classification/language_model/dataset_preparer/custom_collate_function.py sha256=AdzebFubwagehMU8P_anub-fk8F0gbYgyo43DgMHvRA 1224
document_classification/language_model/dataset_preparer/text_only.py sha256=49m9ppihRoqF_La3K0FqPdo78cG1CSkllW2WpJBkpbE 1302
document_classification/language_model/dataset_preparer/text_with_bbox.py sha256=AfygHmjAtVLwpQqVmV3bJJywTlLZfdGBdud6ljKPzb0 2151
document_classification/language_model/predictor.py sha256=AvnhIiuI_KXEqtIHFNItLUJJSGENQW6S_JtRdNn7HPY 2404
document_classification/language_model/schemas/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
document_classification/language_model/schemas/ocr_dataset.py sha256=Hvo63ODuKoB4kMYiIrihUPCvpeqM5ImmtHIxMr8-l4E 597
document_classification/language_model/schemas/ocr_with_bbox_dataset.py sha256=uXmLf6sCAL8GF-PShmX-LOb3hD0vpUZCax3aaB-jduE 876
document_classification/language_model/schemas/slm_model.py sha256=p0uBJPkNnmO-WBmXGaxFa4o1eWiv1to4ePNZ7iT_PDA 1322
document_classification/language_model/tests/__init__.py sha256=YJXXNJZ57TVqj77AsT86M1p3AfTvFNNPrgIto2eAikg 44
document_classification/language_model/tokenizers/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
document_classification/language_model/tokenizers/base.py sha256=skXMPMXVAg4gHy7CvXQeH97HLWE23YTlHXbH7QQVNJ4 243
document_classification/language_model/tokenizers/text.py sha256=SEUAg1zXhwMyC9RWN1GDE_o7P2qjXKruvgEqwauLiy0 1027
document_classification/language_model/tokenizers/text_with_layout.py sha256=TgNOhWzLhVAgBjyyQLxfuxlShNbg7RPKdzPFqSZZBnI 3127
document_classification/language_model/trainer.py sha256=o8L99QHMHSh6j8J5jaHnBQE19xEhi_Xi4bhZpNMwpWs 2751
document_classification/llm/__init__.py sha256=iKj1RG9tdrqjmt_J_2-XWyVstd_E4lvRlN0ej4Gql8Y 78
document_classification/llm/build_classification_schema.py sha256=4Ziob7p2vSxbRGc0QE13NpSBzzHed08Qlcsghn1DiK0 1417
document_classification/llm/classifier.py sha256=fAdE3VIRRxICEw5xKUPZIJHenLwl9RQ8CfwRT8wdd1M 2604
document_classification/llm/config.py sha256=pQavFowG1ySpJdszI1tegYr0fh6rGLjGDocx-iO1L14 391
document_classification/llm/evaluation.py sha256=wntybB7et3fK4xW3mlIBMUPy-WHlA9ypB4dVyFTNssA 3333
document_classification/llm/prompt_technique.py sha256=PaVQ-SQZVLkyKmgj7M6G8r24yNGM6f8WvGG6X_Mr3K4 750
document_classification/llm/schemas/__init__.py sha256=nOeoywVeE7dMl0RKH6QibI8DrObnL7PSkO8yaukVqB0 54
document_classification/llm/schemas/classification_entity.py sha256=wH5nIJ7TXL310O2D3a_XbmpcRUDIcI-DvtY7AfnjOMk 198
document_classification/llm/schemas/document_classification.py sha256=JsRGzdb9ursYzaEHOOdnygFg4I9kQhfOynvaJ81pSvU 424
document_classification/llm/schemas/document_classification_cot.py sha256=QXMaPNf3JF7PqNPxao7wK7iDZYxloHWDdc-2gD-5pk8 383
document_classification/llm/tests/__init__.py sha256=5INpaupq8U7Rrzn-J39GXzj8F_qBtRJquBgLz5qY9ow 33
document_classification/logger.py sha256=VAbPRSt26q080edXZVRRar36EYS0XzbrR_R_Q-YBif4 1701
document_classification/ocr/__init__.py sha256=DymrXVmhA0KG5DJ4LjQUwnp8YSEHf7bwLdDJOidvfYE 19
document_classification/ocr/base.py sha256=6w2HMsHpSEYCdZqIZbi-tKeOA8IVAPbxXPNDG5bSD_M 324
document_classification/ocr/config.py sha256=j0NQcv4ngoByCgU82sELjmA8UAi21BClRL9GX0KZ0f8 497
document_classification/ocr/exceptions/__init__.py sha256=qb5qgj6SWUMT7ZtjDP_ifYGA4zKQOIUazPevV7rRqUA 38
document_classification/ocr/exceptions/ocr_processing_error.py sha256=rj8wWP6Su2Hf6vr5zHqWBwb0oVYj4CbRw-F_AZR59X8 673
document_classification/ocr/providers/__init__.py sha256=QCfvRLWpJceR437IrPwtChYmIEioVYutdnOWxulG5g0 202
document_classification/ocr/providers/closed_source/__init__.py sha256=xVZgV52AvB9-VACgv_sVZikye_DBmbw9jvQNSgZpC0E 188
document_classification/ocr/providers/closed_source/google_vision/__init__.py sha256=5bNRmOIRAc8HSKtC9r1quHmJn9J4exnJYFFloFIqRck 119
document_classification/ocr/providers/closed_source/google_vision/client.py sha256=YHVqtuJf9mLKkBQDVlxF0586qJ-cgGdFbsFWtGLdT6M 313
document_classification/ocr/providers/closed_source/google_vision/credentials.py sha256=N0ZvURr6tADCaGFZlHHTLMbRiXmjlga-DjpUzndJPIk 1339
document_classification/ocr/providers/closed_source/google_vision/image_processor.py sha256=kOlV1fIMWrflTnr5UsCM82EVJ1yLyoZlxmeiJREKi2Y 2938
document_classification/ocr/providers/closed_source/google_vision/ocr.py sha256=o-MBIvgllIcwMBxl1zxoZNOaZZwPz0hHD4ARq2ygYTU 1377
document_classification/ocr/providers/closed_source/google_vision/ocr_processor.py sha256=ILpXhdag0PnkR-jQq1jZ88j9rBojPVKJkfsvu2FcLjc 3987
document_classification/ocr/providers/factory.py sha256=DnfmHQrmKRTkQHAySL8VCPvAwE68QFVyowW2JOn9QdE 1125
document_classification/ocr/providers/mappings.py sha256=x8aNRIW4Oe6t9azX0fzHqzWS8IlWzCX-LMPIA46ldFY 216
document_classification/ocr/providers/model_based/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
document_classification/ocr/providers/open_source/__init__.py sha256=Y2B7Nfn72vopxliZyCxxg7bTvwk8p_egPWXOPnDPN54 180
document_classification/ocr/providers/open_source/paddle.py sha256=O8-fVqdZshmQ1VKAVciFSilDfpiLQ9y54j-FLsxklxI 1360
document_classification/ocr/providers/open_source/tesseract.py sha256=5p5hjcHke-8ixfesGjLVr_KxpM5BAq3Z2b78UyePxWI 1403
document_classification/ocr/readers/__init__.py sha256=xorPU6UkLv5U1E9SFRQdz9uJhKjGU8DHJ8KGq-Z1qpg 177
document_classification/ocr/readers/file_reader.py sha256=LZxUEPijAn2VtG5fg5MQpIQ2bBysyMsqIDqHVUEdPQU 1140
document_classification/ocr/readers/image_reader.py sha256=KGttyLYMVC8TR3C7rq5qkkP5_Ors4gHZT7g57hsGtlo 981
document_classification/ocr/readers/pdf_reader.py sha256=LklBVXsbHSF8RZS9PD2SkK-3dRorGrp7NonXzg7esWE 745
document_classification/ocr/schemas/__init__.py sha256=wgBtfPBxfB2wZutb8eGw4xnJqKOTR8-9PTTC4ouFemE 35
document_classification/ocr/schemas/ocr_result.py sha256=aHUMAxc7xMUPjBbDxIABN9A0TnigRpqRluGLNdll4gM 941
document_classification/ocr/tests/__init__.py sha256=OjsJCqtAXD9BhzExyEFR9Bv3jcuv07swu-c-Cb5nYks 33
document_classification/ocr/tests/schemas/__init__.py sha256=GOytOEcXCnw5W-ZHv1deJrVfAveSHusTVk9fhnOZijE 48
document_classification/ocr/tests/schemas/test_ocr_result.py sha256=kYmeUrpQGxC13Q_VaJiADhkkhGYGLVzRadkIUvvvqNU 2610
document_classification-0.0.2a0.dist-info/LICENSE sha256=vXdMg_COLL0PTV2RThMe1clm3yW06b0etafA8zHwuiA 1065
document_classification-0.0.2a0.dist-info/METADATA sha256=euNAVHrh6TlGMDoP44difF5YhS_sAfGkTxMplc-WBjg 2902
document_classification-0.0.2a0.dist-info/WHEEL sha256=Nq82e9rUAnEjt98J6MlVmMCZb-t9cYE2Ir1kpBmnWfs 88
document_classification-0.0.2a0.dist-info/RECORD