docfusion

View on PyPIReverse Dependencies (0)

0.1.0 docfusion-0.1.0-py3-none-any.whl

Wheel Details

Project: docfusion
Version: 0.1.0
Filename: docfusion-0.1.0-py3-none-any.whl
Download: [link]
Size: 72779
MD5: 8af2a12e03548e4005c7561ec916f603
SHA256: cfa918558a558e0b3ee2b415a2094f49a9b44f7c6e880255af73cc2761d957cc
Uploaded: 2024-09-16 07:47:04 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: docfusion
Version: 0.1.0
Summary: Doc Fusion is a Data Sourcing framework capable of parsing various data types such as pdf, txt, md, docx, xlsx, csv and even a webpage url.
Author: Manoj Jahgirdar
Author-Email: manoj.jahgirdar[at]in.ibm.com
Home-Page: https://github.com/IBM/doc-fusion
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.9
Requires-Dist: ibm-watsonx-ai (==1.1.2)
Requires-Dist: pydantic (==2.8.2)
Requires-Dist: langchain (==0.2.12)
Requires-Dist: langchain-ibm (==0.1.11)
Requires-Dist: langchain-community (==0.2.11)
Requires-Dist: langchain-huggingface (==0.0.3)
Requires-Dist: sqlalchemy (==2.0.32)
Requires-Dist: pymupdf (==1.24.5)
Requires-Dist: fastapi (==0.110.3)
Requires-Dist: uvicorn[standard] (==0.23.2)
Requires-Dist: chromadb (==0.4.15)
Requires-Dist: langchain-core (==0.2.28)
Requires-Dist: sentence-transformers (==3.0.0)
Requires-Dist: openpyxl (==3.1.4)
Requires-Dist: mammoth (==1.8.0)
Requires-Dist: xhtml2pdf (==0.2.16)
Requires-Dist: ibm-cos-sdk (==2.13.2)
Requires-Dist: pandas (==2.1.4)
Description-Content-Type: text/markdown
License-File: LICENSE
[Description omitted; length: 347 characters]

WHEEL

Wheel-Version: 1.0
Generator: setuptools (70.2.0)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
core/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
core/agent.py sha256=UrEf2dQg_O96SXatXt5e_U9d5SWAwwsMbyf9yta91qk 5177
core/output_parser.py sha256=lzXop-MxlKpx_vcVSG9dVyE5fTUf6vbAzQuuqkoSe3E 425
core/splitter.py sha256=WpQHsUvBWYGBosMyylDtAOgq1z-iFedcGprO8kTVoh4 4065
core/structured_data_loader.py sha256=r534MvkXtQssOiLUYhqd0ViiNXxdekCmhh3vpIoLOcE 1848
core/unstructured_data_loader.py sha256=E5U7LYKD3vbi9g4tK4ghFNP9dUZp2VjOele5BEDfLNg 3589
core/web_crawler_loader.py sha256=RMBizAxf2VJBO6TaFBbjzwVA4faP0llUKWbTUnQB3Rc 1726
services/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
tests/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
tests/main_test.py sha256=tNvhOZS5dPF6urxL1JZJfM-F9z9Xob_g44hxv9NSFXo 2855
tests/structured_loader_test.py sha256=mwJPMj1lOSDapNaxm0whI6NYUnA6wCbostDVOlVrXm0 2691
tests/unstructured_loader_test.py sha256=iy7jrOYNcP9pr8lfRa3gC8kWXUpKjIwpcTZQkmKhu6g 1572
tests/test-files/insurance.pdf sha256=ytlMmZdw4CRRgnK11kd-DQxPzbxYOGavznvG3I9WgDQ 50892
tests/test-files/samplestructured1.xlsx sha256=OGCacZtxZvOxk0vNW5yYUn1FUdin19vrPceoMu6iBeY 9023
tests/test-files/samplestructured2.csv sha256=Gil5uvtrhBKPPUI74Bp3qcNZBXEbrGKhNVeMnzTSC8U 73
tools/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
tools/ask_user.py sha256=VymrQ3oMUVft-Zv1OckQbzEMIA23HZkXcFeC8JamcLk 476
tools/recognize_loader.py sha256=GTo_eWCAj5sW9P0ASjV2-4i7Ahzfmms1X25v3XKFG2U 803
tools/write_config_file.py sha256=n5jfxMClouO5UI-ed16Q4wuoWcZyVN1xWCzBBFjvI74 1166
docfusion-0.1.0.dist-info/LICENSE sha256=xx0jnfkXJvxRnG63LTGOxlggYnIysveWIZ6H3PNdCrQ 11357
docfusion-0.1.0.dist-info/METADATA sha256=W_X4wYKAZ5jY4Y6hYw9GLFFMOmEAcKzCEPLxqFMNZBY 1556
docfusion-0.1.0.dist-info/WHEEL sha256=y4mX-SOX4fYIkonsAGA5N0Oy-8_gI4FXw5HNI1xqvWg 91
docfusion-0.1.0.dist-info/top_level.txt sha256=bkj9l7KCGtjArkyWD6CE6LnSh9_qUgE8CSIW9djoVgQ 26
docfusion-0.1.0.dist-info/RECORD

top_level.txt

core
services
tests
tools