document-ingestor

View on PyPIReverse Dependencies (0)

0.1.1 document_ingestor-0.1.1-py3-none-any.whl

Wheel Details

Project: document-ingestor
Version: 0.1.1
Filename: document_ingestor-0.1.1-py3-none-any.whl
Download: [link]
Size: 8926
MD5: ea646fc5cf734fdd355189b3c28d7a8a
SHA256: e440985ff1e1aa1d0847716a356f79cfa5466573e6718898c7f5c685a43932de
Uploaded: 2024-04-17 19:15:50 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: document_ingestor
Version: 0.1.1
Summary: This package consumes one or more Spanish constitution PDFs and then processes them to generate embedding vectors. The vectors are generated with OpenAI service and PineCone is used to store and retrieve embedding vectors.
Author-Email: Milan Anand Raj <manandraj20[at]iitk.ac.in>
Project-Url: Homepage, https://github.com/pypa/sampleproject
Project-Url: Issues, https://github.com/pypa/sampleproject/issues
License: Copyright (c) 2018 The Python Packaging Authority Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions: The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.8
Requires-Dist: pymupdf (<=1.23.11)
Requires-Dist: openai
Requires-Dist: pinecone-client
Requires-Dist: numpy
Requires-Dist: tiktoken
Description-Content-Type: text/markdown
License-File: LICENSE.txt
[Description omitted; length: 276 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.43.0)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
document_ingestor/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
document_ingestor/embedding_process.py sha256=Kixicltx2B_w93w3KZuglqtdYfaeoOmOHteZ2JWLwrg 5779
document_ingestor/eu_data_parser.py sha256=VoiQYPNGOpD0wAkN-Kh3puxYuGc-DvkendSu4m8OGfE 1865
document_ingestor/genericParser.py sha256=gy7Ofkhdv4nuZkF4uoP12Z-m7ZSzHM7YfOIvJH_6oTM 2744
document_ingestor/pdf_parser.py sha256=JWwmqliqvXENzgmlTZWtEpjGzUw0W2OiTiWbwvZSnpw 6146
document_ingestor-0.1.1.dist-info/LICENSE.txt sha256=6kbiFSfobTZ7beWiKnHpN902HgBx-Jzgcme0SvKqhKY 1091
document_ingestor-0.1.1.dist-info/METADATA sha256=vlfSoXxm-Q68TcZmeJmiHxkPGxgp-rDLZW_vbVZCV54 2388
document_ingestor-0.1.1.dist-info/WHEEL sha256=GJ7t_kWBFywbagK5eo9IoUwLW6oyOeTKmQ-9iHFVNxQ 92
document_ingestor-0.1.1.dist-info/top_level.txt sha256=nGv_GDwKrPFq0sQGOxiFVOMzeMkDaQwez9W6_snRcq8 18
document_ingestor-0.1.1.dist-info/RECORD

top_level.txt

document_ingestor