RAGScraper

View on PyPIReverse Dependencies (0)

11.5.2023 ragscraper-11.5.2023-py3-none-any.whl

Wheel Details

Project: RAGScraper
Version: 11.5.2023
Filename: ragscraper-11.5.2023-py3-none-any.whl
Download: [link]
Size: 5282
MD5: 6241fa19c3740baaa6e620eaa8b20bdf
SHA256: e1c4553ea6b80634015d7674aaeeb3aff50d440abe604e10973d267f19bd5466
Uploaded: 2023-11-06 02:41:43 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: RAGScraper
Version: 11.5.2023
Summary: RAGScraper is a Python library designed for efficient and intelligent scraping of web documentation and content. Tailored for Retrieval-Augmented Generation systems, RAGScraper extracts and preprocesses text into structured, machine-learning-ready formats. It emphasizes precision, context preservation, and ease of integration with RAG models, making it an ideal tool for developers looking to enhance AI-driven applications with rich, web-sourced knowledge.
Author: kdcokenny
Author-Email: kenny[at]elapse.ai
License: MIT
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Requires-Python: >=3.10,<4.0
Requires-Dist: beautifulsoup4 (<5.0.0,>=4.12.2)
Requires-Dist: html2text (<2021.0.0,>=2020.1.16)
Requires-Dist: requests (<3.0.0,>=2.31.0)
Description-Content-Type: text/markdown
[Description omitted; length: 955 characters]

WHEEL

Wheel-Version: 1.0
Generator: poetry-core 1.7.0
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
rag_scraper/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
rag_scraper/cli.py sha256=lT8lS6vg12vpwaqOpgp7xSl9a-9wtuOkZVfi3o92ki4 1808
rag_scraper/converter.py sha256=OghiINDJJNKd-BK0sj42zRX3wekeslE3RI2LEAuiaoI 1947
rag_scraper/link_extractor.py sha256=9rrEGLWmJaNcWN5vnm9McRKrSH_q-e7jbwmX74hYsBE 2561
rag_scraper/scraper.py sha256=N5FE9xZKg5ODDKWYkD6UMdn7DgeXHgYnlbF5L8tQVzg 387
rag_scraper/utils.py sha256=KxEyj447kVzufSovI1TluWsVzYSMSXJr-7oSPOHz6qc 323
ragscraper-11.5.2023.dist-info/METADATA sha256=VQWB_EWb_jC1-ejm4nZX8_lr5a6boKVN4bYMeP3PsfA 1953
ragscraper-11.5.2023.dist-info/WHEEL sha256=d2fvjOD7sXsVzChCqf0Ty0JbHKBaLYwDbGQDwQTnJ50 88
ragscraper-11.5.2023.dist-info/RECORD