par_scrape

View on PyPIReverse Dependencies (0)

0.4.8 par_scrape-0.4.8-py3-none-any.whl

Wheel Details

Project: par_scrape
Version: 0.4.8
Filename: par_scrape-0.4.8-py3-none-any.whl
Download: [link]
Size: 29070
MD5: a17e8253ea745f112e99b8377d0a0220
SHA256: 5079c4bbb8186e318b0620c57bd0bcff4aad38a80a6fb29ef32e2f0a6d7670f9
Uploaded: 2024-11-06 17:07:19 +0000

dist-info

METADATA

Metadata-Version: 2.3
Name: par_scrape
Version: 0.4.8
Summary: A versatile web scraping tool with options for Selenium or Playwright, featuring OpenAI-powered data extraction and formatting.
Author-Email: Paul Robello <probello[at]gmail.com>
Maintainer-Email: Paul Robello <probello[at]gmail.com>
Project-Url: Homepage, https://github.com/paulrobello/par_scrape
Project-Url: Documentation, https://github.com/paulrobello/par_scrape/blob/main/README.md
Project-Url: Repository, https://github.com/paulrobello/par_scrape
Project-Url: Issues, https://github.com/paulrobello/par_scrape/issues
Project-Url: Discussions, https://github.com/paulrobello/par_scrape/discussions
Project-Url: Wiki, https://github.com/paulrobello/par_scrape/wiki
License: MIT License Copyright (c) 2024 Paul Robello Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions: The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
Keywords: data extraction,openai,playwright,selenium,web scraping
Classifier: Development Status :: 4 - Beta
Classifier: Environment :: Console
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Internet :: WWW/HTTP :: Browsers
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Topic :: Text Processing :: Markup :: HTML
Classifier: Typing :: Typed
Requires-Python: >=3.11
Requires-Dist: beautifulsoup4 (>=4.12.3)
Requires-Dist: boto3 (>=1.35.37)
Requires-Dist: botocore (>=1.35.37)
Requires-Dist: html2text (>=2024.2.26)
Requires-Dist: langchain-anthropic (>=0.1.23)
Requires-Dist: langchain-aws (>=0.2.2)
Requires-Dist: langchain-community (>=0.2.16)
Requires-Dist: langchain-core (>=0.2.38)
Requires-Dist: langchain-experimental (>=0.0.65)
Requires-Dist: langchain-google-genai (>=1.0.10)
Requires-Dist: langchain-groq (>=0.1.9)
Requires-Dist: langchain-ollama (>=0.1.3)
Requires-Dist: langchain-openai (>=0.1.23)
Requires-Dist: langchain-text-splitters (>=0.2.4)
Requires-Dist: langchain (>=0.2.16)
Requires-Dist: ollama (>=0.3.2)
Requires-Dist: openai (>=1.43.0)
Requires-Dist: openpyxl (>=3.1.5)
Requires-Dist: pandas (>=2.2.2)
Requires-Dist: playwright (>=1.46.0)
Requires-Dist: pydantic (>=2.9.0)
Requires-Dist: python-dotenv (>=1.0.1)
Requires-Dist: rich (>=13.8.0)
Requires-Dist: selenium (>=4.24.0)
Requires-Dist: tabulate (>=0.9.0)
Requires-Dist: tiktoken (>=0.7.0)
Requires-Dist: typer (>=0.12.5)
Requires-Dist: webdriver-manager (>=4.0.2)
Description-Content-Type: text/markdown
License-File: LICENSE
[Description omitted; length: 9593 characters]

WHEEL

Wheel-Version: 1.0
Generator: hatchling 1.25.0
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
par_scrape/__init__.py sha256=_nXGlLG-Fg0wFN14rjGlKf3tqAjRaIRAPqN5EX-uPwQ 675
par_scrape/__main__.py sha256=-ubsr4NbmB3vdgRbeZPj-hjgF0jSb7igbzJL6CTwLrM 13381
par_scrape/enums.py sha256=9ZHaISgn0tCQn6USZwo9m-YVLfBIRIkRAoZpfZs-tDU 517
par_scrape/extraction_prompt.md sha256=_Xx2bvMc6bPuxTpBi3MDJw8wNvPrBeEKk6jbCfnjOdQ 374
par_scrape/fetch_html.py sha256=uOL1BESTqEkmU7Qc828wAmRxK9-WRl1ObFQr3HGSIfQ 8987
par_scrape/py.typed sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
par_scrape/scrape_data.py sha256=PnxaTM31LpDdy5WhE5_futAjApd-lhL9FsTuwjOxq48 7740
par_scrape/utils.py sha256=8oWPZ1zUyfOgl1dcXIMgLzodUNNGkRT6Wlhy-0_skBM 1230
par_scrape/lib/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
par_scrape/lib/llm_config.py sha256=_5F5StKmWfh-yyiyh6VPdGvuAW-P4S7op0EquMlA2wo 16970
par_scrape/lib/llm_image_utils.py sha256=fmOXmCYK36ugurWOANIwYvPGbtIZW1A6bYMT6jYnthw 1357
par_scrape/lib/llm_providers.py sha256=zZ7BCg7YcRjj0NYdgZ7sD_c_5iMxopaO_mWjraAnegM 6300
par_scrape/lib/output_utils.py sha256=uPGddHdCGaSSjCDNXYUaRYYDy64_bQvwwpVUa9aEOrA 3968
par_scrape/lib/pricing_lookup.py sha256=IXW4qf6o-tg85KSDOGRJyWkEK8Nx7_Ap8A0bNBWjhyo 7769
par_scrape/lib/provider_cb_info.py sha256=1Y7G1KL1zxV863YN3ISYcOERfw4od7ZX8PWxsjcnGoM 4188
par_scrape/lib/user_agents.py sha256=mm5gdLEWy0LUjCRCBsY618k5QfqNLxmMFTDk9VeSCQo 6541
par_scrape-0.4.8.dist-info/METADATA sha256=AYx6BB_42Pvd1NtLnGNVyiBEbg5qhK7MD_Wy2-h7Wzw 13242
par_scrape-0.4.8.dist-info/WHEEL sha256=1yFddiXMmvYK7QYTqtRNtX66WJ0Mz8PYEiEUoOUUxRY 87
par_scrape-0.4.8.dist-info/entry_points.txt sha256=2tCGtiLIfK_SVJtO45aQ4OcYX3XHdYcfCAlWjKUCrRg 55
par_scrape-0.4.8.dist-info/licenses/LICENSE sha256=mM7wR5vvvL5avX6RGqhwhOGjxyojbty3pmJSEHOqeFM 1069
par_scrape-0.4.8.dist-info/RECORD

entry_points.txt

par_scrape = par_scrape.__main__:app