par_scrape

View on PyPIReverse Dependencies (0)

0.4.3 par_scrape-0.4.3-py3-none-any.whl

Wheel Details

Project: par_scrape
Version: 0.4.3
Filename: par_scrape-0.4.3-py3-none-any.whl
Download: [link]
Size: 21467
MD5: b9ffab8ed860421817a39d06cfe64956
SHA256: 44936e0d1164c63e701b38110a3afc4a5dd7daa9c278e9254210fac7ddab7209
Uploaded: 2024-09-11 02:12:35 +0000

dist-info

METADATA

Metadata-Version: 2.3
Name: par_scrape
Version: 0.4.3
Summary: A versatile web scraping tool with options for Selenium or Playwright, featuring OpenAI-powered data extraction and formatting.
Author-Email: Paul Robello <probello[at]gmail.com>
Maintainer-Email: Paul Robello <probello[at]gmail.com>
Project-Url: Homepage, https://github.com/paulrobello/par_scrape
Project-Url: Documentation, https://github.com/paulrobello/par_scrape/blob/main/README.md
Project-Url: Repository, https://github.com/paulrobello/par_scrape
Project-Url: Issues, https://github.com/paulrobello/par_scrape/issues
Project-Url: Discussions, https://github.com/paulrobello/par_scrape/discussions
Project-Url: Wiki, https://github.com/paulrobello/par_scrape/wiki
License: MIT License Copyright (c) 2024 Paul Robello Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions: The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
Keywords: data extraction,openai,playwright,selenium,web scraping
Classifier: Development Status :: 4 - Beta
Classifier: Environment :: Console
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Internet :: WWW/HTTP :: Browsers
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Topic :: Text Processing :: Markup :: HTML
Classifier: Typing :: Typed
Requires-Python: >=3.11
Requires-Dist: aiofiles (>=24.1.0)
Requires-Dist: beautifulsoup4 (>=4.12.3)
Requires-Dist: html2text (>=2024.2.26)
Requires-Dist: langchain-anthropic (>=0.1.23)
Requires-Dist: langchain-community (>=0.2.16)
Requires-Dist: langchain-core (>=0.2.38)
Requires-Dist: langchain-experimental (>=0.0.65)
Requires-Dist: langchain-google-genai (>=1.0.10)
Requires-Dist: langchain-groq (>=0.1.9)
Requires-Dist: langchain-ollama (>=0.1.3)
Requires-Dist: langchain-openai (>=0.1.23)
Requires-Dist: langchain-text-splitters (>=0.2.4)
Requires-Dist: langchain (>=0.2.16)
Requires-Dist: ollama (>=0.3.2)
Requires-Dist: openai (>=1.43.0)
Requires-Dist: openpyxl (>=3.1.5)
Requires-Dist: pandas (>=2.2.2)
Requires-Dist: playwright (>=1.46.0)
Requires-Dist: pydantic (>=2.9.0)
Requires-Dist: python-dotenv (>=1.0.1)
Requires-Dist: rich (>=13.8.0)
Requires-Dist: selenium (>=4.24.0)
Requires-Dist: sentence-transformers (>=3.0.1)
Requires-Dist: tabulate (>=0.9.0)
Requires-Dist: tiktoken (>=0.7.0)
Requires-Dist: typer (>=0.12.5)
Requires-Dist: typing-extensions (>=4.12.2)
Requires-Dist: webdriver-manager (>=4.0.2)
Description-Content-Type: text/markdown
License-File: LICENSE
[Description omitted; length: 6974 characters]

WHEEL

Wheel-Version: 1.0
Generator: hatchling 1.25.0
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
par_scrape/__init__.py sha256=nOeM3vrupRbV1dsXFt-N8wD1CCR5UciPsVHI1UXrYjQ 675
par_scrape/__main__.py sha256=zdtza4gD2qSpePmv5WlkG8Xu2PRPeDVgfcbAY2Fxj_E 14363
par_scrape/extraction_prompt.md sha256=_Xx2bvMc6bPuxTpBi3MDJw8wNvPrBeEKk6jbCfnjOdQ 374
par_scrape/fetch_html.py sha256=tTPWwn1-n5gzTpMyt6nwhp3KXS_PSBgQpzuWiSPf6P4 5293
par_scrape/pricing.py sha256=S0FUTzr2lvEguCotyQTLaQKVAF6v-bLj935J6gK6_ok 5685
par_scrape/py.typed sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
par_scrape/scrape_data.py sha256=mDT_pDLJ1X6vuaYl3PCVpigzYJq_3EtIlfK1gUldxFA 8358
par_scrape/utils.py sha256=8oWPZ1zUyfOgl1dcXIMgLzodUNNGkRT6Wlhy-0_skBM 1230
par_scrape/lib/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
par_scrape/lib/llm_config.py sha256=t9jznfzWlFYdYEuDFmzJyvVUtC5NXXR1RqXmN_Q8rd8 7263
par_scrape/lib/llm_providers.py sha256=51dp7Q0Xxb4utG7PavRriaFte4icN2bPsK8ibn3LWjM 1325
par_scrape/lib/output_utils.py sha256=U7yfCVeUyerItrs63m9AEeZs15g6COmqYvpDZAr2DFg 1460
par_scrape/lib/par_ollama_embeddings.py sha256=R1WGlQKcxEw9z6AMX4zeHbfNAcARPdbfGRx3L71h2AE 2216
par_scrape-0.4.3.dist-info/METADATA sha256=dAYO5udCmrKP-wjsD7z3NHYFXsWk6oo_Z9A-dj_HKmk 10641
par_scrape-0.4.3.dist-info/WHEEL sha256=1yFddiXMmvYK7QYTqtRNtX66WJ0Mz8PYEiEUoOUUxRY 87
par_scrape-0.4.3.dist-info/entry_points.txt sha256=2tCGtiLIfK_SVJtO45aQ4OcYX3XHdYcfCAlWjKUCrRg 55
par_scrape-0.4.3.dist-info/licenses/LICENSE sha256=mM7wR5vvvL5avX6RGqhwhOGjxyojbty3pmJSEHOqeFM 1069
par_scrape-0.4.3.dist-info/RECORD

entry_points.txt

par_scrape = par_scrape.__main__:app