webarticlecurator

View on PyPIReverse Dependencies (1)

1.13.0 webarticlecurator-1.13.0-py3-none-any.whl

Wheel Details

Project: webarticlecurator
Version: 1.13.0
Filename: webarticlecurator-1.13.0-py3-none-any.whl
Download: [link]
Size: 37562
MD5: e202d8e49db6f0661da88e2d6572583f
SHA256: 2b5dd645f944a80a55d14cee860a2c0ed42fd9352aa981be458911a0ad5faf1c
Uploaded: 2024-12-19 00:33:08 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: webarticlecurator
Version: 1.13.0
Summary: A crawler program to download content from portals (news, forums, blogs) and convert it to the desired output format according to the configuration.
Author: dlazesz
Home-Page: https://github.com/ELTE-DH/WebArticleCurator
Project-Url: Repository, https://github.com/ELTE-DH/WebArticleCurator
License: LGPLv3
Classifier: Development Status :: 5 - Production/Stable
Classifier: License :: Other/Proprietary License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.8,<4.0
Requires-Dist: beautifulsoup4 (<5.0.0,>=4.12.3)
Requires-Dist: chardet (<6.0.0,>=5.2.0)
Requires-Dist: lxml (<6.0.0,>=5.3.0)
Requires-Dist: mplogger (<2.0.0,>=1.1.0)
Requires-Dist: newspaper3k (<0.3.0,>=0.2.8); extra == "newspaper3k" or extra == "full"
Requires-Dist: pyyaml (<7.0.0,>=6.0.2)
Requires-Dist: ratelimit (<3.0.0,>=2.2.1)
Requires-Dist: requests (<3.0.0,>=2.32.3)
Requires-Dist: urllib3 (<3.0.0,>=2.2.3)
Requires-Dist: warcio (<2.0.0,>=1.7.0)
Requires-Dist: yamale (<6.0.0,>=5.2.1)
Provides-Extra: full
Provides-Extra: newspaper3k
Description-Content-Type: text/markdown
[Description omitted; length: 13769 characters]

WHEEL

Wheel-Version: 1.0
Generator: poetry-core 1.8.1
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
webarticlecurator/__init__.py sha256=ZJFmG62LfBeT7FHf4R6YE3A8oGcbbLTGOj4KoCPg6JY 1022
webarticlecurator/__main__.py sha256=mT7hRIWxxPne6tqFcpJJ1FCJ_3mRdbXiOrkj8DFB4IM 18083
webarticlecurator/crawl_schema.yaml sha256=3Y55s3zz9NyTi4TkqfjDjykCeUTMsQ4ANnhUbS8NxxY 567
webarticlecurator/enhanced_downloader.py sha256=XR-7Eo3Kgz05RYPZsA8ZXbD52MkC8RkwHiWdv1LGeEc 27144
webarticlecurator/news_crawler.py sha256=zXJ3m-RBP1kt8uINYIT3CciKaUUIFg1IudigbchxNE0 19093
webarticlecurator/other_modes.py sha256=WK1BkJ5t6KQX3Ph4IFCDyfc1e0zYCixzZCkk6gUlj1s 6854
webarticlecurator/site_schema.yaml sha256=iiFLdFh0WfCLRqkoXWWHmBIGhrNzJ4CkjNJlxl9cON0 1755
webarticlecurator/strategies.py sha256=gj44a8vf_eEXpPgR5D_WJpkPWz629FR1ynWvwtPqt2s 9265
webarticlecurator/utils.py sha256=mFzoBQ28Jfwbrl5CYzmgr-2eMpywnxA_xIJT6aHveMc 12934
webarticlecurator/version.py sha256=mfyq8J_DbCIiY0XagwgKtz7L6M2c0zrlgTaPUNH2msw 285
webarticlecurator-1.13.0.dist-info/LICENSE sha256=46mU2C5kSwOnkqkw9XQAJlhBL2JAf1_uCD8lVcXyMRg 7652
webarticlecurator-1.13.0.dist-info/METADATA sha256=vOh6hmWq7TrMkUe5bAQ7JHr_cnIhGx_wNc8A98nT0a4 15299
webarticlecurator-1.13.0.dist-info/WHEEL sha256=FMvqSimYX_P7y0a7UY-_Mc83r5zkBZsCYPm7Lr0Bsq4 88
webarticlecurator-1.13.0.dist-info/entry_points.txt sha256=iNaV8s89rKKBjOcKL8fR8Nj-tzi_CovJSqiCQRmXkLo 69
webarticlecurator-1.13.0.dist-info/RECORD

entry_points.txt

webarticlecurator = webarticlecurator.__main__:main