duplicate-url-discarder

View on PyPIReverse Dependencies (1)

0.2.0 duplicate_url_discarder-0.2.0-py3-none-any.whl

Wheel Details

Project: duplicate-url-discarder
Version: 0.2.0
Filename: duplicate_url_discarder-0.2.0-py3-none-any.whl
Download: [link]
Size: 13934
MD5: aca4064e538452d14349eb61b803ab34
SHA256: 8d31005af854ffb42b59869e404fa74fc321f4d9f37fb7f6ffbe29ac587ca780
Uploaded: 2024-07-23 12:13:01 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: duplicate-url-discarder
Version: 0.2.0
Summary: Discarding duplicate URLs based on rules.
Author-Email: Zyte Group Ltd <info[at]zyte.com>
Project-Url: Source, https://github.com/zytedata/duplicate-url-discarder
License: The MIT License (MIT) Copyright (c) 2024 Zyte Group Ltd Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions: The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: BSD License
Classifier: Natural Language :: English
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: Implementation :: CPython
Requires-Python: >=3.8
Requires-Dist: Scrapy (>=2.11.0)
Requires-Dist: url-matcher (>=0.5.0)
Requires-Dist: w3lib (>=2.0.1)
Requires-Dist: duplicate-url-discarder-rules; extra == "rules"
Provides-Extra: rules
Description-Content-Type: text/x-rst
License-File: LICENSE
[Description omitted; length: 6250 characters]

WHEEL

Wheel-Version: 1.0
Generator: setuptools (71.1.0)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
duplicate_url_discarder/__init__.py sha256=w5ggrF61su1Wu-UDpOEDxpSPleXKwEZDGNtUpu950I0 144
duplicate_url_discarder/_addon.py sha256=rBieQhOnLisJ_vq0P-cWKoLgynt5Wkw1_AGsE5HneW0 1214
duplicate_url_discarder/_fingerprinter.py sha256=OUElrwMN6EuvIkeXVMK161NHUk3MGDDAiJrCyReFzLc 2636
duplicate_url_discarder/pipelines.py sha256=SBZvpM2pvT03Qz3LTfRdaRHts6VDNS-fQMYOJ9Tp_Sc 1538
duplicate_url_discarder/py.typed sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
duplicate_url_discarder/rule.py sha256=7G_kGwlXgDyqzBsQxewmJvURJoR_ZuwMxX_qGRwoZl8 1600
duplicate_url_discarder/url_canonicalizer.py sha256=IEZGOv4w4Rga3hrGSuEy62RvL4td9vYVQ1DkGYo6JlU 1683
duplicate_url_discarder/utils.py sha256=YPUwyuhiEVWrpWJk0T3DbtuUm2kmKlh_q1GAcuEmL4A 844
duplicate_url_discarder/processors/__init__.py sha256=Is6N3eywOISe7sHzIDHgYhjcIbXJUyKBuw77B_bl3Gg 852
duplicate_url_discarder/processors/base.py sha256=M_cAAP_fNjUmEDlpdnYgA5YZMynS26KNVLJFal-4ZFg 536
duplicate_url_discarder/processors/normalize.py sha256=irI5pLKiAz7fcLdtDGiTCD8GpHHrrXvvFZbeQ3tfxxU 723
duplicate_url_discarder/processors/query_removal.py sha256=1GxuAVmwu1IuYs-rATkC_Md7oTD6vSkkxuKxtLa5qtw 537
duplicate_url_discarder/processors/query_removal_except.py sha256=nBo4NA9NtVYDtReEX1wGbl59b6zTSzqGdJdGvgRYcUM 536
duplicate_url_discarder/processors/subpath_removal.py sha256=VqTSSwkJ_Od96Vb6ulw45nHLq5AiQsfaerwXi1ADqiw 1218
duplicate_url_discarder-0.2.0.dist-info/LICENSE sha256=kpjNvvmhbqZ-CA2Uue93PYS2wHTPhRiRgyDAWYHbMlo 1081
duplicate_url_discarder-0.2.0.dist-info/METADATA sha256=LQcNJznMcxdyF-F6VwVpkEhbpQRlxyViawaewZGe0YE 8608
duplicate_url_discarder-0.2.0.dist-info/WHEEL sha256=Wyh-_nZ0DJYolHNn1_hMa4lM7uDedD_RGVwbmTjyItk 91
duplicate_url_discarder-0.2.0.dist-info/top_level.txt sha256=jLc7gQ-PrbtD6Ym1I0k7yHLu1neSMBGXga_lWCZ4GKs 24
duplicate_url_discarder-0.2.0.dist-info/RECORD

top_level.txt

duplicate_url_discarder