extractnet

View on PyPIReverse Dependencies (0)

2.0.7 extractnet-2.0.7-cp39-cp39-manylinux_2_24_x86_64.whl
extractnet-2.0.7-cp39-cp39-macosx_12_0_x86_64.whl
extractnet-2.0.7-cp38-cp38-macosx_10_16_x86_64.whl
extractnet-2.0.7-cp38-cp38-macosx_10_15_x86_64.whl
extractnet-2.0.7-cp38-cp38-manylinux_2_24_x86_64.whl
extractnet-2.0.7-cp37-cp37m-macosx_10_16_x86_64.whl
extractnet-2.0.7-cp37-cp37m-macosx_10_15_x86_64.whl
extractnet-2.0.7-cp37-cp37m-manylinux_2_24_x86_64.whl
extractnet-2.0.7-cp36-cp36m-macosx_10_16_x86_64.whl
extractnet-2.0.7-cp36-cp36m-macosx_10_15_x86_64.whl
extractnet-2.0.7-cp36-cp36m-manylinux_2_24_x86_64.whl
extractnet-2.0.7-cp310-cp310-macosx_10_15_x86_64.whl
extractnet-2.0.7-cp310-cp310-manylinux_2_24_x86_64.whl
extractnet-2.0.7-cp310-cp310-macosx_11_0_x86_64.whl

Wheel Details

Project: extractnet
Version: 2.0.7
Filename: extractnet-2.0.7-cp38-cp38-manylinux_2_24_x86_64.whl
Download: [link]
Size: 3324174
MD5: f18a202858433dfa0a56077836924146
SHA256: 48ac8d265cf1766e6a861d66073783729809afeea0b5db0f7dbbcc907299f8ea
Uploaded: 2022-11-06 07:33:58 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: extractnet
Version: 2.0.7
Summary: Extract the main article content (and optionally comments) from a web page
Author: Peter
Author-Email: sales[at]currentsapi.services
Home-Page: https://github.com/currentsapi/extractnet
License: MIT
Keywords: automatic content extraction,web page dechroming,HTML parsing
Classifier: License :: OSI Approved :: MIT License
Classifier: Development Status :: 5 - Production/Stable
Classifier: Environment :: Web Environment
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: Topic :: Internet :: WWW/HTTP
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Intended Audience :: Science/Research
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Platform: Posix; MacOS X
Requires-Dist: cchardet (>=2.1.7)
Requires-Dist: beautifulsoup4 (==4.9.3)
Requires-Dist: ftfy (<5.0.0,>=4.1.0)
Requires-Dist: numpy (>=1.19.0)
Requires-Dist: onnxruntime (>=1.9.0)
Requires-Dist: scikit-learn (>=0.22.0)
Requires-Dist: tld (==0.12.6)
Requires-Dist: scipy (>=0.17.0)
Requires-Dist: sklearn-crfsuite (==0.3.6)
Requires-Dist: dateparser (==1.1.0)
Requires-Dist: joblib (>=1.1.0)
Requires-Dist: htmldate (==0.7.2)
Description-Content-Type: text/markdown
License-File: LICENSE
[Description omitted; length: 7417 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.37.1)
Root-Is-Purelib: false
Tag: cp38-cp38-manylinux_2_24_x86_64

RECORD

Path Digest Size
extractnet-2.0.7.dist-info/LICENSE sha256=M6siLoHxvv-aroiQ3Q1fPHwsvOsOuhWXDOZXY61awqg 1096
extractnet-2.0.7.dist-info/top_level.txt sha256=HHqETiv5qJsWT5GblB-L9YgxYVIP8KfJqw3Ub9zk6mA 11
extractnet-2.0.7.dist-info/WHEEL sha256=Z_Rb1HK01l8x9Vg4167UbAebNFSDPepf_z300ONs1JA 112
extractnet-2.0.7.dist-info/RECORD
extractnet-2.0.7.dist-info/METADATA sha256=atpqlskbbzAl9GwxYUpZQc-2CJh5xJthmCoQkTWX-dM 8997
extractnet/name_crf.py sha256=KuHAZ8xWf6vDAK7xigUn4vuIfyriaAZ8kM51YcxXvJ0 1118
extractnet/blocks.pyx sha256=LEHV7OkZA0i6lGm1YfeNo48kRIXhBqJtE-BHb436Sq8 31827
extractnet/blocks.cpython-38-x86_64-linux-gnu.so sha256=DX0FXyLqOWYyRjGHTJmTmteAaqxB1r6C09eKHE-U4XU 4389720
extractnet/util.py sha256=6jcanJrOnnBozHBcn7tMKVqqp4Wqzu1biQ8KgDd8WRw 7663
extractnet/compat.py sha256=oLr0zmtpRmqkNtDvkm1ko2P55lpXTjNVwLjvinskR3c 9314
extractnet/blocks.cpp sha256=2yAxCx50BzEn--iFZAUr591JKrLdsbCAcev-m_80Vd8 750379
extractnet/nn_models.py sha256=B1nTE6GnwlrlcfmOajNxLPZ8r1Qg7EyLgZj23ACW6fU 3720
extractnet/lcs.cpython-38-x86_64-linux-gnu.so sha256=qwvh3XB67tFDGwbWPuJrmW7I1-NAzTIBqkYGsweZ0rI 552600
extractnet/lcs.pyx sha256=6XR7-efisJ9x8TH9PRJm3PTvmwLXgxM3T6LNKvFTD8c 2981
extractnet/pipeline.py sha256=whSDnoL2Pf-JjE08tc74vGX0td3y_F6DEa3IqTeCuRE 5250
extractnet/lcs.cpp sha256=-0GsmOyjrEfHPorglvSlm_mC7Ypq6FZZQj_uOPPS4Sg 337924
extractnet/__init__.py sha256=YfnCSI36NICHeGeeIhntKfKJc00YvEoa2RbA4DqXqgw 309
extractnet/sequence_tagger/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
extractnet/sequence_tagger/models.py sha256=O9g7q8-moz80RHEVTZ6NqlFhjswLh83v2JJWUORmQ5g 3458
extractnet/metadata_extraction/metaxpaths.py sha256=sWVcYqKC7PhMN8JXXvJUxD-142WK6hgT13PajwmseZg 2423
extractnet/metadata_extraction/json_ld.py sha256=wVBnMSKmKmC6RaIJe2ctOOT2lV1iFs1Zi2pTmKtJ2Pc 7888
extractnet/metadata_extraction/url_utils.py sha256=Zmfz-N_wSHGcqueORlXL8sgjJiKVvXDGMrDk7E-eGgA 4573
extractnet/metadata_extraction/utils.py sha256=QaCK7Stp9AK8Y5VBrjKjKJp1OXnCeR3pdyk3gDYYZlg 10611
extractnet/metadata_extraction/metadata.py sha256=09rc-N-qhNYQQwDlL2H-qGsBN0AZk5XnBXfBNsATmRY 19030
extractnet/metadata_extraction/constant.py sha256=UmjywL-APZ2c47IOOvfUjP--j-n0xRCm_ZNlyLue4P4 6449
extractnet/metadata_extraction/video.py sha256=tYOx-hwLlDu2Kgnx6ckUmXccQIxb_i9_PvIqnc9nG4g 8357
extractnet/models/char_embedding.joblib sha256=CnysOMu_jbHVOwhqDtjhju97uF99ZCK48pL7LeY0NLk 430383
extractnet/models/crf.joblib sha256=RyNsQM457EEurWo27Uic-U2yRmaANu3ws-lw_UQFTP0 2609121
extractnet/models/news_net.onnx sha256=aDCH1Hz1k8ZpQ3Zod12QFlBtI4p9m3Rm172_fZ7YLps 27248
extractnet/features/readability.py sha256=uSFV7KLDp1N2KgjGMsFW3jj93oDj1mey9KEM3wcxocw 1098
extractnet/features/_readability.cpython-38-x86_64-linux-gnu.so sha256=yEIZll1Yxjeg8jVVKr8CeOC9bVDtgt1PmReE1RnpNsw 986752
extractnet/features/_weninger.pyx sha256=bUmlD3Xuuep5gApZjW8bbQKs7z8u5vz8Hsv3ITf8-U0 2548
extractnet/features/author.py sha256=381F9tl2vOTs771cSp24ce9HAMxxMpCoxad3ISbrxKY 4966
extractnet/features/_weninger.cpp sha256=zdZy8-XuseXNrmXh3KlVpsfFwmuA_jlyD3yg4P89A0I 304302
extractnet/features/_weninger.cpython-38-x86_64-linux-gnu.so sha256=yF0keJ1hB2cgMhHM3R33r54H6GOhC8c1jYcOag7fqbA 394816
extractnet/features/weninger.py sha256=Ef0BNZYwNIyYoi7Z6o0d3Sr0-S1zPpLlcN4CGlTLQGM 4158
extractnet/features/kohlschuetter.py sha256=17xr2B_fP9jX-uqCQtQxjDfAcUCnF4ZbGjjz-9NyIjo 1435
extractnet/features/_readability.pyx sha256=0v3kiaOvg__bb_DlC9SJuU4rbde10wAmGfCAbVRo-j8 1986
extractnet/features/css.py sha256=0mbIgwGOeJjJonQDG6O2mJSBLNbBN7bdIHMMLrKE0w4 4569
extractnet/features/_kohlschuetter.cpp sha256=2MBFC2LVAZito4N1hbbXParVLG06D8TUpUdlgeqCJOw 267790
extractnet/features/standardized.py sha256=fyVjPQ9JaTIikNRAUdPM9pcEXs1abFMbs0-VqOiE0rI 2492
extractnet/features/_readability.cc sha256=vdxhULPGqnSaheRGjEHBIkVF9ym4KHWpqJ8RBWs16sA 4492
extractnet/features/_readability.cpp sha256=coo2WIg-8G4xqpXzqgj-amlJHr-1oN2cztLTJu4w6u4 298470
extractnet/features/_kohlschuetter.cpython-38-x86_64-linux-gnu.so sha256=_shuwCWZgG1FBZE6F7MrKoNU0A_NLPMwTR-Rr8tCa0U 284800
extractnet/features/_kohlschuetter.pyx sha256=g5JvRMdChRwvAH5TOWHUnIsj6REAVzGlV6BBtMMS05Y 1162
extractnet/features/__init__.py sha256=3u61HmFmBL2xVLp2h7CC4WN0Z0wbicTIgYtHWygYUYs 840

top_level.txt

extractnet