extractnet

View on PyPIReverse Dependencies (0)

2.0.7 extractnet-2.0.7-cp39-cp39-manylinux_2_24_x86_64.whl
extractnet-2.0.7-cp39-cp39-macosx_12_0_x86_64.whl
extractnet-2.0.7-cp38-cp38-macosx_10_16_x86_64.whl
extractnet-2.0.7-cp38-cp38-macosx_10_15_x86_64.whl
extractnet-2.0.7-cp38-cp38-manylinux_2_24_x86_64.whl
extractnet-2.0.7-cp37-cp37m-macosx_10_16_x86_64.whl
extractnet-2.0.7-cp37-cp37m-macosx_10_15_x86_64.whl
extractnet-2.0.7-cp37-cp37m-manylinux_2_24_x86_64.whl
extractnet-2.0.7-cp36-cp36m-macosx_10_16_x86_64.whl
extractnet-2.0.7-cp36-cp36m-macosx_10_15_x86_64.whl
extractnet-2.0.7-cp36-cp36m-manylinux_2_24_x86_64.whl
extractnet-2.0.7-cp310-cp310-macosx_10_15_x86_64.whl
extractnet-2.0.7-cp310-cp310-manylinux_2_24_x86_64.whl
extractnet-2.0.7-cp310-cp310-macosx_11_0_x86_64.whl

Wheel Details

Project: extractnet
Version: 2.0.7
Filename: extractnet-2.0.7-cp39-cp39-macosx_12_0_x86_64.whl
Download: [link]
Size: 1765452
MD5: ace35914648b367f800f530c1fa1e530
SHA256: 5d2aa539ab2a5fa8c85bdc52745b49736aafd7ac97912569aefbd3625db92789
Uploaded: 2022-11-06 07:34:02 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: extractnet
Version: 2.0.7
Summary: Extract the main article content (and optionally comments) from a web page
Author: Peter
Author-Email: sales[at]currentsapi.services
Home-Page: https://github.com/currentsapi/extractnet
License: MIT
Keywords: automatic content extraction,web page dechroming,HTML parsing
Classifier: License :: OSI Approved :: MIT License
Classifier: Development Status :: 5 - Production/Stable
Classifier: Environment :: Web Environment
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: Topic :: Internet :: WWW/HTTP
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Intended Audience :: Science/Research
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Platform: Posix; MacOS X
Requires-Dist: cchardet (>=2.1.7)
Requires-Dist: beautifulsoup4 (==4.9.3)
Requires-Dist: ftfy (<5.0.0,>=4.1.0)
Requires-Dist: numpy (>=1.19.0)
Requires-Dist: onnxruntime (>=1.9.0)
Requires-Dist: scikit-learn (>=0.22.0)
Requires-Dist: tld (==0.12.6)
Requires-Dist: scipy (>=0.17.0)
Requires-Dist: sklearn-crfsuite (==0.3.6)
Requires-Dist: dateparser (==1.1.0)
Requires-Dist: joblib (>=1.1.0)
Requires-Dist: htmldate (==0.7.2)
Description-Content-Type: text/markdown
License-File: LICENSE
[Description omitted; length: 7417 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.38.2)
Root-Is-Purelib: false
Tag: cp39-cp39-macosx_12_0_x86_64

RECORD

Path Digest Size
extractnet/__init__.py sha256=YfnCSI36NICHeGeeIhntKfKJc00YvEoa2RbA4DqXqgw 309
extractnet/blocks.cpp sha256=73w6nSCyF1fK847TAMlztxPvGQTgrOC92gA4hr8fB_U 750751
extractnet/blocks.cpython-39-darwin.so sha256=ldAgzF4G7h8eOag1pkgSFVhH5l0hEExYLew0yuuE4X0 318720
extractnet/blocks.pyx sha256=LEHV7OkZA0i6lGm1YfeNo48kRIXhBqJtE-BHb436Sq8 31827
extractnet/compat.py sha256=oLr0zmtpRmqkNtDvkm1ko2P55lpXTjNVwLjvinskR3c 9314
extractnet/lcs.cpp sha256=qmenjcPNBpAyTEccRKBsb-ohqZiw_Y4LNBuK0xAwx38 336954
extractnet/lcs.cpython-39-darwin.so sha256=mSAmhSRk6ThzpDwrnhgl7WpBLarkeh9Ijf5CEfRVIhk 126512
extractnet/lcs.pyx sha256=6XR7-efisJ9x8TH9PRJm3PTvmwLXgxM3T6LNKvFTD8c 2981
extractnet/name_crf.py sha256=KuHAZ8xWf6vDAK7xigUn4vuIfyriaAZ8kM51YcxXvJ0 1118
extractnet/nn_models.py sha256=B1nTE6GnwlrlcfmOajNxLPZ8r1Qg7EyLgZj23ACW6fU 3720
extractnet/pipeline.py sha256=whSDnoL2Pf-JjE08tc74vGX0td3y_F6DEa3IqTeCuRE 5250
extractnet/util.py sha256=6jcanJrOnnBozHBcn7tMKVqqp4Wqzu1biQ8KgDd8WRw 7663
extractnet/features/__init__.py sha256=3u61HmFmBL2xVLp2h7CC4WN0Z0wbicTIgYtHWygYUYs 840
extractnet/features/_kohlschuetter.cpp sha256=Bx44wLEfcfQrlCYGp27dTpbJ3VrZaiDo4hKx8cOqUBc 266809
extractnet/features/_kohlschuetter.cpython-39-darwin.so sha256=Sop-N5HIPGC-aqx1DWKLcf0ryhCMb-_XUZduQiBNye0 84344
extractnet/features/_kohlschuetter.pyx sha256=g5JvRMdChRwvAH5TOWHUnIsj6REAVzGlV6BBtMMS05Y 1162
extractnet/features/_readability.cc sha256=vdxhULPGqnSaheRGjEHBIkVF9ym4KHWpqJ8RBWs16sA 4492
extractnet/features/_readability.cpp sha256=DXWE6-0JtDC0ySwsheUNNsEjJMQFgKr0F5YK5keCH28 297500
extractnet/features/_readability.cpython-39-darwin.so sha256=3-tAq70r48ogDOQ2BKgRh10ETvARGM50I9KXOH1z7A4 131136
extractnet/features/_readability.pyx sha256=0v3kiaOvg__bb_DlC9SJuU4rbde10wAmGfCAbVRo-j8 1986
extractnet/features/_weninger.cpp sha256=sLRgsruzMBvavunjBOQlf4OrmOtABborLrQeUpsie4M 303321
extractnet/features/_weninger.cpython-39-darwin.so sha256=vbd9EoGIZRQ2jWDtDr0sDQnqA1l572nKbXcNhM3qgs4 111160
extractnet/features/_weninger.pyx sha256=bUmlD3Xuuep5gApZjW8bbQKs7z8u5vz8Hsv3ITf8-U0 2548
extractnet/features/author.py sha256=381F9tl2vOTs771cSp24ce9HAMxxMpCoxad3ISbrxKY 4966
extractnet/features/css.py sha256=0mbIgwGOeJjJonQDG6O2mJSBLNbBN7bdIHMMLrKE0w4 4569
extractnet/features/kohlschuetter.py sha256=17xr2B_fP9jX-uqCQtQxjDfAcUCnF4ZbGjjz-9NyIjo 1435
extractnet/features/readability.py sha256=uSFV7KLDp1N2KgjGMsFW3jj93oDj1mey9KEM3wcxocw 1098
extractnet/features/standardized.py sha256=fyVjPQ9JaTIikNRAUdPM9pcEXs1abFMbs0-VqOiE0rI 2492
extractnet/features/weninger.py sha256=Ef0BNZYwNIyYoi7Z6o0d3Sr0-S1zPpLlcN4CGlTLQGM 4158
extractnet/metadata_extraction/constant.py sha256=UmjywL-APZ2c47IOOvfUjP--j-n0xRCm_ZNlyLue4P4 6449
extractnet/metadata_extraction/json_ld.py sha256=wVBnMSKmKmC6RaIJe2ctOOT2lV1iFs1Zi2pTmKtJ2Pc 7888
extractnet/metadata_extraction/metadata.py sha256=09rc-N-qhNYQQwDlL2H-qGsBN0AZk5XnBXfBNsATmRY 19030
extractnet/metadata_extraction/metaxpaths.py sha256=sWVcYqKC7PhMN8JXXvJUxD-142WK6hgT13PajwmseZg 2423
extractnet/metadata_extraction/url_utils.py sha256=Zmfz-N_wSHGcqueORlXL8sgjJiKVvXDGMrDk7E-eGgA 4573
extractnet/metadata_extraction/utils.py sha256=QaCK7Stp9AK8Y5VBrjKjKJp1OXnCeR3pdyk3gDYYZlg 10611
extractnet/metadata_extraction/video.py sha256=tYOx-hwLlDu2Kgnx6ckUmXccQIxb_i9_PvIqnc9nG4g 8357
extractnet/models/char_embedding.joblib sha256=CnysOMu_jbHVOwhqDtjhju97uF99ZCK48pL7LeY0NLk 430383
extractnet/models/crf.joblib sha256=RyNsQM457EEurWo27Uic-U2yRmaANu3ws-lw_UQFTP0 2609121
extractnet/models/news_net.onnx sha256=aDCH1Hz1k8ZpQ3Zod12QFlBtI4p9m3Rm172_fZ7YLps 27248
extractnet/sequence_tagger/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
extractnet/sequence_tagger/models.py sha256=O9g7q8-moz80RHEVTZ6NqlFhjswLh83v2JJWUORmQ5g 3458
extractnet-2.0.7.dist-info/LICENSE sha256=M6siLoHxvv-aroiQ3Q1fPHwsvOsOuhWXDOZXY61awqg 1096
extractnet-2.0.7.dist-info/METADATA sha256=atpqlskbbzAl9GwxYUpZQc-2CJh5xJthmCoQkTWX-dM 8997
extractnet-2.0.7.dist-info/WHEEL sha256=d26xriT6m8vKBlJNNHRZzSQPjIZa8-qj4QMkhLauV3M 109
extractnet-2.0.7.dist-info/top_level.txt sha256=HHqETiv5qJsWT5GblB-L9YgxYVIP8KfJqw3Ub9zk6mA 11
extractnet-2.0.7.dist-info/RECORD

top_level.txt

extractnet