extractnet

View on PyPIReverse Dependencies (0)

2.0.7 extractnet-2.0.7-cp39-cp39-manylinux_2_24_x86_64.whl
extractnet-2.0.7-cp39-cp39-macosx_12_0_x86_64.whl
extractnet-2.0.7-cp38-cp38-macosx_10_16_x86_64.whl
extractnet-2.0.7-cp38-cp38-macosx_10_15_x86_64.whl
extractnet-2.0.7-cp38-cp38-manylinux_2_24_x86_64.whl
extractnet-2.0.7-cp37-cp37m-macosx_10_16_x86_64.whl
extractnet-2.0.7-cp37-cp37m-macosx_10_15_x86_64.whl
extractnet-2.0.7-cp37-cp37m-manylinux_2_24_x86_64.whl
extractnet-2.0.7-cp36-cp36m-macosx_10_16_x86_64.whl
extractnet-2.0.7-cp36-cp36m-macosx_10_15_x86_64.whl
extractnet-2.0.7-cp36-cp36m-manylinux_2_24_x86_64.whl
extractnet-2.0.7-cp310-cp310-macosx_10_15_x86_64.whl
extractnet-2.0.7-cp310-cp310-manylinux_2_24_x86_64.whl
extractnet-2.0.7-cp310-cp310-macosx_11_0_x86_64.whl

Wheel Details

Project: extractnet
Version: 2.0.7
Filename: extractnet-2.0.7-cp36-cp36m-manylinux_2_24_x86_64.whl
Download: [link]
Size: 3250646
MD5: 2f2d4356ad390915a58fd75be9a030e7
SHA256: bee36ccf1db343a068c1598380514e6f59e3c73df2b144f4502b59dcf803e209
Uploaded: 2022-11-06 07:33:36 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: extractnet
Version: 2.0.7
Summary: Extract the main article content (and optionally comments) from a web page
Author: Peter
Author-Email: sales[at]currentsapi.services
Home-Page: https://github.com/currentsapi/extractnet
License: MIT
Keywords: automatic content extraction,web page dechroming,HTML parsing
Classifier: License :: OSI Approved :: MIT License
Classifier: Development Status :: 5 - Production/Stable
Classifier: Environment :: Web Environment
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: Topic :: Internet :: WWW/HTTP
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Intended Audience :: Science/Research
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Platform: Posix; MacOS X
Requires-Dist: cchardet (>=2.1.7)
Requires-Dist: beautifulsoup4 (==4.9.3)
Requires-Dist: ftfy (<5.0.0,>=4.1.0)
Requires-Dist: numpy (>=1.19.0)
Requires-Dist: onnxruntime (>=1.9.0)
Requires-Dist: scikit-learn (>=0.22.0)
Requires-Dist: tld (==0.12.6)
Requires-Dist: scipy (>=0.17.0)
Requires-Dist: sklearn-crfsuite (==0.3.6)
Requires-Dist: dateparser (==1.1.0)
Requires-Dist: joblib (>=1.1.0)
Requires-Dist: htmldate (==0.7.2)
Description-Content-Type: text/markdown
License-File: LICENSE
[Description omitted; length: 7419 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.37.1)
Root-Is-Purelib: false
Tag: cp36-cp36m-manylinux_2_24_x86_64

RECORD

Path Digest Size
extractnet-2.0.7.dist-info/LICENSE sha256=M6siLoHxvv-aroiQ3Q1fPHwsvOsOuhWXDOZXY61awqg 1096
extractnet-2.0.7.dist-info/top_level.txt sha256=HHqETiv5qJsWT5GblB-L9YgxYVIP8KfJqw3Ub9zk6mA 11
extractnet-2.0.7.dist-info/WHEEL sha256=D0CgkkX3hVRnO-RroZTlystWrS9-Ls8ePwOaRSwvNN0 113
extractnet-2.0.7.dist-info/RECORD
extractnet-2.0.7.dist-info/METADATA sha256=9pMqV2RwdCFzcWq3JW_OwTDO1j5l4Uch1PcYqMsZXXA 8999
extractnet/name_crf.py sha256=KuHAZ8xWf6vDAK7xigUn4vuIfyriaAZ8kM51YcxXvJ0 1118
extractnet/blocks.pyx sha256=LEHV7OkZA0i6lGm1YfeNo48kRIXhBqJtE-BHb436Sq8 31827
extractnet/lcs.cpython-36m-x86_64-linux-gnu.so sha256=HCoPBeCXV7zyPCLFlTxnllbP08wr-u3qvYdbYAN0ULQ 461488
extractnet/util.py sha256=6jcanJrOnnBozHBcn7tMKVqqp4Wqzu1biQ8KgDd8WRw 7663
extractnet/compat.py sha256=oLr0zmtpRmqkNtDvkm1ko2P55lpXTjNVwLjvinskR3c 9314
extractnet/blocks.cpp sha256=vBiTLPt2kvgdisHEB9zgmxV-GPAOuNUS_c5G0Vo-I2g 750402
extractnet/nn_models.py sha256=B1nTE6GnwlrlcfmOajNxLPZ8r1Qg7EyLgZj23ACW6fU 3720
extractnet/blocks.cpython-36m-x86_64-linux-gnu.so sha256=IFAQUrpxp73b0PiPhbhFmUsW4nbwSlWDPSJF-paRKjI 3778512
extractnet/lcs.pyx sha256=6XR7-efisJ9x8TH9PRJm3PTvmwLXgxM3T6LNKvFTD8c 2981
extractnet/pipeline.py sha256=whSDnoL2Pf-JjE08tc74vGX0td3y_F6DEa3IqTeCuRE 5250
extractnet/lcs.cpp sha256=qmfcILzHzugEDooPr-mNYKheK9F8B_MYjPg6rSDWXEQ 329521
extractnet/__init__.py sha256=YfnCSI36NICHeGeeIhntKfKJc00YvEoa2RbA4DqXqgw 309
extractnet/sequence_tagger/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
extractnet/sequence_tagger/models.py sha256=O9g7q8-moz80RHEVTZ6NqlFhjswLh83v2JJWUORmQ5g 3458
extractnet/metadata_extraction/metaxpaths.py sha256=sWVcYqKC7PhMN8JXXvJUxD-142WK6hgT13PajwmseZg 2423
extractnet/metadata_extraction/json_ld.py sha256=wVBnMSKmKmC6RaIJe2ctOOT2lV1iFs1Zi2pTmKtJ2Pc 7888
extractnet/metadata_extraction/url_utils.py sha256=Zmfz-N_wSHGcqueORlXL8sgjJiKVvXDGMrDk7E-eGgA 4573
extractnet/metadata_extraction/utils.py sha256=QaCK7Stp9AK8Y5VBrjKjKJp1OXnCeR3pdyk3gDYYZlg 10611
extractnet/metadata_extraction/metadata.py sha256=09rc-N-qhNYQQwDlL2H-qGsBN0AZk5XnBXfBNsATmRY 19030
extractnet/metadata_extraction/constant.py sha256=UmjywL-APZ2c47IOOvfUjP--j-n0xRCm_ZNlyLue4P4 6449
extractnet/metadata_extraction/video.py sha256=tYOx-hwLlDu2Kgnx6ckUmXccQIxb_i9_PvIqnc9nG4g 8357
extractnet/models/char_embedding.joblib sha256=CnysOMu_jbHVOwhqDtjhju97uF99ZCK48pL7LeY0NLk 430383
extractnet/models/crf.joblib sha256=RyNsQM457EEurWo27Uic-U2yRmaANu3ws-lw_UQFTP0 2609121
extractnet/models/news_net.onnx sha256=aDCH1Hz1k8ZpQ3Zod12QFlBtI4p9m3Rm172_fZ7YLps 27248
extractnet/features/readability.py sha256=uSFV7KLDp1N2KgjGMsFW3jj93oDj1mey9KEM3wcxocw 1098
extractnet/features/_weninger.pyx sha256=bUmlD3Xuuep5gApZjW8bbQKs7z8u5vz8Hsv3ITf8-U0 2548
extractnet/features/_readability.cpython-36m-x86_64-linux-gnu.so sha256=xVHAh0NlYKWSwyBk8LrJ4kGmGR-uAG38plvWKNL7ap0 888048
extractnet/features/author.py sha256=381F9tl2vOTs771cSp24ce9HAMxxMpCoxad3ISbrxKY 4966
extractnet/features/_weninger.cpp sha256=_RwDUY_OEQC-_KHeqS_XbMwJ0FXZdvE9HtHoBSMYDSA 295890
extractnet/features/weninger.py sha256=Ef0BNZYwNIyYoi7Z6o0d3Sr0-S1zPpLlcN4CGlTLQGM 4158
extractnet/features/kohlschuetter.py sha256=17xr2B_fP9jX-uqCQtQxjDfAcUCnF4ZbGjjz-9NyIjo 1435
extractnet/features/_kohlschuetter.cpython-36m-x86_64-linux-gnu.so sha256=W4q90YqIHl2ht_Ors9jvy8-Ym1p6HYQQ8QhdqZ09dEs 225272
extractnet/features/_readability.pyx sha256=0v3kiaOvg__bb_DlC9SJuU4rbde10wAmGfCAbVRo-j8 1986
extractnet/features/css.py sha256=0mbIgwGOeJjJonQDG6O2mJSBLNbBN7bdIHMMLrKE0w4 4569
extractnet/features/_kohlschuetter.cpp sha256=r-HOY40PQlSFhMhJ8qzbDiKJC1JF_9sODkByAYmXSXQ 259378
extractnet/features/standardized.py sha256=fyVjPQ9JaTIikNRAUdPM9pcEXs1abFMbs0-VqOiE0rI 2492
extractnet/features/_readability.cc sha256=vdxhULPGqnSaheRGjEHBIkVF9ym4KHWpqJ8RBWs16sA 4492
extractnet/features/_readability.cpp sha256=3d2iAgtcAY_-nLXO-KVdPp3sXrcVEfI1uLPuTvkGaAc 290067
extractnet/features/_weninger.cpython-36m-x86_64-linux-gnu.so sha256=rDgOmWEqTAtb6AxO7WnSm6dxJvs3vpHKWBCvKDe4e8k 305576
extractnet/features/_kohlschuetter.pyx sha256=g5JvRMdChRwvAH5TOWHUnIsj6REAVzGlV6BBtMMS05Y 1162
extractnet/features/__init__.py sha256=3u61HmFmBL2xVLp2h7CC4WN0Z0wbicTIgYtHWygYUYs 840

top_level.txt

extractnet