webstruct

View on PyPIReverse Dependencies (1)

0.6 webstruct-0.6-py2.py3-none-any.whl

Wheel Details

Project: webstruct
Version: 0.6
Filename: webstruct-0.6-py2.py3-none-any.whl
Download: [link]
Size: 63046
MD5: 46f4f7b5da5d9129848b70d941da403c
SHA256: b482e789bb39291e62b573c9a089ce06a2510f4f967695b5824252010bf4c332
Uploaded: 2017-12-29 17:39:54 +0000

dist-info

METADATA

Metadata-Version: 2.0
Name: webstruct
Version: 0.6
Summary: A library for creating statistical NER systems that work on HTML data
Author: Mikhail Korobov, Terry Peng
Author-Email: kmike84[at]gmail.com, pengtaoo[at]gmail.com
Home-Page: https://github.com/scrapinghub/webstruct
License: MIT
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: Topic :: Text Processing :: Linguistic
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 2
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.4
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Requires-Dist: lxml
Requires-Dist: requests
Requires-Dist: scikit-learn
Requires-Dist: six
Requires-Dist: tldextract
[Description omitted; length: 1467 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.29.0)
Root-Is-Purelib: true
Tag: py2-none-any
Tag: py3-none-any

RECORD

Path Digest Size
webstruct/__init__.py sha256=ImhhYEEnMF8hQKo6MYJ7-uIS5kX5CIhGYn5pTEuO03M 463
webstruct/_fileresource.py sha256=xeEZXq_X4hppmTwq5rck06S58Hcwbu1d4e-UVcRchYQ 2119
webstruct/annotation_converter.py sha256=t4OYuC1HPkBUXuIUsZMXr4Qy6QDwjs9UxywchKOBFfc 2228
webstruct/annotation_verifier.py sha256=65-0VHP3TaASsjP-Y8nKaNSYMAq0sM3T8K6vMDT8fXU 4987
webstruct/base.py sha256=SrwWSGDLNKwavHaH6e0P46Cj-X2i2xmqtjgsEADxuyU 605
webstruct/crfsuite.py sha256=M8Nvl3HEnLJ7Y0UoPFbVSwYIZvLMdfgSvRb8YH86r2E 3185
webstruct/feature_extraction.py sha256=PO4hOir9_M_uVZTWVwhFTCRlakAbFJBEB5mP60tvHs4 7322
webstruct/grouping.py sha256=wwUutdHYWm2YRjiUcNKBH5X_r0HIV0GNdmcvrQC8KTQ 6385
webstruct/html_tokenizer.py sha256=mHEdwtOwgbFqfQHuKPickeTO0q1-oGLGd41jhwqHFAA 12518
webstruct/html_tokenizer_benchmark.py sha256=aX6rWGCpUGO0CmVeytpuSsu8IJ7Vx1Bp6kMoK1QQJyM 997
webstruct/infer_domain.py sha256=5hA6xnDYaEQ8JlBEXOm-yHI8gmL12kLaXWNDLAriVPI 2426
webstruct/loaders.py sha256=2K60eMDO7b3c5bwTEmiMp3l-dZNeYRZs1cx_oMlF_CY 6964
webstruct/metrics.py sha256=EZE4XsdVVqAm5HWa4yzWjKycpkI02to5uIsBv7D8cGo 3752
webstruct/model.py sha256=Xm00_ZKIFNzPTlGTlnTnKO_ur2g9J65eLu8eNEueKko 5612
webstruct/model_benchmark.py sha256=Xx0EkzimYl2TWKHv_D45EUeCZbsh-drgayAf00RaVts 750
webstruct/sequence_encoding.py sha256=Dlaa0IEnpbbCJpVuWeUqmLzRKpxo1VY2q5fji4uHC5s 6469
webstruct/text_tokenizers.py sha256=gFwZdkIG9-YyY75pxk8A0gh-3y_qAsBjyS4F9Pov5Mc 9381
webstruct/utils.py sha256=l-aCw_XwC3FpGIp0RtSHZ8c91hTdxtY5OSThlMIdX3Y 10047
webstruct/wapiti.py sha256=dI-X26ujR-VdmBObZLydiY6IFl7gyLgDB-4lCv7Qg_8 16322
webstruct/webannotator.py sha256=Z-nngXm-vA-Mpjvl-obCSPs4qEjnVV1gNYdf-wqeK-s 13353
webstruct/features/__init__.py sha256=hKrMUKtIiq9hqapnqznzwnxMtkrYIVZDrV4tQqb1Nuw 850
webstruct/features/block_features.py sha256=UGRQufpU3-itJPmAPlBzFFNNsnWvNY64bocTzimXwXY 1450
webstruct/features/data_features.py sha256=ZHJce_bsQJ7AkpIyXhycq9nbSqn0DExffQMW5vNagZY 2633
webstruct/features/datetime_format.py sha256=R6-C1eueKvD43eddonOhfmv_ZorVcO69Ncagu4Mb9bU 1484
webstruct/features/global_features.py sha256=qiUQ3QT6Mjyv8FxGTmj98hTpdyVG6GJFxXSTwDaEy1c 3502
webstruct/features/token_features.py sha256=r_f_nHGLrZCxnExPNSIo-1nkbN5YUBXitg4tBr3puVI 3162
webstruct/gazetteers/__init__.py sha256=K2bZsZ8s-lFjvrXSG8U6Lk9zL1o-QDjiVAbYk27f-YE 63
webstruct/gazetteers/features.py sha256=Ahiemddc7hGe-kN-1BJOik5eINZuUYC5QIu8xXsUGmk 840
webstruct/gazetteers/geonames.py sha256=zeek9paHQTi0N8N5g9tF-PbNQBN33EJKXJNtuUB-D1s 4126
webstruct/tests/__init__.py sha256=K2bZsZ8s-lFjvrXSG8U6Lk9zL1o-QDjiVAbYk27f-YE 63
webstruct/tests/test_crfsuite.py sha256=3tNbkQvFrDLEAiBsS8ElgIam_sU1U-oeLq9knP5Q8Ls 5668
webstruct/tests/test_html_tokenizer.py sha256=KK5SI7bxtC1XKP2Tcb1ZR5u1O0id__HGY4ukIe5GhRY 5837
webstruct/tests/test_html_tools.py sha256=sp_VCmlMYAJDzRLO-cQw6sBbby_XxNW6p6OMkwyTx5k 1263
webstruct/tests/test_infer_domain.py sha256=vsctkdcA7qiU4JDlNMwdFZcrC5baVL7RrsPZ2PimqWI 2073
webstruct/tests/test_loaders.py sha256=wATnhagvMlTedXBBTpNiVUpL8gMTD3YSB93Dlu11Zho 6853
webstruct/tests/test_pattern_features.py sha256=RrMyvaJQnXBNeMUPF98Yz4tJwT6nXm6Mnd7-nfktu6g 1626
webstruct/tests/test_text_tokenizer.py sha256=aGhZGiSF0HrWxjJZrx5jeCclSuj-2IXkTrWaBessF5U 1850
webstruct/tests/test_utils.py sha256=gMAk06PX65kJ9wUzCEIfX8aduBJvLomBjLlIf5zLoBo 287
webstruct/tests/test_webannotator.py sha256=xXbjmHarvP3mEJtW9Aq37i8EIdewNUm72w1winmGJT4 14003
webstruct/tests/utils.py sha256=PLLzWnSmESTGfhU34CylRe2um0kxwVD6ThRu2wjk_bc 1453
webstruct-0.6.dist-info/DESCRIPTION.rst sha256=DIltgQjcvse-YfSXawPrvfRfZABF8Mmo0od4tHom2LU 1467
webstruct-0.6.dist-info/METADATA sha256=N3L7yxN7sV0Eu4D27okxLOZt2gs-n0nHct3hNX713BE 2597
webstruct-0.6.dist-info/RECORD
webstruct-0.6.dist-info/WHEEL sha256=o2k-Qa-RMNIJmUdIc7KU6VWR_ErNRbWNlxDIpl7lm34 110
webstruct-0.6.dist-info/metadata.json sha256=qQ6DxUpjKQQslKryMsgUuukBCfGot7KN0yjEByfnqpk 1209
webstruct-0.6.dist-info/top_level.txt sha256=tqDruP8qDvp6CpmpGQpPKXumkybTkPb4hBSy7au4NJ0 10

top_level.txt

webstruct