html2tei

View on PyPIReverse Dependencies (0)

1.2.3 html2tei-1.2.3-py3-none-any.whl

Wheel Details

Project: html2tei
Version: 1.2.3
Filename: html2tei-1.2.3-py3-none-any.whl
Download: [link]
Size: 58166
MD5: 14ba4a9e1d70a87dbc1fd0665aa3deca
SHA256: 033c8014884252f1cd733ab079a4cc9cd0f99067374dbe4cc25aecd2f013234e
Uploaded: 2022-01-03 16:08:43 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: html2tei
Version: 1.2.3
Summary: Map the HTML schema of portals to valid TEI XML with the tags and structures used in them using small manual portal-specific configurations.
Author: dlazesz
Home-Page: https://github.com/ELTE-DH/HTML2TEI
Project-Url: Repository, https://github.com/ELTE-DH/HTML2TEI
License: LGPLv3
Classifier: Development Status :: 5 - Production/Stable
Classifier: License :: Other/Proprietary License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.8,<4.0
Requires-Dist: beautifulsoup4 (<5.0.0,>=4.9.0)
Requires-Dist: justext (<4.0.0,>=3.0.0); extra == "justext" or extra == "full"
Requires-Dist: lxml (<5.0.0,>=4.5.0)
Requires-Dist: newspaper3k (<0.3.0,>=0.2.8); extra == "newspaper3k" or extra == "full"
Requires-Dist: pyyaml (<7.0.0,>=6.0.0)
Requires-Dist: warcio (<2.0.0,>=1.7.0)
Requires-Dist: webarticlecurator (<2.0.0,>=1.4.0)
Provides-Extra: full
Provides-Extra: justext
Provides-Extra: newspaper3k
Description-Content-Type: text/markdown
[Description omitted; length: 10546 characters]

WHEEL

Wheel-Version: 1.0
Generator: poetry 1.0.7
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
html2tei/__init__.py sha256=b7U8fsHRQ9My4kBjMkPdqTqpQUtljBCL8PRoM73clFc 1112
html2tei/__main__.py sha256=O4dp_lqw_2PV8xs_0NL7FNI_9XHG7J9w2Ei7Ss6jt-c 7487
html2tei/article_body_converters/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
html2tei/article_body_converters/eltedh_abc.py sha256=7y3OOOhfUjF6KebS-ZQ6t-brsLDnJHL3iaHtBupP43Q 33327
html2tei/article_body_converters/justext_abc.py sha256=ImARl0-XCsthITBT25r-ke9lWSxldOMOu5BvdqNZQ1U 1310
html2tei/article_body_converters/newspaper_abc.py sha256=FlAiLagdiy06kiSs2xM23jTeMLkpKt62q1acY_PKKuE 1567
html2tei/basic_tag_dicts.py sha256=3mL9FTUBZt0UnQzk6m2XDQ6dF6BBRG_CF_8gm_ePmEU 6312
html2tei/digest.py sha256=vEXtiPi5SF34Hy0ZS9Ruc3-Rng1QC_CKfREIHTrBK2A 6241
html2tei/excluded_tags_collection.py sha256=laQn0oUS-kugwbnlwp5mt89kRy-DNuGZLlGuwZx5Vk8 5473
html2tei/html_content_tree.py sha256=g4M4_2gXCcD-xSHq1s8-sog57GJW8VMvqkB7ypBgBRc 3270
html2tei/link_corrector.py sha256=gol-bCRT2U4mfSoo2n3mIjZ6yLQIMB4lKGDltpfODNE 8459
html2tei/portal_article_cleaner.py sha256=Iy45Lc5ajfedIvdzm1DUo2KcXy9NGfnD6zr9x5fOVhU 19356
html2tei/processing_utils.py sha256=nmkZYs3fbY2YUXSBnUL9m4eD64qPhfzYMo85kGvPcVo 10015
html2tei/read_config.py sha256=KpcHWelTsJT8ZKO6xyB80FsKMHbpcQGtefJjM8hjNTU 10135
html2tei/tag_bigrams_maker.py sha256=SOohp7AmL54zAWtS7m2c6T-xtm0jz-LFsM9wY7tjn4U 4305
html2tei/tag_inventory_maker.py sha256=C_LY-i2NYShlP2ZQLhWTKT1s3k5z-05RD4EIGPKwG10 5434
html2tei/tei_utils.py sha256=n7Z4abkJww1G9-IWOgPeLadyAK4T_Xaf4GNYVXIgN1s 11549
html2tei/unicode_error.py sha256=S1GWPFG6GzBQ6mMP8Astzb4xeJxmHiLtylaa88D0brI 2567
html2tei/update_and_filter_tables.py sha256=Loy0Lyz0ww_35gzd-ZOzwVd0zpf6ZM4pzHuAjgZoKXI 4318
html2tei/validate_hash_zip.py sha256=DT-qxD1dOX6c7oGep1oLVBaiQypy0ZeOEEgFRXUJPrs 8005
html2tei/version.py sha256=TMTlmMSw3EGDpLyR2rrGAM_Ie-Dfbgl2lvfca_2I4QQ 196
html2tei-1.2.3.dist-info/entry_points.txt sha256=_-yUgmzGmxVf7E6Y3k8O2PIydXIMB-rSs7xqELnWmPg 51
html2tei-1.2.3.dist-info/LICENSE sha256=46mU2C5kSwOnkqkw9XQAJlhBL2JAf1_uCD8lVcXyMRg 7652
html2tei-1.2.3.dist-info/WHEEL sha256=y3eDiaFVSNTPbgzfNn0nYn5tEn1cX6WrdetDlQM4xWw 83
html2tei-1.2.3.dist-info/METADATA sha256=e1UilswlVyyWJt5Zh23D6iH0RlFHbIkdXDwJ-Qccz7I 11844
html2tei-1.2.3.dist-info/RECORD

entry_points.txt

html2tei = html2tei.__main__:main