openwebmath-text-extract

View on PyPIReverse Dependencies (0)

0.1.3 openwebmath_text_extract-0.1.3-py3-none-any.whl

Wheel Details

Project: openwebmath-text-extract
Version: 0.1.3
Filename: openwebmath_text_extract-0.1.3-py3-none-any.whl
Download: [link]
Size: 38554
MD5: 1715e1f896cf3093fb92e8ee68b6fc7f
SHA256: 0ed258a287228df1b45a0add1cafb179ab4ea0a3e50a27154858f8c1d2318418
Uploaded: 2024-05-30 04:32:14 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: openwebmath-text-extract
Version: 0.1.3
Summary: Text Extractor from OpenWebMath
Author-Email: Keiran Paster <keirp[at]cs.toronto.edu>
Maintainer-Email: Luca Soldaini <luca[at]soldaini.net>
Project-Url: homepage, https://github.com/soldni/OpenWebMath
License: Apache-2.0
Requires-Python: >=3.8
Requires-Dist: resiliparse
Requires-Dist: py-asciimath
Requires-Dist: tabulate
Requires-Dist: lxml
Requires-Dist: pyyaml
Requires-Dist: numpy
Requires-Dist: urllib3
Requires-Dist: black (>=22.6.0); extra == "dev"
Requires-Dist: isort (>=5.10.1); extra == "dev"
Requires-Dist: mypy (>=0.971); extra == "dev"
Requires-Dist: pytest (>=5.2); extra == "dev"
Requires-Dist: ipython (>=8.4.0); extra == "dev"
Requires-Dist: autopep8 (>=1.7.0); extra == "dev"
Requires-Dist: flake8 (>=5.0); extra == "dev"
Requires-Dist: ipdb (>=0.13.0); extra == "dev"
Requires-Dist: flake8-pyi (>=22.8.1); extra == "dev"
Requires-Dist: Flake8-pyproject (>=1.1.0); extra == "dev"
Provides-Extra: dev
Description-Content-Type: text/markdown
[No description]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.43.0)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
text_extract/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
text_extract/banned_selectors.txt sha256=ZpoULPkHxqy7qpLHBjqDfRY0UdPNhdnd8nYpDaDTAMk 1807
text_extract/boilerplate_words.txt sha256=-meAPbbaqO1sjMAID-k4S193UnHmOtsK8nA0_g11_uc 1851
text_extract/extract.py sha256=TqDv31Y9iWOIB_yJ7IEcNQ4QRZA72BKuCYYT5V5MMX0 5034
text_extract/latex_processing.py sha256=by0IU7TlEbG8z9yqWldhRUNFjcNc9D2p2NlGCevIs-8 27327
text_extract/line_processing.py sha256=n80PolC_IiZ8S32uRWsuS9-8ERYC_OB3ussHIYx-mek 3141
text_extract/py.typed sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
text_extract/tree_processing.py sha256=z87hPINK6aNfKxDdqb9kNNsET6TZdEfa5zh9N5qQuWI 14725
text_extract/utils.py sha256=8TjXWHNZawjH5LuDAtx7A7njLeRKYGR4cVcMiXQhwjQ 3401
text_extract/configs/randomized_all.yaml sha256=AXpZsukRM1FeL1eTvwUT1F4ulzdDLRtzUhKmdMbZsuc 563
text_extract/mmltex/README sha256=lNrWsEcf7If-vWd50RusgGIVR7nW1MHO3_VOG8LIugI 3189
text_extract/mmltex/cmarkup.xsl sha256=qssTbdb5_vMPFeo6NQ7cZzQ2iwhRWkJtN59dJDtsbhc 36723
text_extract/mmltex/entities.xsl sha256=uNDq3O4p5u9On0la1bj3KlhOEoS_mQWXZJFeIzFiS2c 71528
text_extract/mmltex/glayout.xsl sha256=sgHlr9-M-FRJrJDtxDJlylpAv6O1b2xjJdaIOsZhv_g 6333
text_extract/mmltex/mmltex.xsl sha256=6KUIyFc9Np8eTbKWbrnXCJhwbmH0WLZl5QnZFVIK0Yk 1643
text_extract/mmltex/scripts.xsl sha256=-qaihwEBKOOKUVa_sqmTGo2rhcmwjvXFklZt2lnrzZ8 9901
text_extract/mmltex/tables.xsl sha256=RxtNo8qDtVAg8_6BuYsafraB_0z7YDAB9D__fT9gmWs 4327
text_extract/mmltex/tokens.xsl sha256=lQIhRsXQ3WOHchoYHIJ0TkoI8mbeHZI-xLsk8NsRMBU 10654
openwebmath_text_extract-0.1.3.dist-info/METADATA sha256=JScpdCBs-8Z2znqgSYTxpBeCEw1y6XLFWhjyHtH5ZY8 1023
openwebmath_text_extract-0.1.3.dist-info/WHEEL sha256=GJ7t_kWBFywbagK5eo9IoUwLW6oyOeTKmQ-9iHFVNxQ 92
openwebmath_text_extract-0.1.3.dist-info/top_level.txt sha256=hrJ-gm0gR-_Y4RTuB3CYV7qMtCV0jE87qWFyCMT9ros 13
openwebmath_text_extract-0.1.3.dist-info/RECORD

top_level.txt

text_extract