opusfilter

View on PyPIReverse Dependencies (1)

3.2.0 opusfilter-3.2.0-py3-none-any.whl

Wheel Details

Project: opusfilter
Version: 3.2.0
Filename: opusfilter-3.2.0-py3-none-any.whl
Download: [link]
Size: 65261
MD5: ffda0c38699a6208300e12e8fcb12262
SHA256: 8fc6addda097ea92044beb779b096c9ade83e6fe649560f248c6ba340b366ff8
Uploaded: 2024-08-14 08:06:00 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: opusfilter
Version: 3.2.0
Summary: Toolbox for filtering parallel corpora
Author: Mikko Aulamo, Sami Virpioja
Author-Email: mikko.aulamo[at]helsinki.fi
Home-Page: https://github.com/Helsinki-NLP/OpusFilter
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.8
Requires-Dist: setuptools
Requires-Dist: opustools (>=1.6.2)
Requires-Dist: beautifulsoup4 (>=4.8.0)
Requires-Dist: graphviz
Requires-Dist: py3langid (>=0.2.2)
Requires-Dist: matplotlib
Requires-Dist: morfessor
Requires-Dist: opus-fast-mosestokenizer (>=0.0.8.5)
Requires-Dist: pandas (>=1.0.0)
Requires-Dist: xxhash (>=3.2.0)
Requires-Dist: sentence-splitter
Requires-Dist: rapidfuzz
Requires-Dist: ruamel.yaml (>=0.15.0)
Requires-Dist: regex
Requires-Dist: requests
Requires-Dist: scikit-learn
Requires-Dist: subword-nmt
Requires-Dist: tqdm
Requires-Dist: lingua-language-detector (>=1.3.0)
Requires-Dist: pycld2; extra == "all"
Requires-Dist: py3langid (<0.3.0); extra == "all"
Requires-Dist: numpy (<2.0.0); extra == "all"
Requires-Dist: fasttext; extra == "all"
Requires-Dist: eflomal (>=2.0.0); extra == "all"
Requires-Dist: jieba (>=0.42); extra == "all"
Requires-Dist: mecab-python3 (>=1.0.8); extra == "all"
Requires-Dist: unidic-lite; extra == "all"
Requires-Dist: laserembeddings; extra == "all"
Requires-Dist: varikn; extra == "all"
Requires-Dist: pytest; extra == "all"
Requires-Dist: myst-parser; extra == "all"
Requires-Dist: sphinx; extra == "all"
Requires-Dist: sphinx-rtd-theme; extra == "all"
Requires-Dist: sphinxcontrib-bibtex; extra == "all"
Requires-Dist: myst-parser; extra == "docs"
Requires-Dist: sphinx; extra == "docs"
Requires-Dist: sphinx-rtd-theme; extra == "docs"
Requires-Dist: sphinxcontrib-bibtex; extra == "docs"
Requires-Dist: eflomal (>=2.0.0); extra == "eflomal"
Requires-Dist: py3langid (<0.3.0); extra == "fasttext"
Requires-Dist: numpy (<2.0.0); extra == "fasttext"
Requires-Dist: fasttext; extra == "fasttext"
Requires-Dist: jieba (>=0.42); extra == "jieba"
Requires-Dist: laserembeddings; extra == "laser"
Requires-Dist: mecab-python3 (>=1.0.8); extra == "mecab"
Requires-Dist: unidic-lite; extra == "mecab"
Requires-Dist: pycld2; extra == "pycld2"
Requires-Dist: pytest; extra == "test"
Requires-Dist: varikn; extra == "varikn"
Provides-Extra: all
Provides-Extra: docs
Provides-Extra: eflomal
Provides-Extra: fasttext
Provides-Extra: jieba
Provides-Extra: laser
Provides-Extra: mecab
Provides-Extra: pycld2
Provides-Extra: test
Provides-Extra: varikn
Description-Content-Type: text/markdown
License-File: LICENSE
[Description omitted; length: 2450 characters]

WHEEL

Wheel-Version: 1.0
Generator: setuptools (72.2.0)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
opusfilter/__init__.py sha256=CDV-2Kox1v8F995zZk87vN0GyO2_QZulBYmjgTGWSbs 2908
opusfilter/autogen.py sha256=icMZzN6XGVErMns-gCtQ5wNaz55xbdH6gYkVUS3_lFk 23143
opusfilter/autogen_cluster.py sha256=zeYmOet9eUBIUF8PJOQT1zj8UGzlP4H2MFDQP-GysXA 14275
opusfilter/classifier.py sha256=OAt4T-xKCiWz01JlXEJBHJEZ8DulA-_RgTWKql6Q-Rk 16384
opusfilter/embeddings.py sha256=5iGHxJ8lK0B_r6MTmWW8ZZldt7BibiYr_VMIxvOSSTw 6435
opusfilter/filters.py sha256=ZKff8YpZ9VLH41h6mPi2k3mrLH9fwX6Y6DBFR2z3_s8 24386
opusfilter/lm.py sha256=YIK2TSmYjGBSzVuscvPWsexOuYyyl5N7oSuCb28hUBE 18048
opusfilter/opusfilter.py sha256=MzAs93_km2gRzF8GVOVCwUncBiwDzZKrXdIHfJIsY94 48203
opusfilter/pipeline.py sha256=mSL1-fhgWlBdPqW-HBe5fGJbTXjnVpucYOtCQ_U3qmo 5136
opusfilter/preprocessors.py sha256=kreKjzP1ObKpRrU1T0Yc00kexUms8wT4YF3tjbPhZsk 6082
opusfilter/segment_hash.py sha256=iPXIgK4AznMa3MUTyVb9H3kIKfjCPKmOOngKksYzDHM 3064
opusfilter/subwords.py sha256=3pMweU6tiMyZkxSQFEE-jZVdF8VwIH58GN6H-am36ME 6495
opusfilter/tokenization.py sha256=7KbJkhtJH3WFk2pqR8iK5GhztRiNGwDNRUL0ibpV5AA 5857
opusfilter/util.py sha256=zQmjRP98qqyr8OHN9qf8KMDMbHLjx03HExGUPBGOorU 9181
opusfilter/word_alignment.py sha256=5tp3FcqhSu2raWl3qrquOXK3mSS957enUL6QZHwlXs8 9474
opusfilter-3.2.0.data/scripts/opusfilter sha256=XpKgnM_pDbAJjQqP8rDV2a7IqJNf953FxFCGjNkKCJY 1170
opusfilter-3.2.0.data/scripts/opusfilter-autogen sha256=RazlNj-7bLwiR--PBoxBLuuChg9USkk1jkKreITr5Lo 4891
opusfilter-3.2.0.data/scripts/opusfilter-cmd sha256=ngtTWhl9APs2LAGM-vlNEbiW6QO8QvFte7mdcPoFWlg 2616
opusfilter-3.2.0.data/scripts/opusfilter-diagram sha256=GdJVcmjwb77ygPTLkgk3ZbD-z7WEhp87tdyct5uXkfA 3663
opusfilter-3.2.0.data/scripts/opusfilter-duplicates sha256=xbVj-N-Hf9oMGcMtJ4ssXujLKaV9ioViADjaI6gTce8 2924
opusfilter-3.2.0.data/scripts/opusfilter-scores sha256=wAYIQpCVHp6CkAlyqLuoBbelI2rtNSxKyhXoI_s1ldQ 8454
opusfilter-3.2.0.data/scripts/opusfilter-test sha256=56bDh5gjg3CmDV-Ik_uRdx2IbB0nd8VCycaSJYZC2lA 2547
opusfilter-3.2.0.dist-info/LICENSE sha256=w58hY4ey5pAQuqPEQI9fMsKEgUvw4bRM8qCMQcp2WvY 1074
opusfilter-3.2.0.dist-info/METADATA sha256=U_LP7gURa3uCOSCZ6nLIVGPIp5-my5pzDfbLu1rbobw 5055
opusfilter-3.2.0.dist-info/WHEEL sha256=HiCZjzuy6Dw0hdX5R3LCFPDmFS4BWl8H-8W39XfmgX4 91
opusfilter-3.2.0.dist-info/top_level.txt sha256=c5aCKb2pgq95RH_duH6VRkuVHrjjdwLm-AdbnNJrEoc 11
opusfilter-3.2.0.dist-info/RECORD

top_level.txt

opusfilter