sentence_splitter

View on PyPIReverse Dependencies (19)

1.4 sentence_splitter-1.4-py2.py3-none-any.whl

Wheel Details

Project: sentence_splitter
Version: 1.4
Filename: sentence_splitter-1.4-py2.py3-none-any.whl
Download: [link]
Size: 44998
MD5: 2997a3de186228e9d434f92bceb751ec
SHA256: 5645a3ad9c348e4287f4bc73bd573d92dccd4139042fddd51fff0591f1376763
Uploaded: 2019-01-14 17:11:23 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: sentence-splitter
Version: 1.4
Summary: Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder
Author: Philip Koehn, Josh Schroeder, Digital Silk Road, Linas Valiukas
Author-Email: lvaliukas[at]cyber.law.harvard.edu
Home-Page: https://github.com/berkmancenter/mediacloud-sentence-splitter
License: LGPLv3
Keywords: sentence splitter tokenization tokenizer tokenize
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Information Technology
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: GNU Lesser General Public License v3 (LGPLv3)
Classifier: Programming Language :: Python
Classifier: Natural Language :: Catalan
Classifier: Natural Language :: Czech
Classifier: Natural Language :: Danish
Classifier: Natural Language :: Dutch
Classifier: Natural Language :: English
Classifier: Natural Language :: Finnish
Classifier: Natural Language :: French
Classifier: Natural Language :: German
Classifier: Natural Language :: Greek
Classifier: Natural Language :: Hungarian
Classifier: Natural Language :: Icelandic
Classifier: Natural Language :: Italian
Classifier: Natural Language :: Latvian
Classifier: Natural Language :: Norwegian
Classifier: Natural Language :: Polish
Classifier: Natural Language :: Portuguese
Classifier: Natural Language :: Portuguese (Brazilian)
Classifier: Natural Language :: Romanian
Classifier: Natural Language :: Russian
Classifier: Natural Language :: Slovak
Classifier: Natural Language :: Slovenian
Classifier: Natural Language :: Spanish
Classifier: Natural Language :: Swedish
Classifier: Natural Language :: Turkish
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: Implementation :: PyPy
Classifier: Topic :: Database
Classifier: Topic :: Internet :: WWW/HTTP :: Indexing/Search
Classifier: Topic :: Text Processing :: Indexing
Classifier: Topic :: Text Processing :: Linguistic
Platform: any
Requires-Python: >=3.5
Requires-Dist: regex (>=2017.12.12)
[Description omitted; length: 582 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.32.3)
Root-Is-Purelib: true
Tag: py2-none-any
Tag: py3-none-any

RECORD

Path Digest Size
sentence_splitter/__init__.py sha256=eh8aYwyMwU7-dtHfNvX-cEOdA1cDwssiAK_97LTr_pU 8449
sentence_splitter/non_breaking_prefixes/ca.txt sha256=m0HCRJPvzIcBc69PFUecxv0R1GBiVf7z4o5Vp9is7B4 1481
sentence_splitter/non_breaking_prefixes/cs.txt sha256=jGrRQWDw072t3fi6AWXZErY43lU4tUtudybjWglElek 1893
sentence_splitter/non_breaking_prefixes/da.txt sha256=1qnvxld9t0bXvHEFbVnj6FUgF6e2o3n287xvo6Jwxf8 3478
sentence_splitter/non_breaking_prefixes/de.txt sha256=JSEFQzkrA_4vWmR_A_fLWRoaDiZ3N83m_sRudMctjZw 2299
sentence_splitter/non_breaking_prefixes/el.txt sha256=0IyP_CJJENN8nDoOkZzqm8bachyxIknWzupNib3Iu9g 17034
sentence_splitter/non_breaking_prefixes/en.txt sha256=Fb7165XBIgei95hBjwziRmNiS_rxiv1QNVFMU8nFDYE 1761
sentence_splitter/non_breaking_prefixes/es.txt sha256=lEqjjsJ4TvQJBjSwGyvazo4H1Y72Nzq6Wd8Lgp2SlZc 1955
sentence_splitter/non_breaking_prefixes/fi.txt sha256=Tb55Q5Z1etjewKYp5Z_RDXw6jsAfYkFir_dzjJOiJvg 1946
sentence_splitter/non_breaking_prefixes/fr.txt sha256=jtYXP4-5H-FpOvsUOuvUAfzmc7iRTn55ITkfKqAdsrA 1585
sentence_splitter/non_breaking_prefixes/hu.txt sha256=muUDzV-8uxhH_-pGouYZMO91tl5Xn6vYcZVTrWOUV_U 1665
sentence_splitter/non_breaking_prefixes/is.txt sha256=nyJWTCe9K7h71eBGTFeLjtIYPY-bn7cdrf8ddlxF024 1059
sentence_splitter/non_breaking_prefixes/it.txt sha256=eEo6UQVBkE486TWcEWMtyFNdbsTB5YBhG5zDpxLLjT8 1408
sentence_splitter/non_breaking_prefixes/lt.txt sha256=tk0hRx0YX5D3xGOV8hjqK45zam8mUFVaxg4DQNe-SU4 8535
sentence_splitter/non_breaking_prefixes/lv.txt sha256=tSYx4yjNo8JflzqK2CSMjom5G0E8L0LDGjhJuEsKVuE 1223
sentence_splitter/non_breaking_prefixes/nl.txt sha256=LyPRTKGIpaA_mHBssJzSEc6TNNcfycvB0n6p7bg2KAQ 2111
sentence_splitter/non_breaking_prefixes/no.txt sha256=M1CDgUva64p1L-7ky3CmJPLtQyjduAQprGuijoZTY2s 2161
sentence_splitter/non_breaking_prefixes/pl.txt sha256=xWv6gmDy0s14DLhEdqjVr90tXfDfUxsxPAE1pBETSjE 1311
sentence_splitter/non_breaking_prefixes/pt.txt sha256=c2LNbil9mtUxnx08OPwa_Q2qFPupS4WIsMl5SYb8J88 4221
sentence_splitter/non_breaking_prefixes/ro.txt sha256=-qDVT7rUoEdt-r7gZHQI_v5WfArJEkGTSy7TLwuUWdY 1976
sentence_splitter/non_breaking_prefixes/ru.txt sha256=8_8J_8mVQlUc2BaHM8zxRjjgKEBTjISoP342TGR5uRw 3012
sentence_splitter/non_breaking_prefixes/sk.txt sha256=ApqiCOjbyZNKP2DFlK2g7F0Ad7mSRtnx9ydfMtCFwh0 2460
sentence_splitter/non_breaking_prefixes/sl.txt sha256=lX7zRV9KpdcIw9I5NaRULDhOsnAa2DJ1YOqD2mqTh1Q 278
sentence_splitter/non_breaking_prefixes/sv.txt sha256=qwnCTnOKi0RPZDYn62g2K8xs6wdp-xJ0zHq23ACtVrw 2512
sentence_splitter/non_breaking_prefixes/tr.txt sha256=HOTYlJ6oKoxLhpHbXeyBzb-ASH0K0tgpCoP9SOhXHiQ 2130
sentence_splitter-1.4.dist-info/LICENSE sha256=e_ty-MlGehl17TykLzmk-8179sXHXlPcnSHAuvVnH8s 782
sentence_splitter-1.4.dist-info/METADATA sha256=docjw0IBw9uQq0OhbmPypuwzolEIv3Ss2FZJwSO4-xQ 2835
sentence_splitter-1.4.dist-info/WHEEL sha256=_wJFdOYk7i3xxT8ElOkUJvOdOvfNGbR9g-bf6UQT6sU 110
sentence_splitter-1.4.dist-info/top_level.txt sha256=5wzsTyGZs4u-2wa-D2Sv24RtHsuvD_W8HN72bXFn14E 18
sentence_splitter-1.4.dist-info/RECORD

top_level.txt

sentence_splitter