modelbench

View on PyPIReverse Dependencies (0)

0.6.0 modelbench-0.6.0-py3-none-any.whl

Wheel Details

Project: modelbench
Version: 0.6.0
Filename: modelbench-0.6.0-py3-none-any.whl
Download: [link]
Size: 82759
MD5: 9bd4216fc396fa082a229ae6bfaca3aa
SHA256: 1527c027e0aa3ec8b0974f7c2109ad4d65da923fb987039c705f30d06d859191
Uploaded: 2024-08-13 17:05:15 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: modelbench
Version: 0.6.0
Summary: Run benchmarks and generate reports measuring the behavior of many AI Systems.
Author: MLCommons AI Safety
Author-Email: ai-safety-engineering[at]mlcommons.org
Home-Page: https://github.com/mlcommons/modelbench
Project-Url: Repository, https://github.com/mlcommons/modelbench
License: Apache-2.0
Keywords: AI,GenAI,LLM,NLP,evaluate,measure,quality,testing,prompt,safety,compare,artificial,intelligence,Large,Language,Models
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Information Technology
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Natural Language :: English
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Topic :: System :: Benchmark
Classifier: Typing :: Typed
Requires-Python: >=3.10,<3.13
Requires-Dist: casefy (<0.2.0,>=0.1.7)
Requires-Dist: click (<9.0.0,>=8.1.7)
Requires-Dist: jinja2 (<4.0.0,>=3.1.3)
Requires-Dist: jq (<2.0.0,>=1.6.0)
Requires-Dist: modelgauge (>=0.6.0)
Requires-Dist: pip (<25.0,>=24.0)
Requires-Dist: retry (<0.10.0,>=0.9.2)
Requires-Dist: scipy (<2.0.0,>=1.12.0)
Requires-Dist: tabulate (<0.10.0,>=0.9.0)
Requires-Dist: termcolor (<3.0.0,>=2.4.0)
Description-Content-Type: text/markdown
[Description omitted; length: 7161 characters]

WHEEL

Wheel-Version: 1.0
Generator: poetry-core 1.9.0
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
modelbench/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
modelbench/benchmarks.py sha256=pXCMtlOxf1paOg4OP5Sf1W-v_JTrIc64q5SUfVLTSzM 2982
modelbench/hazards.py sha256=-eD1nLOYDhBKrBjIdNPjLSm2L35hCL0P_oEIiF_nA1A 6171
modelbench/modelgauge_runner.py sha256=8ymJznBn96QgOBZFj4ebh-14frfGUAIgw7gpz3IkK7s 2910
modelbench/record.py sha256=HkhHc0OO3KgJyTKlSyvrXJCqIYq1j17YAewLN3UFsD4 4128
modelbench/run.py sha256=Pi4F3N3_fCmD50zfyoUA1eXZ48AXOdMGkJHQpEfTIV8 15146
modelbench/scoring.py sha256=PM-7Ft-YxKAwyBjX-Smrl1nj7w0nbSHwmJmdahCQXWI 2788
modelbench/standards.json sha256=yL_pGcqmwLnj8Kwy1D9CpE6tCm0140wvwxDi850eyKg 1421
modelbench/static_site_generator.py sha256=fOwg5jn94e-9OUHhcuxq_0lIt52olVCzvjqJCyDLclM 11169
modelbench/templates/_provisional.html sha256=zD5uU21MlUtGyVP7fDPc8n1kZ5OZpqu6N1u3Td1bTj4 947
modelbench/templates/_test_runs_legend.html sha256=VNwTyBX5BiEexLwa2IMkpVt83rQlEKR7Xy7puxoQ6YQ 481
modelbench/templates/base.html sha256=QIyHKMpbQaoQOx5kg4MgbqTR1wTqFJEyUn9ieMQqDEg 648
modelbench/templates/benchmark.html sha256=GOgKtshjaD4HadaEfsTCmazTOhd2lbgvh-TWrCE4lDA 2984
modelbench/templates/benchmarks.html sha256=MTYr31bmH_ExBoRD3-SAePsCGAGNEUy7lRMZmpyfu_Y 1115
modelbench/templates/content/general.toml sha256=ZERUfRkrOSGlohJ2T4XMCs7LOWtB3JHj8Kdcs_Om_zQ 1796
modelbench/templates/content/general_purpose_ai_chat_benchmark.toml sha256=xREdNA1bCKnnnbCTgsr7jgdCN_qj4zQZpw_hEwWwjFg 2754
modelbench/templates/content/general_purpose_ai_chat_benchmark_v_1.toml sha256=OzDfrU-gKW-Ok7Hgr3_mnNaivqllbvkLWY2TVwjvMNE 1753
modelbench/templates/content/grades.toml sha256=9vR0ZJp4wujiFiBUSothdRCF40Kvzsys0lX4u6fK3gk 997
modelbench/templates/content/hazards.toml sha256=SrNryRXU1zB8ELZWMo10Yp8gu2nJrjWgAAPjq5GEzbQ 1041
modelbench/templates/content/suts.toml sha256=uN36NARE2po8ZgLIlq_feblnXKEqJmU9LNzbrCyak4E 732
modelbench/templates/content/tests/bbq.toml sha256=sYqtj3jTMvtcCXlWxcPr5aI6UUriZsNLgW8GTFjx9AY 70
modelbench/templates/content/tests/real_toxicity_prompts.toml sha256=cAWIXmqS5NuF2iL3IdO7h6VRwlptQqTfgQ_giiLZ51o 77
modelbench/templates/content/tests/safe-cae-benign.toml sha256=S20y4gXRFwQMnD12YeL-52NDwIw8GQeaocklZoSaKPg 121
modelbench/templates/content/tests/safe-cae.toml sha256=9qvwKE4xVfm4Th3jrmQ3GB4LT17LFvZmgUk0uAp5njE 127
modelbench/templates/content/tests/safe-cbr.toml sha256=lKd_CDrjzl0q9AVCAHVCPIKNHCXUfrPV7wKDQqqRgKU 176
modelbench/templates/content/tests/safe-gra.toml sha256=ErYYXrEXlyVJb9XSX-Vp_MTrAikH5pR3_JxkTwjhIo0 128
modelbench/templates/content/tests/safe-ssh-benign.toml sha256=rD5KOJnmjGkL3Bg3Hp845feQwM5Z4wYjTfjsencc9-c 132
modelbench/templates/content/tests/safe-ssh.toml sha256=l4Bz-x-qVM0A1ZdGvUeSO2dEH_SBO5oG9rd4PuJFLII 138
modelbench/templates/content/tests/safe-ter-benign.toml sha256=H6cOcZ56qO5pWKj5QPRbTNRptn3Q1jJ9yxZ7mAhC07Q 102
modelbench/templates/content/tests/safe-ter.toml sha256=q71-8P7urTjt1XZEAWfLK3YV5MKLgl9bcNBgo7sloO0 108
modelbench/templates/content/tests/simple_safety_tests.toml sha256=mDtOHAEGMJZB9tzbWw7sFvonKgP6YYc2An5wx3T-PZ4 83
modelbench/templates/content/tests/xstest.toml sha256=Fr_zCOeRyelY5FBkzub-6w6hBfBX7vxEpUpac4RHEO8 67
modelbench/templates/content_mlc/general.toml sha256=exssP5Ltax7PCWz0eSPAlBF5pi7CCW_AlbDXtG5bH6Y 2348
modelbench/templates/content_mlc/general_purpose_ai_chat_benchmark.toml sha256=JLQUQW3Kp9OZslKvVZHlunZlXoqj8KLcbU5rzcl4Uxc 1073
modelbench/templates/index.html sha256=CuGfpGH8kqyuONHN1gJ-hbBNFG65ac_QDU_7ouZbbzE 330
modelbench/templates/macros/benchmark_card.html sha256=wFX4VUBJYpQ94FG5D087esiK1yBhFOwJ8ktQopYIKNQ 538
modelbench/templates/macros/breadcrumb.html sha256=R21rJ8lwTKNfKA2ZrM7XeQaL-Uk0664VkEI61aalnR0 1056
modelbench/templates/macros/interpret_safety_ratings.html sha256=qvpai4MpgTIX5JdO2nlupH7DkZxOQAiprGkRIa_VK5s 1872
modelbench/templates/macros/sut_card.html sha256=EN9G0LX6aHv3ZWw2OOhCxJLNCO10FvEZa_o0WUEkl78 1951
modelbench/templates/macros/test_runs.html sha256=jqWMV1HRsSD1OPRmxN5Z-hB66c5Pnf4_XvrjxJYlNmo 3160
modelbench/templates/macros/use_hazards_limitations.html sha256=5LFd6IP9FIpPTaZgQofpgc9RDFOsXd-D8vdEEEQW_S0 1316
modelbench/templates/static/images/ml_commons_logo.png sha256=FOoZC66i2hGE6PVKA6hSpyzC2A81qDrcJ-vcRCXo54Y 33565
modelbench/templates/static/style.css sha256=Mx36Q-yH_JOp0cV66gYKlGQYdeeV9FYMTz26t1ZDtyQ 29613
modelbench/templates/test_report.html sha256=nv-ES5PPtKxb1YPJqDMfNhlEZ2oqWHU3crO07EYGn-Q 2733
modelbench/uid.py sha256=1bIRF3m6SpPSFFUyDb7utMspZWrYvTCx02qD7vWutC0 2156
modelbench/utilities.py sha256=dKw_1bXy84WHto3MN-HNwOiMqM5xP3pG_jnyBQXQ_eY 318
modelbench-0.6.0.dist-info/LICENSE.md sha256=DVQuDIgE45qn836wDaWnYhSdxoLXgpRRKH4RuTjpRZQ 10174
modelbench-0.6.0.dist-info/METADATA sha256=jasxG0jDL_39i_JE0fewJbQmCdcL-99aNbqkEYDOo98 8900
modelbench-0.6.0.dist-info/WHEEL sha256=sP946D7jFCHeNz5Iq4fL4Lu-PrWrFsgfLXbbkciIZwg 88
modelbench-0.6.0.dist-info/entry_points.txt sha256=I4hxcFOVRR1G8A3RzM-CGfHOxMSByGJ1XRGQ5sSDnf8 49
modelbench-0.6.0.dist-info/RECORD

entry_points.txt

modelbench = modelbench.run:cli