Skip to main content

Automatically and uniformly measure the behavior of many AI Systems.

Project description

Model Gauge

Goal: Make it easy to automatically and uniformly measure the behavior of many AI Systems.

[!WARNING] This repo is still in beta with a planned full release in Fall 2024. Until then we reserve the right to make backward incompatible changes as needed.

ModelGauge is an evolution of crfm-helm, intended to meet their existing use cases as well as those needed by the MLCommons AI Safety project.

Summary

ModelGauge is a library that provides a set of interfaces for Tests and Systems Under Test (SUTs) such that:

  • Each Test can be applied to all SUTs with the required underlying capabilities (e.g. does it take text input?)
  • Adding new Tests or SUTs can be done without modifications to the core libraries or support from ModelGauge authors.

Currently ModelGauge is targeted at LLMs and single turn prompt response Tests, with Tests scored by automated Annotators (e.g. LlamaGuard). However, we expect to extend the library to cover more Test, SUT, and Annotation types as we move toward full release.

Docs

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

modelgauge-0.3.3.tar.gz (32.3 kB view details)

Uploaded Source

Built Distribution

modelgauge-0.3.3-py3-none-any.whl (44.3 kB view details)

Uploaded Python 3

File details

Details for the file modelgauge-0.3.3.tar.gz.

File metadata

  • Download URL: modelgauge-0.3.3.tar.gz
  • Upload date:
  • Size: 32.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.11.2 Linux/6.6.13-00891-g1af58030b5c8

File hashes

Hashes for modelgauge-0.3.3.tar.gz
Algorithm Hash digest
SHA256 11a6fa7460ee2f049a55fa94a6e9c9df3766dd58c074029d88e4b2854a1bc521
MD5 9bde19e2ba2ee13fa9346466f5337450
BLAKE2b-256 ac1605be6dc293da6d07a201537038e2622d88ffb7dc08eea46b9fb08695e374

See more details on using hashes here.

File details

Details for the file modelgauge-0.3.3-py3-none-any.whl.

File metadata

  • Download URL: modelgauge-0.3.3-py3-none-any.whl
  • Upload date:
  • Size: 44.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.11.2 Linux/6.6.13-00891-g1af58030b5c8

File hashes

Hashes for modelgauge-0.3.3-py3-none-any.whl
Algorithm Hash digest
SHA256 e41298fee2bf83e088c79c831db5986faa8abcd4a70b6edd8321ea481703e26e
MD5 394253bb3bf46421c08d03685f3b82b0
BLAKE2b-256 6b47ef4496c4a8cab8032f118317033a9deb55305685598880542502854f136f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page