Skip to main content

Automatically and uniformly measure the behavior of many AI Systems.

Project description

ModelGauge

Goal: Make it easy to automatically and uniformly measure the behavior of many AI Systems.

[!WARNING] This repo is still in beta with a planned full release in Fall 2024. Until then we reserve the right to make backward incompatible changes as needed.

ModelGauge is an evolution of crfm-helm, intended to meet their existing use cases as well as those needed by the MLCommons AI Safety project.

Summary

ModelGauge is a library that provides a set of interfaces for Tests and Systems Under Test (SUTs) such that:

  • Each Test can be applied to all SUTs with the required underlying capabilities (e.g. does it take text input?)
  • Adding new Tests or SUTs can be done without modifications to the core libraries or support from ModelGauge authors.

Currently ModelGauge is targeted at LLMs and single turn prompt response Tests, with Tests scored by automated Annotators (e.g. LlamaGuard). However, we expect to extend the library to cover more Test, SUT, and Annotation types as we move toward full release.

Docs

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

modelgauge-0.6.1.tar.gz (53.7 kB view details)

Uploaded Source

Built Distribution

modelgauge-0.6.1-py3-none-any.whl (71.2 kB view details)

Uploaded Python 3

File details

Details for the file modelgauge-0.6.1.tar.gz.

File metadata

  • Download URL: modelgauge-0.6.1.tar.gz
  • Upload date:
  • Size: 53.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.10.12 Linux/6.9.3-76060903-generic

File hashes

Hashes for modelgauge-0.6.1.tar.gz
Algorithm Hash digest
SHA256 9619d863f8ec6536a3caf61a2ade6dda89790b9fcf28b78bc54a1552037c019d
MD5 ddd79bce6655a0eb94b0d8e5b81f9fb6
BLAKE2b-256 7b95941c4b93706af039e1a1c467f8482e4e50fb72921db149445d7783884bf5

See more details on using hashes here.

File details

Details for the file modelgauge-0.6.1-py3-none-any.whl.

File metadata

  • Download URL: modelgauge-0.6.1-py3-none-any.whl
  • Upload date:
  • Size: 71.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.10.12 Linux/6.9.3-76060903-generic

File hashes

Hashes for modelgauge-0.6.1-py3-none-any.whl
Algorithm Hash digest
SHA256 bdd149b9e4d8f04d9d8aea1dcdd85fed17e4b870f34371b8ef96c247836a7662
MD5 3acb13bf5e666b10efabd400c0ed31ab
BLAKE2b-256 27680d4de08572177f286be5da28e0c0860fdc493eef1ada3c3a1dd161d0ac1f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page