Skip to main content

Automatically and uniformly measure the behavior of many AI Systems.

Project description

ModelGauge

Goal: Make it easy to automatically and uniformly measure the behavior of many AI Systems.

[!WARNING] This repo is still in beta with a planned full release in Fall 2024. Until then we reserve the right to make backward incompatible changes as needed.

ModelGauge is an evolution of crfm-helm, intended to meet their existing use cases as well as those needed by the MLCommons AI Safety project.

Summary

ModelGauge is a library that provides a set of interfaces for Tests and Systems Under Test (SUTs) such that:

  • Each Test can be applied to all SUTs with the required underlying capabilities (e.g. does it take text input?)
  • Adding new Tests or SUTs can be done without modifications to the core libraries or support from ModelGauge authors.

Currently ModelGauge is targeted at LLMs and single turn prompt response Tests, with Tests scored by automated Annotators (e.g. LlamaGuard). However, we expect to extend the library to cover more Test, SUT, and Annotation types as we move toward full release.

Docs

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

modelgauge-0.5.0.tar.gz (32.4 kB view details)

Uploaded Source

Built Distribution

modelgauge-0.5.0-py3-none-any.whl (44.3 kB view details)

Uploaded Python 3

File details

Details for the file modelgauge-0.5.0.tar.gz.

File metadata

  • Download URL: modelgauge-0.5.0.tar.gz
  • Upload date:
  • Size: 32.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.11.2 Linux/6.6.17-01102-gd3cec3c11146

File hashes

Hashes for modelgauge-0.5.0.tar.gz
Algorithm Hash digest
SHA256 42fbf621dda0ed1c4d6cc97491a6e590a2b9c23650a85b89e829ba10000d16b4
MD5 d6f22fb39f0b531ed59b9cd1d5cb8954
BLAKE2b-256 f20db72ec86d6160d2a2468201d5199ebc715ea3835d58ea2f996364c677eff6

See more details on using hashes here.

File details

Details for the file modelgauge-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: modelgauge-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 44.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.11.2 Linux/6.6.17-01102-gd3cec3c11146

File hashes

Hashes for modelgauge-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f899e7855abc57296f883c2ad68d6df84c3ace8d38b8f96f3f4bcbf2e84993e9
MD5 9475a3b41b89a981b3825e89d05c6d39
BLAKE2b-256 0802defbe0b8c16e8d69175a695cb0ef3b055bfdf35c8d1c792c49ab8360d1c9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page