Skip to main content

This is an unofficial, use-at-your-own risks port of the webarena benchmark, for use as a standalone library package.

Project description

Warning: use at your own risks!

Unofficial WebArena port for compatibility with BrowserGym. Changes below.

More flexible/recent dependencies

  • playwright>=1.32,<1.40
  • openai>=1
  • transformers

Packaging into a single Python namespace

pip install libwebarena
import webarena
import webarena.browser_env
import webarena.agent
import webarena.evaluation_harness
import webarena.llms
import webarena.llms.providers

Making HTMLContentEvaluator idempotent (validate() should not alter the browser's state)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

libwebarena-0.0.3.tar.gz (107.1 kB view details)

Uploaded Source

Built Distribution

libwebarena-0.0.3-py3-none-any.whl (116.3 kB view details)

Uploaded Python 3

File details

Details for the file libwebarena-0.0.3.tar.gz.

File metadata

  • Download URL: libwebarena-0.0.3.tar.gz
  • Upload date:
  • Size: 107.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for libwebarena-0.0.3.tar.gz
Algorithm Hash digest
SHA256 3d05fae6749931aaf26e6c80fd665725dfeab41ac4848f168c407dbe3de89baf
MD5 f18092eee0fd9edc55896775e4d3ef23
BLAKE2b-256 eaa194238187b6c5225be340ebcaf86dd9c1befbc4836cdc7d20439727042237

See more details on using hashes here.

File details

Details for the file libwebarena-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: libwebarena-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 116.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for libwebarena-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 aa0a0879486e5c90b2b2ec1c3bf309b0c7f13ee2bf7c8945447ac15f7027d248
MD5 ac23ce753fc8df4cbef434a41a717260
BLAKE2b-256 da88add70e890f192f280d39d823a2f026c2c0aae6806f8aa2512b065d00d4d2

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page