Tools for single-embodiment, multiple-task, Reinforcement Learning
Project description
AgentFlow: A Modular Toolkit for Scalable RL Research
Overview
AgentFlow
is a library for composing Reinforcement-Learning agents. The core
features that AgentFlow provides are:
- tools for slicing, transforming, and composing specs
- tools for encapsulating and composing RL-tasks.
Unlike the standard RL setup, which assumes a single environment and an agent,
AgentFlow
is designed for the single-embodiment, multiple-task regime. This
was motivated by the robotics use-case, which frequently requires training RL
modules for various skills, and then composing them (possibly with non-learned
controllers too).
Instead of having to implement a separate RL environment for each skill and
combine them ad hoc, with AgentFlow
you can define one or more SubTasks
which modify a timestep from a single top-level environment, e.g. adding
observations and defining rewards, or isolating a particular sub-system of the
environment, such as a robot arm.
You then compose SubTasks with regular RL-agents to form modules, and use a
set of graph-building operators to define the flow of these modules over time
(hence the name AgentFlow
).
The graph-building step is entirely optional, and is intended only for use-cases that require something like a (possibly learnable, possibly stochastic) state-machine.
Components
Control Flow
Examples
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
File details
Details for the file dm_robotics_agentflow-0.8.1-py3-none-any.whl
.
File metadata
- Download URL: dm_robotics_agentflow-0.8.1-py3-none-any.whl
- Upload date:
- Size: 143.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.11.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3693d59b2010ef0b7ac3b598e0f98c9087345b0e9c9818131b00a1d8f276f395 |
|
MD5 | f5ed120ebd948e2ce405b9ccee8104c8 |
|
BLAKE2b-256 | 5569c17bdb6fa73bc797ed68c7f437abdc1463aaf1cbd748dc8219be732e7080 |