Catalog

Browse the open RL environment ecosystem

A working directory of 674 open-source environments — pulled from the hubs where they actually live, and mapped to where the open community is crowded, thin, or wide open.

Sourced from the Prime Intellect Environments Hub (109), the OpenReward registry (311), AgentBeats (18 benchmarks), Harbor (200 datasets), and a curated set of 36 anchor environments. A snapshot, not a live mirror — counts and links reflect each project at review time.

Where environments live

The center of gravity is no longer any single trainer — it is the registries and standards that let an environment be published once and reused anywhere.

The directory

Search and filter the catalogue by source or capability. Every card links to the environment’s home — a Hub package, a repo, or a task suite.

674 of 674

Mapped to the eight domains

How that catalogue lands across PostTrain Arena’s under-served domains. Four are effectively greenfield, two are thin, and two are mature — that asymmetry is the opportunity.

SciencesThin

Math and reasoning are well-served; wet-lab chemistry, biology, and open-ended discovery are sparse.

Industrial & Energy OperationsGreenfield

Only abstract operations exist open-source; real grid and building control means adopting external suites.

CybersecurityThin

Strong evaluation benchmarks exist, but trainable CTF / exploit environments are scarce.

Finance & EconomicsGreenfield

Essentially one anchor (FinRL); verifiable-reward trading is still mostly research papers.

Office & Knowledge WorkThin

Browser tasks are covered; structured office apps, spreadsheets, and enterprise analytics are not.

Media & Multimodal ContentGreenfield

Almost nothing with verifiable rewards — the thinnest domain. Coverage is incidental.

AI/ML & Agentic SystemsWell-covered

The maturest under-served domain: ML engineering, GPU kernels, AI-research task standards.

Software EngineeringSaturated

20+ open-source environments. The open question is differentiation and reward quality, not coverage.

Contribute an environment Read the spec