Scheduling Model#

Use these concepts when you need to reason about where work runs and how RLinf stores trajectory data.

Concept

What you get

Placement

How workers map onto nodes and GPUs.

Execution Modes

Collocated, disaggregated, and hybrid placement trade-offs.

Replay Buffer

Trajectory replay buffer design and sampling.