Rollout Engines#

Use these guides to bring up the inference engines RLinf rollouts talk to over HTTP — server-side processes you launch alongside (or independently of) your training run, plus the client used to call them.

Guide

What you get

SGLang Server & Router

Launch an sglang HTTP server group and an sglang router, with a single OpenAI-compatible endpoint for /generate and /v1/chat/completions.

Calling SGLang with InferenceHTTPClient

Send sync and async /generate / /v1/chat/completions requests to a router (or a single server) from your own code.

SGLang Version Switching

Switch between SGLang versions for the rollout engine.