Publications#

This page lists papers and technical reports associated with RLinf. Detailed publication pages (with results and quickstart links) are linked below.

Detailed publication pages#

Publication

Focus

Preprint

RLinf-USER

Unified system for real-world online policy learning.

arXiv:2602.07837

RLinf-VLA

Unified framework for VLA+RL training.

arXiv:2510.06710

RLinf-Co

Reinforcement learning-based sim-real co-training for VLA models.

arXiv:2602.12628

RLinf

Flexible and efficient RL system.

arXiv:2509.15965

πRL

Online RL fine-tuning for flow-based VLA models.

arXiv:2510.25889

WoVR

World model-based RL fine-tuning for VLA policies.

arXiv:2602.13977

WideSeek-R1

Exploring width scaling for broad information seeking via multi-agent reinforcement learning.

arXiv:2602.04634