f1tenth-rl-controller

ROS 2 Python package for training, evaluating, exporting, and serving PPO-based autonomous racing policies for the AutoDRIVE RoboRacer / F1TENTH-style simulator workflow.

This repository contains:

a baseline PPO racing controller compatible with the saved model artifacts already included in the repo
a higher-speed training path for new experiments
ONNX export utilities
inference benchmarking utilities for SB3, ONNX Runtime, and TensorRT
a minimal Triton model repository and client

Project Scope

The repository is centered on reinforcement learning policy inference for autonomous racing. The current policy family uses Stable-Baselines3 PPO and ROS 2 topic I/O to interact with the simulator.

At a high level, the code supports:

training a PPO controller
evaluating a saved PPO model in the simulator
exporting a trained policy to ONNX
benchmarking inference backends
serving exported models through Triton Inference Server

Repository Layout

.
├── autodrive_race/
│   ├── autodrive_race/
│   │   ├── benchmark_inference.py
│   │   ├── build_tensorrt.py
│   │   ├── constants.py
│   │   ├── env.py
│   │   ├── eval.py
│   │   ├── export_onnx.py
│   │   ├── final_test_training.py
│   │   ├── inference_utils.py
│   │   ├── ppo_training.py
│   │   ├── triton_client.py
│   │   └── utils.py
│   ├── best_model/
│   ├── checkpoints/
│   ├── package.xml
│   └── setup.py
└── triton_model_repo/
    └── f110_policy/

Policy Paths

The repo currently contains two main policy paths.

1. Baseline Path

The baseline path is the one used by the saved PPO model already stored in the repository.

entry point: autodrive_race.ppo_training
evaluation entry point: autodrive_race.eval
environment factory: make_baseline_env
saved-model contract:
- observation shape: 180
- action space: Discrete(25)

This path is the compatibility-preserving path. If you want to resume from the included checkpoints or evaluate the included best model, use this one.

2. Advanced Path

The advanced path is a train-from-scratch higher-speed variant.

entry point: autodrive_race.final_test_training
environment factory: make_advanced_env
contract:
- observation shape: 242
- action space: MultiDiscrete([4, 11])
- observation contents:
  - 240 LiDAR features
  - 2 previous-action features

This path is intended for new experiments and does not load the legacy baseline weights.

Requirements

Minimum expected software stack:

Ubuntu with ROS 2 Humble
Python 3.10+
colcon
simulator / devkit environment that publishes and consumes the expected AutoDRIVE topics

For training and evaluation:

stable-baselines3
gymnasium
numpy
rclpy
ROS 2 message packages used by the simulator

For ONNX export:

torch
onnx
onnxruntime

For TensorRT benchmarking / engine build:

tensorrt
pycuda
trtexec

For Triton client usage:

tritonclient[grpc] or tritonclient[http]

ROS 2 Workspace Setup

Clone the repository into a ROS 2 workspace and build the package.

mkdir -p ~/roboracer_ws/src
cd ~/roboracer_ws/src
git clone https://github.com/1Kaustubh122/f1tenth-rl-controller.git

cd ~/roboracer_ws
source /opt/ros/humble/setup.bash
colcon build --symlink-install --packages-select autodrive_race
source install/setup.bash

Expected ROS Topics

The environments in this repository expect the simulator bridge to provide the following topics:

/autodrive/roboracer_1/lidar
/autodrive/roboracer_1/collision_count
/autodrive/roboracer_1/last_lap_time
/autodrive/roboracer_1/lap_count

The controller publishes:

/autodrive/roboracer_1/throttle_command
/autodrive/roboracer_1/steering_command

Training

Before launching training, make sure:

the simulator is running
the AutoDRIVE bridge is running
the ROS 2 graph is healthy and the LiDAR topic is active

Baseline Training

Resume from the latest compatible checkpoint, or start a fresh baseline run if none exists:

source /opt/ros/humble/setup.bash
source ~/roboracer_ws/install/setup.bash
ros2 run autodrive_race ppo_training -- --timesteps 1000000 --checkpoint-freq 10000

Outputs:

checkpoints: autodrive_race/checkpoints/
best model: autodrive_race/best_model/
eval logs: autodrive_race/logs/

Advanced Training

Train the higher-speed model:

source /opt/ros/humble/setup.bash
source ~/roboracer_ws/install/setup.bash
ros2 run autodrive_race final_test_training -- \
  --device cuda \
  --timesteps 3000000 \
  --learning-rate 1e-4 \
  --n-steps 8192 \
  --batch-size 1024 \
  --checkpoint-freq 10000

If GPU memory is tighter than expected, use:

ros2 run autodrive_race final_test_training -- \
  --device cuda \
  --timesteps 3000000 \
  --learning-rate 1e-4 \
  --n-steps 4096 \
  --batch-size 512 \
  --checkpoint-freq 10000

Outputs:

checkpoints: autodrive_race/advanced_checkpoints/
best model: autodrive_race/advanced_best_model/best_model.zip
eval logs: autodrive_race/advanced_logs/

Evaluation

Evaluate the default saved model:

source /opt/ros/humble/setup.bash
source ~/roboracer_ws/install/setup.bash
ros2 run autodrive_race eval -- --steps 5000

Evaluate a specific model:

ros2 run autodrive_race eval -- --model-path /absolute/path/to/model.zip --steps 5000

The evaluator inspects the saved-model contract and selects the appropriate environment automatically.

Export to ONNX

Export a PPO model to ONNX:

source /opt/ros/humble/setup.bash
source ~/roboracer_ws/install/setup.bash
ros2 run autodrive_race export_onnx -- \
  --model-path /absolute/path/to/model.zip \
  --output /absolute/path/to/model.onnx \
  --opset 17

Copy the exported model into the Triton repository structure at the same time:

ros2 run autodrive_race export_onnx -- \
  --model-path /absolute/path/to/model.zip \
  --output /absolute/path/to/model.onnx \
  --copy-to-triton-repo

The export path performs a deterministic-action sanity check with ONNX Runtime when the required dependencies are installed.

Benchmark Inference

Compare SB3, ONNX Runtime, and TensorRT:

source /opt/ros/humble/setup.bash
source ~/roboracer_ws/install/setup.bash
ros2 run autodrive_race benchmark_inference -- \
  --model-path /absolute/path/to/model.zip \
  --onnx-path /absolute/path/to/model.onnx \
  --engine-path /absolute/path/to/model.plan \
  --warmup 20 \
  --runs 200 \
  --device cuda

Write results to a JSON file:

ros2 run autodrive_race benchmark_inference -- \
  --model-path /absolute/path/to/model.zip \
  --onnx-path /absolute/path/to/model.onnx \
  --output /absolute/path/to/results.json

If ONNX Runtime or TensorRT is unavailable, the script reports that backend as unavailable instead of crashing the whole benchmark run.

Build a TensorRT Engine

If trtexec is installed:

source /opt/ros/humble/setup.bash
source ~/roboracer_ws/install/setup.bash
ros2 run autodrive_race build_tensorrt -- \
  --onnx-path /absolute/path/to/model.onnx \
  --engine-path /absolute/path/to/model.plan \
  --model-path /absolute/path/to/model.zip \
  --fp16

Triton Inference Server

This repository includes a minimal Triton model repository:

triton_model_repo/f110_policy/
├── config.pbtxt
└── 1/

After exporting a model into the Triton repository location, launch Triton with that repository:

tritonserver --model-repository /absolute/path/to/f1tenth-rl-controller/triton_model_repo

Triton Client

Send a single observation to Triton and decode the returned action into throttle and steering:

source /opt/ros/humble/setup.bash
source ~/roboracer_ws/install/setup.bash
ros2 run autodrive_race triton_client -- \
  --model-path /absolute/path/to/model.zip \
  --url localhost:8001 \
  --protocol grpc

Optionally provide an explicit observation:

ros2 run autodrive_race triton_client -- \
  --model-path /absolute/path/to/model.zip \
  --observation "0.2,0.3,0.4,..."

Simulator Notes

For LiDAR-only training, no-graphics / headless simulator mode is usually preferable because it avoids rendering overhead.
GUI mode is useful for visual inspection and playback of a trained model.
If your simulator stack distinguishes between true headless camera rendering and no-graphics mode, check the simulator’s own documentation for camera-specific limitations. This repository’s policies are LiDAR-based.

Public Usage Notes

This repository does not assume any private shell aliases.
All runnable entry points are exposed through ROS 2 console scripts.
Use absolute paths when passing model or artifact locations unless you are sure about your working directory.

License

See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
autodrive_race		autodrive_race
triton_model_repo/f110_policy		triton_model_repo/f110_policy
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

f1tenth-rl-controller

Project Scope

Repository Layout

Policy Paths

1. Baseline Path

2. Advanced Path

Requirements

ROS 2 Workspace Setup

Expected ROS Topics

Training

Baseline Training

Advanced Training

Evaluation

Export to ONNX

Benchmark Inference

Build a TensorRT Engine

Triton Inference Server

Triton Client

Simulator Notes

Public Usage Notes

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

f1tenth-rl-controller

Project Scope

Repository Layout

Policy Paths

1. Baseline Path

2. Advanced Path

Requirements

ROS 2 Workspace Setup

Expected ROS Topics

Training

Baseline Training

Advanced Training

Evaluation

Export to ONNX

Benchmark Inference

Build a TensorRT Engine

Triton Inference Server

Triton Client

Simulator Notes

Public Usage Notes

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages