6 Known Issues

Known issues in the evaluation pipeline and models.

6.1 Multi-GPU Training Can Slightly Degrade Performance

Training on 4 GPUs sometimes yields marginally lower closed-loop performance than single-GPU training. The effect is small and doesn’t change qualitative conclusions, but appears consistently in certain runs.

6.2 Static Graph Can Degrade Performance

We avoid static_graph in the pipeline due to observed performance issues.

6.3 CARLA Waypoint PID Controller Needs Better Tuning

A well-tuned controller (e.g., MPC) can significantly improve performance. Preliminary experiments showed ~5-7 DS improvement on Bench2Drive for TFv5, though these numbers are approximate since controller tuning wasn’t the focus.

6.4 CARLA 0.9.16 Has Goal-Point Issues

CARLA 0.9.16 currently has problems with the goal-point pipeline that degrade policy behavior. We don’t recommend evaluating models on this version.