A few demonstrations of the working policy can be found at this link
The source code is contained in tadpo.py. This file contains the algorithm TADPO implemented using the stable_baselines3 framework. The code used to generate gym environments is not shared as it can only be run with BeamNG (A proprierty simulator).
The requirements are:
- numpy
- torch
- gymnasium
- stable_baselines3