Path planning with PPO

Hello to everyone!!!

After training period we have the trained Neural Network.
In my case I have an agent who navigates from one initial point to another.
If I want to export the only one path (the positions from the beginning to the goal )with the max reward, how I can do this?

Thank you in advance!!!

Hi @dani_kal ,
Do I understand correctly that you want to have the trained neural network do the pathfinding over and over and only export the path with the maximum reward? What kind of path is it? Would another algorithm suffice for this task such as A*?

Good evening !!
Thank you for your answer!!!
Specifically,I have trained an agent to go from an initial point to a final.

Lets say that we have an agent at position (-5,-5) and his goal is at (5,5) with obstacles between them.
After the training, the agent knows how to navigate from the start-point to the goal-point.
I want to extract this path (which is the path with the maximum reward).
This would be something like [(-5,-5)( -4,-4), (-4,-3) … (5,5)]. These are the positions in which the agent was found from the beginning to the end with the max reward.
My question is that I have to save each time the positions and the rewards during the whole training and find from them the max reward so the optimal path.
Or is there another way that we can have it?

Thank you again!!!

Once your agent is trained to follow the optimal path, you could then run the agent in inference mode with your own script that records his actions.

1 Like

Ok !!!Thank you very much!!!