GAIL + BC + PPO Suddenly Regresses Learning

unity_D7A3B4E547B316A9E49C · July 14, 2022, 11:31pm

I am training an ml agent model in which it has to collect food , deposit it and every once in a while take shelter i.e. go to a safe spot in the world and stand still. I used GAIL and BC on top of PPO. On training the model my agents picked up the right actions very quickly but a few hours in they started regressing from the behavior and started to perform random incorrect actions. Im struggling to understand why that happened? Any insights would be really helpful.

For reference :- Gail Training parameters :- strength - 0.1, use_actions : true, use_vail : true , learning_rate:0.0009

BC training parameters :strength:0.1
Extrinsic Reward:- strength : 1

HyperParamters : beta :- 0.005
Learning Rate :- 0.0009

Need some help badly.

unity_D7A3B4E547B316A9E49C · July 15, 2022, 4:45am

Attached my training parameters for reference!

Topic		Replies	Views
PPO+GAIL performing worse than only PPO Unity Engine ML-Agents , com_unity_ml-agents	4	1481	January 2, 2022
Question about bc+gail+extrinsic reward Unity Engine ML-Agents , Question , com_unity_ml-agents	2	778	April 16, 2023
GAIL vs Behavioral Cloning, what's the difference? Unity Engine ML-Agents , com_unity_ml-agents	11	6072	July 27, 2021
Help needed with training an agent using GAIL and RL Unity Engine ML-Agents , Advanced , Question , 6-0	0	93	April 13, 2025
Ml-Agents suddenly gives up Unity Engine ML-Agents , Question , com_unity_ml-agents	2	809	February 20, 2023

GAIL + BC + PPO Suddenly Regresses Learning

Related topics