Self-play and multi-agent reinforcement learning

Hi @unity_6gCk04bKBvTBXw ,
You already have a thread open where we are trying to help you with this issue. Please keep questions to that thread .