while game over add remaintime/15 to every step's rewards. to improve this round's training weight. fix get target from states still using onehot decoder bug. |
||
---|---|---|
.. | ||
AimBotEnv-old.py | ||
AimbotEnv.py | ||
MultiNN-PPO.py | ||
ppo.py | ||
testarea.ipynb | ||
testEnv.py |