Aimbot-PPO/Aimbot-PPO-Python/Pytorch
Koha9 895cd5c118 Add EndReward Broadcast function
while game over add remaintime/15 to every step's rewards. to improve this round's training weight.
fix get target from states still using onehot decoder bug.
2022-12-03 03:58:19 +09:00
..
AimBotEnv-old.py Parallel Environment Discrete PPO finish 2022-10-30 04:13:14 +09:00
AimbotEnv.py Side Channel added 2022-11-30 06:45:07 +09:00
MultiNN-PPO.py Add EndReward Broadcast function 2022-12-03 03:58:19 +09:00
ppo.py Add Multi-NN agent 2022-12-01 19:55:51 +09:00
testarea.ipynb Add EndReward Broadcast function 2022-12-03 03:58:19 +09:00
testEnv.py Parallel Environment Discrete PPO finish 2022-10-30 04:13:14 +09:00