Aimbot-PPO

Koha9/Aimbot-PPO

Fork 0

Commit Graph

Author	SHA1	Message	Date
Koha9	895cd5c118	Add EndReward Broadcast function while game over add remaintime/15 to every step's rewards. to improve this round's training weight. fix get target from states still using onehot decoder bug.	2022-12-03 03:58:19 +09:00
Koha9	3930bcd953	Add Multi-NN agent Add Multi neural network in output layer use different nn while facing to different target.	2022-12-01 19:55:51 +09:00

Author

SHA1

Message

Date

Koha9

895cd5c118

Add EndReward Broadcast function

while game over add remaintime/15 to every step's rewards. to improve this round's training weight.
fix get target from states still using onehot decoder bug.

2022-12-03 03:58:19 +09:00

Koha9

3930bcd953

Add Multi-NN agent

Add Multi neural network in output layer
use different nn while facing to different target.

2022-12-01 19:55:51 +09:00

2 Commits