Commit Graph

6 Commits

Author SHA1 Message Date
34206b95c5 Multi Agent Each Type Action Select Style
Multi Agent Each Type Action Select Style.
waste too much time
2022-12-14 09:01:29 +09:00
1787872e82 wrong remain Time Fix
wrong remain Time Fix, what a stupid mistake...
and fix doubled WANDB writer
2022-12-04 09:20:05 +09:00
ad9817e7a4 Totally disparate NN by target
Totally disparate NN by target.
2022-12-03 21:35:33 +09:00
cbc385ca10 Change training dataset storage method
save training dataset by it target type.
while training NN use single target training set to backward NN.
this improve at least 20 times faster than last update!
2022-12-03 07:54:38 +09:00
895cd5c118 Add EndReward Broadcast function
while game over add remaintime/15 to every step's rewards. to improve this round's training weight.
fix get target from states still using onehot decoder bug.
2022-12-03 03:58:19 +09:00
3930bcd953 Add Multi-NN agent
Add Multi neural network in output layer
use different nn while facing to different target.
2022-12-01 19:55:51 +09:00