Aimbot-PPO

Author	SHA1	Message	Date
Koha9	cbcecfa9e9	Change to LeakyRelu change activation function as leakyrelu, fix some bugs graph.py is for Thesis	2023-03-09 18:11:00 +09:00
Koha9	0e0d98d8b1	Change Param based on a Paper Change Param based on a Paper, and it work!	2022-12-17 09:59:44 +09:00
Koha9	3116831ae6	change network and fix trainset bug change network and fix trainset bug	2022-12-17 09:59:44 +09:00
Koha9	bf77060456	Change Critic NN as Multi-NN Change Critic NN as Multi-NN wrong remain Time Fix wrong remain Time Fix, what a stupid mistake... and fix doubled WANDB writer Deeper TargetNN deeper target NN and will get target state while receive hidden layer's output. Change Middle input let every thing expect raycast input to target network. Change Activation function to Tanh Change Activation function to Tanh, and it's works a little bit better than before.	2022-12-17 09:59:44 +09:00
Koha9	cbc385ca10	Change training dataset storage method save training dataset by it target type. while training NN use single target training set to backward NN. this improve at least 20 times faster than last update!	2022-12-03 07:54:38 +09:00
Koha9	895cd5c118	Add EndReward Broadcast function while game over add remaintime/15 to every step's rewards. to improve this round's training weight. fix get target from states still using onehot decoder bug.	2022-12-03 03:58:19 +09:00
Koha9	3930bcd953	Add Multi-NN agent Add Multi neural network in output layer use different nn while facing to different target.	2022-12-01 19:55:51 +09:00