Aimbot-PPO

Author	SHA1	Message	Date
Koha9	32d398dbef	Change Learning timing change learning timing to each episode end.	2022-11-16 19:40:57 +09:00
Koha9	a0895c7449	Add load & save function. Add load & save function. Add train flag to test model. Add new action select function while in test mode. Add decision period to skip step.	2022-11-08 23:14:34 +09:00
Koha9	474032d1e8	hybrid dis-con action, save-load, converge wad observed add discrete and continuous action in same NN model. model save and load. reward is increasing, converge was observed. this two models are seems good: Aimbot_9331_1667423213_hybrid_train2 Aimbot_9331_1667389873_hybrid	2022-11-03 07:16:18 +09:00
Koha9	0dbe2013ae	weight and bias sync added weight and bias sync added	2022-11-01 19:11:45 +09:00
Koha9	7497ffcb0f	Parallel Environment Discrete PPO finish Parallel Environment Discrete PPO finish. Runnable.	2022-10-30 04:13:14 +09:00