while game over add remaintime/15 to every step's rewards. to improve this round's training weight.
fix get target from states still using onehot decoder bug.
add discrete and continuous action in same NN model.
model save and load.
reward is increasing, converge was observed.
this two models are seems good:
Aimbot_9331_1667423213_hybrid_train2
Aimbot_9331_1667389873_hybrid
add GAIL GAILMem GAILConfig Class.
add HumanAction record to save expert data.
add tackState future for stack multiple states to let agent knows what happened before.
Unity:
No more detect Closest enemy info. Add different density sensor let agent get more state information on the center of view.
Adjust Start Scene UI manager. Add in game visible rayCast & information that rayCast detect.
Python:
Start use mypy black and flake8 to format Python.