save training dataset by it target type.
while training NN use single target training set to backward NN.
this improve at least 20 times faster than last update!
while game over add remaintime/15 to every step's rewards. to improve this round's training weight.
fix get target from states still using onehot decoder bug.
add discrete and continuous action in same NN model.
model save and load.
reward is increasing, converge was observed.
this two models are seems good:
Aimbot_9331_1667423213_hybrid_train2
Aimbot_9331_1667389873_hybrid