分离ppoagent,AI memory,AI Recorder 优化Aimbot Env 正规化各类命名 Archive不使用的package
add side Channel to save target win ratio. Fix some Bug
add discrete and continuous action in same NN model. model save and load. reward is increasing, converge was observed. this two models are seems good: Aimbot_9331_1667423213_hybrid_train2 Aimbot_9331_1667389873_hybrid
weight and bias sync added
Parallel Environment Discrete PPO finish. Runnable.