Build-ParallelEnv-Target-OffPolicy-SingleStack-SideChannel-EndReward-Easy-V2.1 add spin penalty while agent keep spin will give a penalty reward. lower Go target in area reward. |
||
---|---|---|
Assets | ||
Packages | ||
ProjectSettings | ||
UserSettings | ||
.gitignore | ||
.vsconfig | ||
README.md |