Forceful_Dragon posted... Someone in a comment just mentioned the Surf HM in the Safari Zone.
How on earth will the AI randomly navigate through the safari zone even one time to trigger the positive feedback? I guess you put a point reward on "enter safari zone" to teach it that attempting the SZ is a good idea. But other than directly programming it to go the right way I'm not sure how it will be able to develop a habit for reaching the right spot in the zone.
It's just about 300 steps exactly to reach the right spot and moving correctly 300/500 times feels near impossible. I guess you would have to remove the step limit like they did in Twitch Plays Pokemon?
I'm not even all that confident the AI can learn how to cut a bush