A Reinforcement Learning Agent

I finally got around to putting up my code for a sarsa agent. A couple of things to note: The underlying function approximation is accomplished through a CMAC. Note that my CMAC implementation uses Python native hashing. In particular, if you know anything about CMAC, this is not memory bounded, as Python hashes are designed [...]