I want to implement a reinforcement learning algorithm in Python and PyMC3 seems to be the most elaborate package for probabilistic modeling if have seem so far.
Has this been tried or are there any hints on how I could proceed? Am I in completely the wrong place ?
I have. But the latest commit is from '15 and 225 GitHub stars vs. 3,369 is considerable
I like the approach how probabilistic language is mapped onto python and I have the feeling MCMC and VI could be useful in this context too.