Maybe the encodings got mixed (or which category is censored and which is not), or the lecture is actually modeling in terms of rate directly and not mean (that is a and not 1/a). Just trying to guess
About the initvals, why not use a lognormal prior or something that has only positive support instead of a normal prior?