Hi @chris,
Actually, I don’t think my suggestion of using a softmax is well thought. Looking at the data generation process, each row is not necessary sum to 1 right? If you have K > 2 you can try modeling the pi using a softmax, but I am not sure it is better than using Dirichlet (which use a stick-breaking process internally).
1 Like