Can you marginalize a mixture model where the draws from the different components are not independent?

Interesting, I didn’t know about this model. I think it could work, although it would take a little bit of math to figure out the correct kernel. But would this actually be faster? You’d get a very deep graph which seems like it would be slow to sample from?