Why does MAP vastly outperform sample in bayesian clustering?

Also note that MvNormal is not a silver bullet. if you have a lot of vertical overlap across clusters you may still have switching problems.

Depending on the problem you may want to sort y means instead or perhaps use a different space representation (like polar coordinates) that better disambiguates the mean coordinates.

1 Like