Can you marginalize a mixture model where the draws from the different components are not independent?

The transition kernel would have to depend on the fraction of previous observations that are in the 1 state, which introduces a long-range dependence.