Kernel hyper-parameters priors and their use in MAP estimates of Gaussian Processes

Oh, now I follow. You are right! find_MAP does consider the conditional p(y \mid \theta, X) too and thus optimize for the values of hyper parameters \theta.