Develop multi-output GP model with linear mean for each task and with learnable hyperparameters

After examining the above resources. I am still confused. from the paper and the lecture, it seems that the application of multi-output method is based on the assumption that the distribution mean is zero. However, I am wondering if it’s possible to extend its application so that each task has a mean with learnable parameters
Thank you