Gaussian Process regression with categorical features

bwengals · June 13, 2019, 2:28am

Yep, different kernels can use different features. Use the input_dim and active_dims parameters of each kernel. This convention was shamelessly stolen from GPy and GPflow because it’s a good idea. input_dim will be equal to the number of columns of X, and active_dims is used to pick out which columns an individual kernel is applied to. So for your case, you’d write:

cov = nu**2 * pm.gp.cov.Matern32(input_dim=X.shape[1], active_dims=features1) * pm.gp.cov.ExpQuad(input_dim=X.shape[1], active_dims=features2)

Regarding your first question, there aren’t any built in kernels specifically for dealing with categorical inputs. One option is one-hot encoding your categories and using a standard ExpQuad. You can also define a custom kernel pretty easily (see here). Or you could use the Coregionalization kernel, possibly without modification.

Topic		Replies	Views
Gaussian Processes - combining constant features with covariance Questions	10	1288	December 21, 2017
Multi-output gaussian processes Questions	13	5488	October 22, 2017
Multidimensional input using Gaussian Process Questions	6	4021	June 28, 2017
Extending Gaussian Process functionality: Coregion and beyond Development	19	2367	November 25, 2017
Coregionalization model for two separable multidimensional Gaussian Process Questions	3	2304	February 2, 2019

Gaussian Process regression with categorical features

Related topics