Models with low observed uncertainty compared to rv uncertainty

NateAM · October 29, 2018, 4:23pm

I have a problem where I am trying to model the output of some complicated process using surrogate models (I’ve asked several questions in this regard). My surrogate model is a gaussian process fit to data generated by my expensive model. The GP is parameterized such that the parameters of my expensive model are the inputs and measurements of interest are the outputs.

The measurements of my model are known with a high degree of certainty compared to the uncertainties on my parameters so that if my model variation in output due to my parameters is order 1, the variation in my measured test data is order 1e-3 - 1e-4. I tried defining my likelihood like this:

unknown_sigma = pm.HalfNormal('unk_sigma',sd=some_value)
sigma = pm.Deterministic('sigma',unknown_sigma + observed_sigma)
y = pm.Normal('y',mu=surrogate_out,sd = sigma, observed=data)

but I have a lot of problems with divergence. Does anyone have any ideas on how to reparameterize this model so that I could avoid this problem?

junpenglao · October 30, 2018, 10:11am

Difficult to say without the data and the model, but since:

did you try scaling the measured data?

For example, puting it into a range of 0 - 1 or 0 - 10 would be easier to handle numerically.

NateAM · October 30, 2018, 12:53pm

I didn’t try scaling the data yet though I can give that a shot. I still will have a much larger variance in the output than the input.

Is there a way to start with a higher tolerance and, either in sequential samplings or during a single trace, decrease it once the parameters have been better estimated?

junpenglao · October 30, 2018, 12:54pm

Not sure I understand: tolerance in terms of what? scaling?

NateAM · October 30, 2018, 1:46pm

Sorry, that is what I mean. What I was thinking was is it possible to do something like:

with pm.Model() as model:
    define my model here...

    y = pm.Normal('y',mu=surrogate_out, sd=factor*sigma, observed=data)

where factor can start as a large number and be gradually decreased as the sampling converges?

junpenglao · October 30, 2018, 3:48pm

I dont think that is a valid thing to do. First, you need to know what sampling converges you are looking for (i.e., how do you define convergence); and second, I think it is more important to investigate why a model does not converge (or output divergence samples), an automatic process doesnt seems to be the answer here.

NateAM · October 30, 2018, 4:49pm

That makes sense.

What I think is happening, is that because my observed variance is so small the normal distribution I’m using for y just can’t find the observed value. My thought was to artificially inflate my observed sigma using the factor to some value that I could converge at and use the resulting median of my trace as an initial guess as I reduced the value of the factor.

Topic		Replies	Views
Adaptive surrogate models Questions	4	1646	October 23, 2017
Multidimensional inference and GP surrogate problem Questions	4	714	June 2, 2023
Divergences in gaussian process model Questions	0	523	February 19, 2019
Uncertainty in gaussian processes outside of input range Questions gaussian_process , modeling	2	460	January 17, 2023
Joint PDF Surrogate Model Questions	9	1665	July 30, 2018

Models with low observed uncertainty compared to rv uncertainty

Related topics