Adding observed to result of maximum

jlindbloom · August 1, 2021, 4:13pm

I don’t think there is currently a simple way to do this for general RVs using PyMC3’s API. However, I can show you a trick for your case with two Gaussians.

The reason your code doesn’t work is because the maximum of two random variables is another random variable, whereas you are trying to use the vector maximum function that just selects the maximal element of the array. This is why observed = data throws the error, since this only works as a kwarg for random variables.

To get this working, you will need a PyMC3 distribution that represents the distribution of the maximum of two Gaussians. So what is this distribution? In the general case of n i.i.d. random variables X_1, \ldots, X_n, we can express the pdf of the maximum \rho(x) in terms of the pdf f(x) and cdf F(x) of X_1 as

\rho(x) = n [F(x)]^{n-1} f(x).

So for a general random variable, you could write a custom PyMC3 distribution that calls the pdf/cdf of the underlying random variable in order to define the distribution over the maximum. I think this would be an interesting but simple feature request for PyMC3 that wouldn’t be too hard to implement, and it wouldn’t surprise me if someone has already done this on their own.

In the meantime, you are lucky because it just so happens that the distribution for the maximum of two Gaussians is just a skew-Normal distribution, which is already implemented as a distribution in PyMC3. From here the pdf for the max of two standardized but correlated Gaussians is

\rho(x) = 2 \phi(x) \Phi \Big( \cfrac{1 - r}{\sqrt{1 - r^2}} x \Big)

where r is the correlation coefficient. For your case, it seems like you want r = 0. Since you have non-standardized Gaussians the above becomes

\rho(x) = 2 \phi \Bigg( \frac{x - \mu}{\sigma} \Bigg) \Phi \Bigg( \frac{1 - r}{\sqrt{1 - r^2}} \cdot \frac{x - \mu}{\sigma} \Bigg)

where for your case you have \mu = 10, \sigma = 1. We are almost there. The pdf above is implemented in PyMC3’s skew-normal distribution, where you take the parameters to be \mu = \mu, \sigma = \sigma, and \alpha = \frac{1- r}{\sqrt{1 - r^2}}. If your Gaussians are uncorrelated just take \alpha = 1.0.

TL;DR Here’s a Collab notebook that should solve your problem. One thing to note is that you aren’t actually inferring any parameters in your model, so maybe you want to pretend you don’t know what \mu and \sigma are. The code I used was:

# You only had one observation so I added more
data = np.array([np.max([10 + np.random.randn(), 10 + np.random.randn()]) for j in range(100)])

with pm.Model() as model:
  # Priors on mu and sigma
  mu = pm.Normal('mu', mu=9, sigma=3)
  sigma = pm.Uniform("sigma", lower=0.2, upper=4.0)

  # Likelihood
  likelihood = pm.SkewNormal("likelihood", mu=mu, sigma=sigma, alpha=1.0, observed=data)

Topic		Replies	Views
Including Observations in the Model, a beginner question version agnostic gaussian_process , modeling , sampling	14	119	October 20, 2024
Using a random variable as observed Questions	19	4027	October 20, 2023
How to model product of multiple random variables Questions	8	4194	June 28, 2019
Newbie question - How to estimate a distribution if I only have the min, max and mean of observations Questions	4	2436	January 7, 2019
Modelling a timeseries of cumulative maximum Questions	19	1127	August 10, 2021

Adding observed to result of maximum

Related topics