Maximum likelihood estimation of a Bayesian model

sammosummo · March 17, 2018, 11:20am

I’m sure this question is anathema to many of you.

Is there a way to obtain maximum likelihood estimates, or even just the likelihood given some parameter values, of a model in PyMC3?

For context, I have an idea about how to model some data which works as a Bayesian hierarchical model in PyMC3. However, I work in a field that is still very much dominated by p values, and my concern is that readers, and even some of my coauthors, will not accept this model if it is Bayesian, at least not initially. I would like the initial presentation of the model to be as accessible as possible for my audience, and unfortunately that means using MLE and p values.

I know how to code this model in PyMC3 but not outside of it (e.g., using just SciPy).

junpenglao · March 17, 2018, 12:44pm

If you replace all the prior in your model using Flat prior and then run pm.find_MAP, you would get the MLE if the optimizer converged. If your model is not too difficult, it should work, but make sure you compare with the result using sampling.

As for p value it is more difficult, permutation might work but again, depending on your model.

Damn Right Walter White GIF by Breaking Bad - Find & Share on GIPHY

sammosummo · March 17, 2018, 8:24pm

Thanks! Unfortunately the model is quite complex and pm.find_MAP doesn’t work when I replace the priors with flat priors, however it samples just fine. So can I just record the logp of all the samples and pick the parameter values of the sample with the maximum? Or is there some reason this won’t be the true MLE?

junpenglao · March 17, 2018, 8:50pm

It general is not considered as MLE, but MAP.
A work around is to get the maximum and do a small local search around the maximum.

drbenvincent · March 25, 2018, 9:57am

This is interesting stuff. When publishing, the ability to do MAP or MLE is actually very useful in comparing to previous approaches.

Could you just expand a bit on why that wouldn’t correspond to MLE with flat priors?

junpenglao · March 25, 2018, 2:19pm

What I meant is it only considered as MLE if there is not prior information (or flat prior). As you are only evaluating the likelihood function and trying to find the maximum.

sammosummo · April 10, 2018, 2:57pm

Do you have any idea if there is a way to extract the likelihood/logpt function from a model? I don’t mean the values, rather the equations from Theano (presumably).

junpenglao · April 10, 2018, 3:01pm

Yes you can get the logpt from model, which is a theano tensor. To evaluate the tensor you can compile it as a function, eg see:

github.com

pymc-devs/pymc3/blob/master/pymc3/step_methods/metropolis.py#L136




    # Determine type of variables
    self.discrete = np.concatenate(
        [[v.dtype in pm.discrete_types] * (v.dsize or 1) for v in vars])
    self.any_discrete = self.discrete.any()
    self.all_discrete = self.discrete.all()


    self.mode = mode


    shared = pm.make_shared_replacements(vars, model)
    self.delta_logp = delta_logp(model.logpt, vars, shared)
    super(Metropolis, self).__init__(vars, shared)


def astep(self, q0):
    if not self.steps_until_tune and self.tune:
        # Tune scaling parameter
        self.scaling = tune(
            self.scaling, self.accepted / float(self.tune_interval))
        # Reset counter
        self.steps_until_tune = self.tune_interval
        self.accepted = 0

sammosummo · April 10, 2018, 3:09pm

Sorry, I’m having difficulty expressing myself clearly. I want to find out the equation that is generating the values for a given model. In simple models, such as a non-hierarchical linear regression, the likelihood function is obvious (just the univariate normal pdf). However for my model, which is complex and hierarchical, I can build it in pymc3 easily but don’t know how to write the likelihood symbolically. Presumably there is such a symbolic expression of the likelihood/posterior within Theano somewhere.

junpenglao · April 10, 2018, 3:38pm

I dont think the symbolic expression is available (at least not from theano). If you know sympy that would be a good direction to try.

sammosummo · April 12, 2018, 3:38pm

Thanks! It doesn’t look like sympy is powerful enough yet for my purposes since its matrix support is limited. Fortunately though I was able to find the equations in an obscure journal article from a decade ago, so all is well!

sammosummo · April 12, 2018, 7:24pm

~~Hmmm, looks like my model.logpt is routinely > e with flat priors. I’d assumed it must be << e. Is this a bug or my mistaken thinking?~~

Edit: Nevermind, I was being an idiot.

Topic		Replies	Views
Interpreting Bayesian Inference with multiple parameters for the MLE Questions	8	1916	December 5, 2018
Adjusting models with mcmc Questions	5	1447	June 26, 2020
Calculating marginal likelihood of GP Model Questions	6	1294	December 31, 2018
Is the mode of a parameter in a model with only pm.Flat priors a maximimum likelihood estimate?	5	164	May 27, 2024
Unconsistent logp, Evaluating the results of find_MAP on Custom Likelihood Questions	3	598	August 28, 2018

Maximum likelihood estimation of a Bayesian model

Related topics