Theano error debugging model

rpgoldman · September 25, 2019, 10:13pm

Here’s a degenerate model that I plucked out of a model I’m trying to debug:

with pm.Model() as model:
    beta_soc_dem_edu_spend = pm.Normal('β(soc_dem, edu spend)', shape=1)
    print('β(soc_dem, edu spend) shape is: %s'%beta_soc_dem_edu_spend.shape)
    print('β(soc_dem, edu spend) shape is: %s'%beta_soc_dem_edu_spend.shape.eval())

When I run this, the first print prints out the theano variable:

β(soc_dem, edu spend) shape is: Shape.0
But then the second line raises a MissingInputError:

MissingInputError: Input 0 of the graph (indices start from 0), used to compute Shape(β(soc_dem, edu spend)), was not provided and not given a value. Use the Theano flag exception_verbosity='high', for more information on this error.

Why does this happen?
By the way, setting that Theano flag seems to do nothing at all.

Dominik · September 26, 2019, 7:10am

In the second print fails because beta_soc_dem_edu_spend has no value and therefore no shape yet. It only obtains a value once you start sampling from the distribution.

In the first print you refer to beta_soc_dem_edu_spend.shape which is a symbolic theano expression that can be used, e.g., to define a computation that only produces a value once its .eval() is called. Here is an example:

with pm.Model() as model:
    beta = pm.Normal('β', shape=1)
    a = pm.Normal('a', mu=beta.shape, testval=0)
    trace = pm.sample()

Looking at np.mean(trace['a']) shows 0.9695488955681962 so the mean of pm.Normal('a', ... is indeed the shape of beta.

If you want to use the shape of beta during the model definition you could store it in a separate variable, e.g.:

beta_shape = 1
with pm.Model() as model:
    beta_soc_dem_edu_spend = pm.Normal('β(soc_dem, edu spend)', shape=beta_shape)
    print('β(soc_dem, edu spend) shape is: %s'%beta_shape)

junpenglao · September 26, 2019, 1:34pm

In addition, you can make it work by supplying a value to the graph:

In [13]: with pm.Model() as model: 
    ...:     beta_soc_dem_edu_spend = pm.Normal('β(soc_dem, edu spend)', shape=1
    ...: ) 
    ...:     print('β(soc_dem, edu spend) shape is: %s'%beta_soc_dem_edu_spend.s
    ...: hape) 
    ...:     print('β(soc_dem, edu spend) shape is: %s'%beta_soc_dem_edu_spend.s
    ...: hape.eval({beta_soc_dem_edu_spend: np.zeros(1)}))

rpgoldman · September 26, 2019, 2:28pm

@Dominik
Thanks. I’m still a little confused because I set the shape of the RV to a constant, so I don’t understand why it is that evaluation is needed: why isn’t this a theano constant?

Similarly, I didn’t get what input was missing to the Theano shape variable, so that I get the MissingInputError (and it seems hard to winkle that information out of Theano).

If I understand what you’re saying, the shape depends on the actual value of the variable, and that’s why there’s a missing input.

As you might have guessed, this arises when I’m trying to debug a shape error in sampling (even the prior predictive fails), and I’m looking for a general way to diagnose such errors. There is discussion in the Theano docs about how one debugs theano graphs, but it’s not obvious (at least not to me!) how one gets hold of this graph when one is interacting with Theano through PyMC3.

rpgoldman · September 26, 2019, 2:30pm

Thanks. passing the dictionary is an important clue. As I said above, I stumbled on this trying to debug a shape mismatch – specifically in a model where there is missing data.

I still don’t quite get how to go from PyMC3 to debugging the underlying computation graph. I’ll see what I can figure out.

Dominik · September 26, 2019, 2:43pm

Since all RVs of model have a test value, you could use that to get its shape:

model.test_point['β(soc_dem, edu spend)'].shape

Not sure if that is what you are looking for but I also do not know how to debug the theano compute graph efficiently.

Edit: As @junpenglao points out, you can get the shape even more conveniently with the python variable through beta.tag.test_value.shape.

junpenglao · September 26, 2019, 2:47pm

Doing rv.tag.test_value is also my go-to for debugging shape issue.

Topic		Replies	Views
Test_value shape errors with theano.shared Questions	13	2555	March 27, 2018
Theano.gradient(beta_0, alpha (beta samples are from a beta_dist which takes as input alpha)) returns DisconnectedInputError Questions theano	13	1385	March 18, 2018
Shape mismatch using theano shared variable Questions	20	1573	February 1, 2019
Model fits fine on OSx but not on Ubuntu 16.04 Questions	6	618	July 31, 2018
PyMC3 Model theano tensor switch error Questions theano	1	584	February 22, 2022

Theano error debugging model

Related topics