Can't vectorise custom distribution

Helmut · May 24, 2018, 6:43am

As a first step in building my model, I am trying to write a custom Dirichlet distribution and use it as a prior to estimate the parameters of a binomial distribution. The following clunky method works OK:

from pymc3 import *
from pymc3.distributions.multivariate import Dirichlet, Multinomial
import numpy as np
import theano.tensor as tt


with Model() as model1:
    prior = np.ones(4) / 4
    
    def dirich_logpdf(value):
        v0 = value[0]
        v1 = value[1]
        v2 = value[2]
        v3 = value[3]
        return  -5.15209009879231 - 0.75 * (np.log(v0) + np.log(v1) + np.log(v2) + np.log(v3))
    
    stick = distributions.transforms.StickBreaking()
    probs = DensityDist('probs', dirich_logpdf, shape=4, testval=np.array(prior), transform=stick)
    data = np.array([5, 7, 1, 0])
    sfs_obs = Multinomial('sfs_obs', n=np.sum(data), p=probs, observed=data)
    
with model1:
    step = Metropolis()
    trace = sample(100000, tune=10000, step=step)
    
print(df_summary(trace))

However, I would prefer to write the function dirich_logpdf so that it can accommodate arrays of general length, by replacing the function dirich_logpdf with something like the following:

def dirich_logpdf(value=prior):
        theano.config.compute_test_value = 'ignore'    # See stack overflow #30236070
        v = tt.vector('v')
        out = -5.15209009879231 - 0.75 * tt.log(v).sum()
        dirich_logpdf_tt = theano.function([v], out)
        out = dirich_logpdf_tt(value)
        return  float(out)

However, this causes an error: `TypeError: Bad input argument with name “v” to theano function with name “:14” at index 0 (0-based). ’

However, when the function is tested by providing a numpy array as input it works OK.

junpenglao · May 24, 2018, 6:59am

You dont need to create a theano tensor within the logp function, instead, rewrite it to take input, for example, something like:

def dirich_logpdf(value):
    return -5.15209009879231 - 0.75 * tt.log(v).sum()

Helmut · May 24, 2018, 7:41am

Thanks. That has fixed it.

Topic		Replies	Views
Running into Theano issues when using DensityDist Questions theano	6	1183	February 10, 2021
Theano error with element wise multiplication Questions theano	5	2487	July 12, 2017
Fitting a distribution with custom functions Questions theano	5	3778	August 3, 2017
Passing pymc3 distribution variable to theano function Questions	3	1630	September 23, 2019
Integrating new distribution (wrapped c++ functions) - Problems with broadcasting and shapes Questions theano	3	701	July 8, 2019

Can't vectorise custom distribution

Related topics