Mean changing every data point

ay_al_42 · June 9, 2021, 4:21am

Hello everyone,

I have the following model.

with basic_model:
     lambda1 = pm.Gamma("lambda1", alpha=0.001, beta=0.001)
     p =  pm.Beta('p', 1, 1,)
     z = [0.0] * len(x['Length'].values)
     Y_obs = [0.0] * len(x['Length'].values)
 
     for i in range(len(x['Length'].values)):
     	z[i] = pm.Bernoulli('z[i]',p)
     	Y_obs[i] = pm.Poisson("Y_obs[i]", mu=lambda1*z[i]*x['Length'].values+0.001, observed=x['Count'].values[i])
     trace = pm.sample(7000, tune=2000, cores=1, return_inferencedata=True)

It is producing the error for the names Y_obs[i] and z[i]. I understand that I cannot change use the same name for the variables, but I couldn’t figure out how to change the rate of Y_obs[i] at every iteration. At a later stage, I will be changing the rates with if-else conditions, as well. How do I define a different mean for every data point?

OriolAbril · June 12, 2021, 8:46pm

To avoid having repeated variable names you should use f-strings for example so that ...Bernoulli(f"z[{i}]",... generates z[0], z[1]… instead of generating z[i] every single time.

That being said, this is generally not a good idea. Why don’t you use vectorized statements?

ay_al_42 · June 13, 2021, 6:18pm

I am new to PyMC3 and coming from JAGS, this concept was easier for me to understand. I tried to do it the other way, but I had a gazillion. of tensor related errors.

OriolAbril · June 13, 2021, 7:07pm

I think something like this code below will technically work:

with basic_model:
     lambda1 = pm.Gamma("lambda1", alpha=0.001, beta=0.001)
     p =  pm.Beta('p', 1, 1,)

     pm.Poisson("Y_obs", mu=lambda1*p*x['Length'].values+0.001, observed=x['Count'])
     trace = pm.sample(7000, tune=2000, cores=1, return_inferencedata=True)

The problem is that p and lambda1 are completely degenerate, they only ever appear multiplying each other so only their product is constrained.

ay_al_42 · June 14, 2021, 12:06am

This works but doesn’t capture the “Counts are either 0 or Poisson(lambda1) and the degenerate probability is p” structure of Y_obs. I will also have to extend this model to a point where Y_obs[i] depends on Y_obs[i-1].

Topic		Replies	Views
What is the point of assigning a random variable to a named python variable? v5	1	113	April 26, 2024
Iterative MAP estimation with increasing data points Questions	2	511	July 18, 2019
Difference in results between PyMC and STAN version agnostic modeling	4	925	July 5, 2022
Getting rid of for loops for recursion of deterministic variables Questions	2	459	March 24, 2024
Shared variables not reflecting v5 pytensor	3	70	September 21, 2024

Mean changing every data point

Related topics