How to index pm.Data with a Rv's output -->Data[rv]?

roesta07 · January 19, 2025, 7:09am

I believe I might be approaching this the wrong way. I wanted to index pm.Data using the output from a random variable (RV) within the model, but the random variable appears to be stuck at a single value, indicating that it’s not sampling properly. Here’s an example of the code I used (there are also some warning generated in the logs):

import numpy as np
import pymc as pm
import pytensor.tensor as pt
import pytensor
import arviz as az
import seaborn as sns


with pm.Model() as m:
  mean_sales = pm.MutableData("mean_sales",[100, 0., 50000])
  categories = pm.Categorical("categories", p=[0.33, 0.33, 0.34])

  sales = pm.Normal("sales",mu=mean_sales[categories],sigma=1) #i <<-indexing here

  idata=pm.sample(10000)

Sampling 2 chains, 0 divergences ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 100% 0:00:00 / 0:00:20
/usr/local/lib/python3.11/dist-packages/arviz/stats/diagnostics.py:596: 
RuntimeWarning: invalid value encountered in scalar divide
(between_chain_variance / within_chain_variance + num_samples - 1) / (num_samples)

When I plot the posterior samples from the random variable categories, it appears to be stuck at a single value which has similar effects on its descendant nodes.

jessegrabowski · January 19, 2025, 7:31am

Sampling from a prior distribution using MCMC can often be harder than you expect. In this case, the sampler is faced with the following likelihood function:

Since the standard deviation of the Normal is so small, it essentially has 3 spikes at the three possible means, and a sea of zero everywhere else. Depending on the initial value, the sampler will find the nearest mean and wander around it. It will not be able to make a proposal that crosses the large gulf between the means, and will thus never draw samples from them. This blog post by @colcarroll has some nice illustrations that will be relevant for building intuition about this issue. This animation is also one I really like.

If you crank up the sigma to something like 10,000, you get a likelihood function that is more tractable:

And posterior samples from categories reflect this:

Another way to remedy the situation is to condition the likelihood on some generated data.

roesta07 · January 19, 2025, 9:04am

Yeah, Thanks for sharing Jesse. Meanwhile using pm.sample_prior_predictive instead of pm.sample

Topic		Replies	Views
Restrict sampler to only draw from the prior for a particular variable v5	2	241	February 11, 2024
DensityDist conditional random() method Questions development	7	1064	July 5, 2019
How to get a value in an array based on the sampled value Questions	9	901	March 6, 2021
Random Variable with observed data not in trace version agnostic	3	539	October 20, 2022
Can pm.draw() be used inside a pymc model or will it break links between RVs and observed/likelihood variable? v5	5	567	May 8, 2023

How to index pm.Data with a Rv's output -->Data[rv]?

Related topics