Jax_sampling model

yachats · March 21, 2021, 5:47pm

I have only been using pymc3 for a month or so but wanted to try out the new backend jax sampling on a model that i could understand and also gauge the increase in speed, a multinomial model using baseball data.

N = data_mlb.shape[0]
results = data_mlb[[‘single’, ‘double’, ‘triple’, ‘home_run’, ‘tw’, ‘strikeout’, ‘bo’]]
results = results.to_numpy()

K = results.shape[1]

with pm.Model() as hitting:
a = pm.Normal(‘a’, mu=0, sigma=1.5, shape=K)

ev0 = pm.math.exp(a[0])
ev1 = pm.math.exp(a[1])
ev2 = pm.math.exp(a[2])
ev3 = pm.math.exp(a[3])
ev4 = pm.math.exp(a[4])
ev5 = pm.math.exp(a[5])
ev6 = pm.math.exp(a[6])
ev = pm.math.stack([ev0, ev1, ev2, ev3, ev4, ev5, ev6]).T

p_ = pm.Dirichlet('p_', a=ev, shape=(N, K))
y = pm.Multinomial('y', n=data_mlb.pa, p=p_, shape=(N,K), observed=results)

The results are normal and as expected when doing traditional sampling. However, when I sample with sampling_jax.sample_tfp_nuts i get errors and zeros for all results. Using sampling_jax.sample_numpyro_nuts i get .143 or (1/number of outcomes). Not sure if I am missing something entirely or something specific with the jax sampling.

PedroSebe · July 14, 2021, 2:09am

I am having the same problem today. I am running on a Google Colab instance, with PyMC3 3.11.2 and JAX 0.2.11. This is my model:

p, n = df.shape
k = 5

with pm.Model() as non_hierarchical_model:
    exposures = pm.Dirichlet("W*", np.ones((n, k)), testval=np.ones((n, k))/k) 
    signatures = pm.Dirichlet("H", np.ones((k, p)), testval=np.ones((k, p))/p)
    exp_catalogue = pm.Deterministic("WH", pm.math.dot(exposures, signatures))
    pm.Multinomial("X", df.sum().values, exp_catalogue, observed=df.values.T, shape=(n,p), testval=df.values.T)

And this is how I sample from it:

with non_hierarchical_model:
  #trace = pm.sample()
  trace = pm.sampling_jax.sample_tfp_nuts()

I also get the same value (1/N) for all entries of my Dirichlet entries.

twiecki · July 14, 2021, 10:52am

Does the model sample fine (no divergences) with the PyMC3 default NUTS sampler?

yachats · July 14, 2021, 1:45pm

My model sampled fine using the default NUTS sampler and the results were what I anticipated and I recall all traceplots looked good. I have since tossed out the environment and data sorry.

PedroSebe · July 14, 2021, 5:49pm

In my case, default NUTS was very slow. I did not find any divergences, but maybe it’s because I took only a few samples. I switched to a Logit-Normal prior, which is working fine so far. Still looking for a solution though, since I think Dirichlet is a better fit for my case.

twiecki · July 14, 2021, 5:59pm

Most likely your model isn’t parameterized appropriately. I would first get it to reasonably sample with high ESS using the PyMC3 NUTS sampler and if that’s still too slow experiment with JAX.

What’s going on here most likely is that the JAX sampler just tunes worse than the PyMC3 one so it will have an even harder time for a not-well-specified model and not even get off the ground.

Topic		Replies	Views
Wild results for masked multinomial with jax sampler(s) v5 modeling , jax	2	384	May 7, 2023
Use of numpyro/Jax with pymc-dev v5 jax	5	2295	July 28, 2022
Sampling_jax issues v5	6	1242	July 10, 2023
Sample_numpyro_nuts hangs when parallelizing over datasets with multiprocessing v5 jax	0	683	November 9, 2022
Jax sampling in Pymc4: no pm.sampling.jax module v5	5	1588	April 10, 2024

Jax_sampling model

Related topics