Difference in posterior between Metropolis and NUTS Sampler

r2keks2 · March 18, 2024, 7:05pm

Hello,
I try to sample different polymer-chains with length N in spherical-coordinates, and than calculate the center of mass in Cartesian coordinates. Theta and phi (angles between monomers) have a uniform prior distribution.
When running this with NUTS, I get a Gausian distribution for center of mass, with mean at 0. When i run this with Metropolis i get a Gausian, but for the x axis the mean is shifted. Seen in the plots below. I use 10 chains and 8000 draws. For me it looks like it converged(I am quite new to this). How can this happen, do I have a mistake in my parametrization?
(Using pymc 5.0.2)
My Model:

N=40
processors=10
with pm.Model() as chain:
    
    theta = pm.Uniform("theta", lower=-np.pi/2, upper=np.pi/2, size=N)
    phi = pm.Uniform("phi", lower=0, upper=2*np.pi, size=N)
    
    ct = theta.cos()
    st = theta.sin()
    
    cp = phi.cos()
    sp = phi.sin()
    
    _x, _ = pt.scan(lambda _ct, _st, _cp, _sp, prior: prior + [_cp*_ct, _sp*_ct, _st],
               sequences = [ct, st, cp, sp],
               outputs_info = pt.tensor.zeros(3))

    x = _x[:,0]
    y = _x[:,1]
    z = _x[:,2]
      
    
    pm.Deterministic("z_mean", z.mean())
    pm.Deterministic("x_mean", x.mean())
    pm.Deterministic("y_mean", y.mean())

    #This lets the energy-less distribution be uniform on the sphere
    pm.Potential("sphere", ct.log().sum())

    
    #step=[pm.step_methods.Metropolis([theta, phi])]
    step=[pm.NUTS([theta, phi], target_accept=0.9)]

    #datat = pm.smc.sample_smc(draws=500,cores=processors)
    datat = pm.sample(draws = 2000, cores = processors, step= step)

cluhmann · March 18, 2024, 9:35pm

Welcome!

It is not entirely clear what is going on here. If you have a runnable example (e.g., something simpler than what you are currently using) someone might be able to investigate further.

A couple of suggestions. First, I would install the most recent version of PyMC if you can mange to do so. Second, I would suggest avoiding explicit specification of the step methods and allow pm.sample() to automatically select step methods (including compound methods). There is not particular reason to use Metropolis except in relatively specific scenarios. If NUTS works for you model, you should probably use it. Third, 80,000 samples is likely excessive unless your model is sampling very very poorly (at which point you should just fix your model).

r2keks2 · March 18, 2024, 11:23pm

Thanks for the quick response.
Updating to 5.6.1 solved the problem.
I guess the problem was fixed some time ago.

Regarding why i use Metropolis instead of NUTS;
I want to use SMC(for tempering), where i had the same problem(updating also solved this problem).

r2keks2 · March 22, 2024, 1:10pm

Maybe i spoke to soon.

Updated to 5.11.0 it solved the problem for the Metropolis sampler.
Now i tried to use the SMC sample, resulting in the same problem. As seen in the pictures below, x_mean goes to 0 the more draws I use.

I don’t know if I should open a new thread for this?

Questions I have:
Why is this happening for x and not for y?
The different chains should be uncorrelated, so why are they converging to the same posterior which then converges to mean=0?(maybe I misunderstand how the SMC alg. works)

Also to note updating to 5.11.0 solved the problem for the “normal” Metropolis sampler but not for SMC.

This time I also added a exported jupyter notebook with my model. I couldn’t think of something more simpler. I excluded the z-axes.

For_forum_2d.py (1.7 KB)

Topic		Replies	Views
SMC; Random walk; Adding z dimension results in bias in x dimension v5 bug , sampling , smc	0	90	April 22, 2024
Mixture Model Metropolis vs. NUTS Questions	1	592	April 1, 2020
NUTS sampler converges to wrong value Questions	7	806	September 9, 2020
Don't Use Metropolis Sharing	9	4080	June 21, 2022
Sequential monte carlo - need help? [SOLVED] Questions	10	888	January 8, 2018

Difference in posterior between Metropolis and NUTS Sampler

Related topics