Pymc3 sampling processes

BjornHartmann · December 15, 2020, 2:49pm

Simple question for you:

In a sampling process, where no inference is being done, is pymc3 simply taking one sample from each distribution 500 times for a draws=500 process?

In other words is:

    with pm.Model() as model:
        a = pm.Normal('a')
        b = pm.Normal('b')
        c = pm.Deterministic('c', a+b)
        pm.sample(draws=500)

equivalent to

a = np.random.normal(size=500)
b = np.random.normal(size=500)
c=[]
for i in a:
    c.append(random.sample(list(a),1)+random.sample(list(a),1)

junpenglao · December 15, 2020, 5:52pm

Other than the fact that PyMC3 use MCMC to draw samples when pm.sample is called, the model code will roughly equivalent to

a = np.random.normal(size=500)
b = np.random.normal(size=500)
c = a + b  # <= deterministic

BjornHartmann · December 17, 2020, 11:41am

awesome, thank you for getting back to me.
Follow-up question:
If i do such a simple sampling process with pymc3, my complete runtime including compilation is about 10 seconds, while with the np/scipy case it is done in 0.5 seconds. Is there a way to speed up pymc3 for easy cases like these? Is pymc3 not the right tool for such easy naive monte carlo processes?

junpenglao · December 17, 2020, 12:52pm

Usually, for simple forward simulation, you should use pm.sample_prior_predictive instead.

BjornHartmann · December 17, 2020, 4:59pm

thank you once again for getting back to me.

I tried the pm.sample_prior_predictive as well, but i did not see a significant increase in computation-time.

So in order to understand the computational advantages with pymc3 over numpy, i implemented the Metropolis-sampling described in https://twiecki.io/blog/2015/11/10/mcmc-sampling/
and compared that to the same model set up in pymc3.

For small number of draws, I saw that the numpy-way was significantly faster, while as for bigger number of draws/samples pymc3 outperformed numpy, as expected.

Can i by this conclude that for my simple “forward simulation”, pymc3 is too much of a powerful tool for a job simple enough for numpy to handle, and that pymc3 is slower for such jobs?

junpenglao · December 18, 2020, 9:14am

Depending on what you want to do - the nice thing of using a full PPL is you can write the forward sampling and reverse inference just once, instead of writing the forward simulation and then later on write the model again.

Topic		Replies	Views
Performance of draw() vs. pymc3's draw_values() v5	2	192	March 5, 2024
Sampling running very slowly for all models? Questions	1	840	April 27, 2020
A simple example But run very slow~ Questions	4	1550	February 11, 2021
Draw_values() speed/scaling with transformed variables Questions	9	1967	November 7, 2019
Slow sampling in pymc3 (on "tutorial problem") Questions	8	10227	July 17, 2019

Pymc3 sampling processes

Related topics