How to use plot_trace to draw individuals results in dependet piture?

mqin · August 22, 2020, 12:57pm

I want to use az.plot_trace() to draw trace for all subjects. However, I just got a long picture which contains 10 of subjects’ results. I want to divide the picture into different subjects.

Does there exist a useful method to draw the picture individually? By the way, how to average these resemble lines? All of them are sample lies of my fitted model. Must I convert pymc3 trace data to dataframe can I average the samples?

OriolAbril · August 22, 2020, 2:14pm

I am not sure I understand what you are trying to achieve, but this blog post may help

mqin · August 23, 2020, 3:43pm

Thank you. The figure shows above is a total result for 20 subjects. My pymc3 data contains all subjects’ results, so by using plot_trace methods, it returns all the results on a single picture, I want to divide these results into different figures, that is, using 20 figures for 20 subjects to replace this long figure. I have read this blog, but it doesn’t mentioned how to plot the trace for different subjects.

Elenchus · August 23, 2020, 9:08pm

You can select the random variables you want to plot by passing a list to var_names

with pm.Model() as model: 
    pm.Normal('RV1') 
    pm.Normal('RV2') 
    trace = pm.sample(200) 
    pm.traceplot(trace, var_names=['RV1'])

mqin · August 24, 2020, 1:37am

Oh sorry, I’m afraid you don’t understand the poblem I deal with. This figure is one varable of my model which contains a lot of subjects’ data. Each subject(n_subj) has this varable(theta) at each trial(n_trial). n_sample is 1000. That is, the theta size is “n_subjn_trialn_sample”(I use trace_to_dataframe to know the column index of it, which likes “theta__isubject_itrial”, which has 1000 rows because it has 1000 samples), using plot_trace, it will return a long figure contains these subjects’ results which likes what I show above (each row is one subject’s theta trace and there shows 20 subjects’ theta trace). I want to plot the figure for each subject’s trace, that is, plot 20 figures for 20 subjects’ theta.

Elenchus · August 24, 2020, 2:22am

I still don’t understand your problem. If I use pm.Normal('RV1', shape=3) or even pm.Normal('RV1', shape=(3,3)) I only get a single trace, not one for each dimension, which makes me think you have a different variable for each subject and are plotting the whole trace. Can you post your code?

mqin · August 24, 2020, 3:13am

############ the model fit code ###############
def hmm_fit(inp, nans, n_subs, last):
# inp - array containing responses, outcomes, and a switch variable witch turns off update in the presence of nans
# nans - bool array pointing towards locations of nan responses and outcomes
# n_subs - int value, total number of subjects (each subjects is fited to a different parameter value)
# last - int value, negative value denoting number of last trials to exclude from parameter estimation
# e.g. setting last = -35 excludes the last 35 trials from parameter estimation.

# define the hierarchical parametric model for ED-HMM
# define the hierarchical parametric model
d_max = 200  # maximal value for state duration
with Model() as hmm:
    d = tt.arange(d_max)  # vector of possible duration values from zero to d_max
    d = tt.tile(d, (n_subs, 1))
    P = tt.ones((2, 2)) - tt.eye(2)  # permutation matrix

    # set prior state probability
    theta0 = tt.ones(n_subs) / 2

    # set hierarchical prior for delta parameter of prior beliefs p_0(d)
    dtau = HalfCauchy('dtau', beta=1)
    dloc = HalfCauchy('dloc', beta=dtau, shape=(n_subs,))
    # rate of prior beliefs
    delta = Deterministic('delta', dloc / (1 + dloc))

    # set hierarchical prior for r parameter of prior beliefs p_0(d)
    r = 1.0

    # compute prior beliefs over state durations for given
    binomln = tt.gammaln(d + r) - tt.gammaln(d + 1) - tt.gammaln(r)
    pd0 = tt.nnet.softmax(binomln + d * log(1 - delta[:, None]) + r * log(delta[:, None]))

    # set joint probability distribution
    joint0 = tt.stack([theta0[:, None] * pd0, (1 - theta0)[:, None] * pd0]).dimshuffle(1, 0, 2)

    # set hierarchical priors for response noises
    btau = HalfCauchy('btau', beta=1)
    bloc = HalfCauchy('bloc', beta=btau, shape=(n_subs,))
    # response precision
    beta = Deterministic('beta', 1 / bloc)

    # set hierarchical priors for initial beliefs about reward probability
    mtau = HalfCauchy('mtau', beta=4)
    mloc = HalfCauchy('mloc', beta=mtau, shape=(n_subs, 2))
    # initial expectations
    muA = Deterministic('muA', mloc[:, 0] / (1 + mloc[:, 0]))
    muB = Deterministic('muB', 1 / (1 + mloc[:, 1]))
    init = tt.stacklists([[10 * muA, 10 * (1 - muA)], [10 * muB, 10 * (1 - muB)]]).dimshuffle(2, 0, 1)

    # compute the posterior beliefs over states, durations, and reward probabilities
    (post, _) = scan(edhmm_model,
                     sequences=[inp],
                     outputs_info=[init, joint0],
                     non_sequences=[pd0, P, range(n_subs)],
                     name='hmm')

    # get posterior reward probability and state probability
    a0 = init[None, :, :, 0]
    b0 = init[None, :, :, 1]
    a = tt.concatenate([a0, post[0][:-1, :, :, 0]])
    b = tt.concatenate([b0, post[0][:-1, :, :, 1]])
    mu = Deterministic('mu', a / (a + b))
    theta = Deterministic('theta',
                          tt.concatenate([theta0[None, :], post[1][:-1].sum(axis=-1)[:, :, 0]])[:, :, None])
    # compute choice dependent expected reward probability
    mean = (theta * mu + (1 - theta) * mu.dot(P))

    # compute expected utility
    U = Deterministic('U', 2 * mean - 1)

    # set  for response biases
    ctau = HalfCauchy('ctau', beta=1)
    cloc = HalfCauchy('cloc', beta=ctau, shape=(n_subs,))
    c0 = Deterministic('c0', cloc / (1 + cloc))

    # compute response noise and response bias modulated expected free energy
    G = Deterministic('G', beta[None, :, None] * U + log([c0, 1 - c0]).T[None, :, :])

    # compute response probability for the pre-reversal and the reversal phase of the hierarchical priorexperiment
    nzero = tt.nonzero(~nans[:last])
    p = Deterministic('p', tt.nnet.softmax(G[:last][nzero]))

    # set observation likelihood of responses
    responses = inp[:last, :, 0][~nans[:last]]
    Categorical('obs', p=p, observed=responses)
# fit the model
with hmm:
    approx = fit(method='advi', n=20000, progressbar=True)

return approx

#################### plot trace #######################
filename = ‘approx_hmm.data’
f = open(filename, ‘rb’)
approx_hmm = pickle.load(f)
f.close()
nsample = 1000
hmm_trace = approx_hmm.sample(nsample, include_transformed=True)
az.plot_trace(hmm_trace[‘G’], combined=True)

I think I’v got your points. Your suggestion is that I shouldn’t fit everyone in a single model data. Is it?

Elenchus · August 24, 2020, 5:00am

Ok, I’m with you now. Part of my confusion was I thought pm.traceplot was just a wrapper around az.plot_trace, but they seem to be operating differently here.

I think the cleanest way to do this might be to pull the samples from each subject out and use plot_trace on that, e.g.

subject_idx = 1
subject = []
for sample in trace['G']:
    trials = [x[subject_idx] for x in sample]
    subject.append(trials)

az.plot_trace(np.array(subject))

Unfortunately matplotlib axes are tightly coupled to the figure, and I couldn’t find a way to take them from the figure and plot them individually. You could try something like resizing them, but I couldn’t get that to work

mqin · August 25, 2020, 4:02am

Thank you very much! I successfully separated the figure into figures for individuals!

Topic		Replies	Views
Plotting traces from time series Questions	4	730	May 4, 2020
Pm.forestplot and pm.traceplot very slow Questions	6	1949	April 2, 2020
Plotting priors in traceplot Questions	2	2533	February 15, 2018
Traceplot for linear regression Questions	3	3064	May 19, 2019
PyMC3 traceplot not displaying Questions	2	2035	November 21, 2017

How to use plot_trace to draw individuals results in dependet piture?

Related topics