Inf Average loss ADVI in Correlated Topic Model

gaddamanil16 · July 23, 2018, 11:06pm

Hi,

While implementing Correlated Topic Model with ADVI I get inf average loss. What does it mean? Now I’m unsure whether I’m implementing it in a correct way or not.

Here is my code:

import numpy as np
import pymc3 as pm, theano.tensor as t
import matplotlib.pyplot as plt
K = 4
V = 4 # number of words
D = 10 # number of documents

data = np.random.randint(V,size=(D,4))

alpha = np.ones((1, K))
beta = np.ones((1, V))
model = pm.Model()

mu = np.ones((1,K),dtype = np.float64)
cov = np.random.random_sample(size=(K,K))

Wd = [len(doc) for doc in data]
(D, W) = data.shape

def log_lda(theta,phi):
    def ll_lda(value):  
        dixs, vixs = value.nonzero()
        vfreqs = value[dixs, vixs]
        ll =vfreqs* pm.math.logsumexp(t.log(theta[dixs]) + t.log(phi.T[vixs]), axis = 1).ravel()
        return t.sum(ll) 
    return ll_lda

with model: 
    eta = pm.MvNormal('eta',mu = mu,cov = cov, shape = (D,K))
    theta = t.nnet.softmax(eta)
    phi = pm.Dirichlet("phi", a=beta, shape=(K, V))
    doc = pm.DensityDist('doc', log_lda(theta,phi), observed=data)   
with model:    
    inference = pm.ADVI()
    approx = pm.fit(n=10000,method= inference,callbacks=[pm.callbacks.CheckParametersConvergence(diff='absolute')])
   
#inference    
tr1 = approx.sample(draws=1000)
pm.plots.traceplot(tr1);    
pm.plot_posterior(tr1, color='LightSeaGreen');

plt.plot(approx.hist)

Can someone please guide me with this?
Help much appreciated.
Thanks

gaddamanil16 · July 23, 2018, 11:07pm

Average Loss = inf: 100%|██████████| 10000/10000 [00:06<00:00, 1497.31it/s]
Finished [100%]: Average Loss = nan

junpenglao · July 24, 2018, 4:22am

What is plt.plot(approx.hist) looks like?

gaddamanil16 · July 24, 2018, 3:16pm

It gives me nothing. Its blank.

junpenglao · July 24, 2018, 4:27pm

Yeah there is definitely some problems of your model. You should double check the likelihood implementation, print the testpoint and model.check_test_point.

gaddamanil16 · July 24, 2018, 4:39pm

It gives me

eta -inf
phi_stickbreaking__ -15.01367190100603
doc -72.08730677823429

For eta :

Using this I have used
eta = pm.MvNormal('eta',mu = mu,cov = cov, shape = (D,K))

Is there anything wrong with this?

junpenglao · July 24, 2018, 7:19pm

This is not right - the reason is that your cov is not a positive definite matrix.

gaddamanil16 · July 24, 2018, 8:18pm

In the documentation provided, its not mentioned about the positive definiteness of the covariance matrix?

junpenglao · July 24, 2018, 8:48pm

I guess the document is not explicit enough on that but the logp of MvNormal (or MvStudentT) only defined on covariance matrix that are positive definiteness.

gaddamanil16 · July 25, 2018, 4:05pm

Okay! Thanks.
Also do you know any implementation of Stochastic Mixed membership Block Models in pymc3. I want to implement it using ADVI.

Topic		Replies	Views
Average Loss in optimization output Questions	20	4695	July 20, 2018
Infinite loss with ADVI Questions	1	1102	January 24, 2018
Negative "Average loss" in ADVI Questions	4	1378	April 15, 2019
Average loss for ADVI never decrease for my model	6	366	September 3, 2023
Poor Accuracy of ADVI for Linear Regression Questions	12	3373	April 18, 2018

Inf Average loss ADVI in Correlated Topic Model

Related topics