Multivariate Random Walk with missing values

gcgibson · September 9, 2019, 9:07pm

Hey all!

I am trying to use the multivariate normal random walk to forecast values in the future. The best way to do this (at least in JAGS) is to add nans to the observed data. I tried to do that using pymc3 because I want to try out some of the very nice VI tools, but I can’t quite get it to work. I have attached a minimal reproducible example where you can toggle on the nan and see it fail or leave it off and see it successfully sample.

Thanks!

import matplotlib.pyplot as plt
from scipy.linalg import cholesky
from pymc3.distributions import Continuous
import scipy as sp
import theano.tensor as T
import theano.tensor.nlinalg
import sys
import pymc3 as pm
import theano.tensor.slinalg as sla


X = np.random.normal(size=(3,3))
class mvNormalRandomWalk(Continuous):
    def __init__(self, mu=0., cov=1., *args, **kwargs):
        super(mvNormalRandomWalk, self).__init__(*args, **kwargs)
        self.cov = cov
        self.mu = mu
    
    def logp(self, x):
        mu = self.mu
        
        x_im1 = x[:-1]
        x_i = x[1:]
        
        L = sla.cholesky(self.cov)
        log_det = T.log(L.diagonal()).sum()
        delta = x_i - (x_im1+mu)
        
        solve_lower_triangular = sla.Solve(A_structure='lower_triangular', lower=True)
        Linv_delta = solve_lower_triangular(L,delta.T)
        
        k = L.shape[0]
        innov_like = -(0.5*k*T.log(2*np.pi) + log_det + 0.5*T.sum(Linv_delta*Linv_delta,axis=0))
        return T.sum(innov_like)

n_samples = 5000
Sigma = np.random.randn(3,5)
Sigma = Sigma.dot(Sigma.T)


#TURN OFF OR ON
#X[0,0] = np.nan


with pm.Model() as model:
    mu = pm.MvNormal('mu',mu=np.zeros(3), cov=np.eye(3),shape=3)
    likelihood = mvNormalRandomWalk('y',mu=mu,cov=Sigma,observed=X[0:3,0:3])
    step = pm.NUTS()
    trace = pm.sample(n_samples, step)

junpenglao · September 10, 2019, 10:00am

Did you have a look at the discussion in Multivariate Normal with missing inputs? I feel that the solution might apply.

junpenglao · September 10, 2019, 11:46am

Actually, random walk is modeling X_{t} - X_{t-1} \sim MvNormal(\theta), which makes missing value quite difficult to handle. I will need to think a bit more about it.

Topic		Replies	Views
Handling missing values in predictor when outcome is a Multivariate Normal distribution v5	7	103	October 25, 2024
randomWalk predictions Questions	0	396	February 25, 2020
PyMC v5.14.0 introduces error in MultivariateNormal with NaN v5 development , bug , modeling	3	132	May 14, 2024
Multivariate Random Walk Regression v5	3	339	December 8, 2023
Multivariate Normal Imputation Error v5 aesara	1	528	December 19, 2022

Multivariate Random Walk with missing values

Related topics