Variational inference over cartesian product of large sets of observations

mhlr · February 7, 2019, 9:01pm

I have 2 large sets of observations and I would like to do variational inference over the cartesian product of these sets. How do I use pymc3.Minibatch to get representative samples.

For example suppose the observations a are vactors and I want to model the distribution of dot product of of samples from the 2 sets.

Something like:

model = Model()
with model:
  A = pm.Minibatch(a, 100)
  B = pm.Minibatch(b, 100)
  C = pm.Deterministic('C', A.dot(B))
  N = pm.Normal('N, 0, 100, C)
  fit = pm.fit()

except I think do not think the above will sample fairly from the cartesian product of a and b.

How do I do something like the above but sampling uniformly over the cartesian product of a and b?

junpenglao · February 8, 2019, 6:07am

What do you mean by:

The minibatch sync across different input so you should be fine doing this. Also, since a and b is observed you can compute C first and do the minibatch on C

mhlr · February 8, 2019, 7:13am

Thanks! a and b are too large to precompute C particularly since the function I actually need to compute returns high dimensional vectors.

Also do I need to give the 2 Minibatches different seeds? It looks like by default menibatches all initialize to 42. Could that cause problems?

junpenglao · February 8, 2019, 8:57am

I dont think you would want to set different seed, as I understand that you would want to the minibatch to be in sync

mhlr · February 8, 2019, 10:04am

I do not think I want them in sync. I want it possible for any item in a to pair with any item in b with equal probability. If they are in sync then I think most pairings can never happen.

Or am I thinking about this wrong?

junpenglao · February 8, 2019, 10:52am

Oh I think I get what you mean - since a and b is too large to do dot product, you want to minibatch them to just get a slice… I am not sure it is the right way to do: I dont think dot product itself is batchable

Topic		Replies	Views
How to make Minibatch for multi-dimensional data? Questions	10	2477	September 17, 2020
Minibatch when latent variable size depends on data dimension Questions	2	675	February 8, 2019
Inference with multi-dimensional data and minibatches Questions	0	307	March 12, 2020
Minibatch Giving Inf Loss v5 variational_inferenc , modeling	4	33	July 24, 2024
Minibatch for MAP and/or wide models? Questions	5	852	August 31, 2017

Variational inference over cartesian product of large sets of observations

Related topics