How best to build a model on 200k normally distributed observations without a simple vector relation (rather, a piecewise vector relation, i.e. subsets of data depend on a combination of parameters)

DanWeitzenfeld · July 23, 2021, 8:23pm

I would try to use indexing. Assuming x is a data frame, create integer variables for i and j.

u = pm.Dirichlet('u', ..., shape=n_time_bins)  # or whatever prior you wish
v = pm.Dirichlet('v', ..., shape=n_days) # or whatever prior you wish
sd = pm.Deterministic('sd', (u[df.i.values] * v[df.j.values])**2)
observed = pm.Normal('observed', mu=0, sd=sd, observed=df.x)

Topic		Replies	Views
Multiple Linear Regression Questions	2	664	April 26, 2019
Help with a custom Deterministic variable Questions	0	409	March 13, 2020
How to model Normally distributed but integer data? Questions	3	666	December 30, 2019
Time series analysis tutorials? Questions	3	3788	January 3, 2018
Building a model where model parameter depends on independent data Questions	2	632	May 14, 2019

How best to build a model on 200k normally distributed observations without a simple vector relation (rather, a piecewise vector relation, i.e. subsets of data depend on a combination of parameters)

Related topics