How to run unpooled independent regressions using coords/dims

bstu · January 12, 2022, 6:58pm

I’m a newer user to pymc and having trouble understanding how to structure my inputs and model to fit unpooled independent regressions using the concepts of coords and dims.

I’ve mocked up some dummy data in a pandas dataframe that represents a simplified problem as follows. I have a product_name (string), price (float), and sales (integer).

My goal is to fit independent parameters for each product name (sales ~ price separately for each product_name). Based on initial research, I believe this is done with use of coords/dims and incorporated into priors and likelihood functions, but am having trouble conceptualizing how to incorporate into code.

I could certainly loop and fit a bunch of models on subsets of the data (ex: filter to the “bike” product name + fit a model, then filter to “car” and repeat), but I’m sure the better way is to structure in a way that one can be run and then posterior results parsed afterwards. The model (simple log log linear regression) is defined as follows, but currently only works for a single product_name:

with pm.Model() as model:
    m = pm.Normal('m', mu=-1, sd=1)
    b = pm.Normal('b', mu=0, sd=1)
    sig = pm.HalfNormal('sig', sigma=1)
    y_hat_log = m * np.log(df.price) + b
    y_observed_log = np.log(df.sales)
    lik = pm.Normal('lik', mu=y_hat_log, observed=y_observed_log, sigma=sig)
    trace = pm.sample()

cluhmann · January 12, 2022, 10:04pm

Welcome!

Have you checked out the multi-level/hierachical modeling notebook found here? If not, I would suggest doing so. It takes an unpooled, fully pooled, and hierarhical approach with the same data and uses coordinates/dimensions for each.

Topic		Replies	Views
Problem with coords/dims in hierarchical model v5	4	816	January 5, 2023
Expand Multilevel Logistic Regression Model to include Individual Covariates v5	3	279	March 6, 2024
Understanding coords, indexation, Data, ..., for multilevel models v5 modeling	1	3884	April 29, 2022
Rolling regression with multivariate stored in a pandas dataframe v5 modeling	0	621	September 28, 2022
PyMC3+ArviZ: improve your workflow with labeled coords and dims Sharing doc	20	5788	April 5, 2021

How to run unpooled independent regressions using coords/dims

Related topics