How to generate data as a variable?

Jordan_Howell · June 6, 2023, 8:28pm

Hello,

I have developed a time series sales forecasting model that works better than expected. I’d like to add a variable simulating how many buyers we will have as our buyer count is predictive of our sales. I know, how many buyers are possible in each market because our buyers have to register with us in order to buy our product.

That said, is it as simple as the following:

with pm.Model(coords=coords) as constant_model:
    simulated_buyers = pm.TruncatedNormal('simulated_buyers', mu = "registered_buyers.mean", std = "registered_buyers.std()", upper = "registered_buyers.max")
    buyers_coeff = pm.Normal("buyers_coeff", mu = 0, std =1)

   mu= simulated_buyers*buyers_coeff
   sigma = pm.HalfNormal('sigma', sigma=100)

   eaches = pm.StudentT('predicted_eaches',
                             mu=mu,
                             sigma=sigma,
                             nu=15,
                             # lower = 0,
                             observed=observed_eaches)

Where “registered_buyers.mean/std” are the mean and standard deviation of our registered buyers on a monthly basis?

Is there anything wrong with taking two RVs and multiplying them together?

jessegrabowski · June 7, 2023, 3:26am

You can definitely multiply things together, no problem. I guess it’s common in mixed media models, see here for an example where random variables are mixed together in a regression. As it stands, what you wrote is just fine, you will just have to make sure all the shapes work, as simulated_buyers will inherit the shape of registered_buyers.mean.

Another option would be to make simulated_buyers an observed node, with the number of buyers each month as data and estimate mu and sigma. As it stands you lose some uncertainty because you are computing summary statistics (registered.mean and .std) outside the model, then using them as deterministic data. It might not matter in your application, though.

Topic		Replies	Views
Proper way to model several variables v5	4	1261	August 8, 2022
Trying to understand pymc3 through a simple example Questions	10	2460	August 22, 2022
Simulator for Multivariate Observed Data	1	169	February 20, 2024
Modelling product demand when observing sales Questions	3	571	July 5, 2018
Fit many regressions simultaneously Questions	4	1147	November 3, 2017

How to generate data as a variable?

Related topics