Trying to impute missing categorical data

jessegrabowski · January 2, 2023, 9:04pm

I ran into this as well when trying to track down the first bug, I was hoping it was just something wrong with my system. I think there’s a glitch that sets the shape of the observed data as the number of classes in the categorical (rather than inferring it from the length of p).

Could you test that:

There’s no error if the length observed data (still with missing values) is exactly equal to the length of p, and;
If (1) works successfully, that if you sample from data with fewer observations than the number of classes, the largest class you get in your samples is the length of the data, not the length of p?

If I’m right I think I can track the bug down pretty quick.

Topic		Replies	Views
Trouble specificying X \| a, b, c, d ~ Categorical( . ) Questions	5	535	March 2, 2019
Out of memory for simple Categorical model version agnostic	2	396	May 30, 2022
Marginalizing over missing categories Questions	1	738	June 17, 2020
Problem with categorical index variable in v5 v5	1	315	January 3, 2024
pm.Categorical behaves differently in a model versus as pm.Categorical.dist Questions	2	944	August 8, 2018

Trying to impute missing categorical data

Related topics