What would be the correct way of modelling logistic regression output p(C=1|X) uncertainty?

tomkov · March 5, 2021, 12:26pm

Hello,

As the topic says, i am interested in modelling logistic regression output p(C=1|X) uncertainty?
So given some input feature vector X=(X_1,…,X_n), i would like to know p(C=1|X), but also uncertainty as in modelling coin throw with Beta distribution? I.e. for two “similar” input, there could be same expectation E[p(C=1|X_1)] = E[p(C=1|X_2)] = 0.3, but var[p(C=1|X_1)] != var[p(C=1|X_2)]?

If we model this with “standard” Bayesian logistic regression (with Bernoulli observed RV), posterior predictive distribution will provide us with realisations of Bernoulli RV - in this case, we can only estimate p as mean of those realisations, while variance is defined as p*q, meaning that we are more certain about the classification output that is closer to 1 or 0, but we do not have hierarchical modelling of p.

Am I missing something here?

ricardoV94 · March 5, 2021, 1:30pm

You also get some uncertainty from the coefficients of your model. This can lead to a higher posterior uncertainty for models that have the same expectation (This is to say that p will not be fixed).

I am not sure about the analogy you are making with a hierarchical beta-binomial model. Can you elaborate?

Topic		Replies	Views
Sensible prior for logistic regression with one or no input variable Questions	2	456	December 5, 2021
Why does my logistic regression model summary give a mean for each observation? version agnostic	1	468	August 26, 2022
Observational error in logistic regression v5	6	374	March 2, 2023
Concepts of Parameter Estimation and Predictions, and Out of Sample Predicted Probability for Logistic Regression Questions	5	1438	May 11, 2018
Bambi logistic regression, prior and posterior distributions of probabilities Questions bambi	4	1427	October 7, 2021

What would be the correct way of modelling logistic regression output p(C=1|X) uncertainty?

Related topics