Fitting multimodal data

BlueMan66 · November 3, 2021, 1:11pm

I’m trying to fit the following distribution to which I only have access to the binned data:

I’ve tried the following mixture function:

def mixture_density(w, alpha, beta, scalling, x):
    logp = tt.log(w) + pm.Normal.dist(mu=alpha,sigma=beta).logp(x)
    return scalling * tt.sum(tt.exp(logp))

and the model I’ve been using is the following:

with pm.Model() as frequency_model:
    
    alpha_1 = pm.HalfNormal('Alpha_1', sigma= 1.)
    beta_1 = pm.HalfNormal('Beta_1', sigma= 1., )
    
    alpha_2 = pm.HalfNormal('Alpha_2', sigma= 1.)
    beta_2 = pm.HalfNormal('Beta_2', sigma= 1., )
    
    mix_1 = mixture_density(alpha_1, beta_1, x = frequency)
    mix_2 = mixture_density(alpha_2, beta_2, x = frequency)
    
    noise_1 = pm.HalfNormal('noise_1', sigma = 1)
    noise_2 = pm.HalfNormal('noise_2', sigma = 1)
        
    N_1 = pm.Normal('N_1', mu = mix_1, sigma=noise_1)
    N_2 = pm.Normal('N_2', mu = mix_2, sigma=noise_2)
    
    w = pm.Dirichlet('w', a=np.array([1.,1.]))
    
    likelihood = pm.Mixture('Likelihood',w=w, comp_dists=[N_1, N_2], observed=frequency_density)
           
    trace = pm.sample(draws=3000, chains = 4, tune=2000, target_accept=0.8)

The results from the nuts sampler have been inconclusive and the model can’t distinguish between both distributions. Is there a better way to do this? Or should I split the dataset and adjust each mode independently (I’ve had some success with that)
Thanks for your answers!!

Topic		Replies	Views
Fitting a histogram Questions	8	1606	November 27, 2020
Frequency Data Modelling Questions	0	345	October 27, 2021
Badly fitting DPMM Questions	1	417	February 14, 2019
Help with fitting mixture -- data is fit much worse than fitting the distributions separately v5 modeling	0	20	July 24, 2024
Fitting joint distributions using custom distribution as part of mixture Questions	3	482	January 13, 2020

Fitting multimodal data

Related topics