Identifiable up to rearrangement

Hamster_on_wheels · May 17, 2019, 1:20pm

Can NUTS help me to find a set of parameters with largest likelihood if my model is identifiable only up to rearrangement?

One of the parameter is a matrix \mathcal{A}. Let’s say matrix A_{1} is a possible value of \mathcal{A}.

I can form a new matrix A_{2} by the following two steps:

swap i^{\text{th}} and j^{\text{th}} row of A_{1} to form B
swap i^{\text{th}} and j^{\text{th}} column of B to form A_{2}

Likelihood of A_{1} is same as likelihood of A_{2}.

I’m using a Dirichlet prior for \mathcal{A}.

Hamster_on_wheels · May 17, 2019, 3:15pm

Something like transform.ordered might work. But I need to sort both row and columns by the ordering of row sums.

junpenglao · May 25, 2019, 6:13am

Nope, most likely NUTS will try to do sample all the space and failed, see eg Identifying Bayesian Mixture Models

One solution I can think of is to port the permutation bijector from TFP into a transformer in pymc3.

github.com

tensorflow/probability/blob/main/tensorflow_probability/python/bijectors/permute.py

# Copyright 2018 The TensorFlow Probability Authors.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
# ============================================================================
"""Permutation bijectors."""

# Dependency imports
import numpy as np

import tensorflow.compat.v2 as tf

This file has been truncated. show original

One solution

Hamster_on_wheels · May 28, 2019, 9:33pm

Is the following post-processing of the trace enough to fix this problem?

For each value of matrix \mathcal{A} in the trace,

Calculate indices = np.argsort(np.sum(A, axis=0))
Reorder the matrix by A = A[indices, :][:, indices]

junpenglao · May 29, 2019, 5:00am

Well, it is certainly one way to do it, but if you have within trace mode exploring than it would be quite difficult.

Hamster_on_wheels · May 29, 2019, 3:29pm

Would the identifiability problem mess up the tuning of the mass matrix and NUTS?
What is “within trace mode exploring”?

junpenglao · May 29, 2019, 3:42pm

I would say no as the mass matrix estimation is over the whole posterior, so label switching wont effect that

I meant mode switching within chain

Hamster_on_wheels · May 29, 2019, 3:50pm

Would the permutation bijector avoid the mode switching problem?

I think mode switching within chain would affect estimation of the distribution.
But I only want to find the best parameters for the model and don’t need an estimation of the variance now. So I guess the post-processing would work for me.

Topic		Replies	Views
Divergences and label-switching in a Dirichlet process mixture model Questions	2	1330	July 18, 2021
Label switching in multivariate mixtures Questions	0	764	September 28, 2020
Constraining order in a Mixture model Questions	6	1435	August 9, 2021
Constrain one variable to be greater than another Questions	14	3508	January 30, 2018
Why does `transform=pm.distributions.transforms.ordered` lead to worse convergence? Questions	3	1571	August 27, 2021

Identifiable up to rearrangement

Related topics