Limit or prevent unrealistic output of neural network

square · February 19, 2018, 7:00pm

I was trying to use one hidden layer neural network model to predict the observing data rainfall (target value, the red dots), which shouldn’t be negative. This result of each day contains 1000 predictions, which was calculated as probability density and shown with the different shades of blue (It’s similar to the posterior distribution, but all put together) (green dot is one of the inputs) I also tried with two layer model and several different range of mu and sdt for weights in the hidden layer but it didn’t change too much the results.

Is there any methods to limited the results not to be negative? Any information is appreciated. Thanks a lot.

Model information
structure: 7-10-1 (inputs - nodes in hidden layer - output)
w_in_1_mu, sd 0, 2
w_1_out_mu,sd 0, 2
train_samp 20000
train_tune 1000

junpenglao · February 19, 2018, 8:34pm

You can try modeling the observed using a distribution defined in (0, inf). For example, HalfNormal, logNormal.

square · March 5, 2018, 8:29am

Thanks a lot for your reply! I tried

with pm.Model() as neural_network:                        
  weights_in_1 = pm.Uniform('w_in_1', -1, 1,     
                            shape=(X.shape[1], n_hidden1),
                           testval=init_1) 

  weights_1_out = pm.Normal('w_1_out', w_1_out_mu, sd = w_1_out_sd,
                             shape=(n_hidden1,),
                              testval=init_out)

  act_1 = pm.math.tanh( pm.math.dot(ann_input, weights_in_1 ))
  regression = T.dot(act_1, weights_1_out)    #   T.mean(x)>>  mean function in Theano     

   out = pm.Normal('out', mu=regression, sd=np.sqrt( 0.9 ), observed=ann_output)

but got error message

ValueError: Bad initial energy: nan. The model might be misspecified.

Then I tried with [ -10000,10000 ], and it works.
I am quite curious about…is it a good way to figure out how to assign a prior? (Also the bound, I am not sure when I just try and see if it works…) Many papers and articles say the prior is how much information you have for your data before observing them, but in this case I don’t really know…

Is it possible to assign different mu and sd to the nodes in the same hidden layer with the function provided? Because now in the hidden layer (see below), all weights in the first hidden layer all has mu = 0 and sd =1.

  weights_in_1 = pm.Normal('w_in_1', mu = 0, sd = 1,     
                            shape=(X.shape[1], n_hidden1),
                           testval=init_1)

Thanks again for any suggestion.

junpenglao · March 5, 2018, 9:17am

If you try printing the logp from the model for all the nodes and there is no inf or nan (see eg Getting ‘Bad initial energy: inf’ when trying to sample simple model), but when you sample using the default trace = pm.sample(1000) it throws an error before the first sample, it is quite likely that the jitter in the default initialization jitter+adapt_diag makes some of the input invalids. Currently you can either set the init='adapt_diag' or init=None. We are in the process of making it more robust.
Yes, you can pass an array to mu and sd, eg:

mu0 = np.random.randn(n_hidden1)
sd0 = np.random.rand(n_hidden1)*2.
weights_in_1 = pm.Normal('w_in_1', mu = mu0, sd = sd0,     
                           shape=(X.shape[1], n_hidden1),
                           testval=init_1)

square · March 5, 2018, 11:21am

Thanks lot for the reply!

square · April 20, 2018, 11:16am

Is it possible to use different prior distribution for different weights in neural network, instead of just different parameters? Thanks a lot.

junpenglao · April 20, 2018, 11:26am

Yes, you can put hyperprior on the mu, for example:

mu0 = pm.Normal('hyper_mu', 0., 10., shape=n_hidden1)
weights_in_1 = pm.Normal('w_in_1', mu = mu0, sd = 1.,     
                           shape=(X.shape[1], n_hidden1),
                           testval=init_1)

Topic		Replies	Views
Setting boundaries for prior distributions Questions	8	1272	February 18, 2022
Positive samples Questions theano	1	437	August 4, 2018
Setting hard limits for the estimated value Questions	7	753	June 18, 2021
Bad initial energy: inf or nan Questions	1	1522	March 30, 2018
Log probability negative infinity Questions	4	1312	December 31, 2019

Limit or prevent unrealistic output of neural network

Related topics