Choosing the right likelihood function

I have a question regarding the choise of the likelihood distribution. Lets assume the simple case of a normal distributed prior used in inference with some dataset evidence with an uknown distribution. If we fabricate this data using a normal distribution, choosing the right likelihood distribution is trivial, but if the datapoints are to few to be certain. what does the choise of likelihood distribution matter in pymc3?

 dataset = data of length x with unknown distribution
 prior = pm.Normal("prior", mu=5,sd=1)
 likelihood = pm.Normal("likelihood", mu= prior, observed = dataset)

Here the likelihood function has a normal distribution, but what if i were to choose a beta distribution here?

Thank you

You can start narrowing down the choice of reasonable likelihoods with any constraints. If you know the data is going to be purely positive, then you can rule out distributions that assign data to the entire real line. For discrete counts you can use the Poisson or negative binomial.

One line of reasoning for likelihood selection is that you should use the maximum entropy distribution that satisfies any constraints you have. For a random variable with finite mean and variance, the max entropy distribution is the normal distribution. Can you supply any constraints on the possible values of the observed data?