Do we need a testing set?

More references and resources in @twiecki’s post.