Since it seems like you cannot specify the axis as with the softmax function in Tensorflow, reshaping is the work-around, but what I would like to confirm, is the softmax applied to the last axis (-1) then?
Since it seems like you cannot specify the axis as with the softmax function in Tensorflow, reshaping is the work-around, but what I would like to confirm, is the softmax applied to the last axis (-1) then?