Not quite, you may also have to change the order of the multiplication in your network. Right now it actually looks like
A1 = f(XW0)
A2 = f(A1W1)
A3 = f(A2W2)
which is all good – and convention (i). So it just looks like the weight sizes are a little off. They should be
w_1_2 ~ (53, 128) so that (161533, 53) x (53, 128) -> (161533, 128)
w_2_3 ~ (128, 64) so that (161533, 128) x (128, 64) -> (161533, 64)
and so on.