It might just come down to running a boatload of models and computing MAPE/RSME/whatever for different specifications, especially if you have forecasts for lots of products that might all respond differently to the number of lags. But I think shrinkage is a good principled way to handle the problem.