Hello, I have this answer for the Exercise 4.6 but I'm not sure if it's right?
Because

for any

, very small weights are still as powerful as large weights (all that matters is the accuracy of the calculations that computer being able to perform): That also means a hyperplane can be represented by many hypotheses, constraining the weights can reduce the number of hypotheses represents the same hyperplane. Hence soft-order constraint will be able to reduce the

component while likely not compromising the

component.
----------------------------------------
Edit: I have just remembered that the growth function has already taken care of the issue many hypotheses representing the same hyperplane (and this issue does not affect the

component anyway (?)). So in this case the answer should be the hard-order constraint...? I'm really confused right now.