1. If g has multiple parameters, do you average the curves, or do you average the individual parameters (or does it matter)?

You average the value as a function of
, so 'you average the curves.'
2. When the book says we can think of it this way, does it mean this is not the exact definition?

This would be a finitesample estimate of
that gets closer and closer to the expected value (which is the formal definition) as you use more and more data sets.