Quote:
Originally Posted by pyguy
I'm trying to compute the stochastic gradient descent for linear regression, and the formula I'm using is:
I'm running the experiment with what I believe to be the correct inputs, but I'm not getting the what I expect to be the output, so I'm trying to trace my steps and see where I went wrong. I was looking at the formula, and one part that I was uncertain about was the part. Aren't and both 1x3 row vectors? If I transpose , and multiply, I'd get a 3x3 matrix which didn't make sense to me in the calculation, so I'm essentially multiplying them right now as if they were just 1x3 row vectors.

The convention is that vectors are column vectors, so
is an inner product, not an outer product.