![]() |
movie ratings
What is the intuition behind the form of the hypothesis function for movie ratings?
I'm trying to understand why it makes sense to multiply user factor by movie factor, to get that factor's contribution to the rating. E.g. if user doesn't like horror movies and the movie has a low "horror movie" rating, multiplying these together gives a low number. Shouldn't the rating be based on the distance/difference between a user's value for a factor and a movie's value for that factor? I understand that in a learning situation the factors do not have specific interpretations -- there is just a list of factors. Still, the motivation was clearly that there are factors (horror-ness, comedy-ness etc). So what is the motivation behind taking the product of factors instead of some form of their difference? |
Re: movie ratings
Quote:
|
Re: movie ratings
Ah, if each vector is normalized to unit length then this makes sense. But, there is no way to constrain the vector component values during gradient descent so that the vectors stay at unit length. Or is it that each vector is normalized every time we compute the dot product? I understand wanting the vectors to point in the same direction, but the vector magnitude seems like a distraction.
I know that model parameters needn't have a human-understandable interpretation (cf. hidden layers of neural networks), but if they do, it helps to see that the intuition makes sense :) |
Re: movie ratings
Quote:
|
Re: movie ratings
Got it. I guess the learning algorithm cares most about the number of parameters, and forcing normalization would only reduce that by two.
Point of learning is to not have to guess the target function or even its form, but it's hard to resist micromanaging the process :) Thanks! |
Re: movie ratings
Quote:
|
All times are GMT -7. The time now is 03:18 PM. |
Powered by vBulletin® Version 3.8.3
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.
The contents of this forum are to be used ONLY by readers of the Learning From Data book by Yaser S. Abu-Mostafa, Malik Magdon-Ismail, and Hsuan-Tien Lin, and participants in the Learning From Data MOOC by Yaser S. Abu-Mostafa. No part of these contents is to be communicated or made accessible to ANY other person or entity.