View Single Post
Old 07-27-2012, 02:35 AM
htlin's Avatar
htlin htlin is offline
Join Date: Aug 2009
Location: Taipei, Taiwan
Posts: 601
Default Re: Machine learning with vector images

Originally Posted by Ubermensch View Post
In the digit recognition example, the pixels of the image are split into two features, density and symmetry, to identify the digits. In case of raster graphics, the pixels are the primary unit of which ML could be done and they are uniform.

But in case of vector graphics like SVG, or co-ordinate systems, the co-ordinates and paths are explicit or denoted by a mathematical function. How could the image recognition analogy apply to this problem?

Taking the example of a SVG file, I can parse the co-ordinates and path transformations but how could I put them into matrix form since each SVG file would have different number of co-ordinates and path transformations (and of course attributes for color,strokes, etc).
Interesting question and I believe it is an ongoing research issue. Of course, we can try to extract fixed-length feature vectors using ideas similar to those used in pixel-based image learning. Or, there are some learning algorithms that do not necessarily require using fixed-length vectors for representing the data, and can thus be possibly applied to the SVG learning case you are describing. For instance,

* Nearest Neighbor models, which generally only require a suitable similarity function.
* Kernelized Support Vector Machines, which requires the instances to map to a "computable" inner-product space

Some related discussions can be found in the following workshop:

Hope this helps.
When one teaches, two learn.
Reply With Quote