Paper Summary: WSABIE: Scaling Up To Large Vocabulary Image Annotation

Last updated: 05 Oct 2017

Please note This post is mainly intended for my personal use. It is not peer-reviewed work and should not be taken as such.

WHAT

They created a special ranking loss function (called WARP loss) to learn a) a classifier for ranking tags given an image and b) embeddings for both the images and the tags, in the same shared vector space.

CLAIMS

State-of-the-art results for precision @1 and @10 against baselines in image classification tasks.
Much faster and consumes much less memory than alternatives.
Ensemble models using different types of image features perform even better.

NOTES

Directly optimizes Precision @k
Apparently calculates some score for each (instance,label) pair.
Supervised approach to learning embeddings.

References

http://www.thespermwhale.com/jaseweston/papers/wsabie-ijcai.pdf

Felipe 05 Oct 2017 05 Oct 2017 paper-summary embeddings tags