SZTAKI HLT | SZTAKI @ ImageCLEF 2010

SZTAKI @ ImageCLEF 2010

Bálint Daróczy, István Petrás, András Benczúr, Dávid Márk Nemeskey, Róbert Pethes

In CLEF (Notebook Papers/Labs/Workshop), 2010

PDF

Our approach to the ImageCLEF 2010 tasks is based on Histogram of Oriented Gradients descriptors (HOG) and Okapi BM25 based text retrieval. We extracted feature vectors to describe the visual content of an image region or the entire image. We trained a Gaussian Mixture Model (GMM) to cluster the feature vectors extracted from the image regions. To represent each image with only one vector we computed a Bag-of-Words (BOW) model from GMM probabilities of HOG descriptors. We trained linear regression classificators for the Photo Annotation task. To improve our textual ranking in the Wikipedia MM task we successfully expanded the textual query based on the visual ranking.