Classifying the Hungarian Web

AndrĂ¡s Kornai, Marc Krellenstein, Michael Mulligan, David Twomey, Fruzsina Veress, Alec Wysoker
In Proceedings of the EACL, 2003


In this paper we present some lessons learned from building VIZSLA, the keyword search and topic classification system used on the largest Hungarian portal, []. Based on a simple statistical language model, and the large-scale supporting evidence from vizsla, we argue that in topic classification only positive evidence matters.

