SZTAKI HLT | A two-level classifier for discriminating similar languages

A two-level classifier for discriminating similar languages

Judit Ács, László Grad-Gyenge, Thiago Bruno Rodrigues de Rezende Oliveira
In Proceedings of the Workshop on Language Technology for Closely Related Languages, Varieties and Dialects, LT4VarDial, 2015

ACLWEB

The BRUniBP team's submission is presented for the Discriminating between Similar Languages Shared Task 2015. Our method is a two phase classifier that utilizes both character and word-level features. The evaluation shows 100% accuracy on language group identification and 93.66% accuracy on language identification. The main contribution of the paper is a memory-efficient correlation based feature selection method.

Citation
@article{Acs:2015,
  title={A two-level classifier for discriminating similar languages},
  author={Acs, Judit and Grad-Gyenge, L{\'a}szl{\'o} and de Rezende Oliveira, Thiago Bruno Rodrigues},
  journal={Proceedings of the Workshop on Language Technology for Closely Related Languages, Varieties and Dialects, LT4VarDial},
  volume={15},
  pages={139--145},
  year={2015}
}