SZTAKI HLT | BME-UW at SRST-2019: Surface realization with Interpreted Regular Tree Grammars

BME-UW at SRST-2019: Surface realization with Interpreted Regular Tree Grammars

Ádám Kovács
Oct. 21, 2019, 9:30
SZTAKI, Kende Street, Great Council Hall

Ádám Kovács, Evelin Ács, Judit Ács, András Kornai, Gábor Recski

The Surface Realization Shared Task involves mapping Universal Dependency graphs to raw text, i.e.~restoring word order and inflection from a graph of typed, directed dependencies between lemmas. Interpreted Regular Tree Grammars (IRTGs) encode the correspondence between generations in multiple algebras, and have previously been used for semantic parsing from raw text. Our system induces an IRTG for simultaneously building pairs of surface forms and UD graphs in the SRST training data, then prunes this grammar for each UD graph in the test data for efficient parsing and generation of the surface ordering of lemmas. For the inflection step we use a standard sequence-to-sequence model with a biLSTM encoder and an LSTM decoder with attention. Both components of our system are available on GitHub under an MIT license.