SZTAKI HLT | Publications

Publications

2024
2023
2022
2021
G. Szolnok, B. Barta, D. Lakatos, J. Ács: BME Submission for SIGMORPHON 2021 Shared Task 0. A Three Step Training Approach with Data Augmentation for Morphological Inflection. In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology. J. Bogensperger, S. Schlarb, A. Hanbury, G. Recski: DreamDrug - A crowdsourced NER dataset for detecting drugs in darknet markets. In Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021). J. Ács, D. Lévai, D. Nemeskey, A. Kornai: Evaluating Contextualized Language Models for Hungarian. In XVII. Magyar SzámítógépesNyelvészeti Konferencia. J. Ács, D. Lévai, A. Kornai: Evaluating Transferability of BERT Models on Uralic Languages. In Seventh International Workshop for Computational Linguistics of Uralic Languages (IWCLUL 2021). G. Recski, B. Lellmann, Á. Kovács, A. Hanbury: Explainable Rule Extraction via Semantic Graphs. In Proceedings of the Fifth Workshop on Automated Semantic Analysis of Information in Legal Text (ASAIL 2021). D. Nemeskey: Introducing huBERT. In XVII. Magyar Számítógépes Nyelvészeti Konferencia. K. Gémes, Á. Kovács, M. Reichel, G. Recski: Offensive text detection on English Twitter with deep learning models and rule-based systems. In FIRE 2021: Forum for Information Retrieval Evaluation. T. Pimentel, B. Barta, D. Lakatos, G. Szolnok, J. Ács, et al.: SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages. In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology. J. Ács, Á. Kádár, A. Kornai: Subword Pooling Makes a Difference. In 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL21). K. Gémes, G. Recski: TUW-Inf at GermEval2021: Rule-based and Hybrid Methods for Detecting Toxic, Engaging, and Fact-Claiming Comments. In Proceedings of the GermEval 2021 Workshop on the Identification of Toxic, Engaging, and Fact-Claiming Comments. R. Csáky, G. Recski: The Gutenberg Dialogue Dataset. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers. A. Kornai: Vocabulary: common or basic?. D. Lévai, V. Kulcsár: emPhon: Morphologically sensitive open-source phonetic transcriber. In XVII. Magyar Számítógépes Nyelvészeti Konferencia.
2020
2019
P. Ihász: A supplementary feature set for sentiment analysis in Japanese dialogues. In Transactions on Asian and Low-Resource Language Information Processing. Á. Kovács, E. Ács, J. Ács, A. Kornai, G. Recski: BME-UW at SR'19: Surface realization with Interpreted Regular Tree Grammars. In Proceedings of the 2nd Workshop on Multilingual Surface Rea lisation (MSR), 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP). J. Ács, D. Nemeskey, G. Recski: Building word embeddings from dictionary definitions. In K + K = 120 Papers dedicated to László Kálmán and András Kornaion the occasion of their 60th birthdays. K. Gémes: Deep learning of graph transformations. In MSc Thesis. P. Ihász: Emotion Recognition through Intentional Context. In International Journal of Affective Engineering. E. Ács, G. Recski: Generating IRTG grammars from parallel data. In Proceedings of the Automation and Applied Computer Science Workshop 2019 : AACS'19. R. Csáky, P. Purgai, G. Recski: Improving Neural Conversational Models with Entropy-Based Data Filtering. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL). B. Döbrössy, M. Makrai, B. Tarján, G. Szaszák: Investigating sub-word embedding strategies for the morphologically rich and free phrase-order Hungarian. In Proc Repl4NLP. K. Gémes, Á. Kovács, G. Recski: Machine comprehension using semantic graphs. In Proceedings of the Automation and Applied Computer Science Workshop 2019 : AACS'19. B. Indig, B. Sass, E. Simon, I. Mittelholcz, N. Vadász, M. Makrai: One format to rule them all – The emtsv pipeline for Hungarian. In Proc The 13th Linguistic Annotation Workshop. E. Ács, Á. Holló-Szabó, G. Recski: Parsing noun phrases with Interpreted Regular Tree Grammars. In XV. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2019). R. Csáky: Proposal Towards a Personalized Knowledge-powered Self-play Based Ensemble Dialog System. In arXiv. A. Kornai: Semantics. G. Borbély, A. Kornai: Sentence Length. In Proceedings of the 16th Meeting on the Mathematics of Language. D. Lévai, A. Kornai: The impact of inflection on word vectors. In MSZNY 2019. M. Castro-Bleda, E. Iklódi, G. Recski, G. Borbély: Towards a Universal Semantic Dictionary. In Applied Sciences 9(19). A. Kornai: Truth or dare. In Tokens of Meaning: Papers in Honor of Lauri Karttunen.
2018
G. Berend, M. Makrai, P. Földiák: 300-sparsans at SemEval-2018 Task 9: Hypernymy as interaction of sparse attributes. In SemEval. J. Ács: BME-HAS System for CoNLL--SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection. In Proceedings of the SIGNLL Conference on Computational Natural Language Learning. G. Recski: Building concept definitions from explanatory dictionaries. In International Journal of Lexicography 31/3. D. Nemeskey, A. Kornai: Emergency Vocabulary. In Information Systems Frontiers (20/5). P. Ihász: Emotions and intentions mediated with dialogue acts. In Proceedings of 2018 5th International Conference on Business and Industrial Research (ICBIR). E. Ács, G. Recski: Evaluation of Universal Dependency parsers for Hungarian. In XIV. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2018). G. Németh, J. Ács: Hyphenation using deep neural networks. In XIV. Magyar Számítógépes Nyelvészeti Konferencia. Á. Kovács, G. Recski: Knowledge base population using natural language inference. In Proceedings of Automation and Applied Computer Science Workshop. E. Ács, G. Recski: Semantic parsing with Interpreted Regular Tree Grammars. In Proceedings of the Automation and Applied Computer Science Workshop 2018 : AACS'18. K. Gémes, Á. Kovács: Semantic parsing with graph transformations. In Scientific Student's Assosiactions Report. Á. Kovács: Semantic parsing with graph transformations MSc Thesis. In Thesis. D. Lévai: The impact of inflection on word vectors, thesis. E. Ács: Universal Dependency parsing of English and Hungarian – an application for semantic parsing, MSc thesis.
2017
2016
G. Recski: Building concept graphs from monolingual dictionary entries. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). G. Recski, A. Bolevácz, G. Borbély: Building definition graphs using monolingual dictionaries of Hungarian. In XII. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2016). G. Recski: Computational methods in semantics. G. Borbély, A. Kornai, M. Kracht, D. Nemeskey: Denoising composition in distributional semantics. In 28th European Summer School in Logic, Language and Information. A. Kornai, D. Nemeskey, G. Recski: Detecting Optional Arguments of Verbs. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). J. Ács, A. Kornai: Evaluating embeddings on dictionary-based similarity. In RepEval @ ACL 2016. G. Borbély, A. Kornai, M. Makrai, D. Nemeskey: Evaluating multi-sense embeddings for semantic resolution monolingually and in word translation. In repeval. M. Makrai: Filtering Wiktionary triangles by linear mapping between distributed word models. In Proceedings of 10th Edition of the Language Resources and Evaluation Conference. J. Ács, J. Halmi: Hunaccent: Small Footprint Diacritic Restoration for Social Media. In Normalisation and Analysis of Social Media Texts (NormSoMe) Workshop, LREC16. G. Recski, E. Iklódi, K. Pajkossy, A. Kornai: Measuring semantic similarity of words using concept networks. In Proceedings of the 1st Workshop on Representation Learning for NLP. K. Pajkossy, A. Zséder: The hunvec framework for NN-CRF-based sequential tagging. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016).
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2024
2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2024
2023
2022
2021
BME Submission for SIGMORPHON 2021 Shared Task 0. A Three Step Training Approach with Data Augmentation for Morphological Inflection. In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology. DreamDrug - A crowdsourced NER dataset for detecting drugs in darknet markets. In Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021). Evaluating Contextualized Language Models for Hungarian. In XVII. Magyar SzámítógépesNyelvészeti Konferencia. Evaluating Transferability of BERT Models on Uralic Languages. In Seventh International Workshop for Computational Linguistics of Uralic Languages (IWCLUL 2021). Explainable Rule Extraction via Semantic Graphs. In Proceedings of the Fifth Workshop on Automated Semantic Analysis of Information in Legal Text (ASAIL 2021). Introducing huBERT. In XVII. Magyar Számítógépes Nyelvészeti Konferencia. Offensive text detection on English Twitter with deep learning models and rule-based systems. In FIRE 2021: Forum for Information Retrieval Evaluation. SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages. In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology. Subword Pooling Makes a Difference. In 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL21). TUW-Inf at GermEval2021: Rule-based and Hybrid Methods for Detecting Toxic, Engaging, and Fact-Claiming Comments. In Proceedings of the GermEval 2021 Workshop on the Identification of Toxic, Engaging, and Fact-Claiming Comments. The Gutenberg Dialogue Dataset. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers. Vocabulary: common or basic?. emPhon: Morphologically sensitive open-source phonetic transcriber. In XVII. Magyar Számítógépes Nyelvészeti Konferencia.
2020
2019
A supplementary feature set for sentiment analysis in Japanese dialogues. In Transactions on Asian and Low-Resource Language Information Processing. Az ellenforradalmár. In Nyelv, biológia, szabadság. A 90 éves Chomsky jelentősége a tudományban és azon túl. BME-UW at SR'19: Surface realization with Interpreted Regular Tree Grammars. In Proceedings of the 2nd Workshop on Multilingual Surface Rea lisation (MSR), 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP). Building word embeddings from dictionary definitions. In K + K = 120 Papers dedicated to László Kálmán and András Kornaion the occasion of their 60th birthdays. Deep learning of graph transformations. In MSc Thesis. Emotion Recognition through Intentional Context. In International Journal of Affective Engineering. Generating IRTG grammars from parallel data. In Proceedings of the Automation and Applied Computer Science Workshop 2019 : AACS'19. Improving Neural Conversational Models with Entropy-Based Data Filtering. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL). Investigating sub-word embedding strategies for the morphologically rich and free phrase-order Hungarian. In Proc Repl4NLP. Machine comprehension using semantic graphs. In Proceedings of the Automation and Applied Computer Science Workshop 2019 : AACS'19. One format to rule them all – The emtsv pipeline for Hungarian. In Proc The 13th Linguistic Annotation Workshop. Parsing noun phrases with Interpreted Regular Tree Grammars. In XV. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2019). Proposal Towards a Personalized Knowledge-powered Self-play Based Ensemble Dialog System. In arXiv. Semantics. Sentence Length. In Proceedings of the 16th Meeting on the Mathematics of Language. The impact of inflection on word vectors. In MSZNY 2019. Towards a Universal Semantic Dictionary. In Applied Sciences 9(19). Truth or dare. In Tokens of Meaning: Papers in Honor of Lauri Karttunen.
2018
300-sparsans at SemEval-2018 Task 9: Hypernymy as interaction of sparse attributes. In SemEval. A szöveg mint skálafüggetlen hálózat. In XIV. Magyar Számítógépes Nyelvészeti Konferencia. BME-HAS System for CoNLL--SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection. In Proceedings of the SIGNLL Conference on Computational Natural Language Learning. Building concept definitions from explanatory dictionaries. In International Journal of Lexicography 31/3. Emergency Vocabulary. In Information Systems Frontiers (20/5). Emotions and intentions mediated with dialogue acts. In Proceedings of 2018 5th International Conference on Business and Industrial Research (ICBIR). Evaluation of Universal Dependency parsers for Hungarian. In XIV. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2018). Hibrid nyelvtechnológiák. In Magyar Tudomány 2018/6. Hyphenation using deep neural networks. In XIV. Magyar Számítógépes Nyelvészeti Konferencia. Knowledge base population using natural language inference. In Proceedings of Automation and Applied Computer Science Workshop. Semantic parsing with Interpreted Regular Tree Grammars. In Proceedings of the Automation and Applied Computer Science Workshop 2018 : AACS'18. Semantic parsing with graph transformations. In Scientific Student's Assosiactions Report. Semantic parsing with graph transformations MSc Thesis. In Thesis. Szemantika. Természetes nyelvi interfész menetrend- és utazástervező szolgáltatásokhoz. In XIV. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2018). The impact of inflection on word vectors, thesis. Universal Dependency parsing of English and Hungarian – an application for semantic parsing, MSc thesis.
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003