SZTAKI HLT | Publikációk

Publikációk

2024
2023
2022
2021
G. Szolnok, B. Barta, D. Lakatos, J. Ács: BME Submission for SIGMORPHON 2021 Shared Task 0. A Three Step Training Approach with Data Augmentation for Morphological Inflection. In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology. J. Bogensperger, S. Schlarb, A. Hanbury, G. Recski: DreamDrug - A crowdsourced NER dataset for detecting drugs in darknet markets. In Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021). J. Ács, D. Lévai, D. Nemeskey, A. Kornai: Evaluating Contextualized Language Models for Hungarian. In XVII. Magyar SzámítógépesNyelvészeti Konferencia. J. Ács, D. Lévai, A. Kornai: Evaluating Transferability of BERT Models on Uralic Languages. In Seventh International Workshop for Computational Linguistics of Uralic Languages (IWCLUL 2021). G. Recski, B. Lellmann, Á. Kovács, A. Hanbury: Explainable Rule Extraction via Semantic Graphs. In Proceedings of the Fifth Workshop on Automated Semantic Analysis of Information in Legal Text (ASAIL 2021). D. Nemeskey: Introducing huBERT. In XVII. Magyar Számítógépes Nyelvészeti Konferencia. K. Gémes, Á. Kovács, M. Reichel, G. Recski: Offensive text detection on English Twitter with deep learning models and rule-based systems. In FIRE 2021: Forum for Information Retrieval Evaluation. T. Pimentel, B. Barta, D. Lakatos, G. Szolnok, J. Ács, et al.: SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages. In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology. J. Ács, Á. Kádár, A. Kornai: Subword Pooling Makes a Difference. In 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL21). K. Gémes, G. Recski: TUW-Inf at GermEval2021: Rule-based and Hybrid Methods for Detecting Toxic, Engaging, and Fact-Claiming Comments. In Proceedings of the GermEval 2021 Workshop on the Identification of Toxic, Engaging, and Fact-Claiming Comments. R. Csáky, G. Recski: The Gutenberg Dialogue Dataset. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers. A. Kornai: Vocabulary: common or basic?. D. Lévai, V. Kulcsár: emPhon: Morphologically sensitive open-source phonetic transcriber. In XVII. Magyar Számítógépes Nyelvészeti Konferencia.
2020
2019
P. Ihász: A supplementary feature set for sentiment analysis in Japanese dialogues. In Transactions on Asian and Low-Resource Language Information Processing. A. Kornai: Az ellenforradalmár. In Nyelv, biológia, szabadság. A 90 éves Chomsky jelentősége a tudományban és azon túl. Á. Kovács, E. Ács, J. Ács, A. Kornai, G. Recski: BME-UW at SR'19: Surface realization with Interpreted Regular Tree Grammars. In Proceedings of the 2nd Workshop on Multilingual Surface Rea lisation (MSR), 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP). J. Ács, D. Nemeskey, G. Recski: Building word embeddings from dictionary definitions. In K + K = 120 Papers dedicated to László Kálmán and András Kornaion the occasion of their 60th birthdays. K. Gémes: Deep learning of graph transformations. In MSc Thesis. P. Ihász: Emotion Recognition through Intentional Context. In International Journal of Affective Engineering. E. Ács, G. Recski: Generating IRTG grammars from parallel data. In Proceedings of the Automation and Applied Computer Science Workshop 2019 : AACS'19. R. Csáky, P. Purgai, G. Recski: Improving Neural Conversational Models with Entropy-Based Data Filtering. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL). B. Döbrössy, M. Makrai, B. Tarján, G. Szaszák: Investigating sub-word embedding strategies for the morphologically rich and free phrase-order Hungarian. In Proc Repl4NLP. K. Gémes, Á. Kovács, G. Recski: Machine comprehension using semantic graphs. In Proceedings of the Automation and Applied Computer Science Workshop 2019 : AACS'19. B. Indig, B. Sass, E. Simon, I. Mittelholcz, N. Vadász, M. Makrai: One format to rule them all – The emtsv pipeline for Hungarian. In Proc The 13th Linguistic Annotation Workshop. E. Ács, Á. Holló-Szabó, G. Recski: Parsing noun phrases with Interpreted Regular Tree Grammars. In XV. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2019). R. Csáky: Proposal Towards a Personalized Knowledge-powered Self-play Based Ensemble Dialog System. In arXiv. A. Kornai: Semantics. G. Borbély, A. Kornai: Sentence Length. In Proceedings of the 16th Meeting on the Mathematics of Language. D. Lévai, A. Kornai: The impact of inflection on word vectors. In MSZNY 2019. M. Castro-Bleda, E. Iklódi, G. Recski, G. Borbély: Towards a Universal Semantic Dictionary. In Applied Sciences 9(19). A. Kornai: Truth or dare. In Tokens of Meaning: Papers in Honor of Lauri Karttunen.
2018
G. Berend, M. Makrai, P. Földiák: 300-sparsans at SemEval-2018 Task 9: Hypernymy as interaction of sparse attributes. In SemEval. M. Makrai, B. Sass: A szöveg mint skálafüggetlen hálózat. In XIV. Magyar Számítógépes Nyelvészeti Konferencia. J. Ács: BME-HAS System for CoNLL--SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection. In Proceedings of the SIGNLL Conference on Computational Natural Language Learning. G. Recski: Building concept definitions from explanatory dictionaries. In International Journal of Lexicography 31/3. D. Nemeskey, A. Kornai: Emergency Vocabulary. In Information Systems Frontiers (20/5). P. Ihász: Emotions and intentions mediated with dialogue acts. In Proceedings of 2018 5th International Conference on Business and Industrial Research (ICBIR). E. Ács, G. Recski: Evaluation of Universal Dependency parsers for Hungarian. In XIV. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2018). J. Ács, G. Borbély, M. Makrai, D. Nemeskey, G. Recski, A. Kornai: Hibrid nyelvtechnológiák. In Magyar Tudomány 2018/6. G. Németh, J. Ács: Hyphenation using deep neural networks. In XIV. Magyar Számítógépes Nyelvészeti Konferencia. Á. Kovács, G. Recski: Knowledge base population using natural language inference. In Proceedings of Automation and Applied Computer Science Workshop. E. Ács, G. Recski: Semantic parsing with Interpreted Regular Tree Grammars. In Proceedings of the Automation and Applied Computer Science Workshop 2018 : AACS'18. K. Gémes, Á. Kovács: Semantic parsing with graph transformations. In Scientific Student's Assosiactions Report. Á. Kovács: Semantic parsing with graph transformations MSc Thesis. In Thesis. A. Kornai: Szemantika. B. Kemény, G. Recski: Természetes nyelvi interfész menetrend- és utazástervező szolgáltatásokhoz. In XIV. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2018). D. Lévai: The impact of inflection on word vectors, thesis. E. Ács: Universal Dependency parsing of English and Hungarian – an application for semantic parsing, MSc thesis.
2017
A. Kornai, I. Szekrényes: Az e-magyar beszédarchívum. In XIII. Magyar Számítógépes Nyelvészeti Konferencia. J. Ács, D. Nemeskey, G. Recski: Building word embeddings from dictionary definitions. In Papers dedicated to László Kálmán and András Kornai on the occasion of their 60th birthdays. J. Ács, G. Velkey: Comparing word segmentation algorithms. In Proceedings of the Automation and Applied Computer Science Workshop 2017 (AACS'17).. R. Csáky, G. Recski: Deep Learning Based Chatbot Models. In National Scientific Students' Associations Conference. J. Ács, K. Pajkossy, A. Kornai: Digital vitality of Uralic languages. In Acta Linguistica 64/3. M. Makrai, V. Lipp: Do multi-sense word embeddings learn more senses?. In K + K = 120. D. Huszti, J. Ács: Entitásorientált véleménykinyerés magyar nyelven. In XIII. Magyar Számítógépes Nyelvészeti Konferencia. J. Ács, D. Nemeskey, A. Kornai: Identification of Disaster-implicated Named Entities. In Proceedings of the First International Workshop on Exploitation of Social Media for Emergency Relief and Preparedness. B. Gyuris, K. Mády, G. Recski: K + K = 120. Papers dedicated to László Kálmán and András Kornai on the occasion of their 60th birthdays. G. Borbély: Language modeling with matrix embeddings. In K + K = 120. Papers dedicated to László Kálmán and András Kornai on the occasion of their 60th birthdays. D. Nemeskey: emLam – a Hungarian Language Modeling baseline. In XIII. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2017).
2016
G. Recski: Building concept graphs from monolingual dictionary entries. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). G. Recski, A. Bolevácz, G. Borbély: Building definition graphs using monolingual dictionaries of Hungarian. In XII. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2016). G. Recski: Computational methods in semantics. G. Borbély, A. Kornai, M. Kracht, D. Nemeskey: Denoising composition in distributional semantics. In 28th European Summer School in Logic, Language and Information. A. Kornai, D. Nemeskey, G. Recski: Detecting Optional Arguments of Verbs. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). J. Ács, A. Kornai: Evaluating embeddings on dictionary-based similarity. In RepEval @ ACL 2016. G. Borbély, A. Kornai, M. Makrai, D. Nemeskey: Evaluating multi-sense embeddings for semantic resolution monolingually and in word translation. In repeval. M. Makrai: Filtering Wiktionary triangles by linear mapping between distributed word models. In Proceedings of 10th Edition of the Language Resources and Evaluation Conference. J. Ács, J. Halmi: Hunaccent: Small Footprint Diacritic Restoration for Social Media. In Normalisation and Analysis of Social Media Texts (NormSoMe) Workshop, LREC16. G. Recski, E. Iklódi, K. Pajkossy, A. Kornai: Measuring semantic similarity of words using concept networks. In Proceedings of the 1st Workshop on Representation Learning for NLP. K. Pajkossy, A. Zséder: The hunvec framework for NN-CRF-based sequential tagging. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016).
2015
2014
A. Kornai: Bounding the impact of AGI. In Journal of Experimental and Theoretical Artificial Intelligence. M. Makrai: Causality in vectors space language models. In Spring Wind. A. Kornai: Corpus-based Population of a Mid-level Business Ontology. In Proc MSZNY X. M. Makrai: Deep cases in the 4lang concept lexicon [Mélyesetek a 4lang fogalmi szótárban]. In X. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2014). A. Kornai: Euclidean Automata. In Implementing Selves with Safe Motivational Systems and Self-Improvement. A. Kornai: Finite automata with continuous input. In Short Papers from the Sixth Workshop on Non-Classical Models of Automata and Applications. G. Recski: Hungarian Noun Phrase Extraction Using Rule-based and Hybrid Methods. In Acta Cybernetica. A. Kornai, P. Bhattacharyya: Indian Subcontinent Language Vitalization. In Proc. 2014 LREC Workshop on Indian Language Data: Resources and Evaluation (WILDRE2). M. Makrai: Mélyesetek a 4lang fogalmi szótárban. In X. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2014). J. Ács: Pivot-based multilingual dictionary building using Wiktionary. In The 9th edition of the Language Resources and Evaluation Conference. A. Kornai: Resolving the infinitude controversy. In Journal of Logic Language and Information. M. Makrai: Vector space language models for psycholinguistic analysis. In Corpus resources for quantitative and psycholinguistic analysis. D. Nemeskey: Why Implementation Matters: Evaluation of an Open-source Constraint Grammar Parser. In COLING 2014.
2013
A. Kornai, M. Makrai: A 4lang fogalmi szótár. In IX. Magyar Számitógépes Nyelvészeti Konferencia. M. Makrai, D. Nemeskey, A. Kornai: Applicative structure in vector space models. In Proceedings of the Workshop on Continuous Vector Space Models and their Compositionality. E. Simon: Approaches to Hungarian Named Entity Recognition. J. Ács, K. Pajkossy, A. Kornai: Building basic vocabulary across 40 languages. In Proceedings of the Sixth Workshop on Building and Using Comparable Corpora. A. Kornai: Digital language death. In PloS one. G. Recski: Egy általános célú morfológiai annotáció kiterjesztése [Extending a general-purpose morphological annotation system]. In VII. Alkalmazott Nyelvészeti Doktoranduszkonferencia. M. Makrai: Fogalmak fontossága a definíciós gráf vizsgálatával [Importance of concepts based on the analysis of the definition graph]. In VII. Alkalmazott Nyelvészeti Doktoranduszkonferencia. G. Recski, A. Rung: Identifying Epenthetic Nouns using Maximum Entropy Classification. In VLLXX: Papers Presented to László Varga on his 70th Birthday. J. Ács: Intelligent multilingual dictionary building. In MSc Thesis, Budapest University of Technology and Economics. D. Nemeskey, G. Recski, M. Makrai, A. Zséder, A. Kornai: Spreading activation in language understanding. In Proc. CSIT 2013. A. Kornai, A. Zséder, G. Recski: Structure Learning in Weighted Languages. In Proceedings of the 13th Meeting on the Mathematics of Language (MoL 13). K. Pajkossy: Studying feature selection methods applied to classification tasks in natural language processing. A. Kornai, G. Penn, J. Rogers, A. Yli-Jyrä: The mathematics of language learning. In Proceedings of the Conference 51st Annual Meeting of the Association for Computational Linguistics: Companion Volume.
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2024
2023
2022
2021
BME Submission for SIGMORPHON 2021 Shared Task 0. A Three Step Training Approach with Data Augmentation for Morphological Inflection. In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology. DreamDrug - A crowdsourced NER dataset for detecting drugs in darknet markets. In Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021). Evaluating Contextualized Language Models for Hungarian. In XVII. Magyar SzámítógépesNyelvészeti Konferencia. Evaluating Transferability of BERT Models on Uralic Languages. In Seventh International Workshop for Computational Linguistics of Uralic Languages (IWCLUL 2021). Explainable Rule Extraction via Semantic Graphs. In Proceedings of the Fifth Workshop on Automated Semantic Analysis of Information in Legal Text (ASAIL 2021). Introducing huBERT. In XVII. Magyar Számítógépes Nyelvészeti Konferencia. Offensive text detection on English Twitter with deep learning models and rule-based systems. In FIRE 2021: Forum for Information Retrieval Evaluation. SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages. In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology. Subword Pooling Makes a Difference. In 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL21). TUW-Inf at GermEval2021: Rule-based and Hybrid Methods for Detecting Toxic, Engaging, and Fact-Claiming Comments. In Proceedings of the GermEval 2021 Workshop on the Identification of Toxic, Engaging, and Fact-Claiming Comments. The Gutenberg Dialogue Dataset. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers. Vocabulary: common or basic?. emPhon: Morphologically sensitive open-source phonetic transcriber. In XVII. Magyar Számítógépes Nyelvészeti Konferencia.
2020
2019
A supplementary feature set for sentiment analysis in Japanese dialogues. In Transactions on Asian and Low-Resource Language Information Processing. Az ellenforradalmár. In Nyelv, biológia, szabadság. A 90 éves Chomsky jelentősége a tudományban és azon túl. BME-UW at SR'19: Surface realization with Interpreted Regular Tree Grammars. In Proceedings of the 2nd Workshop on Multilingual Surface Rea lisation (MSR), 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP). Building word embeddings from dictionary definitions. In K + K = 120 Papers dedicated to László Kálmán and András Kornaion the occasion of their 60th birthdays. Deep learning of graph transformations. In MSc Thesis. Emotion Recognition through Intentional Context. In International Journal of Affective Engineering. Generating IRTG grammars from parallel data. In Proceedings of the Automation and Applied Computer Science Workshop 2019 : AACS'19. Improving Neural Conversational Models with Entropy-Based Data Filtering. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL). Investigating sub-word embedding strategies for the morphologically rich and free phrase-order Hungarian. In Proc Repl4NLP. Machine comprehension using semantic graphs. In Proceedings of the Automation and Applied Computer Science Workshop 2019 : AACS'19. One format to rule them all – The emtsv pipeline for Hungarian. In Proc The 13th Linguistic Annotation Workshop. Parsing noun phrases with Interpreted Regular Tree Grammars. In XV. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2019). Proposal Towards a Personalized Knowledge-powered Self-play Based Ensemble Dialog System. In arXiv. Semantics. Sentence Length. In Proceedings of the 16th Meeting on the Mathematics of Language. The impact of inflection on word vectors. In MSZNY 2019. Towards a Universal Semantic Dictionary. In Applied Sciences 9(19). Truth or dare. In Tokens of Meaning: Papers in Honor of Lauri Karttunen.
2018
300-sparsans at SemEval-2018 Task 9: Hypernymy as interaction of sparse attributes. In SemEval. A szöveg mint skálafüggetlen hálózat. In XIV. Magyar Számítógépes Nyelvészeti Konferencia. BME-HAS System for CoNLL--SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection. In Proceedings of the SIGNLL Conference on Computational Natural Language Learning. Building concept definitions from explanatory dictionaries. In International Journal of Lexicography 31/3. Emergency Vocabulary. In Information Systems Frontiers (20/5). Emotions and intentions mediated with dialogue acts. In Proceedings of 2018 5th International Conference on Business and Industrial Research (ICBIR). Evaluation of Universal Dependency parsers for Hungarian. In XIV. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2018). Hibrid nyelvtechnológiák. In Magyar Tudomány 2018/6. Hyphenation using deep neural networks. In XIV. Magyar Számítógépes Nyelvészeti Konferencia. Knowledge base population using natural language inference. In Proceedings of Automation and Applied Computer Science Workshop. Semantic parsing with Interpreted Regular Tree Grammars. In Proceedings of the Automation and Applied Computer Science Workshop 2018 : AACS'18. Semantic parsing with graph transformations. In Scientific Student's Assosiactions Report. Semantic parsing with graph transformations MSc Thesis. In Thesis. Szemantika. Természetes nyelvi interfész menetrend- és utazástervező szolgáltatásokhoz. In XIV. Magyar Számítógépes Nyelvészeti Konferencia (MSZNY 2018). The impact of inflection on word vectors, thesis. Universal Dependency parsing of English and Hungarian – an application for semantic parsing, MSc thesis.
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003