SZTAKI HLT | POTATO: exPlainable infOrmation exTrAcTion framewOrk

POTATO: exPlainable infOrmation exTrAcTion framewOrk

Ádám Kovács, Kinga Gémes, Eszter Iklódi, Gábor Recski
In CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Poster (PDF)

Abstract: We present POTATO, a task- and language-independent framework for human-in-the-loop (HITL) learning of rule-based text classifiers using graph-based features. POTATO handles any type of directed graph and supports parsing text into Abstract Meaning Representations (AMR), Universal Dependencies (UD), and 4lang semantic graphs. A web-based user interface allows users to build rule systems from graph patterns, provides real-time evaluation based on ground truth data, and suggests rules by ranking graph features using interpretable machine learning models. Users can also provide patterns over graphs using regular expressions, and POTATO can recommend refinements of such rules. POTATO is applied in projects across domains and languages, including classification tasks on German legal text and English social media data. All components of our system are written in Python, can be installed via pip, and are released under an MIT License on GitHub.

author = {Kov\'{a}cs, \'{A}d\'{a}m and G\'{e}mes, Kinga and Ikl\'{o}di, Eszter and Recski, G\'{a}bor},
title = {{POTATO: ExPlainable InfOrmation ExTrAcTion FramewOrk}},
year = {2022},
isbn = {9781450392365},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {},
doi = {10.1145/3511808.3557196},
booktitle = {Proceedings of the 31st ACM International Conference on Information and Knowledge Management},
pages = {4897–4901},
numpages = {5},
location = {Atlanta, GA, USA},
series = {CIKM '22}