At the exhibition of the Artificial Intelligence Coalition, human language understanding was represented by researchers from the Institute of Linguistics of the MTA. Visitors could learn about two technologies: the e-magyar Hungarian text analysis chain and Hungarian word embeddings that represent words in systems based on machine learning (i.e. artificial intelligence). There were three demos for the latter:
- search for similar words, answer analogical questions, and select the odd-man-out with a Hungarian word embedding
- the neighborhood of words, i.e. similar words in the syntactic-semantic space, visualized by Dániel Varga (MTA Rényi)
- a galactic journey in English by Andrei Kashcha.
Machine translation is perhaps the main application of natural language processing, another is the mining of structured information from texts where it is needed in large quantities, eg. measuring consumer/voter satisfaction/opinion. If we want to learn machine learning on texts, then we need features first: vectors provide these. Most of the methods of language technology work for the 100--200 languages whith enough text, but there are differences, for example, in Hungarian, it is worth using a morphological analyzer (stemmer) based on linguistic knowledge because of the many word forms (inflectional and derivational). The most important module of e-Hungarian is a mophological analyser. Vectors model the frequency (probability, "naturalness") of the words in each narrow context, and the same vectors (and more recently the early layers of deep nets) can be used for quite different tasks. The word artificial intelligence has become fashionable around 2012 because the models of connectionism (based on the activations of nodes thought to be neurons and the associations between them, researched since 1974) met the solid methodology of machine learning devloped in the 1990s.
Less than five percent of current world languages are in use online, according to a recent study by prominent linguist András Kornai -- and the Internet may be helping the other 95 percent to their graves.
Hungarian linguists discuss the digital survival of the language. The conference was hosted by the Research Institute for Linguistics of the Hungarian Academy of Sciences.
András Kornai's talk at the Budapest New Tech Meetup
Is Hungarian lost in the digital see and what do we have to do to survive?
Interview in the Orient Radio
The Hungarian online newspaper Origo published an article about the robotic ticket clerk presented by our group at the public event "Researchers' Night"
The institutes of the Hungarian Academy of Sciences are present on the "Night of the Researchers" event
The Hungarian online newspaper Nyest's report on the robotic ticket clerk
Interview with the mathematical linguist András Kornai.
András Kornai's speech in the German Research Center for Artificial Intelligence (DFKI)
An updated version of Terry Winograd's famous SHRDLU robot.
Teaser for the robotic ticket clerk
A short introductory film produced by Mindentudás
In 2011, András Kornai gave a lecture popularizing human language technology on national television.