My name is Jonáš Vidra and I am a postgraduate student at the Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics of the Charles University. If you want to contact me, please write me an e-mail at my-surname@ufal.mff.cuni.cz
I'm working on a system for automatic discovery of derivational relations in many languages at once, using both monolingual data and cross-lingual information transfer.
My master’s thesis topic was supervised morphological segmentation for Czech using data from the DeriNet project, supervised by Zdeněk Žabokrtský. The preliminary code is published as a Git repository.
I help develop DeriNet, a lexical network of derivational relations between Czech words. My focus is mostly on the technical side of things, although I do some linguistic work as well. I develop and maintain the (deprecated but still used) Perl API used for building the network and help develop the new Python API that will replace it. I also develop a search engine called DeriSearch for querying derivational relations. If you’re looking for the development version of DeriSearch, you can find it on this very server. A new version capable of visualizing non-tree graphs (i.e. word-formational families with compounding) is hosted at the ÚFAL Quest server