This page is outdated and kept for documentation purposes only! It reflects the status of the translation activity mid-2019, before the launch of Puhti and Saga.
This page describes resources previously installed on the Abel cluster. For unlinked resources, see pages for currently available software/data on the main parsing page.
Additionally, a variety of tools for sentence splitting, tokenization, lemmatization, et al. are available through the NLPL installations of the Natural Language Processing Toolkit (NLTK) and the spaCy: Natural Language Processing in Python tools.
- Stanford Graph-Based Parser by Tim Dozat
- Universal Dependencies treebanks
- Semantic Dependency parsing