Difference between revisions of "Parsing/home"
Line 6: | Line 6: | ||
= Preprocessing Tools = | = Preprocessing Tools = | ||
− | * [http://wiki.nlpl.eu/index.php/Parsing/ | + | * [http://wiki.nlpl.eu/index.php/Parsing/udpipe UDPipe] |
Additionally, a variety of tools for sentence splitting, tokenization, lemmatization, et al. | Additionally, a variety of tools for sentence splitting, tokenization, lemmatization, et al. | ||
Line 17: | Line 17: | ||
* [http://wiki.nlpl.eu/index.php/Parsing/uuparser The Uppsala Parser] | * [http://wiki.nlpl.eu/index.php/Parsing/uuparser The Uppsala Parser] | ||
* [http://wiki.nlpl.eu/index.php/Parsing/udpipe UDPipe] | * [http://wiki.nlpl.eu/index.php/Parsing/udpipe UDPipe] | ||
− | * [http://wiki.nlpl.eu/index.php/Parsing/ | + | * [http://wiki.nlpl.eu/index.php/Parsing/turboparser TurboParser] |
= Training and Evaluation Data = | = Training and Evaluation Data = |
Revision as of 10:18, 14 January 2020
Background
An experimentation environment for data-driven dependency parsing is maintained for NLPL under the coordination of Uppsala University (UU). Initially, the software and data are commissioned on the Norwegian Abel supercluster.
Preprocessing Tools
Additionally, a variety of tools for sentence splitting, tokenization, lemmatization, et al. are available through the NLPL installations of the Natural Language Processing Toolkit (NLTK) and the spaCy: Natural Language Processing in Python tools.