Difference between revisions of "Parsing/home"

From Nordic Language Processing Laboratory
Jump to: navigation, search
Line 6: Line 6:
 
= Preprocessing Tools =
 
= Preprocessing Tools =
  
* [http://wiki.nlpl.eu/index.php/Parsing/repp REPP Tokenizer (English and Norwegian)]
+
* [http://wiki.nlpl.eu/index.php/Parsing/udpipe UDPipe]
  
 
Additionally, a variety of tools for sentence splitting, tokenization, lemmatization, et al.
 
Additionally, a variety of tools for sentence splitting, tokenization, lemmatization, et al.
Line 17: Line 17:
 
* [http://wiki.nlpl.eu/index.php/Parsing/uuparser The Uppsala Parser]
 
* [http://wiki.nlpl.eu/index.php/Parsing/uuparser The Uppsala Parser]
 
* [http://wiki.nlpl.eu/index.php/Parsing/udpipe UDPipe]
 
* [http://wiki.nlpl.eu/index.php/Parsing/udpipe UDPipe]
* [http://wiki.nlpl.eu/index.php/Parsing/dozat Stanford Graph-Based Parser by Tim Dozat]
+
* [http://wiki.nlpl.eu/index.php/Parsing/turboparser TurboParser]
  
 
= Training and Evaluation Data =  
 
= Training and Evaluation Data =  

Revision as of 10:18, 14 January 2020

Background

An experimentation environment for data-driven dependency parsing is maintained for NLPL under the coordination of Uppsala University (UU). Initially, the software and data are commissioned on the Norwegian Abel supercluster.

Preprocessing Tools

Additionally, a variety of tools for sentence splitting, tokenization, lemmatization, et al. are available through the NLPL installations of the Natural Language Processing Toolkit (NLTK) and the spaCy: Natural Language Processing in Python tools.

Parsing Systems

Training and Evaluation Data