Difference between revisions of "Parsing/home"

From Nordic Language Processing Laboratory
Jump to: navigation, search
(Available Data Sets)
(Parsing Systems)
Line 3: Line 3:
 
An experimentation environment for data-driven dependency parsing is maintained for NLPL under the coordination of Uppsala University (UU).
 
An experimentation environment for data-driven dependency parsing is maintained for NLPL under the coordination of Uppsala University (UU).
 
Initially, the software and data are commissioned on the Norwegian Abel supercluster.
 
Initially, the software and data are commissioned on the Norwegian Abel supercluster.
 +
 +
= Preprocessing Tools =
 +
 +
* [http://wiki.nlpl.eu/index.php/Parsing/repp REPP Tokenizer (English and Norwegian)]
 +
 +
Additionally, a variety of tools for sentence splitting, tokenization, lemmatization, et al.
 +
are available through the NLPL installations of the
 +
[http://nltk.org Natural Language Processing Toolkit (NLTK)]
 +
[http spaCy: Natural Language Processing in Python] tools.
  
 
= Parsing Systems =
 
= Parsing Systems =
Line 8: Line 17:
 
* [http://wiki.nlpl.eu/index.php/Parsing/uuparser The Uppsala Parser]
 
* [http://wiki.nlpl.eu/index.php/Parsing/uuparser The Uppsala Parser]
 
* [http://wiki.nlpl.eu/index.php/Parsing/udpipe UDPipe]
 
* [http://wiki.nlpl.eu/index.php/Parsing/udpipe UDPipe]
 +
* [http://wiki.nlpl.eu/index.php/Parsing/dozat Stanford Graph-Based Parser by Tim Dozat]
  
 
= Training and Evaluation Data =  
 
= Training and Evaluation Data =  

Revision as of 22:34, 7 January 2019

Background

An experimentation environment for data-driven dependency parsing is maintained for NLPL under the coordination of Uppsala University (UU). Initially, the software and data are commissioned on the Norwegian Abel supercluster.

Preprocessing Tools

Additionally, a variety of tools for sentence splitting, tokenization, lemmatization, et al. are available through the NLPL installations of the Natural Language Processing Toolkit (NLTK) [http spaCy: Natural Language Processing in Python] tools.

Parsing Systems

Training and Evaluation Data