Difference between revisions of "Parsing/home"

From Nordic Language Processing Laboratory
Jump to: navigation, search
(Preprocessing Tools)
(Parsing Systems)
 
(7 intermediate revisions by 2 users not shown)
Line 2: Line 2:
  
 
An experimentation environment for data-driven dependency parsing is maintained for NLPL under the coordination of Uppsala University (UU).
 
An experimentation environment for data-driven dependency parsing is maintained for NLPL under the coordination of Uppsala University (UU).
Initially, the software and data are commissioned on the Norwegian Abel supercluster.
+
The data is available on the Norwegian Saga cluster and on the Finnish Puhti cluster.
 +
The software is available on the Norwegian Saga cluster
 +
 
 +
Initially, software and data were commissioned on the Norwegian Abel supercluster, see [http://wiki.nlpl.eu/index.php/Parsing/abel The Abel page] for legacy information.
  
 
= Preprocessing Tools =
 
= Preprocessing Tools =
  
* [http://wiki.nlpl.eu/index.php/Parsing/repp REPP Tokenizer (English and Norwegian)]
+
* [http://wiki.nlpl.eu/index.php/Parsing/udpipe UDPipe]
  
 
Additionally, a variety of tools for sentence splitting, tokenization, lemmatization, et al.
 
Additionally, a variety of tools for sentence splitting, tokenization, lemmatization, et al.
 
are available through the NLPL installations of the
 
are available through the NLPL installations of the
[http://nltk.org Natural Language Processing Toolkit (NLTK)]
+
[http://nltk.org Natural Language Processing Toolkit (NLTK)] and the
[https://en.wikipedia.org/wiki/SpaCy spaCy: Natural Language Processing in Python] tools.
+
[https://spacy.io spaCy: Natural Language Processing in Python] tools.
  
 
= Parsing Systems =
 
= Parsing Systems =
  
 
* [http://wiki.nlpl.eu/index.php/Parsing/uuparser The Uppsala Parser]
 
* [http://wiki.nlpl.eu/index.php/Parsing/uuparser The Uppsala Parser]
* [http://wiki.nlpl.eu/index.php/Parsing/udpipe UDPipe]
+
* [http://wiki.nlpl.eu/index.php/Parsing/udpipe UDPipe]  
* [http://wiki.nlpl.eu/index.php/Parsing/dozat Stanford Graph-Based Parser by Tim Dozat]
+
* [http://wiki.nlpl.eu/index.php/Parsing/turboparser TurboParser]
 +
 
 +
 
 +
Additionallly, parsers are available in several toolkits installed by nlpl: [http://wiki.nlpl.eu/index.php/Parsing/stanfordnlp StanfordNLP], [https://www.nltk.org/ NLTK], [https://spacy.io/ spaCy].
  
 
= Training and Evaluation Data =  
 
= Training and Evaluation Data =  
  
* [http://wiki.nlpl.eu/index.php/Parsing/ud Universal Dependencies v2.0–2.3]
+
* [http://wiki.nlpl.eu/index.php/Parsing/ud Universal Dependencies v2.0–2.5]
 
* [http://wiki.nlpl.eu/index.php/Parsing/sdp Semantic Dependency Parsing]
 
* [http://wiki.nlpl.eu/index.php/Parsing/sdp Semantic Dependency Parsing]

Latest revision as of 07:18, 15 January 2020

Background

An experimentation environment for data-driven dependency parsing is maintained for NLPL under the coordination of Uppsala University (UU). The data is available on the Norwegian Saga cluster and on the Finnish Puhti cluster. The software is available on the Norwegian Saga cluster

Initially, software and data were commissioned on the Norwegian Abel supercluster, see The Abel page for legacy information.

Preprocessing Tools

Additionally, a variety of tools for sentence splitting, tokenization, lemmatization, et al. are available through the NLPL installations of the Natural Language Processing Toolkit (NLTK) and the spaCy: Natural Language Processing in Python tools.

Parsing Systems


Additionallly, parsers are available in several toolkits installed by nlpl: StanfordNLP, NLTK, spaCy.

Training and Evaluation Data