Difference between revisions of "Parsing/home"
(→Parsing Systems) |
(→Parsing Systems) |
||
(8 intermediate revisions by 2 users not shown) | |||
Line 2: | Line 2: | ||
An experimentation environment for data-driven dependency parsing is maintained for NLPL under the coordination of Uppsala University (UU). | An experimentation environment for data-driven dependency parsing is maintained for NLPL under the coordination of Uppsala University (UU). | ||
− | Initially, | + | The data is available on the Norwegian Saga cluster and on the Finnish Puhti cluster. |
+ | The software is available on the Norwegian Saga cluster | ||
+ | |||
+ | Initially, software and data were commissioned on the Norwegian Abel supercluster, see [http://wiki.nlpl.eu/index.php/Parsing/abel The Abel page] for legacy information. | ||
= Preprocessing Tools = | = Preprocessing Tools = | ||
− | * [http://wiki.nlpl.eu/index.php/Parsing/ | + | * [http://wiki.nlpl.eu/index.php/Parsing/udpipe UDPipe] |
Additionally, a variety of tools for sentence splitting, tokenization, lemmatization, et al. | Additionally, a variety of tools for sentence splitting, tokenization, lemmatization, et al. | ||
are available through the NLPL installations of the | are available through the NLPL installations of the | ||
− | [http://nltk.org Natural Language Processing Toolkit (NLTK)] | + | [http://nltk.org Natural Language Processing Toolkit (NLTK)] and the |
− | [ | + | [https://spacy.io spaCy: Natural Language Processing in Python] tools. |
= Parsing Systems = | = Parsing Systems = | ||
* [http://wiki.nlpl.eu/index.php/Parsing/uuparser The Uppsala Parser] | * [http://wiki.nlpl.eu/index.php/Parsing/uuparser The Uppsala Parser] | ||
− | * [http://wiki.nlpl.eu/index.php/Parsing/udpipe UDPipe] | + | * [http://wiki.nlpl.eu/index.php/Parsing/udpipe UDPipe] |
− | * [http://wiki.nlpl.eu/index.php/Parsing/ | + | * [http://wiki.nlpl.eu/index.php/Parsing/turboparser TurboParser] |
+ | |||
+ | |||
+ | Additionallly, parsers are available in several toolkits installed by nlpl: [http://wiki.nlpl.eu/index.php/Parsing/stanfordnlp StanfordNLP], [https://www.nltk.org/ NLTK], [https://spacy.io/ spaCy]. | ||
= Training and Evaluation Data = | = Training and Evaluation Data = | ||
− | * [http://wiki.nlpl.eu/index.php/Parsing/ud Universal Dependencies v2.0–2. | + | * [http://wiki.nlpl.eu/index.php/Parsing/ud Universal Dependencies v2.0–2.5] |
* [http://wiki.nlpl.eu/index.php/Parsing/sdp Semantic Dependency Parsing] | * [http://wiki.nlpl.eu/index.php/Parsing/sdp Semantic Dependency Parsing] |
Latest revision as of 07:18, 15 January 2020
Background
An experimentation environment for data-driven dependency parsing is maintained for NLPL under the coordination of Uppsala University (UU). The data is available on the Norwegian Saga cluster and on the Finnish Puhti cluster. The software is available on the Norwegian Saga cluster
Initially, software and data were commissioned on the Norwegian Abel supercluster, see The Abel page for legacy information.
Preprocessing Tools
Additionally, a variety of tools for sentence splitting, tokenization, lemmatization, et al. are available through the NLPL installations of the Natural Language Processing Toolkit (NLTK) and the spaCy: Natural Language Processing in Python tools.
Parsing Systems
Additionallly, parsers are available in several toolkits installed by nlpl: StanfordNLP, NLTK, spaCy.