Difference between revisions of "Parsing/sdp"
(Created page with "= Background = The ''Semantic Dependency Parsing'' (SDP) initiative seeks to aid the development of data-driven parsing systems whose outputs are general graphs (i.e. transce...") |
(→Available Data) |
||
Line 7: | Line 7: | ||
= Available Data = | = Available Data = | ||
+ | |||
+ | As of late 2018, there have been three public releases of the | ||
+ | SDP graphs. | ||
+ | (1) The full SDP collection includes annotations that are | ||
+ | derivative of the venerable Penn Treebank (PTB) and the Penn | ||
+ | Chinese Treebank (CTB) and, thus, is distributed via the | ||
+ | Linguistic Data Consortium as | ||
+ | [https://catalog.ldc.upenn.edu/LDC2016T10 LDC2016T10]. | ||
+ | This version is installed as | ||
+ | <code>.../data/parsing/sdp/2016T10/</code> and must only | ||
+ | be used by NLPL users whose home institution holds LDC | ||
+ | licenses to the PTB and CTB (which likely is true for | ||
+ | most NLPL partner sites). | ||
+ | |||
+ | Subsets of the SDP graphs that are free of historic | ||
+ | licensing constraints have been released as | ||
+ | ''Open SDP'' versions | ||
+ | (2) [http://hdl.handle.net/11234/1-1742 1.1] and | ||
+ | (3) [http://hdl.handle.net/11234/1-1956 1.2]. | ||
+ | These fully open-source releases include the | ||
+ | English SDP graphs in the DELPH-IN MRS Dependencies (DM) | ||
+ | format and the Czech graphs in the Prague Semantic | ||
+ | Dependencies (PSD) format. | ||
+ | They are installed as | ||
+ | <code>.../data/parsing/11/</code> and | ||
+ | <code>.../data/parsing/12/</code>, respectively. |
Revision as of 20:22, 2 January 2019
Background
The Semantic Dependency Parsing (SDP) initiative seeks to aid the development of data-driven parsing systems whose outputs are general graphs (i.e. transcend rooted trees).
Available Data
As of late 2018, there have been three public releases of the
SDP graphs.
(1) The full SDP collection includes annotations that are
derivative of the venerable Penn Treebank (PTB) and the Penn
Chinese Treebank (CTB) and, thus, is distributed via the
Linguistic Data Consortium as
LDC2016T10.
This version is installed as
.../data/parsing/sdp/2016T10/
and must only
be used by NLPL users whose home institution holds LDC
licenses to the PTB and CTB (which likely is true for
most NLPL partner sites).
Subsets of the SDP graphs that are free of historic
licensing constraints have been released as
Open SDP versions
(2) 1.1 and
(3) 1.2.
These fully open-source releases include the
English SDP graphs in the DELPH-IN MRS Dependencies (DM)
format and the Czech graphs in the Prague Semantic
Dependencies (PSD) format.
They are installed as
.../data/parsing/11/
and
.../data/parsing/12/
, respectively.