Difference between revisions of "Parsing/sdp"

From Nordic Language Processing Laboratory
Jump to: navigation, search
(Created page with "= Background = The ''Semantic Dependency Parsing'' (SDP) initiative seeks to aid the development of data-driven parsing systems whose outputs are general graphs (i.e. transce...")
 
(Available Data)
Line 7: Line 7:
  
 
= Available Data =
 
= Available Data =
 +
 +
As of late 2018, there have been three public releases of the
 +
SDP graphs.
 +
(1) The full SDP collection includes annotations that are
 +
derivative of the venerable Penn Treebank (PTB) and the Penn
 +
Chinese Treebank (CTB) and, thus, is distributed via the
 +
Linguistic Data Consortium as
 +
[https://catalog.ldc.upenn.edu/LDC2016T10 LDC2016T10].
 +
This version is installed as
 +
<code>.../data/parsing/sdp/2016T10/</code> and must only
 +
be used by NLPL users whose home institution holds LDC
 +
licenses to the PTB and CTB (which likely is true for
 +
most NLPL partner sites).
 +
 +
Subsets of the SDP graphs that are free of historic
 +
licensing constraints have been released as
 +
''Open SDP'' versions
 +
(2) [http://hdl.handle.net/11234/1-1742 1.1] and
 +
(3) [http://hdl.handle.net/11234/1-1956 1.2].
 +
These fully open-source releases include the
 +
English SDP graphs in the DELPH-IN MRS Dependencies (DM)
 +
format and the Czech graphs in the Prague Semantic
 +
Dependencies (PSD) format.
 +
They are installed as
 +
<code>.../data/parsing/11/</code> and
 +
<code>.../data/parsing/12/</code>, respectively.

Revision as of 20:22, 2 January 2019

Background

The Semantic Dependency Parsing (SDP) initiative seeks to aid the development of data-driven parsing systems whose outputs are general graphs (i.e. transcend rooted trees).


Available Data

As of late 2018, there have been three public releases of the SDP graphs. (1) The full SDP collection includes annotations that are derivative of the venerable Penn Treebank (PTB) and the Penn Chinese Treebank (CTB) and, thus, is distributed via the Linguistic Data Consortium as LDC2016T10. This version is installed as .../data/parsing/sdp/2016T10/ and must only be used by NLPL users whose home institution holds LDC licenses to the PTB and CTB (which likely is true for most NLPL partner sites).

Subsets of the SDP graphs that are free of historic licensing constraints have been released as Open SDP versions (2) 1.1 and (3) 1.2. These fully open-source releases include the English SDP graphs in the DELPH-IN MRS Dependencies (DM) format and the Czech graphs in the Prague Semantic Dependencies (PSD) format. They are installed as .../data/parsing/11/ and .../data/parsing/12/, respectively.