Difference between revisions of "Parsing/sdp"

From Nordic Language Processing Laboratory
Jump to: navigation, search
(Created page with "= Background = The ''Semantic Dependency Parsing'' (SDP) initiative seeks to aid the development of data-driven parsing systems whose outputs are general graphs (i.e. transce...")
 
(Available Data)
 
(2 intermediate revisions by the same user not shown)
Line 4: Line 4:
 
aid the development of data-driven parsing systems whose outputs
 
aid the development of data-driven parsing systems whose outputs
 
are general graphs (i.e. transcend rooted trees).
 
are general graphs (i.e. transcend rooted trees).
 +
These target representations have become (somewhat) popular
 +
through two semantic dependency parsing competitions as part
 +
of the 2014 and 2015 Semantic Evaluation Exercises (SemEval).
 +
For additional background, please see the
 +
[http://sdp.delph-in.net SDP web site] and the
 +
[http://moin.delph-in.net/EdsTop companion site] for
 +
one of the frameworks represented under the SDP umbrella.
  
 +
= Available Data =
  
= Available Data =
+
As of late 2018, there have been three public releases of the
 +
SDP graphs.
 +
(1) The full SDP collection includes annotations that are
 +
derivative of the venerable Penn Treebank (PTB) and the Penn
 +
Chinese Treebank (CTB) and, thus, is distributed via the
 +
Linguistic Data Consortium as
 +
[https://catalog.ldc.upenn.edu/LDC2016T10 LDC2016T10].
 +
This version is installed as
 +
<code>.../data/parsing/sdp/2016T10/</code> and must only
 +
be used by NLPL users whose home institution holds LDC
 +
licenses to the PTB and CTB (which likely is true for
 +
most NLPL partner sites).
 +
 
 +
Subsets of the SDP graphs that are free of historic
 +
licensing constraints have been released as
 +
''Open SDP'' versions
 +
(2) [http://hdl.handle.net/11234/1-1742 1.1] and
 +
(3) [http://hdl.handle.net/11234/1-1956 1.2].
 +
These fully open-source releases include the
 +
English SDP graphs in the DELPH-IN MRS Dependencies (DM)
 +
format and the Czech graphs in the Prague Semantic
 +
Dependencies (PSD) format.
 +
They are installed as
 +
<code>.../data/parsing/sdp/11/</code> and
 +
<code>.../data/parsing/sdp/12/</code>, respectively.

Latest revision as of 14:50, 8 January 2019

Background

The Semantic Dependency Parsing (SDP) initiative seeks to aid the development of data-driven parsing systems whose outputs are general graphs (i.e. transcend rooted trees). These target representations have become (somewhat) popular through two semantic dependency parsing competitions as part of the 2014 and 2015 Semantic Evaluation Exercises (SemEval). For additional background, please see the SDP web site and the companion site for one of the frameworks represented under the SDP umbrella.

Available Data

As of late 2018, there have been three public releases of the SDP graphs. (1) The full SDP collection includes annotations that are derivative of the venerable Penn Treebank (PTB) and the Penn Chinese Treebank (CTB) and, thus, is distributed via the Linguistic Data Consortium as LDC2016T10. This version is installed as .../data/parsing/sdp/2016T10/ and must only be used by NLPL users whose home institution holds LDC licenses to the PTB and CTB (which likely is true for most NLPL partner sites).

Subsets of the SDP graphs that are free of historic licensing constraints have been released as Open SDP versions (2) 1.1 and (3) 1.2. These fully open-source releases include the English SDP graphs in the DELPH-IN MRS Dependencies (DM) format and the Czech graphs in the Prague Semantic Dependencies (PSD) format. They are installed as .../data/parsing/sdp/11/ and .../data/parsing/sdp/12/, respectively.