Penn Discourse Treebank Version 2.0 - German Translation

Saved in:
Bibliographic Details
Imprint:[Philadelphia, Pa.] : Linguistic Data Consortium, ©2021.
Description:1 CD-ROM ; 4 3/4 in.
Language:English
Subject:
Format: Unknown
URL for this record:http://pi.lib.uchicago.edu/1001/cat/bib/12739739
Hidden Bibliographic Details
Other authors / contributors:Sluyter-Gaethje, Henny
Bourgonje, Peter
Stede, Manfred
Linguistic Data Consortium.
ISBN:1585639559
9781585639557
Notes:Title from disc label.
"LDC2021T05."
Author(s): Henny Sluyter-Gaethje, Peter Bourgonje, Manfred Stede
February 15, 2021
Data is in CoNLL format. Text was automatically translated into German with deepL, and projections of the annotations using word alignments were produced with GIZA++. See the included documentation for more information on the relation annotations. Source text and CoNLL format annotations are each presented in their own tab separated plain text file, encoded in UTF-8
Summary:Penn Discourse Treebank Version 2.0 - German Translation was developed at the University of Potsdam's Applied Computational Linguistics group and consists of approximately one million tokens derived from Penn Discourse Treebank Version 2.0 (LDC2008T05). This data was translated into German and annotated for shallow discourse relations in the financial news domain. The aim of the Penn Discourse Treebank (PDTB) project is to annotate the Wall Street Journal text in Treebank-2 with discourse relations. PDTB2-German is based on a subset of PDTB2.0 used in the 2016 CoNLL Shared Task on Multilingual Shallow Discourse Parsing.

MARC

LEADER 00000cmm a2200000 a 4500
001 12739739
008 220317s2021 pau q m eng d
005 20220714150006.3
040 |a AZU  |b eng  |c AZU  |d OCLCQ  |d OCLCO  |d CGU 
020 |a 1585639559 
020 |a 9781585639557 
035 |a (OCoLC)1304031561 
050 4 |a PE1422  |b .P452 2021 
245 0 0 |a Penn Discourse Treebank Version 2.0 - German Translation 
260 |a [Philadelphia, Pa.] :  |b Linguistic Data Consortium,  |c ©2021. 
300 |a 1 CD-ROM ;  |c 4 3/4 in. 
500 |a Title from disc label. 
500 |a "LDC2021T05." 
500 |a Author(s): Henny Sluyter-Gaethje, Peter Bourgonje, Manfred Stede 
500 |a February 15, 2021 
520 |a Penn Discourse Treebank Version 2.0 - German Translation was developed at the University of Potsdam's Applied Computational Linguistics group and consists of approximately one million tokens derived from Penn Discourse Treebank Version 2.0 (LDC2008T05). This data was translated into German and annotated for shallow discourse relations in the financial news domain. The aim of the Penn Discourse Treebank (PDTB) project is to annotate the Wall Street Journal text in Treebank-2 with discourse relations. PDTB2-German is based on a subset of PDTB2.0 used in the 2016 CoNLL Shared Task on Multilingual Shallow Discourse Parsing. 
500 |a Data is in CoNLL format. Text was automatically translated into German with deepL, and projections of the annotations using word alignments were produced with GIZA++. See the included documentation for more information on the relation annotations. Source text and CoNLL format annotations are each presented in their own tab separated plain text file, encoded in UTF-8 
650 0 |a Discourse analysis  |v Databases. 
650 0 |a Computational linguistics  |v Databases. 
650 0 |a English language  |x Data processing  |v Databases. 
650 7 |a Computational linguistics.  |2 fast  |0 (OCoLC)fst00871998 
650 7 |a Discourse analysis.  |2 fast  |0 (OCoLC)fst00894932 
650 7 |a English language  |x Data processing.  |2 fast  |0 (OCoLC)fst00911073 
655 7 |a Databases.  |2 fast  |0 (OCoLC)fst01411643 
655 7 |a Databases.  |2 lcgft 
700 1 |a Sluyter-Gaethje, Henny 
700 1 |a Bourgonje, Peter 
700 1 |a Stede, Manfred 
710 2 |a Linguistic Data Consortium. 
929 |a cat 
999 f f |s 19a4f8e8-8f6e-4794-92af-3011a7a0f1cc  |i 861d627b-337d-4ea3-b2f4-7b21daf46f98 
928 |t Library of Congress classification  |a PE1422.P452 2021  |p CDRom  |l ASR  |c ASR-JRLASR  |i 12876626 
927 |t Library of Congress classification  |a PE1422.P452 2021  |p CDRom  |l ASR  |c ASR-JRLASR  |b 115529468  |i 10401619