|
|
|
|
LEADER |
00000cmm a2200000 a 4500 |
001 |
12739739 |
008 |
220317s2021 pau q m eng d |
005 |
20220714150006.3 |
040 |
|
|
|a AZU
|b eng
|c AZU
|d OCLCQ
|d OCLCO
|d CGU
|
020 |
|
|
|a 1585639559
|
020 |
|
|
|a 9781585639557
|
035 |
|
|
|a (OCoLC)1304031561
|
050 |
|
4 |
|a PE1422
|b .P452 2021
|
245 |
0 |
0 |
|a Penn Discourse Treebank Version 2.0 - German Translation
|
260 |
|
|
|a [Philadelphia, Pa.] :
|b Linguistic Data Consortium,
|c ©2021.
|
300 |
|
|
|a 1 CD-ROM ;
|c 4 3/4 in.
|
500 |
|
|
|a Title from disc label.
|
500 |
|
|
|a "LDC2021T05."
|
500 |
|
|
|a Author(s): Henny Sluyter-Gaethje, Peter Bourgonje, Manfred Stede
|
500 |
|
|
|a February 15, 2021
|
520 |
|
|
|a Penn Discourse Treebank Version 2.0 - German Translation was developed at the University of Potsdam's Applied Computational Linguistics group and consists of approximately one million tokens derived from Penn Discourse Treebank Version 2.0 (LDC2008T05). This data was translated into German and annotated for shallow discourse relations in the financial news domain. The aim of the Penn Discourse Treebank (PDTB) project is to annotate the Wall Street Journal text in Treebank-2 with discourse relations. PDTB2-German is based on a subset of PDTB2.0 used in the 2016 CoNLL Shared Task on Multilingual Shallow Discourse Parsing.
|
500 |
|
|
|a Data is in CoNLL format. Text was automatically translated into German with deepL, and projections of the annotations using word alignments were produced with GIZA++. See the included documentation for more information on the relation annotations. Source text and CoNLL format annotations are each presented in their own tab separated plain text file, encoded in UTF-8
|
650 |
|
0 |
|a Discourse analysis
|v Databases.
|
650 |
|
0 |
|a Computational linguistics
|v Databases.
|
650 |
|
0 |
|a English language
|x Data processing
|v Databases.
|
650 |
|
7 |
|a Computational linguistics.
|2 fast
|0 (OCoLC)fst00871998
|
650 |
|
7 |
|a Discourse analysis.
|2 fast
|0 (OCoLC)fst00894932
|
650 |
|
7 |
|a English language
|x Data processing.
|2 fast
|0 (OCoLC)fst00911073
|
655 |
|
7 |
|a Databases.
|2 fast
|0 (OCoLC)fst01411643
|
655 |
|
7 |
|a Databases.
|2 lcgft
|
700 |
1 |
|
|a Sluyter-Gaethje, Henny
|
700 |
1 |
|
|a Bourgonje, Peter
|
700 |
1 |
|
|a Stede, Manfred
|
710 |
2 |
|
|a Linguistic Data Consortium.
|
929 |
|
|
|a cat
|
999 |
f |
f |
|s 19a4f8e8-8f6e-4794-92af-3011a7a0f1cc
|i 861d627b-337d-4ea3-b2f4-7b21daf46f98
|
928 |
|
|
|t Library of Congress classification
|a PE1422.P452 2021
|p CDRom
|l ASR
|c ASR-JRLASR
|i 12876626
|
927 |
|
|
|t Library of Congress classification
|a PE1422.P452 2021
|p CDRom
|l ASR
|c ASR-JRLASR
|b 115529468
|i 10401619
|