|
|
|
|
LEADER |
00000cam a2200000Mu 4500 |
001 |
12873857 |
006 |
m o d |
007 |
cr |n|---||||| |
008 |
210116s2020 xr o 000 0 eng d |
005 |
20240708192011.0 |
019 |
|
|
|a 1232480437
|
020 |
|
|
|a 9788024647654
|
020 |
|
|
|a 8024647656
|
020 |
|
|
|z 9788024647593
|
035 |
|
|
|a (OCoLC)1231608795
|z (OCoLC)1232480437
|
035 |
|
9 |
|a (OCLCCM-CC)1231608795
|
040 |
|
|
|a EBLCP
|b eng
|e pn
|c EBLCP
|d N$T
|d OCLCO
|d EBLCP
|d OCLCF
|
049 |
|
|
|a MAIN
|
050 |
|
4 |
|a P128.C68
|
100 |
1 |
|
|a Rosen, Alexandr.
|
245 |
1 |
0 |
|a Compiling and Annotating a Learner Corpus for a Morphologically Rich Language
|
260 |
|
|
|a Prague :
|b Karolinum Press,
|c 2020.
|
300 |
|
|
|a 1 online resource (281 pages)
|
336 |
|
|
|a text
|b txt
|2 rdacontent
|
337 |
|
|
|a computer
|b c
|2 rdamedia
|
338 |
|
|
|a online resource
|b cr
|2 rdacarrier
|
588 |
0 |
|
|a Print version record.
|
505 |
0 |
|
|a Cover -- Contents -- List of abbreviations -- Introduction -- About this book -- Reasons to study non-native Czech -- Some properties of non-native Czech -- Morphology -- Syntax -- Word segmentation -- Learner corpus -- Roadmap -- Learner corpora -- Terminology -- Various types of learner corpora -- The choice of texts -- Annotation -- Textual annotation -- Linguistic annotation -- Error annotation -- correction -- Error annotation -- categorization -- Annotation scheme -- Data access -- Some learner corpora -- ASK -- CLC -- COPLE2 -- CroLTeC -- Falko -- ICLE -- MERLIN -- RLC -- SweLL
|
505 |
8 |
|
|a Relationships of CzeSL with other learner corpora -- Introducing the CzeSL project -- Specifications of CzeSL -- Intended usage -- AKCES -- the umbrella project -- Procurement of texts -- Text collection -- Transcription -- Anonymization -- Metadata -- Error annotation -- Errors and learner language -- More than one way to annotate errors in CzeSL -- A wishlist for error annotation -- Interference and other types of explanation -- Interpretation in terms of TH -- Word order -- Style -- Communication goal -- The two-tier annotation scheme -- Annotation scheme as a compromise -- Why multiple tiers
|
505 |
8 |
|
|a How many tiers -- Multiple tiers in a tabular format -- Content of the tiers -- A sample text with T1 vs. T2 corrections -- Links between tiers -- Error tags -- Morphosyntactic references -- Follow-up corrections -- Alternative target hypotheses -- Error tagset -- Based on linguistic categories -- Grammar-based vs. formal errors -- Extent of the annotated unit -- Grammar-based tags -- Errors at T1 -- Errors at T2 -- Coarse-grained -- An example of complex annotation -- Evaluation of the manual tiered error annotation -- Inter-annotator agreement (IAA) -- A pilot annotation
|
505 |
8 |
|
|a IAA on all doubly-annotated texts -- Error tags depend on target hypothesis -- Possible causes of the annotators' disagreements -- Formal tags -- Automatic extension and modification of error annotation -- Automatic detection of formal errors on T1 -- Formal orthographic errors -- Formal errors sometimes influencing pronunciation -- Formal errors influencing pronunciation -- Other types of errors -- Automatic classification of word-boundary errors -- Implicit error annotation -- Multi-dimensional error annotation (MD) -- Focus on morphology -- All annotation applied to the source text
|
505 |
8 |
|
|a Extent of the annotated unit -- Alternative error domains -- Source text, target hypothesis, annotated strings -- Domains and features -- Linguistic annotation -- Annotation with tools for Standard Czech -- Annotation of target hypothesis -- Annotation of T1 -- Annotation of source texts -- Annotation of interlanguage in UD -- Tokenization -- Part-of-speech and morphology -- Lemmata -- Syntactic Structure -- Evaluation -- Annotation process -- Overview of the annotation process -- Transcription and anonymization of manuscripts -- Tiered error annotation -- Manual error annotation
|
500 |
|
|
|a Automatic annotation checking.
|
650 |
|
0 |
|a Corpora (Linguistics)
|0 http://id.loc.gov/authorities/subjects/sh2006006393
|
650 |
|
0 |
|a Czech language.
|0 http://id.loc.gov/authorities/subjects/sh85035271
|
650 |
|
7 |
|a Corpora (Linguistics)
|2 fast
|0 (OCoLC)fst01740921
|
650 |
|
7 |
|a Czech language.
|2 fast
|0 (OCoLC)fst00886348
|
655 |
|
0 |
|a Electronic books.
|
655 |
|
4 |
|a Electronic books.
|
700 |
1 |
|
|a Hana, Jiří.
|
700 |
1 |
|
|a Vidová Hladká, Barbora.
|
776 |
0 |
8 |
|i Print version:
|a Rosen, Alexandr.
|t Compiling and Annotating a Learner Corpus for a Morphologically Rich Language.
|d Prague : Karolinum Press, ©2020
|z 9788024647593
|
929 |
|
|
|a oclccm
|
999 |
f |
f |
|s 6d780453-f714-48ad-aeb3-7a464144f747
|i fe6588aa-da57-48ac-b949-1cc14426606c
|
928 |
|
|
|t Library of Congress classification
|a P128.C68
|l Online
|c UC-FullText
|u https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=e000xna&AN=2729804
|z eBooks on EBSCOhost
|g ebooks
|i 13011562
|