Compiling and Annotating a Learner Corpus for a Morphologically Rich Language

Saved in:
Bibliographic Details
Author / Creator:Rosen, Alexandr.
Imprint:Prague : Karolinum Press, 2020.
Description:1 online resource (281 pages)
Language:English
Subject:
Format: E-Resource Book
URL for this record:http://pi.lib.uchicago.edu/1001/cat/bib/12873857
Hidden Bibliographic Details
Other authors / contributors:Hana, Jiří.
Vidová Hladká, Barbora.
ISBN:9788024647654
8024647656
9788024647593
Notes:Automatic annotation checking.
Print version record.
Other form:Print version: Rosen, Alexandr. Compiling and Annotating a Learner Corpus for a Morphologically Rich Language. Prague : Karolinum Press, ©2020 9788024647593

MARC

LEADER 00000cam a2200000Mu 4500
001 12873857
006 m o d
007 cr |n|---|||||
008 210116s2020 xr o 000 0 eng d
005 20240708192011.0
019 |a 1232480437 
020 |a 9788024647654 
020 |a 8024647656 
020 |z 9788024647593 
035 |a (OCoLC)1231608795  |z (OCoLC)1232480437 
035 9 |a (OCLCCM-CC)1231608795 
040 |a EBLCP  |b eng  |e pn  |c EBLCP  |d N$T  |d OCLCO  |d EBLCP  |d OCLCF 
049 |a MAIN 
050 4 |a P128.C68 
100 1 |a Rosen, Alexandr. 
245 1 0 |a Compiling and Annotating a Learner Corpus for a Morphologically Rich Language 
260 |a Prague :  |b Karolinum Press,  |c 2020. 
300 |a 1 online resource (281 pages) 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
588 0 |a Print version record. 
505 0 |a Cover -- Contents -- List of abbreviations -- Introduction -- About this book -- Reasons to study non-native Czech -- Some properties of non-native Czech -- Morphology -- Syntax -- Word segmentation -- Learner corpus -- Roadmap -- Learner corpora -- Terminology -- Various types of learner corpora -- The choice of texts -- Annotation -- Textual annotation -- Linguistic annotation -- Error annotation -- correction -- Error annotation -- categorization -- Annotation scheme -- Data access -- Some learner corpora -- ASK -- CLC -- COPLE2 -- CroLTeC -- Falko -- ICLE -- MERLIN -- RLC -- SweLL 
505 8 |a Relationships of CzeSL with other learner corpora -- Introducing the CzeSL project -- Specifications of CzeSL -- Intended usage -- AKCES -- the umbrella project -- Procurement of texts -- Text collection -- Transcription -- Anonymization -- Metadata -- Error annotation -- Errors and learner language -- More than one way to annotate errors in CzeSL -- A wishlist for error annotation -- Interference and other types of explanation -- Interpretation in terms of TH -- Word order -- Style -- Communication goal -- The two-tier annotation scheme -- Annotation scheme as a compromise -- Why multiple tiers 
505 8 |a How many tiers -- Multiple tiers in a tabular format -- Content of the tiers -- A sample text with T1 vs. T2 corrections -- Links between tiers -- Error tags -- Morphosyntactic references -- Follow-up corrections -- Alternative target hypotheses -- Error tagset -- Based on linguistic categories -- Grammar-based vs. formal errors -- Extent of the annotated unit -- Grammar-based tags -- Errors at T1 -- Errors at T2 -- Coarse-grained -- An example of complex annotation -- Evaluation of the manual tiered error annotation -- Inter-annotator agreement (IAA) -- A pilot annotation 
505 8 |a IAA on all doubly-annotated texts -- Error tags depend on target hypothesis -- Possible causes of the annotators' disagreements -- Formal tags -- Automatic extension and modification of error annotation -- Automatic detection of formal errors on T1 -- Formal orthographic errors -- Formal errors sometimes influencing pronunciation -- Formal errors influencing pronunciation -- Other types of errors -- Automatic classification of word-boundary errors -- Implicit error annotation -- Multi-dimensional error annotation (MD) -- Focus on morphology -- All annotation applied to the source text 
505 8 |a Extent of the annotated unit -- Alternative error domains -- Source text, target hypothesis, annotated strings -- Domains and features -- Linguistic annotation -- Annotation with tools for Standard Czech -- Annotation of target hypothesis -- Annotation of T1 -- Annotation of source texts -- Annotation of interlanguage in UD -- Tokenization -- Part-of-speech and morphology -- Lemmata -- Syntactic Structure -- Evaluation -- Annotation process -- Overview of the annotation process -- Transcription and anonymization of manuscripts -- Tiered error annotation -- Manual error annotation 
500 |a Automatic annotation checking. 
650 0 |a Corpora (Linguistics)  |0 http://id.loc.gov/authorities/subjects/sh2006006393 
650 0 |a Czech language.  |0 http://id.loc.gov/authorities/subjects/sh85035271 
650 7 |a Corpora (Linguistics)  |2 fast  |0 (OCoLC)fst01740921 
650 7 |a Czech language.  |2 fast  |0 (OCoLC)fst00886348 
655 0 |a Electronic books. 
655 4 |a Electronic books. 
700 1 |a Hana, Jiří. 
700 1 |a Vidová Hladká, Barbora. 
776 0 8 |i Print version:  |a Rosen, Alexandr.  |t Compiling and Annotating a Learner Corpus for a Morphologically Rich Language.  |d Prague : Karolinum Press, ©2020  |z 9788024647593 
929 |a oclccm 
999 f f |s 6d780453-f714-48ad-aeb3-7a464144f747  |i fe6588aa-da57-48ac-b949-1cc14426606c 
928 |t Library of Congress classification  |a P128.C68  |l Online  |c UC-FullText  |u https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=e000xna&AN=2729804  |z eBooks on EBSCOhost  |g ebooks  |i 13011562