Compiling and Annotating a Learner Corpus for a Morphologically Rich Language

Saved in:

Bibliographic Details
Author / Creator:	Rosen, Alexandr.
Imprint:	Prague : Karolinum Press, 2020.
Description:	1 online resource (281 pages)
Language:	English
Subject:	Corpora (Linguistics) Czech language. Corpora (Linguistics) Czech language. Electronic books.
Format:	E-Resource Book
URL for this record:	http://pi.lib.uchicago.edu/1001/cat/bib/12873857

Hidden Bibliographic Details
Other authors / contributors:	Hana, Jiří. Vidová Hladká, Barbora.
ISBN:	9788024647654 8024647656 9788024647593
Notes:	Automatic annotation checking. Print version record.
Other form:	Print version: Rosen, Alexandr. Compiling and Annotating a Learner Corpus for a Morphologically Rich Language. Prague : Karolinum Press, ©2020 9788024647593

MARC


LEADER	00000cam a2200000Mu 4500
001	12873857
006	m o d
007	cr \|n\|---\|\|\|\|\|
008	210116s2020 xr o 000 0 eng d
005	20240708192011.0
019			\|a 1232480437
020			\|a 9788024647654
020			\|a 8024647656
020			\|z 9788024647593
035			\|a (OCoLC)1231608795 \|z (OCoLC)1232480437
035		9	\|a (OCLCCM-CC)1231608795
040			\|a EBLCP \|b eng \|e pn \|c EBLCP \|d N$T \|d OCLCO \|d EBLCP \|d OCLCF
049			\|a MAIN
050		4	\|a P128.C68
100	1		\|a Rosen, Alexandr.
245	1	0	\|a Compiling and Annotating a Learner Corpus for a Morphologically Rich Language
260			\|a Prague : \|b Karolinum Press, \|c 2020.
300			\|a 1 online resource (281 pages)
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a online resource \|b cr \|2 rdacarrier
588	0		\|a Print version record.
505	0		\|a Cover -- Contents -- List of abbreviations -- Introduction -- About this book -- Reasons to study non-native Czech -- Some properties of non-native Czech -- Morphology -- Syntax -- Word segmentation -- Learner corpus -- Roadmap -- Learner corpora -- Terminology -- Various types of learner corpora -- The choice of texts -- Annotation -- Textual annotation -- Linguistic annotation -- Error annotation -- correction -- Error annotation -- categorization -- Annotation scheme -- Data access -- Some learner corpora -- ASK -- CLC -- COPLE2 -- CroLTeC -- Falko -- ICLE -- MERLIN -- RLC -- SweLL
505	8		\|a Relationships of CzeSL with other learner corpora -- Introducing the CzeSL project -- Specifications of CzeSL -- Intended usage -- AKCES -- the umbrella project -- Procurement of texts -- Text collection -- Transcription -- Anonymization -- Metadata -- Error annotation -- Errors and learner language -- More than one way to annotate errors in CzeSL -- A wishlist for error annotation -- Interference and other types of explanation -- Interpretation in terms of TH -- Word order -- Style -- Communication goal -- The two-tier annotation scheme -- Annotation scheme as a compromise -- Why multiple tiers
505	8		\|a How many tiers -- Multiple tiers in a tabular format -- Content of the tiers -- A sample text with T1 vs. T2 corrections -- Links between tiers -- Error tags -- Morphosyntactic references -- Follow-up corrections -- Alternative target hypotheses -- Error tagset -- Based on linguistic categories -- Grammar-based vs. formal errors -- Extent of the annotated unit -- Grammar-based tags -- Errors at T1 -- Errors at T2 -- Coarse-grained -- An example of complex annotation -- Evaluation of the manual tiered error annotation -- Inter-annotator agreement (IAA) -- A pilot annotation
505	8		\|a IAA on all doubly-annotated texts -- Error tags depend on target hypothesis -- Possible causes of the annotators' disagreements -- Formal tags -- Automatic extension and modification of error annotation -- Automatic detection of formal errors on T1 -- Formal orthographic errors -- Formal errors sometimes influencing pronunciation -- Formal errors influencing pronunciation -- Other types of errors -- Automatic classification of word-boundary errors -- Implicit error annotation -- Multi-dimensional error annotation (MD) -- Focus on morphology -- All annotation applied to the source text
505	8		\|a Extent of the annotated unit -- Alternative error domains -- Source text, target hypothesis, annotated strings -- Domains and features -- Linguistic annotation -- Annotation with tools for Standard Czech -- Annotation of target hypothesis -- Annotation of T1 -- Annotation of source texts -- Annotation of interlanguage in UD -- Tokenization -- Part-of-speech and morphology -- Lemmata -- Syntactic Structure -- Evaluation -- Annotation process -- Overview of the annotation process -- Transcription and anonymization of manuscripts -- Tiered error annotation -- Manual error annotation
500			\|a Automatic annotation checking.
650		0	\|a Corpora (Linguistics) \|0 http://id.loc.gov/authorities/subjects/sh2006006393
650		0	\|a Czech language. \|0 http://id.loc.gov/authorities/subjects/sh85035271
650		7	\|a Corpora (Linguistics) \|2 fast \|0 (OCoLC)fst01740921
650		7	\|a Czech language. \|2 fast \|0 (OCoLC)fst00886348
655		0	\|a Electronic books.
655		4	\|a Electronic books.
700	1		\|a Hana, Jiří.
700	1		\|a Vidová Hladká, Barbora.
776	0	8	\|i Print version: \|a Rosen, Alexandr. \|t Compiling and Annotating a Learner Corpus for a Morphologically Rich Language. \|d Prague : Karolinum Press, ©2020 \|z 9788024647593
929			\|a oclccm
999	f	f	\|s 6d780453-f714-48ad-aeb3-7a464144f747 \|i fe6588aa-da57-48ac-b949-1cc14426606c
928			\|t Library of Congress classification \|a P128.C68 \|l Online \|c UC-FullText \|u https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=e000xna&AN=2729804 \|z eBooks on EBSCOhost \|g ebooks \|i 13011562

Compiling and Annotating a Learner Corpus for a Morphologically Rich Language

MARC

Similar Items