CSLU, 22 languages corpus.
Saved in:
Imprint: | [Philadelphia, PA] : Linguistic Data Consortium, c2005. |
---|---|
Description: | 2 DVD-ROMs ; 4 3/4 in. |
Language: | English |
Subject: | |
Format: | DVD Video E-Resource |
URL for this record: | http://pi.lib.uchicago.edu/1001/cat/bib/7729075 |
Varying Form of Title: | 22 languages corpus Twenty-two languages corpus Center for Spoken Language Understanding twenty-two languages corpus |
---|---|
Other authors / contributors: | Oregon Health Sciences University. Center for Spoken Language Understanding. Linguistic Data Consortium. |
ISBN: | 1585633569 9781585633562 |
Notes: | Title from disc label. "LDC2005S26." |
Summary: | "Produce[d] by Center for Spoken Language Understanding and distributed by the Linguistic Data Consortium, the 22 Language corpus consists of telephone speech from 21 languages: Eastern Arabic, Cantonese, Czech, Farsi, German, Hindi, Hungarian, Japanese, Korean, Malay, Mandarin, Italian, Polish, Portuguese, Russian, Spanish, Swedish, Swahili, Tamil, Vietnamese, and English. The corpus contains fixed vocabulary utterances (e.g. days of the week) as well as fluent continuous speech. Each of the 50191 utterances is verified by a native speaker to determine if the caller followed instructions when answering the prompts. For this release, approximately 19758 utterances have corresponding orthographic transcriptions."--Introd. |
Similar Items
-
CSLU Yes/No : version 1.2 /
Published: (2007) -
Multilanguage telephone speech version 1.2 /
Published: (2006) -
Switchboard cellular part 1. Speech files for speaker identification (SID) /
Published: (2001) -
CSLU Apple words and phrases /
Published: (2007) -
2007 NIST language recognition evaluation supplemental training set.
Published: (2009)