CSLU, 22 languages corpus.

Saved in:
Bibliographic Details
Imprint:[Philadelphia, PA] : Linguistic Data Consortium, c2005.
Description:2 DVD-ROMs ; 4 3/4 in.
Language:English
Subject:
Format: DVD Video E-Resource
URL for this record:http://pi.lib.uchicago.edu/1001/cat/bib/7729075
Hidden Bibliographic Details
Varying Form of Title:22 languages corpus
Twenty-two languages corpus
Center for Spoken Language Understanding twenty-two languages corpus
Other authors / contributors:Oregon Health Sciences University. Center for Spoken Language Understanding.
Linguistic Data Consortium.
ISBN:1585633569
9781585633562
Notes:Title from disc label.
"LDC2005S26."
Summary:"Produce[d] by Center for Spoken Language Understanding and distributed by the Linguistic Data Consortium, the 22 Language corpus consists of telephone speech from 21 languages: Eastern Arabic, Cantonese, Czech, Farsi, German, Hindi, Hungarian, Japanese, Korean, Malay, Mandarin, Italian, Polish, Portuguese, Russian, Spanish, Swedish, Swahili, Tamil, Vietnamese, and English. The corpus contains fixed vocabulary utterances (e.g. days of the week) as well as fluent continuous speech. Each of the 50191 utterances is verified by a native speaker to determine if the caller followed instructions when answering the prompts. For this release, approximately 19758 utterances have corresponding orthographic transcriptions."--Introd.

Similar Items