CSLU, 22 languages corpus.

CSLU, 22 languages corpus.

Saved in:

Bibliographic Details
Imprint:	[Philadelphia, PA] : Linguistic Data Consortium, c2005.
Description:	2 DVD-ROMs ; 4 3/4 in.
Language:	English
Subject:	Telephone calls -- Databases. Computational linguistics -- Databases. Computational linguistics. Telephone calls. Databases.
Format:	DVD Video E-Resource
URL for this record:	http://pi.lib.uchicago.edu/1001/cat/bib/7729075

Hidden Bibliographic Details
Varying Form of Title:	22 languages corpus Twenty-two languages corpus Center for Spoken Language Understanding twenty-two languages corpus
Other authors / contributors:	Oregon Health Sciences University. Center for Spoken Language Understanding. Linguistic Data Consortium.
ISBN:	1585633569 9781585633562
Notes:	Title from disc label. "LDC2005S26."
Summary:	"Produce[d] by Center for Spoken Language Understanding and distributed by the Linguistic Data Consortium, the 22 Language corpus consists of telephone speech from 21 languages: Eastern Arabic, Cantonese, Czech, Farsi, German, Hindi, Hungarian, Japanese, Korean, Malay, Mandarin, Italian, Polish, Portuguese, Russian, Spanish, Swedish, Swahili, Tamil, Vietnamese, and English. The corpus contains fixed vocabulary utterances (e.g. days of the week) as well as fluent continuous speech. Each of the 50191 utterances is verified by a native speaker to determine if the caller followed instructions when answering the prompts. For this release, approximately 19758 utterances have corresponding orthographic transcriptions."--Introd.

Similar Items

CSLU Yes/No : version 1.2 /
Published: (2007)

Multilanguage telephone speech version 1.2 /
Published: (2006)

Switchboard cellular part 1. Speech files for speaker identification (SID) /
Published: (2001)

CSLU Apple words and phrases /
Published: (2007)

2007 NIST language recognition evaluation supplemental training set.
Published: (2009)

Switchboard cellular part 1. Speech files for conversational speech recognition (HUB-5/LVCSR) /
Published: (2001)

CSLU : speaker recognition version 1.1 /
Published: (2006)

IARPA Babel Guarani language pack IARPA-babel305b-v1.0c
Published: (2019)

CSLU national cellular telephone speech, release 2.3.
Published: (2008)

CSLU Portland cellular telephone speech, version 1.3.
Published: (2008)

CSLU alphadigit. Version 1.3.
Published: (2008)

IARPA Babel Amharic language pack IARPA-babel307b-v1.0b
Published: (2015)

IARPA Babel Cantonese language pack IARPA-babel101b-v0.4c
Published: (2016)

2017 NIST OpenSAT Pilot - SSSF ;
Published: (2022)

IARPA Babel Tagalog Language Pack IARPA-babel106-v0.2g.
Published: (2015)

IARPA Babel Lithuanian Language Pack IARPA-babel304b-v1.0b.
Published: (2015)

IARPA Babel Pashto Language Pack IARPA-babel104b-v0.4bY.
Published: (2016)

IARPA Babel Swahili Language Pack IARPA-babel202b-v1.0d.
Published: (2015)

IARPA Babel Lao Language Pack IARPA-babel203b-v3.1a.
Published: (2015)

IARPA Babel Tamil Language Pack IARPA-babel204b-v1.1b.
Published: (2015)

CSLU kid's speech, version 1.1.
Published: (2007)

Voicemail corpus. Part II.
Published: (2002)

Gulf Arabic conversational telephone speech transcripts /
Published: (2006)

1998 HUB5 English Transcripts.
Published: (2003)

Iraqi Arabic conversational telephone speech, transcripts.
Published: (2006)

The walking around corpus.
Published: (2015)

Korean telephone conversations speech
Published: (2003)

2003 NIST rich transcription evaluation data.
Published: (2007)

CALLFRIEND Farsi second edition transcripts /
Published: (2014)

RATS language identification.
Published: (2017)

RATS keyword spotting.
Published: (2017)

British national corpus XML edition /
Published: (2007)

Gulf Arabic conversational telephone speech /
Published: (2006)

2010 NIST speaker recognition evaluation test set.
Published: (2017)

Iraqi Arabic conversational telephone speech.
Published: (2006)

NXT switchboard annotations.
Published: (2003)

CALLFRIEND Farsi second edition speech /
Published: (2014)

HKUST Mandarin telephone speech. part 1 /
Published: (2005)

MDE RT-04 training data speech /
Published: (2005)

MDE RT-04 training data text/annotations /
Published: (2005)