Multilingual ATIS /

Saved in:

Bibliographic Details
Imprint:	Philadelphia, Pa. : Linguistic Data Consortium, 2019.
Description:	1 CD-ROM ; 4 3/4 in.
Language:	English
Subject:	Automatic speech recognition -- Databases. Natural language processing (Computer science) -- Databases. Automatic speech recognition. Natural language processing (Computer science) Databases.
Format:	Unknown
URL for this record:	http://pi.lib.uchicago.edu/1001/cat/bib/12739914

Hidden Bibliographic Details
Varying Form of Title:	Air Travel Information System
Other authors / contributors:	Upadhyay, Shyam Hakkani-Tur, Dilek Tur, Gokhan Rastogi, Abhinav Linguistic Data Consortium. National Institute of Standards and Technology (U.S.) United States. Advanced Research Projects Agency.
ISBN:	1585638749 9781585638741
Notes:	Title from disc surface. Author(s) Shyam Upadhyay, Dilek Hakkani-Tur, Gokhan Tur, Abhinav Rastogi Release Date: February 15, 2019 LDC2019T04. Introduction: Multilingual ATIS was developed by Google Inc. and consists of 5,871 utterances from ATIS2 (LDC93S5), ATIS3 Training Data (LDC94S19), and ATIS3 Test Data (LDC95S26) annotated and translated into Hindi and Turkish. The ATIS (Air Travel Information Services) collection was developed to support the research and development of speech understanding systems. Participants were presented with various hypothetical travel planning scenarios and asked to solve them by interacting with partially or completely automated ATIS systems. The resulting utterances were recorded and transcribed. Data was collected in the early 1990s at five US sites: Raytheon BBN, Carnegie Mellon University, MIT Laboratory for Computer Science, National Institute for Standards and Technology and SRI International. Data: The data in this release is separated into training and test sets following the original ATIS division. The training set contains 4978 utterances selected from the Class A (context independent) training data in the ATIS2 and ATIS3 corpora. The test set contains 893 utterances from the November 1993 and December 1994 data sets in ATIS3. The original English utterances were manually translated into Hindi and Turkish. This release also includes the original English utterance and the machine translation back into English of the manual target language utterance translation. Each utterance is annotated with named entities via table lookup; markers include city, airline, airport names, and dates. All data is stored in UTF-8 encoded tab separated value files.

MARC


LEADER	00000cmm a2200000Ia 4500
001	12739914
008	220325s2019 pau q u eng d
005	20220714181027.8
040			\|a AZU \|b eng \|c AZU \|d OCLCO \|d CGU
020			\|a 1585638749
020			\|a 9781585638741
035			\|a (OCoLC)1305399218
050		4	\|a TK7895.S65 \|b M95 2019
245	0	0	\|a Multilingual ATIS / \|c Linguistic Data Consortium
246	3		\|a Air Travel Information System
260			\|a Philadelphia, Pa. : \|b Linguistic Data Consortium, \|c 2019.
300			\|a 1 CD-ROM ; \|c 4 3/4 in.
500			\|a Title from disc surface.
500			\|a Author(s) Shyam Upadhyay, Dilek Hakkani-Tur, Gokhan Tur, Abhinav Rastogi
500			\|a Release Date: February 15, 2019
500			\|a LDC2019T04.
500			\|a Introduction: Multilingual ATIS was developed by Google Inc. and consists of 5,871 utterances from ATIS2 (LDC93S5), ATIS3 Training Data (LDC94S19), and ATIS3 Test Data (LDC95S26) annotated and translated into Hindi and Turkish. The ATIS (Air Travel Information Services) collection was developed to support the research and development of speech understanding systems. Participants were presented with various hypothetical travel planning scenarios and asked to solve them by interacting with partially or completely automated ATIS systems. The resulting utterances were recorded and transcribed. Data was collected in the early 1990s at five US sites: Raytheon BBN, Carnegie Mellon University, MIT Laboratory for Computer Science, National Institute for Standards and Technology and SRI International.
500			\|a Data: The data in this release is separated into training and test sets following the original ATIS division. The training set contains 4978 utterances selected from the Class A (context independent) training data in the ATIS2 and ATIS3 corpora. The test set contains 893 utterances from the November 1993 and December 1994 data sets in ATIS3. The original English utterances were manually translated into Hindi and Turkish. This release also includes the original English utterance and the machine translation back into English of the manual target language utterance translation. Each utterance is annotated with named entities via table lookup; markers include city, airline, airport names, and dates. All data is stored in UTF-8 encoded tab separated value files.
650		0	\|a Automatic speech recognition \|v Databases.
650		0	\|a Natural language processing (Computer science) \|v Databases.
650		7	\|a Automatic speech recognition. \|2 fast \|0 (OCoLC)fst00822769
650		7	\|a Natural language processing (Computer science) \|2 fast \|0 (OCoLC)fst01034365
655		7	\|a Databases. \|2 fast \|0 (OCoLC)fst01411643
700	1		\|a Upadhyay, Shyam
700	1		\|a Hakkani-Tur, Dilek
700	1		\|a Tur, Gokhan
700	1		\|a Rastogi, Abhinav
710	2		\|a Linguistic Data Consortium.
710	2		\|a National Institute of Standards and Technology (U.S.)
710	1		\|a United States. \|b Advanced Research Projects Agency.
929			\|a cat
999	f	f	\|s cba903f3-1d3c-4f6b-847f-5c6ae0fdb60a \|i cba34ca2-15f8-4016-ab05-e67800683bb0
928			\|t Library of Congress classification \|a TK7895.S65M95 2019 \|p CDRom \|l ASR \|c ASR-JRLASR \|i 12876799
927			\|t Library of Congress classification \|a TK7895.S65M95 2019 \|p CDRom \|l ASR \|c ASR-JRLASR \|b 115529696 \|i 10401798

Multilingual ATIS /

MARC

Similar Items