NIST 2008-2012 open machine translation (OpenMT) progress test sets.

Saved in:

Bibliographic Details
Imprint:	[Philadelphia, PA] : Linguistic Data Consortium, c2013.
Description:	1 CD-ROM ; 4 3/4 in.
Language:	Arabic Chinese English
Subject:	Machine translating. Arabic language -- Translating into English -- Databases. Chinese language -- Translating into English -- Databases. Linguistics -- Research. Arabic language -- Translating into English. Chinese language -- Translating into English. Machine translating. Databases.
Format:	E-Resource
URL for this record:	http://pi.lib.uchicago.edu/1001/cat/bib/9108736

Hidden Bibliographic Details
Varying Form of Title:	NIST 2008 2012 open machine translation (OpenMT) progress test sets NIST 2008-2012 open machine translation progress test sets NIST 2008 2012 OpenMT progress test sets
Other authors / contributors:	NIST Multimodal Information Group. National Institute of Standards and Technology (U.S.) Linguistic Data Consortium.
ISBN:	1585636401 9781585636402
Notes:	Title from disc label. Author: NIST Multimodal Information Group. Data type: Text. Data sources: Newswire, Web collection. Application: Machine translation. "LDC2013T07." Also available on the Internet. Arabic, Chinese, English.
Summary:	NIST 2008-2012 Open Machine Translation (OpenMT) Progress Test Sets was developed by NIST Multimodal Information Group. This release contains the evaluation sets (source data and human reference translations), DTD, scoring software, and evaluation plans for the Arabic-to-English and Chinese-to-English progress test sets for the NIST OpenMT 2008, 2009, and 2012 evaluations. The test data remained unseen between evaluations and was reused unchanged each time. The package was compiled, and scoring software was developed, at NIST, making use of Chinese and Arabic newswire and web data and reference translations collected and developed by the Linguistic Data Consortium (LDC). This evaluation kit includes a single Perl script (mteval-v13a.pl) that may be used to produce a translation quality score for one (or more) MT systems. The script works by comparing the system output translation with a set of (expert) reference translations of the same source text. Comparison is based on finding sequences of words in the reference translations that match word sequences in the system output translation. The objective of the OpenMT evaluation series is to support research in, and help advance the state of the art of, machine translation (MT) technologies -- technologies that translate text between human languages. Input may include all forms of text. The goal is for the output to be an adequate and fluent translation of the original. . For more general information about the NIST OpenMT evaluations, please refer to the NIST OpenMT website. The MT evaluation series started in 2001 as part of the DARPA TIDES (Translingual Information Detection, Extraction) program. Beginning with the 2006 evaluation, the evaluations have been driven and coordinated by NIST as NIST OpenMT. These evaluations provide an important contribution to the direction of research efforts and the calibration of technical capabilities in MT. The OpenMT evaluations are intended to be of interest to all researchers working on the general problem of automatic translation between human languages. To this end, they are designed to be simple, to focus on core technology issues and to be fully supported