Multilingual automatic document classification analysis and translation (MADCAT) phase 1 training.

Saved in:
Bibliographic Details
Imprint:[Philadelphia, PA] : Linguistic Data Consortium, c2012.
Description:2 DVD ; 4 3/4 in.
Language:Arabic
Subject:
Format: E-Resource
URL for this record:http://pi.lib.uchicago.edu/1001/cat/bib/8926012
Hidden Bibliographic Details
Varying Form of Title:Title in LDC online catalogue: MADCAT phase 1 training set
Other authors / contributors:Lee, David.
Linguistic Data Consortium.
ISBN:1585636231
9781585636235
Notes:Title from disc label.
Data type: Text.
Data sources: Newsgroups, newswire, weblogs.
Applications: Handwriting recognition, machine translation.
"LDC2012T15".
Authors: David Lee, Safa Ismael, Stephen Grimes, Dave Doermann, Stephanie Strassel, Zhiyi Song.
Arabic
Summary:"MADCAT (Multilingual Automatic Document Classification Analysis and Translation) phase 1 training set contains all training data created by the Linguistic Data Consortium (LDC) to support Phase 1 of the DARPA MADCAT Program. The material in this release consists of handwritten Arabic documents, scanned at high resolution and annotated for the physical coordinates of each line and token. Digital transcripts and English translations of each document are also provided, with the various content and annotation layers integrated in a single MADCAT XML output. The goal of the MADCAT program is to automatically convert foreign text images into English transcripts." -- LDC online catalogue.

Mansueto

Loading map link
Holdings details from Mansueto
Call Number: CDRom PJ6123.M851 2012
c.1 Available Loan period: standard loan  Request from Mansueto Scan and Deliver Need help? - Ask a Librarian

Mansueto

Loading map link
Holdings details from Mansueto
Call Number: CDRom PJ6123.M851 2012
c.2 Available Loan period: standard loan  Request from Mansueto Scan and Deliver Need help? - Ask a Librarian