Summary: | "The source broadcast recordings feature news broadcasts focusing principally on current events from the following sources: Anhui TV, a regional television station in Mainland China, Anhui Province, China Central TV (CCTV), a national and international broadcaster in Mainland China and Phoenix TV, a Hong Kong-based satellite television station .... The transcript files are in plain-text, tab-delimited format (TDF) with UTF-8 encoding, and the transcribed data totals 1,593,049 tokens." -- LDC online catalogue. GALE Phase 2 Chinese Broadcast News Transcripts was developed by the Linguistic Data Consortium (LDC) and contains transcriptions of approximately 110 hours of Chinese broadcast news speech collected in 2006 and 2007 by LDC and Hong University of Science and Technology (HKUST), Hong Kong, during Phase 2 of the DARPA GALE (Global Autonomous Language Exploitation) Program. The source broadcast recordings feature news broadcasts focusing principally on current events from the following sources: Anhui TV, a regional television station in Mainland China, Anhui Province, China Central TV (CCTV), a national and international broadcaster in Mainland China and Phoenix TV, a Hong Kong-based satellite television station.
|