Portable Language-Independent Adaptive Translation From OCR
Quarterly status rept. no. 6, 1 Jan-31 Mar 2009
BBN TECHNOLOGIES CAMBRIDGE MA
Pagination or Media Count:
This is the sixth RD quarterly progress report QPR of the BBN-led team under DARPAs MADCAT program. This report is organized by technical task area. The following tasks were performed this quarter 1.1. Pre-Processing and Page Segmentation - Text Segmentation and Verification Shape-DNA based Handwritten Text Line Detection Text line detection and separation. 1.2. Text Recognition - Error Analysis Training with Phase 2 Data Unsupervised Scribe Adaptation Named Entity Detection using Lattices. 1.4. Integration with GALE MT - Recognition Lattices for MT. 1.5. Metadata Extraction - Logo Recognition.