Accession Number : ADA616866


Title :   Impact of Machine-Translated Text on Entity and Relationship Extraction


Descriptive Note : Final rept. Oct 2013-Sep 2014


Corporate Author : ARMY RESEARCH LAB ABERDEEN PROVING GROUND MD COMPUTATIONAL AND INFORMATION SCIENCES DIRECTORATE


Personal Author(s) : Mittrick, Mark R ; Richardson, John T


Full Text : https://apps.dtic.mil/dtic/tr/fulltext/u2/a616866.pdf


Report Date : Dec 2014


Pagination or Media Count : 28


Abstract : We performed an experiment to study the effects of machine (performed by software) versus manual (performed by a human) translation on the performance of a Small Business Innovation Research text analytics tool. The text analytics in the experiment is Contour, developed by Decisive Analytics Corporation, which automatically builds high-fidelity social networks from text data sets too large to be scrutinized in detail through manual effort. Specifically, we analyzed the ability to extract text entities with the roles of person, location, or organization. The data consists of the translations of many news stories collected from Arabic language websites. There are 5 translations for each story to examine (4 human and 1 machine). The performance of the machine translation Contour results is analyzed against the Contour results of the manual translation


Descriptors :   *DATA MINING , *ONLINE COMMUNITIES , *TEXT PROCESSING , *TRANSLATIONS , ARABIC LANGUAGE , AUTOMATION , EXPERIMENTAL DATA , INFORMATION EXCHANGE , NATURAL LANGUAGE , PATTERN RECOGNITION , PERFORMANCE(ENGINEERING) , PERFORMANCE(HUMAN) , SEMANTICS , SOFTWARE TOOLS , WORDS(LANGUAGE)


Subject Categories : Information Science
      Linguistics
      Computer Programming and Software
      Computer Systems


Distribution Statement : APPROVED FOR PUBLIC RELEASE