Accession Number:

ADA460607

Title:

Accelerating Corporate Research in the Development, Application and Deployment of Human Language Technologies

Descriptive Note:

Corporate Author:

IBM THOMAS J WATSON RESEARCH CENTER YORKTOWN HEIGHTS NY

Personal Author(s):

Report Date:

2003-01-01

Pagination or Media Count:

9.0

Abstract:

IBM Research has over 200 people working on Unstructured Information Management UIM technologies with a strong focus on HLT. Spread out over the globe they are engaged in activities ranging from natural language dialog to machine translation to bioinformatics to open-domain question answering. An analysis of these activities strongly suggested that improving the organizations ability to quickly discover each others results and rapidly combine different technologies and approaches would accelerate scientific advance. Furthermore, the ability to reuse and combine results through a common architecture and a robust software framework would accelerate the transfer of research results in HLT into IBMs product platforms. Market analyses indicating a growing need to process unstructured information, specifically multi-lingual, natural language text, coupled with IBM Researchs investment in HLT, led to the development of middleware architecture for processing unstructured information dubbed UIMA. At the heart of UIMA are powerful search capabilities and a data-driven framework for the development, composition and distributed deployment of analysis engines. In this paper we give a general introduction to UIMA focusing on the design points of its analysis engine architecture and we discuss how UIMA is helping to accelerate research and technology transfer.

Subject Categories:

  • Information Science
  • Linguistics
  • Computer Programming and Software

Distribution Statement:

APPROVED FOR PUBLIC RELEASE