BBN: Description of the PLUM System as Used for MUC-4

Ayuso, Damaris; Boisen, Sean; Fox, Heidi; Gish, Herb; Ingria, Robert; Weischedel, Ralph

BBN: Description of the PLUM System as Used for MUC-4

Active / Technical Report | Accession Number: ADA460888 |

Open PDF

Abstract:

Traditional approaches to the problem of extracting data from texts have emphasized hand-rafted linguistic knowledge. In contrast, BBNs PLUM system Probabilistic Language Understanding Model was developed as part of a DARPA-funded research effort on integrating probabilistic language models with more traditional linguistic techniques. Our research and development goals are more rapid development of new applications, the ability to train and re-train systems based on user markings of correct and incorrect output, more accurate selection among interpretations when more than one is found, and more robust partial interpretation when no complete interpretation can be found. A central assumption of our approach is that in processing unrestricted text for data extraction, a non-trivial amount of the text will not be understood. As a result, all components of PLUM are designed to operate on partially understood input, taking advantage of information when available, and not failing when information is unavailable. We had previously performed experiments on components of the system with texts from the Wall Street Journal, however, the MUC-3 task was the first end-to-end application of PLUM. Very little hand-tuning of knowledge bases was done for MUC-4 since MUC-3, the system architecture as depicted in figure 1 has remained essentially the same. In addition to participating in MUC-4, since MUC-3 we focused on porting to new domains and a new language, and on performing various experiments designed to control recallprecision tradeoffs. To support these goals, the preprocessing component and the fragment combiner were made declarative the semantics component was generalized to use probabilities on word senses we expanded our treatment of reference we enlarged the set of system parameters at all levels and we created a new probabilistic classifier for text relevance which filters discourse events.

Author(s):

Ayuso, Damaris ; Boisen, Sean ; Fox, Heidi ; Gish, Herb ; Ingria, Robert ; Weischedel, Ralph

Author Organization(s):

BBN SYSTEMS AND TECHNOLOGIES CORP CAMBRIDGE MA

Descriptive Note:

Conference paper

Supplementary Note:

Presented at the Message Understanding Conference (4th) (MUC-4), held in McLean, VA on 16-18 June 1992. Pub. in the Proceedings of the Message Understanding Confeence (4th) (MUC-4), 1992. Paper M92-1024.

Pagination:

0009

Security Markings

DOCUMENT & CONTEXTUAL SUMMARY

Distribution:

Approved For Public Release

Distribution Statement:

Approved For Public Release; Distribution Is Unlimited.

RECORD

Collection: TR

Identifying Numbers

Contract/Grant Number(s):

F30602-91-C-0051

Monitor Series:

DARPA

Subject Terms

Joint Capability Areas:

JCA_3.2_Engagement; JCA_3_Force Application; JCA_3.2.1_Kinetic Means; JCA_1_Force Support; JCA_5_Command and Control; JCA_1.2_Force Preparation; JCA_1.2.1_Training; JCA_5.3_Planning; JCA_8_Building Partnerships

Communities of Interest:

Weapons Technologies

Descriptor(s):

*MATHEMATICAL MODELS, *INFORMATION RETRIEVAL, *LANGUAGE TRANSLATION, *KNOWLEDGE BASED SYSTEMS, *TEXT PROCESSING, SYMPOSIA, PROBABILITY, SEMANTICS, MARKOV PROCESSES, RECALL, PARSERS, PREPROCESSING, COMPUTATIONAL LINGUISTICS, MARKERS, TEMPLATES, ACCURACY, COMPUTER ARCHITECTURE, PRECISION

Field(s)/Group(s):

Information Science, Linguistics, Cybernetics

Keyword(s):

*PROBABILISTIC LANGUAGE UNDERSTANDING MODELS, *MESSAGE UNDERSTANDING, PLUM(PROBABILISTIC LANGUAGE UNDERSTANDING MODEL), PARTIAL UNDERSTANDING, DISCOURSE PROCESSING, FPP(FAST PARTIAL PARSER), WORD TAGGING, TEXT RELEVANCE, TEXT CLASSIFIERS, SEMANTIC INTERPRETER, MARKOV MODELS

Report Date:

1992 Jan 01

Creation Date:

2007 Feb 07