View The Document

Accession Number:

AD1195012

Title:

Multimodal Decompositional Semantics

Author(s):

Author Organization(s):

Report Date:

2023-03-08

Abstract:

Johns Hopkins University, partnering with the University of Rochester, pursued research and development of analytics in support of a larger framework for knowledge-driven hypothesis testing. We leveraged our expertise in dataset creation for decompositional semantics, to develop new datasets specifically geared towards the extraction problems of the DARPAs Active Interpretation of Disparate Alternatives (AIDA) program (specifically in event extraction and coreference resolution). Notable examples of results from our team include: the construction of RAMs, the first publicly available multi-sentence event extraction dataset; the development of state of the art multilingual coreference models, including an online variant that handled long documents with a fixed amount of memory, as well as a new multilingual dataset that focused on multi-person dialogues; a new supervised approaches to cross-lingual alignment, supporting the automatic creation of training data through projecting from English to less-resourced languages; a framework for sentence-level paraphrasing and data augmentation; collaborations on the emerging science of probing neural language models; and the development of new decompositional resources and analysis across a number of new linguistic dimensions. In the initial phase of the program, we provide analytic outputs as part of the program-wide evaluation run by NIST (focus on multilingual text and speech). In the second phase we provided fewer components, focusing exclusively on text. In the third phase our focus was on data annotation under a newly proposed claim frame task, which exercised our background in crowd-sourcing rich linguistic annotations.

Pages:

38

File Size:

2.36MB

Descriptors:

Identifiers:

SubjectCategory:

Communities of Interest:

Distribution Statement:

Approved For Public Release

View The Document