Accession Number:

AD1157983

Title:

Anaphoric Annotation in the ARRAU Corpus

Descriptive Note:

[Technical Report, Research Paper]

Corporate Author:

UNIVERSITY OF SOUTHERN CALIFORNIA LOS ANGELES

Personal Author(s):

Report Date:

2008-01-01

Pagination or Media Count:

5

Abstract:

Arrau is a new corpus annotated for anaphoric relations, with information about agreement and explicit representation of multiple antecedents for ambiguous anaphoric expressions and discourse antecedents for expressions which refer to abstract entities such as events, actions and plans. The corpus contains texts from different genres task-oriented dialogues from the Trains-91 and Trains-93 corpus, narratives from the English Pear Stories corpus, newspaper articles from the Wall Street Journal portion of the Penn Treebank, and mixed text from the Gnome corpus.

Subject Categories:

  • Linguistics
  • Cybernetics

Distribution Statement:

[A, Approved For Public Release]