Accession Number:

ADA512967

Title:

TREC 2008 at the University at Buffalo: Legal and Blog Track

Descriptive Note:

Conference paper

Corporate Author:

STATE UNIV OF NEW YORK AT BUFFALO DEPT OF LIBRARY AND INFORMATION STUDIES

Report Date:

2008-11-01

Pagination or Media Count:

13.0

Abstract:

In the TREC 2008, the team from the State University of New York at Buffalo participated in the Legal track and the Blog track. For the Legal track, we worked on the interactive search task using the Web-based Legacy Tobacco Document Library Boolean search system. Our experiment achieved reasonable precision but suffered significantly from low recall. These results, together with the appealing and adjudication results, suggest that the concept of document relevance in legal e-discovery deserve further investigation. For the Blog distillation task, our official runs were based on a reduced document model in which only text from several most content-bearing fields were indexed. This approach indeed yielded encouraging retrieval effectiveness while significantly decreasing the index size. We also studied query independencedependence and link-based features for finding relevant feeds. For the Blog opinion and polarity tasks, we mainly investigated the usefulness of opinionated words contained in the SentiGI lexicon. Our experiment results showed that the effectiveness of the technique is quite limited, indicating other more sophisticated techniques are needed.

Subject Categories:

  • Linguistics
  • Computer Systems
  • Information Science

Distribution Statement:

APPROVED FOR PUBLIC RELEASE