View The Document

Accession Number:

AD1049685

Title:

Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text

Author(s):

Author Organization(s):

Report Date:

2016-11-29

Abstract:

This paper investigates how linguistic knowledge mined from large text corpora can aid the generation of natural language descriptions of videos. Specifically, we integrate both a neural language model and distributional semantic strained on large text corpora into a recent LSTM-based architecture for video description. We evaluate our approach on a collection of Youtube videos as well as two large movie description datasets showing significant improvements in grammaticality while modestly improving descriptive quality.

Pages:

6

File Size:

1.73MB

Descriptors:

SubjectCategory:

Distribution Statement:

Approved For Public Release

View The Document