Accession Number:

AD1047509

Title:

Foundations of Sequential Learning

Descriptive Note:

Technical Report,01 Apr 2016,31 Aug 2017

Corporate Author:

Duke University Durham United States

Report Date:

2018-02-01

Pagination or Media Count:

95.0

Abstract:

This report summarizes the research done under FA8750-16-2-0173. This research advanced understanding of bandit algorithms and exploration in Markov Decision Processes MDPs. New algorithms and theory were proposed for bandits with periodic payoff multipliers and arms with costs. Exploration and transfer learning algorithms were evaluated for MDPs.

Subject Categories:

  • Cybernetics
  • Statistics and Probability

Distribution Statement:

APPROVED FOR PUBLIC RELEASE