Accession Number:

AD1090741

Title:

A Practitioner's Guide to Maximum Causal Entropy Inverse Reinforcement Learning, Starting from Markov Decision Processes

Descriptive Note:

Technical Report

Corporate Author:

CARNEGIE-MELLON UNIV PITTSBURGH PA PITTSBURGH United States

Personal Author(s):

Report Date:

2019-02-01

Pagination or Media Count:

12.0

Abstract:

This guide is meant to describe both the semantics and mechanics of the Maximum Causal Entropy MaxCausalEnt Inverse Reinforcement Learning IRL algorithm 4. Throughout the remainder of this document, we provide a measure of formal definition of the algorithm, starting from the basics, adding some intuition as we go. We intentionally skip a large amount of prior, related, and theoretic work that motivates and contextualizes the algorithm. For further reading on these subjects, see the works referenced throughout. Finally, the gray break out boxes are notes meant to provide broader context, and can be skipped without breaking the ow of the guide.

Subject Categories:

  • Statistics and Probability

Distribution Statement:

APPROVED FOR PUBLIC RELEASE