Autonomous Robot Skill Acquisition

Konidaris, George M.

Autonomous Robot Skill Acquisition

Active / Technical Report | Accession Number: ADA579654 |

Open PDF

Abstract:

Among the most impressive of aspects of human intelligence is skill acquisition--the ability to identify important behavioral components, retain them as skills, refine them through practice, and apply them in new task contexts. Skill acquisition underlies both our ability to choose to spend time and effort to specialize at particular tasks, and our ability to collect and exploit previous experience to become able to solve harder and harder problems over time with less and less cognitive effort. Hierarchical reinforcement learning provides a theoretical basis for skill acquisition, including principled methods for learning new skills and deploying them during problem solving. However, existing work focuses largely on small, discrete problems. This dissertation addresses the question of how we scale such methods up to high-dimensional continuous domains, in order to design robots that are able to acquire skills autonomously. This presents three major challenges we introduce novel methods addressing each of these challenges. First, how does an agent operating in a continuous environment discover skills Although the literature contains several methods for skill discovery in discrete environments, it offers none for the general continuous case. We introduce skill chaining, a general skill discovery method for continuous domains. Skill chaining incrementally builds a skill tree that allows an agent to reach a solution state from any of its start states by executing a sequence or chain of acquired skills. We empirically demonstrate that skill chaining can improve performance over monolithic policy learning in the Pinball domain, a challenging dynamic and continuous reinforcement learning problem. Second, how do we scale up to high-dimensional state spaces While learning in relatively small domains is generally feasible, it becomes exponentially harder as the number of state variables grows.

Author(s):

Konidaris, George M.

Author Organization(s):

MASSACHUSETTS UNIV AMHERST

Descriptive Note:

Doctoral thesis

Pagination:

0165

Security Markings

DOCUMENT & CONTEXTUAL SUMMARY

Distribution:

Approved For Public Release

Distribution Statement:

Approved For Public Release; Distribution Is Unlimited.

RECORD

Collection: TR

Identifying Numbers

Contract/Grant Number(s):

FA9550-08-1-0418

Monitor Series:

AFOSR/VA

Subject Terms

Joint Capability Areas:

JCA_5_Command and Control; JCA_5.3_Planning; JCA_5.3.4_Develop Courses of Action; JCA_5.3.5_Analyze Courses of Action; JCA_5.5.2_Task; JCA_5.5_Direct; JCA_2.1_Planning and Direction; JCA_2.1.4_Evaluation; JCA_1.2.5_Lessons Learned; JCA_5.2.2_Develop Knowledge and Situational Awareness; JCA_5.2_Understand; JCA_1.2.3_Educating; JCA_5.4_Decide; JCA_1.2.1_Training; JCA_1.2.7_Experimentation; JCA_4.6_Engineering; JCA_1.2.6_Concepts

Modernization Areas:

Autonomy

Communities of Interest:

No COI(s) Identified

Descriptor(s):

*ROBOTS, ACQUISITION, ARTIFICIAL INTELLIGENCE, LEARNING MACHINES, SELF OPERATION, SKILLS, THESES

Field(s)/Group(s):

Cybernetics

Keyword(s):

SKILL ACQUISITION

Report Date:

2011 May 01

Creation Date:

2013 Jul 17