ICRA 2012 Paper Abstract


Paper TuB110.5

Malone, Nicholas (University of New Mexico), Rohrer, Brandon R. (Sandia National Laboratories), Tapia, Lydia (University of New Mexico), Lumia, Ron (University of New Mexico), Wood, John (University of New Mexico)

Implementation of an Embodied General Reinforcement Learner on a Serial Link Manipulator

Scheduled for presentation during the Interactive Session "Interactive Session TuB-1" (TuB110), Tuesday, May 15, 2012, 10:30−11:00, Ballroom D

2012 IEEE International Conference on Robotics and Automation, May 14-18, 2012, RiverCentre, Saint Paul, Minnesota, USA

This information is tentative and subject to change. Compiled on December 11, 2017

Keywords Learning and Adaptive Systems, Motion and Path Planning, Biologically-Inspired Robots


BECCA (a Brain-Emulating Cognition and Control Architecture software package) was developed in order to perform general reinforcement learning, that is, to enable unmodeled embodied systems operating in unstructured environments to perform unfamiliar tasks. It accomplishes this through automatic paired feature creation and reinforcement learning algorithms. This paper describes an implementation of BECCA on a seven Degree of Freedom (DoF) Barrett Whole Arm Manipulator (WAM) undergoing a series of experiments designed to test the reinforcement learner's ability to adapt to the WAM hardware. In the experiments, the following is demonstrated, 1) learning to transition the WAM between states, 2) learning to perform at near optimal levels on one, two and three dimensional navigation tasks, 3) applying learning in simulation to hardware performance, 4) learning under inconsistent, human-generated reward, and 5) combining the reinforcement learner with Probabilistic Roadmap Methods (PRM) to improve scalability. The goal of the paper is to demonstrate both the scalability of the BECCA reinforcement learning approach using different formulations of the state space and to show the approach in this paper operating on complex physical hardware.



Technical Content © IEEE Robotics & Automation Society

This site is protected by copyright and trademark laws under US and International law.
All rights reserved. © 2002-2017 PaperCept, Inc.
Page generated 2017-12-11  16:36:26 PST  Terms of use