IROS 2015 Paper Abstract

Close

Paper ThCT10.3

Ure, Nazim Kemal (Massachusetts Institute of Technology), Omidshafiei, Shayegan (Massachusetts Institute of Technology), Lopez, Brett Thomas (Massachusetts Institute of Technology), Agha-mohammadi, Ali-akbar (Qualcomm Research), How, Jonathan Patrick (Massachusetts Institute of Technology), Vian, John (The Boeing Company)

Online Heterogeneous Multiagent Learning under Limited Communication with Applications to Forest Fire Management

Scheduled for presentation during the Regular session "Autonomous Agents" (ThCT10), Thursday, October 1, 2015, 11:50−12:05, Saal B4

2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sept 28 - Oct 03, 2015, Congress Center Hamburg, Hamburg, Germany

This information is tentative and subject to change. Compiled on July 19, 2019

Keywords Autonomous Agents, Model Learning

Abstract

Many robotic missions require online estimation of the unknown state transition models associated with uncertainty that stems from mission dynamics. The learning problem is usually distributed among agents in multiagent scenarios, either due to the absence of a centralized processing unit or because of the large size of the joint learning problem. This paper addresses the problem of multiagent learning in the likely scenario that agents estimate different models from their measured data, but they can share information by communicating model parameters. Previous approaches either consider homogeneous scenarios or perform model transfer in an open-loop manner, which hinders the convergence rate. We develop a closed-loop multiagent learning algorithm, Collaborative Filtering-Decentralized Incremental Feature Dependency Discovery (CF-Dec-iFDD), which enables agents to learn and share models in heterogeneous scenarios. Each agent learns a linear function approximation of the actual model, and the number of features is increased incrementally to adjust model complexity based on the observed data. The agents obtain feedback from other agents on the model error reduction associated with the communicated features. Although this increases the communication cost of exchanging features, it improves the quality/utility of what is being exchanged, leading to improved convergence rate. The approach is demonstrated in indoor hardware flight tests on a forest fire management scenario for which agents must learn the transition model of the fire spread depending on external factors such as wind and vegetation. It is shown that CF-Dec-iFDD has superior convergence rate compared to the alternative approaches.

 

 

Technical Content © IEEE Robotics & Automation Society


This site is protected by copyright and trademark laws under US and International law.
All rights reserved. © 2002-2019 PaperCept, Inc.
Page generated 2019-07-19  14:24:12 PST  Terms of use