<?xml version="1.0" encoding="utf-8"?><rss version="2.0"><channel><title>RSS for UAV cooperative control with stochastic risk models</title><link>http://academic.research.microsoft.com/Rss.aspx?cata=9&amp;id=51105453</link><description>Search RSS feed for Microsoft Academic Search</description><generator>MSRA Libra RSS Burner</generator><copyright>(c)2008 Microsoft Corpration, All right reserved.</copyright><pubDate>Thu, 23 May 2013 07:58:19 GMT</pubDate><lastBuildDate>Thu, 23 May 2013 07:58:19 GMT</lastBuildDate><category /><item><title>UAV cooperative control with stochastic risk models</title><link>http://academic.research.microsoft.com/Publication/51105453</link><pubDate>Thu, 23 May 2013 00:58:19 GMT</pubDate><guid isPermaLink="false">511054530</guid><description><![CDATA[<div><a href="http://academic.research.microsoft.com/Author/3545466">Alborz Geramifard</a>, <a href="http://academic.research.microsoft.com/Author/53491824">Joshua Redding</a>, <a href="http://academic.research.microsoft.com/Author/1371206">Nicholas Roy</a>, <a href="http://academic.research.microsoft.com/Author/18017204">Jonathan P. How</a>:
            
            <span style="margin-left:20px" /><span style="margin-left:20px"><a href="http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5991309">view publication</a></span></div><div>Risk and reward are fundamental concepts in the <a href='http://academic.research.microsoft.com/Keyword/8015/cooperative-control'>cooperative control</a>  of unmanned systems. This paper focuses on a constructive relationship between a cooperative planner and a learner in order to mitigate the learning risk while boosting the asymptotic performance and safety of agent behavior. Our framework is an instance of the intelligent <a href='http://academic.research.microsoft.com/Keyword/8015/cooperative-control'>cooperative control</a>  architecture (iCCA) where a learner (Natural actor-critic, Sarsa) initially follows a "safe" policy generated by a cooper- ative planner (consensus-based bundle algorithm). The learner incrementally improves this baseline policy through interaction, while avoiding behaviors believed to be "risky". This paper extends previous work toward the coupling of learning and <a href='http://academic.research.microsoft.com/Keyword/8015/cooperative-control'>cooperative control</a>  strategies in real-time stochastic domains in two ways: (1) the <a href='http://academic.research.microsoft.com/Keyword/35756/risk-analysis'>risk analysis</a>  module supports stochastic risk models, and (2) learning schemes that do not store the policy as a separate entity are integrated with the cooperative planner extending the applicability of iCCA framework. The performance of the resulting approaches are demonstrated through simulation of limited fuel UAVs in a stochastic <a href='http://academic.research.microsoft.com/Keyword/41391/task-assignment'>task assignment</a>  problem. Results show an 8% reduction in risk, while improving the performance up to 30%. I. INTRODUCTION</div><div></div><div></div><div>Published in 2011</div>]]></description></item></channel></rss>