摘要
Thispaperpresentsamodel-basedapproximateλ-policyiterationapproachusingtemporaldifferencesforoptimizingpathsonlineforapursuit-evasionproblem,whereanagentmustvisitseveraltargetpositionswithinaregionofinterestwhilesimultaneouslyavoidingoneormoreactivelypursuingadversaries.Thismethodisrelevanttoapplications,suchasroboticpathplanning,mobile-sensorapplications,andpathexposure.Themethodologydescribedutilizescelldecompositiontoconstructadecisiontreeand...
出版日期
2011年03月13日(中国期刊网平台首次上网日期,不代表论文的发表时间)