LE QUY DON
Technical University
VietnameseClear Cookie - decide language by browser settings

Supervised deep actor network for imitation learning in a ground-air UAV-UGVs coordination task

Nguyen, H.T. and Garratt, M. and Bui, L.T. and Abbass, H. (2018) Supervised deep actor network for imitation learning in a ground-air UAV-UGVs coordination task. In: 2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017, 27 November 2017 through 1 December 2017.

Text
Supervised deep actor network for imitation learning in a ground-air UAV-UGVs coordination task..pdf

Download (957kB) | Preview

Abstract

Ground-Air coordination is a very complex environment for a machine learning algorithm. We focus on the case where an Unmanned Aerial Vehicle (UAV) needs to support a group of Unmanned Ground Vehicles (UGVs). The UAV is required to broadcast an image that contains all UGVs, thus, offering a bird-eye-view on the group as a whole. The source of complexity in this task is twofold. First, coordination needs to occur without communication between the UAV and UGVs. Second, the ability of the UAV to sense the UGVs is coupled with the ability of the UAV to learn how to track laterally the UGVs and adapt its vertical position so that the images of the UGVs are appropriately spaced within the camera field of view. In this paper, we propose using the Deep Actor Network component of an Actor-Critic Deep Reinforcement Learning architecture as a supervised learner. The advantage of this approach is that it offers a step towards autonomous learning whereby the full Actor-Critic model can be utilized in the future. Human demonstrations are collected for the deep Actor network to learn from. The system is built using the Gazebo Simulator, Robot Operating System, and the OpenAI Gym. We show that the proposed setup is able to train the UAV to follow the UGVs while maintaining all UGVs within camera range in situations where UGVs are performing complex maneuvers. © 2017 IEEE.

Item Type: Conference or Workshop Item (Paper)
Divisions: Faculties > Faculty of Information Technology
Identification Number: 10.1109/SSCI.2017.8285387
Uncontrolled Keywords: Antennas; Artificial intelligence; Cameras; Complex networks; Coordination reactions; Deep neural networks; Ground vehicles; Intelligent vehicle highway systems; Learning algorithms; Reinforcement learning; Gazebo; Ground-Air Interaction; Human demonstrations; Learning by imitation; OpenAI Gym; Unmanned aerial vehicles (UAV)
Additional Information: Conference code: 134337. Language of original document: English.
URI: http://eprints.lqdtu.edu.vn/id/eprint/9591

Actions (login required)

View Item
View Item