Recognizing and Tracking Person of Interest: A Real-Time Efficient Deep Learning based Method for Quadcopters

This work presents a distributed system architecture that leverages the asynchronous threading and communication property of ROS2 to develop and implement a real-time efficient Deep Learning (DL) based method for recognizing and tracking a person of interest. The DL model receives snapshots from the quadcopter’s camera and sends back an information vector, which includes all recognized persons and their corresponding position information within the camera frame of the quadcopter. The person of interest tracking control system receives face set information about the person of interest and generates reference velocity signals to be tracked by low-level controllers embedded within the drone. Experiments conducted in a cluttered and complex environment demonstrate the efficiency of the DL-based architecture for quadcopters. Below is a video demonstrating our work:

Images