Exploring Deep Learning Based Robot Perception Techniques for Navigating Outdoor Terrains

Ko, Hanseok

Exploring Deep Learning Based Robot Perception Techniques for Navigating Outdoor Terrains

Active / Technical Report | Accesssion Number: AD1178574 |

Open PDF

Abstract:

When autonomously navigating to its objectives, a ground robot encounters formidable challenges detecting and recognizing its surroundings and objects. From its sensory input, the robots AI has to semantically segment the scenes such as terrain, vegetation, human-made structures, debris, water streams, etc. The on-board perception then has to assess intelligently and determine what parts of the scene the robot can traverse safely to the objective. The project goal is to develop a novel method of vision-based perception for assessing the navigability of terrains that an autonomous ground vehicle may encounter traversing in natural or structured environments. With the great success brought by the advances in deep learning, computer vision sometimes exceeds human-level performance in object recognition tasks. These algorithms, however, require a large size of class examples to perform accurately. Although visual data are abundant, images relevant to ground navigation, especially those with class labels are scarce. Therefore, there is a need for a computer vision algorithm capable of high performance with small training sets and capable of recognizing novel objects. We propose to investigate the GAN-based data augmentation approach and the efficient scene understanding approach to tackle the data scarcity issue of the perception problem related to autonomous robotic maneuvers in previously unseen environments. Expected relevant robot maneuvering environments and scenes are typically unusual, and the data for the current paradigms of deep learning is either scarce or non-existent. Hence, it is expected that the GAN-based data augmentation approaches provide solutions to developing terrestrial robotic vehicles capable of perceiving and understanding novel environments.

Author(s):

Ko, Hanseok

Author Organization(s):

Korea University Research and Business Foundation

Funding Organization(s):

AIR FORCE OFFICE OF SCIENTIFIC RESEARCH/FAR EAST APO SAN FRANCISCO 96503, TOKYO

Document Type:

Technical Report/Final Report

Publication Date:

2022 Aug 13

Pagination:

20

Security Markings

DOCUMENT & CONTEXTUAL SUMMARY

Distribution Code:

A - Approved For Public Release

Distribution Statement: Public Release

RECORD

Collection: TRECMS

Identifying Numbers

Report Number(s):

AFRL-AFOSR-JP-TR-2022-0061

Grant Number(s):

FA2386-19-1-4001

Subject Terms

Modernization Areas:

Autonomy; AI and Machine Learning

Communities of Interest:

Biomedical

Descriptor(s):

computer vision, pattern recognition, artificial intelligence, change detection, machine learning, anomaly detection, deep learning, high resolution, unsupervised machine learning, cross domain, detection, recognition, signal processing, artificial intelligence software, covid-19, environment, event detection

Keyword(s):

COMPUTER VISION ALGORITHMS, GAN-based data augmentation, gan (generative adversarial networks), semantic segmentation, cyclegan (cycle-consistent gan), sfnet (Sketch-and-Fill Network)

Subject Categories:

Mathematical and Computer Sciences; Navigation, Detection and Countermeasures

Creation Date:

2022 Aug 29

Update Date:

2022 Nov 09