Enhanced Annotation for Semantic Segmentation in Unstructured Video Sequences for Robotic Navigation

Kwon, Christine; Wigness, Maggie

Enhanced Annotation for Semantic Segmentation in Unstructured Video Sequences for Robotic Navigation

Active / Technical Report | Accesssion Number: AD1137228 |

Open PDF

Abstract:

Methods of visual perception provide identification of different landmarks and terrain that can help improve the intelligence of a robot, allowing it to efficiently optimize its paths to avoid obstacles and rough terrain. Semantic segmentation is a type of perception task that applies labels to every pixel within an image after being rigorously trained on large datasets that are accurately annotated. Due to human error there are bound to be annotated images with mislabeled or unlabeled pixels that can distort learning, affecting the visual perception of a robot. To address and correct these errors we propose automated relabeling algorithms. We exploit minute changes in object location between consecutive frames in a video sequence by referencing and comparing related pixels in adjacent frames; if the label values of those pixels in the neighboring images match, we can infer the label of the unlabeled pixel at the corresponding location in the image of interest. As a way to collect more evidence, we extend this approach to use peripheral pixels within a radius threshold in the neighboring images. These pixel-wise labeling solutions and analyses of their resulting annotated images will enable faster annotation and error correction by eliminating human labeling effort. We provide initial results of our automatic annotation inference and discuss the implications this will have on machine learning models used to provide perception information to autonomous robots.

Author(s):

Kwon, Christine ; Wigness, Maggie

Author Organization(s):

DEVCOM Army Research Laboratory

Funding Organization(s):

DEVCOM Army Research Laboratory, Adelphi , MD

Document Type:

Technical Report/Technical Note

Publication Date:

2021 Jun 01

Pagination:

21

Security Markings

DOCUMENT & CONTEXTUAL SUMMARY

Distribution Code:

A - Approved For Public Release

Distribution Statement: Public Release

RECORD

Collection: TRECMS

Identifying Numbers

Report Number(s):

ARL-TN-1064

Subject Terms

Modernization Areas:

Autonomy

Communities of Interest:

Autonomy

Descriptor(s):

computer vision, information science, navigation, visual perception, military research, abstracts, statistics, machine learning, perception, algorithms, artificial intelligence, automatic, boundaries, data science, environment, frequency, identification

Keyword(s):

semantic segmentation, automated annotation, robot perception, visual-aware navigation, object detection,

Subject Categories:

Navigation, Detection and Countermeasures; Mathematical and Computer Sciences

Creation Date:

2021 Jul 06

Update Date:

2021 Sep 08