Convolutional neural networks of the YOLO class in computer vision systems for mobile robotic complexes

Bibliografski detalji
Parent link:Control and Communications (SIBCON-2019).— 2019.— [18739806, 5 p.]
Glavni autor: Zoev I. V. Ivan Vladimirovich
Autor kompanije: Национальный исследовательский Томский политехнический университет Инженерная школа информационных технологий и робототехники Отделение информационных технологий
Daljnji autori: Beresnev A. P. Aleksey Pavlovich, Markov N. G. Nikolai Grigorevich
Sažetak:Title screen
An important scientific direction is the development and study of computer vision systems (CVS) for mobile robotic complexes. Today, developers of CVS are most often using convolutional neural networks (CNN). For increasing the speed detection of objects on images in CVS, there has been a trend of using CNN, which are hardware-implemented on field-programmable gate array (FPGAs).This article shows that the perspective for hardware implementation on the FPGA is the tiny-YOLO CNN from the YOLO class. For reduce required FPGA computing resources in this CNN, was proposed to use Inception-ResNet modules. We was found that with high detection accuracy of objects in images with minimum resources requirements provide by the tiny-YOLO-Inception-ResNet2 network architecture. It is obtained from replacing the fifth tiny-YOLO convolutional layer of the tiny-YOLO CNN with two sequential processing Inception-ResNet modules. Also results of the study of the detection accuracy of objects using the CNN for this architecture with the lack of resource-intensive operations: batch normalization and bias from calculations were given. These studies were performed for different formats of representation numbers in the FPGA.
Режим доступа: по договору с организацией-держателем ресурса
Jezik:engleski
Izdano: 2019
Teme:
Online pristup:https://doi.org/10.1109/SIBCON.2019.8729605
Format: Elektronički Poglavlje knjige
KOHA link:https://koha.lib.tpu.ru/cgi-bin/koha/opac-detail.pl?biblionumber=661724
Opis
Sažetak:Title screen
An important scientific direction is the development and study of computer vision systems (CVS) for mobile robotic complexes. Today, developers of CVS are most often using convolutional neural networks (CNN). For increasing the speed detection of objects on images in CVS, there has been a trend of using CNN, which are hardware-implemented on field-programmable gate array (FPGAs).This article shows that the perspective for hardware implementation on the FPGA is the tiny-YOLO CNN from the YOLO class. For reduce required FPGA computing resources in this CNN, was proposed to use Inception-ResNet modules. We was found that with high detection accuracy of objects in images with minimum resources requirements provide by the tiny-YOLO-Inception-ResNet2 network architecture. It is obtained from replacing the fifth tiny-YOLO convolutional layer of the tiny-YOLO CNN with two sequential processing Inception-ResNet modules. Also results of the study of the detection accuracy of objects using the CNN for this architecture with the lack of resource-intensive operations: batch normalization and bias from calculations were given. These studies were performed for different formats of representation numbers in the FPGA.
Режим доступа: по договору с организацией-держателем ресурса
Digitalni identifikator objekta:10.1109/SIBCON.2019.8729605