Development of the video stream object detection algorithm (VSODA) with tracking; EAI Endorsed Transactions on Energy Web; Vol. 19, iss. 22

Detalhes bibliográficos
Parent link:EAI Endorsed Transactions on Energy Web
Vol. 19, iss. 22.— 2019.— [e1, 5 p.]
Autor Corporativo: Национальный исследовательский Томский политехнический университет Инженерная школа информационных технологий и робототехники Отделение автоматизации и робототехники
Outros Autores: Zarnitsyn A. Yu. Aleksander Yurievich, Volkov A. S. Artem Sergeevich, Voytsekhovskiy A. A. Aleksey Alekseevich, Pyakullya B. I. Boris Ivanovich
Resumo:Title screen
The object tracking is one of the most important task in video analysis. Many methods have been proposed such as TLD (Tracking, Learning, Detection), Meanshift and MIL but they show good accuracy in laboratory cases, not in real ones, where the accuracy is defined as a numerical difference between computed object coordinates and the real ones. One of the reasons is lack of information about tracked object and environment changes. If a method has the prior information about tracked object, then it will be able to perform with higher accuracy. Some of the newest object tracking methods such as GOTURN use trained CNN (convolutional neural network) and have better accuracy because of knowledge about how the tracked object looks like in different situations such as light intensity changes and tracked object’s rotations. If we use only a classification algorithm (classifier) then it can find an object that was in training set with high probability. But if its appearance is changing it will be lost when deviation will be higher than trust limit. Then it is important to have parts of prior and posterior information about tracked object. The prior information is given by detector (CNN) and posterior information – by tracking algorithm (TLD). One of the biggest detector problems is high computational complexity in terms of operations’ number and one of the solutions is to use the classifier in parallel with the tracker. In future work we are going to use different sensors, not only RGB camera, but RGBD camera, which may improve accuracy due to higher amount of information.
Idioma:inglês
Publicado em: 2019
Assuntos:
Acesso em linha:http://dx.doi.org/10.4108/eai.22-1-2019.156385
Formato: xMaterials Recurso Eletrônico Capítulo de Livro
KOHA link:https://koha.lib.tpu.ru/cgi-bin/koha/opac-detail.pl?biblionumber=661768

MARC

LEADER 00000naa0a2200000 4500
001 661768
005 20250417152252.0
035 |a (RuTPU)RU\TPU\network\32640 
035 |a RU\TPU\network\29971 
090 |a 661768 
100 |a 20200214d2019 k||y0rusy50 ba 
101 0 |a eng 
135 |a drcn ---uucaa 
181 0 |a i  
182 0 |a b 
200 1 |a Development of the video stream object detection algorithm (VSODA) with tracking  |f A. Yu. Zarnitsyn, A. S. Volkov, A. A. Voytsekhovskiy, B. I. Pyakullya 
203 |a Text  |c electronic 
300 |a Title screen 
320 |a [References: 9 tit.] 
330 |a The object tracking is one of the most important task in video analysis. Many methods have been proposed such as TLD (Tracking, Learning, Detection), Meanshift and MIL but they show good accuracy in laboratory cases, not in real ones, where the accuracy is defined as a numerical difference between computed object coordinates and the real ones. One of the reasons is lack of information about tracked object and environment changes. If a method has the prior information about tracked object, then it will be able to perform with higher accuracy. Some of the newest object tracking methods such as GOTURN use trained CNN (convolutional neural network) and have better accuracy because of knowledge about how the tracked object looks like in different situations such as light intensity changes and tracked object’s rotations. If we use only a classification algorithm (classifier) then it can find an object that was in training set with high probability. But if its appearance is changing it will be lost when deviation will be higher than trust limit. Then it is important to have parts of prior and posterior information about tracked object. The prior information is given by detector (CNN) and posterior information – by tracking algorithm (TLD). One of the biggest detector problems is high computational complexity in terms of operations’ number and one of the solutions is to use the classifier in parallel with the tracker. In future work we are going to use different sensors, not only RGB camera, but RGBD camera, which may improve accuracy due to higher amount of information. 
461 |t EAI Endorsed Transactions on Energy Web 
463 |t Vol. 19, iss. 22  |v [e1, 5 p.]  |d 2019 
610 1 |a электронный ресурс 
610 1 |a труды учёных ТПУ 
610 1 |a computer vision 
610 1 |a deep learning 
610 1 |a machine learning 
610 1 |a pattern recognition 
610 1 |a mobile robotics 
610 1 |a object tracking 
610 1 |a video analysis 
610 1 |a компьютерное зрение 
610 1 |a машинное обучение 
610 1 |a распознавание 
610 1 |a распознавание образов 
610 1 |a мобильная робототехника 
610 1 |a отслеживание 
610 1 |a видеоанализ 
701 1 |a Zarnitsyn  |b A. Yu.  |c specialist in the field of informatics and computer technology  |c Assistant of the Department of Tomsk Polytechnic University  |f 1990-  |g Aleksander Yurievich  |3 (RuTPU)RU\TPU\pers\46039 
701 1 |a Volkov  |b A. S.  |g Artem Sergeevich 
701 1 |a Voytsekhovskiy  |b A. A.  |g Aleksey Alekseevich 
701 1 |a Pyakullya  |b B. I.  |c specialist in the field of informatics and computer technology  |c design engineer of Tomsk Polytechnic University  |f 1990-  |g Boris Ivanovich  |3 (RuTPU)RU\TPU\pers\34170  |9 17704 
712 0 2 |a Национальный исследовательский Томский политехнический университет  |b Инженерная школа информационных технологий и робототехники  |b Отделение автоматизации и робототехники  |3 (RuTPU)RU\TPU\col\23553 
801 2 |a RU  |b 63413507  |c 20200214  |g RCR 
850 |a 63413507 
856 4 |u http://dx.doi.org/10.4108/eai.22-1-2019.156385 
942 |c CF