CN110782479B - Visual target tracking method based on Gaussian center alignment - Google Patents

Visual target tracking method based on Gaussian center alignment Download PDF

Info

Publication number
CN110782479B
CN110782479B CN201910947357.3A CN201910947357A CN110782479B CN 110782479 B CN110782479 B CN 110782479B CN 201910947357 A CN201910947357 A CN 201910947357A CN 110782479 B CN110782479 B CN 110782479B
Authority
CN
China
Prior art keywords
target
center
gaussian
pooling layer
center alignment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910947357.3A
Other languages
Chinese (zh)
Other versions
CN110782479A (en
Inventor
胡钦涛
毛耀
周丽君
周国忠
何秋农
周翕
李志俊
张超
乔琦
聂康
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Optics and Electronics of CAS
Original Assignee
Institute of Optics and Electronics of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Optics and Electronics of CAS filed Critical Institute of Optics and Electronics of CAS
Priority to CN201910947357.3A priority Critical patent/CN110782479B/en
Publication of CN110782479A publication Critical patent/CN110782479A/en
Application granted granted Critical
Publication of CN110782479B publication Critical patent/CN110782479B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/246Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)

Abstract

The invention provides a visual target tracking method based on Gaussian center alignment, and aims to solve the problem that the target characteristics of the existing target tracking technology cannot be aligned with the modeled Gaussian center. The invention comprises the following steps: step 1, selecting a target initial frame; step 2, extracting target characteristics by using a network; step 3, the target features pass through a center alignment pooling layer; step 4, aligning the result passing through the center alignment pooling layer with the modeled Gaussian center to obtain a minimum loss error; and 5, obtaining a target tracking result. The invention has the beneficial technical effects that: the tracking precision of the Gaussian model can be greatly improved.

Description

Visual target tracking method based on Gaussian center alignment
Technical Field
The invention relates to the technical field of image processing, in particular to a visual target tracking method based on Gaussian center alignment.
Background
The target tracking is widely applied to the fields of automatic driving, aerial photography, video monitoring and the like, most of the traditional image algorithms are characteristic engineering, such as methods of color histograms, HOG characteristics and the like, or methods of particle filtering, Kalman filters and the like, but with the development of the technology, the performance can not meet the current use requirements due to the practical and complex application environment, background similar interference, the change of illumination conditions, shielding and other external factors, target posture change, appearance deformation, scale change, out-of-plane rotation, in-plane rotation, out-of-view, rapid movement, movement blur and the like; with the development of correlation filtering and deep learning, a batch of correlation filtering and deep learning tracking algorithms also appear in succession, from correlation filtering methods proposed by MOSSE algorithm, KCF algorithm, DCF and the like to the deep learning target tracking algorithm such as SiamFC and the like which are developed at a high speed at present, most of the correlation filtering and deep learning tracking algorithms model a target into a gaussian target, however, the target features extracted by the deep learning network or the correlation filter may have the problem of misalignment with the tracked gaussian center (for example, the geometric center point of a human body is near the belly, but the features may be at the head), and meanwhile, it is necessary to develop a tracking algorithm with higher precision and without affecting real-time property.
Disclosure of Invention
The invention aims to provide a visual target tracking method based on Gaussian center alignment. In order to improve the tracking accuracy of the existing tracking algorithm without influencing the real-time performance, the problem that the tracking target feature in the background is not aligned with the Gaussian center of the target to be learned is solved.
In order to solve the technical problems, the invention adopts the following technical scheme:
the visual target tracking method based on Gaussian center alignment comprises the following steps:
step 1, selecting a target initial frame;
step 2, extracting target characteristics by using a network or a related filter;
step 3, passing the target feature through a center alignment pooling layer; the role of the center-aligned pooling layer is to align the network or filter learned features with the modeled gaussian center;
step 4, aligning the result passing through the center alignment pooling layer with the modeled Gaussian center to obtain a minimum loss error; the loss function that is taken is as follows,
Figure RE-GDA0002282158960000021
wherein, f (x)j(ii) a w) is the target feature, yjIs the generated Gaussian model, PRP is the center-aligned pooling layer, γjIs the corresponding feature map weight, defined by, for the PRP center pooling layer:
Figure RE-GDA0002282158960000022
for the target feature
Figure RE-GDA0002282158960000023
Each element is equal to the sum of its row maximum and column maximum, xpnIs a row value, xnqIs the column value.
And 5, obtaining a target tracking result. And obtaining the position of the tracking target according to the obtained response of the target function.
Compared with the prior art, the invention has the following advantages:
1. the invention improves the tracking performance of the original method.
2. The invention is applicable to both the neural network and the related filtering method, and has wide applicability.
3. The invention corrects the Gaussian center by introducing the center alignment pooling layer, has small calculation amount and basically does not change the running speed of the original algorithm in use.
Drawings
FIG. 1 is an overall frame diagram of the present invention;
FIG. 2 is an example of a center-aligned pooling layer of the present invention;
fig. 3 is a comparative example of response gaussian functions with and without the present invention, wherein fig. 3(a) is the gaussian response function of the present invention and fig. 3(b) is the gaussian response function without the present invention.
Detailed Description
The following detailed description of the embodiments of the invention refers to the accompanying drawings. However, the following examples are only for illustrating the present invention in detail and do not limit the scope of the present invention in any way. The programs referred to or relied on in the following embodiments are all conventional programs or simple programs in the art, and those skilled in the art can make routine selection or adaptation according to specific application scenarios.
As shown in fig. 1, the present invention relates to a visual target tracking method based on gaussian center alignment, which comprises the following steps:
step 1, selecting a target initial frame: selecting a target to be tracked, and extracting the characteristics of the target to be tracked;
step 2, extracting target characteristics by using a network or a related filter, such as a ResNet network in the figure 1;
step 3, passing the target feature through a center alignment pooling layer; the role of the center-aligned pooling layer, such as the FIG. 2, is to align the network or filter learned features with the modeled Gaussian center;
step 4, aligning the result passing through the center alignment pooling layer with the modeled Gaussian center to obtain a minimum loss error; the loss function to be taken is as follows,
Figure RE-GDA0002282158960000031
wherein, f (x)j(ii) a w) is the target feature, yjIs the generated Gaussian model, the last Gaussian in FIG. 1, that is, the Gaussian in FIG. 3(b), PRP is the center-aligned pooling layer, γjIs the corresponding feature map weight, defined by, for the PRP center pooling layer:
Figure RE-GDA0002282158960000032
for the target feature
Figure RE-GDA0002282158960000033
Each element is equal to the sum of its row maximum and column maximum, xpnIs a row value, xnqIs the column value.
Fig. 3 is a comparison of gaussian plots generated using and without the present invention, where fig. 3(a) is a gaussian response plot of the present invention, which can be found to be smoother than fig. 3(b), and to have only one gaussian response.
And 5, obtaining a target tracking result. And obtaining the position of the tracking target according to the obtained response of the target function.
In order to verify the effectiveness of the invention, ATOM and Simmask algorithms are selected as comparative examples, a standard VOT2018 data set is used for comparing results, and the precision A and robustness R of the tracking method and the expected average coverage EAO are compared.
Table one: comparison of results on data sets for examples and comparative examples
Method A R EAO
ATOM 0.590 0.204 0.401
Siammask 0.609 0.276 0.380
The invention 0.612 0.169 0.434
As can be seen from the table I, compared with the prior advanced ATOM and Simmask, the method of the invention improves the precision A (the higher the better), the robustness R (the lower the better) and the expected average coverage EAO (the higher the better).

Claims (1)

1. A visual target tracking method based on Gaussian center alignment is characterized by comprising the following steps:
step 1, selecting a target initial frame;
step 2, extracting target characteristics by using a network;
step 3, the target features pass through a center alignment pooling layer;
step 4, aligning the result passing through the center alignment pooling layer with the modeled Gaussian center to obtain a minimum loss error;
step 5, obtaining a target tracking result;
the step 3 of aligning the center of the pooling layer is to re-correct the target feature from the eccentricity of the object to the center of the object;
the loss error function adopted in step 4 is:
Figure FDA0003627001480000011
wherein, f (x)j(ii) a w) is the target feature, yjIs a generated Gaussian model, PRP is a center pairQuasi-pooling layer, γjIs the corresponding feature map weight;
for PRP central pooling layer, by the following definition:
Figure FDA0003627001480000012
for the target feature
Figure FDA0003627001480000013
Each element is equal to the sum of its row maximum and column maximum, xpnIs a row value, xnqIs the column value.
CN201910947357.3A 2019-10-08 2019-10-08 Visual target tracking method based on Gaussian center alignment Active CN110782479B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910947357.3A CN110782479B (en) 2019-10-08 2019-10-08 Visual target tracking method based on Gaussian center alignment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910947357.3A CN110782479B (en) 2019-10-08 2019-10-08 Visual target tracking method based on Gaussian center alignment

Publications (2)

Publication Number Publication Date
CN110782479A CN110782479A (en) 2020-02-11
CN110782479B true CN110782479B (en) 2022-07-19

Family

ID=69384815

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910947357.3A Active CN110782479B (en) 2019-10-08 2019-10-08 Visual target tracking method based on Gaussian center alignment

Country Status (1)

Country Link
CN (1) CN110782479B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100753844B1 (en) * 2006-09-29 2007-08-31 한국전자통신연구원 Radar tracking device and method thereof
CN103268480A (en) * 2013-05-30 2013-08-28 重庆大学 System and method for visual tracking
CN107292918A (en) * 2016-10-31 2017-10-24 清华大学深圳研究生院 Tracking and device based on video on-line study
CN107424177A (en) * 2017-08-11 2017-12-01 哈尔滨工业大学(威海) Positioning amendment long-range track algorithm based on serial correlation wave filter
CN108665481A (en) * 2018-03-27 2018-10-16 西安电子科技大学 Multilayer depth characteristic fusion it is adaptive resist block infrared object tracking method
CN110276782A (en) * 2018-07-09 2019-09-24 西北工业大学 A kind of EO-1 hyperion method for tracking target of combination sky spectrum signature and correlation filtering

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8989442B2 (en) * 2013-04-12 2015-03-24 Toyota Motor Engineering & Manufacturing North America, Inc. Robust feature fusion for multi-view object tracking

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100753844B1 (en) * 2006-09-29 2007-08-31 한국전자통신연구원 Radar tracking device and method thereof
CN103268480A (en) * 2013-05-30 2013-08-28 重庆大学 System and method for visual tracking
CN107292918A (en) * 2016-10-31 2017-10-24 清华大学深圳研究生院 Tracking and device based on video on-line study
CN107424177A (en) * 2017-08-11 2017-12-01 哈尔滨工业大学(威海) Positioning amendment long-range track algorithm based on serial correlation wave filter
CN108665481A (en) * 2018-03-27 2018-10-16 西安电子科技大学 Multilayer depth characteristic fusion it is adaptive resist block infrared object tracking method
CN110276782A (en) * 2018-07-09 2019-09-24 西北工业大学 A kind of EO-1 hyperion method for tracking target of combination sky spectrum signature and correlation filtering

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
CenterNet: Keypoint Triplets for Object Detection;Kaiwen Duan etc.;《arXiv:1904.08189》;20190419;全文 *
Design and Implementation of Real-Time Object Tracking System Using the Gaussian Motion Model and the Otsu Algorithm;Meei-Ling Hung etc.;《2009 Second International Symposium on Knowledge Acquisition and Modeling》;20091228;全文 *
SPSTracker Sub-Peak Suppression of Response Map for Robust Object Tracking;Qintao Hu etc.;《https://doi.org/10.48550/arXiv.1912.00597》;20200303;全文 *
智能视频监控中运动目标检测与跟踪技术研究及实现;贾茜;《中国优秀硕士论文全文数据库》;20111215;全文 *
目标跟踪算法研究及实现;郑鹏程;《中国优秀硕士论文全文数据库》;20190115;全文 *

Also Published As

Publication number Publication date
CN110782479A (en) 2020-02-11

Similar Documents

Publication Publication Date Title
CN109816689B (en) Moving target tracking method based on adaptive fusion of multilayer convolution characteristics
CN110232389B (en) Stereoscopic vision navigation method based on invariance of green crop feature extraction
CN109035163B (en) Self-adaptive image denoising method based on deep learning
WO2021017361A1 (en) Template matching algorithm based on edge and gradient feature
WO2020134617A1 (en) Positioning method for matching buildings of repetitive structures on the basis of street view image
CN109285179A (en) A kind of motion target tracking method based on multi-feature fusion
CN110782477A (en) Moving target rapid detection method based on sequence image and computer vision system
CN110246154B (en) Visual target tracking method based on ICA-R multi-feature fusion and self-adaptive updating
CN102169581A (en) Feature vector-based fast and high-precision robustness matching method
CN108257155B (en) Extended target stable tracking point extraction method based on local and global coupling
CN110992263A (en) Image splicing method and system
CN102938147A (en) Low-altitude unmanned aerial vehicle vision positioning method based on rapid robust feature
CN110930411A (en) Human body segmentation method and system based on depth camera
CN110766723A (en) Unmanned aerial vehicle target tracking method and system based on color histogram similarity
CN105760879A (en) Fourier-Mellin transform-based image geometric matching method
CN109064402B (en) Single image super-resolution reconstruction method based on enhanced non-local total variation model prior
CN103955950A (en) Image tracking method utilizing key point feature matching
CN108710883B (en) Complete salient object detection method adopting contour detection
CN112419369A (en) Anti-occlusion real-time target tracking method
CN117764967A (en) Fitment quality detection method and system based on artificial intelligence
CN110782479B (en) Visual target tracking method based on Gaussian center alignment
CN107564013B (en) Scene segmentation correction method and system fusing local information
CN112132855B (en) Target tracking method of self-adaptive Gaussian function based on foreground segmentation guide
CN116958264A (en) Bolt hole positioning and pose estimation method based on three-dimensional vision
CN113538509B (en) Visual tracking method and device based on adaptive correlation filtering feature fusion learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant