CN110633632A - Weak supervision combined target detection and semantic segmentation method based on loop guidance - Google Patents

Weak supervision combined target detection and semantic segmentation method based on loop guidance Download PDF

Info

Publication number
CN110633632A
CN110633632A CN201910723018.7A CN201910723018A CN110633632A CN 110633632 A CN110633632 A CN 110633632A CN 201910723018 A CN201910723018 A CN 201910723018A CN 110633632 A CN110633632 A CN 110633632A
Authority
CN
China
Prior art keywords
semantic segmentation
target detection
segmentation
neural network
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910723018.7A
Other languages
Chinese (zh)
Inventor
纪荣嵘
沈云航
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiamen University
Original Assignee
Xiamen University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xiamen University filed Critical Xiamen University
Priority to CN201910723018.7A priority Critical patent/CN110633632A/en
Publication of CN110633632A publication Critical patent/CN110633632A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • G06V10/267Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion by performing operations on regions, e.g. growing, shrinking or watersheds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

A weak supervision joint target detection and semantic segmentation method based on loop guidance belongs to the technical field of computer vision. Initializing a convolutional neural network; the neural network forwards propagates to obtain a characteristic map of the image; the target detection branch is transmitted forward to obtain a target positioning diagram; forward propagation of semantic segmentation branches to obtain segmentation masks; obtaining a pseudo-true semantic segmentation label through a target positioning diagram; obtaining the weight value of the image candidate region by dividing the mask; calculating the loss of the semantic segmentation branch; calculating the loss of the target detection branch; updating parameters by using a random gradient descent algorithm; repeating the steps until convergence; inputting an image into a neural network to obtain a target detection and semantic segmentation result; initializing a convolutional neural network; the neural network is propagated forward to obtain an image characteristic map; the target detection branch is transmitted forward to obtain a target detection result; forward propagating the semantic segmentation branch to obtain a semantic segmentation mask; and obtaining an example segmentation mask through the target detection result and the semantic segmentation mask.

Description

Weak supervision combined target detection and semantic segmentation method based on loop guidance
Technical Field
The invention belongs to the technical field of computer vision, and particularly relates to a weak supervision joint target detection and semantic segmentation method based on loop guidance.
Background
Target detection and semantic segmentation are basic problems of machine vision, and are widely applied to scenes such as video monitoring, unmanned driving and the like, for example, the remote sensing field, and after a remote sensing image is input, the position of a building or a person in the remote sensing image can be automatically detected, so that the place is determined; can also be applied in the medical field, and various lesions can be analyzed according to medical X-ray images or microscopic images; in the military field, object detection may be used to locate the position of an enemy. Machine learning has enjoyed great success in tasks such as target detection and semantic segmentation, and is particularly based on strong supervised learning tasks such as classification and regression. Predictive models are learned from a training data set containing a large number of training samples, each corresponding to an event or object. The training sample consists of two parts: a feature vector (or example) describing the event/object, and a label representing the true value output. In the classification task, the labels represent the classes to which the training samples belong; in the regression task, the label is a real number value corresponding to the sample.
With the rise of deep learning, a large number of excellent object detection and semantic segmentation models have emerged in recent years. With the continuous development of data-driven methods in image recognition, people have a greater and greater interest in enlarging the scale of target detection and semantic segmentation systems. However, the current target detection and semantic segmentation have two disadvantages: first, most successful techniques require a large training data set containing truth labels. However, in many scenarios, it is difficult to obtain strong surveillance information due to the extremely high cost of the data annotation process. Therefore, training a high-accuracy detection and segmentation model requires a large amount of finely labeled picture data in the form of bounding boxes and pixels as a model supervision condition, and a large amount of manpower and material resources are required. Second, unlike the classification task, the method of fully labeling an object instance with categories, bounding boxes, and pixels is almost inextensible. Therefore, people increase the exploration strength of unsupervised and weakly supervised target detection and semantic segmentation methods, but the performance of the existing completely unsupervised and unmarked methods in the tasks of target detection and semantic segmentation is poor, and the conventional weakly supervised method cannot be well generalized to the image processing of complex scenes.
The weak supervision problem is that in order to realize a certain computer vision task, a manual label weaker than the task is adopted as supervision information. In general, weakly supervised annotations are easier to obtain than the original annotations. For example, for an object detection task, a label of an image-level (image-level) is a kind of weakly supervised label compared with a bounding box (bounding box) of an object; for semantic segmentation task, image-level (image-level) labels and bounding boxes (bounding boxes) of objects are a kind of weakly supervised labels compared with pixel-level (pixel-level) labels.
For target detection and semantic segmentation, related research work has been a research hotspot of computer vision. The current weak supervision target detection and semantic segmentation still have challenges, and overall, the challenges are mainly reflected in the following two aspects: robustness and computational complexity.
Robustness of target detection and semantic segmentation is mainly affected by intra-class apparent difference and inter-class apparent difference, and large intra-class apparent difference and small inter-class apparent difference generally cause the robustness of the target detection method to be reduced. Intra-class apparent differences refer to variations between different individuals of the same class, e.g., different individuals of a horse differ in color, texture, shape, pose, etc. Due to the influence of illumination, background, posture, change of viewpoint and shielding, even the same horse looks very different in different images, so that the construction of an appearance model with generalization capability is extremely difficult.
The computational complexity of target detection and semantic segmentation mainly derives from the number of target classes to be detected, the dimensionality of class appearance descriptors and the acquisition of a large amount of labeled data. The number of object categories in the real world is hundreds and thousands, the apparent descriptors are high-dimensional, and the acquisition of a large amount of sufficient labeled data is extremely time-consuming and labor-consuming, so that the computer complexity of target detection and semantic segmentation is high, and the design of an efficient target detection and semantic segmentation algorithm is very important. The current part of the work proposes a new feature matching method and a positioning strategy. Another category of computational complexity research is directed towards how to reduce the Search space in object detection and semantic segmentation, and such methods are collectively referred to as Selective Search strategies (Selective Search) or object Estimation (Estimation). The core idea of the method is that not every sub-region in an image contains objects which are independent of the category, and only a few candidate windows are meaningful candidate regions in target detection and semantic segmentation.
Disclosure of Invention
The invention aims to provide a weak supervision combined target detection and semantic segmentation method based on loop guidance.
The invention comprises the following steps:
the model training process comprises the following steps:
1) initializing a convolutional neural network;
2) the neural network forwards propagates to obtain a characteristic map of the image;
3) forward propagation of the target detection branch and obtaining a target positioning diagram;
4) forward propagating the semantic segmentation branches and obtaining segmentation masks;
5) obtaining a false-true semantic segmentation label through a target positioning graph, and taking the false-true semantic segmentation label as supervision information to train semantic segmentation;
6) obtaining the weight of the image candidate region through the segmentation mask, and correcting the candidate region as the prior of positioning;
7) calculating a loss of semantic segmentation branches based on the false-true semantic segmentation labels;
8) calculating the loss of the target detection branch by combining the weight of the candidate region;
9) updating parameters by using a random gradient descent algorithm;
10) repeating the steps 2) to 9) until convergence;
11) inputting an image into a neural network to obtain a target detection and semantic segmentation result;
in step 5) and step 6), the present invention proposes to use a loop-guided mechanism to assist the learning of both branches with each other. And obtaining a false-true semantic segmentation label by using a target positioning diagram detected by a weak supervision target, training semantic segmentation as supervision information, and obtaining a weight of an image candidate region by using a segmentation mask predicted by weak supervision semantic segmentation as a priori for positioning to correct the candidate region.
In step 7), the loss function of the semantic segmentation branch is:
Figure BDA0002157903850000031
in step 8), the loss function of the target detection branch is:
Figure BDA0002157903850000032
(II) model reasoning process:
12) initializing a convolutional neural network;
13) the neural network forwards propagates to obtain a characteristic map of the image;
14) the target detection branch is transmitted forward and a target detection result is obtained;
15) forward propagating the semantic segmentation branches and obtaining a semantic segmentation mask;
16) and obtaining an example segmentation mask through the target detection result and the semantic segmentation mask.
From the weak supervision angle, the method learns the target detection and semantic segmentation by using the image-level weakly labeled picture (only knowing whether the picture contains the target object). The invention relates to a novel cycle guidance-based weak supervision combined target detection and semantic segmentation method. The current weak supervision target detection and weak supervision semantic segmentation algorithms are usually separated and have poor performance. The invention provides a mechanism of multi-task learning to combine weak supervision target detection and semantic segmentation, and provides a learning mechanism of cycle guidance to mutually assist the learning of two tasks. The invention uses a deep convolutional neural network to train three modules simultaneously: the system comprises a backbone neural network, a target detection branch and a semantic segmentation branch. The backbone neural network is used for extracting the characteristics of the whole image. And the target detection branch carries out classified prediction on each candidate region. The semantic division branch classifies each position to form a division mask.
The invention provides a weak supervision target detection and semantic segmentation method using multi-task learning joint training, and respective tasks are enhanced by utilizing complementary information of target detection and semantic segmentation. The target positioning map of the weak supervised target detection can provide false and real semantic segmentation labels for weak supervised semantic segmentation, and the prediction mask of the weak supervised semantic segmentation can evaluate the weight value for the candidate region of the weak supervised detection. The invention introduces a cyclic learning guiding strategy on the existing weak supervision model, and simultaneously learns two models of weak supervision target detection and weak supervision semantic segmentation. The invention improves the weakly supervised target detector and the weakly supervised semantic segmentation model, and is more accurate than the original model. A large number of experimental results show that the method provided by the invention achieves excellent weak supervision target detection and weak supervision semantic segmentation performances.
Drawings
FIG. 1 is a method of cycle directed learning of the present invention.
Fig. 2 is an object location diagram for weakly supervised target detection.
Fig. 3 is a structural frame of the present invention.
FIG. 4 is complementary information of weakly supervised target detection and weakly supervised semantic segmentation.
Detailed Description
The invention provides a weak supervision combined target detection and semantic segmentation method based on loop guidance, and the following embodiments are combined with the accompanying drawings to explain the invention in detail:
the symbols primarily used in the present invention are first defined. Here, I e R is usedH×W×3Representing an input image in RGB format, t ∈ {0, 1}cTags representing corresponding image planes, { p }1 … pRDenotes candidate regions (propofol) of the image, R denotes the number of candidate regions, c denotes the number of categories as a whole, and H and W denote the height and width of the input image, respectively.
As shown in FIG. 1, the present invention uses a loop-guided strategy to train both weakly supervised target detection and weakly supervised semantic segmentation models. Firstly, the target detector predicts the category and position of the object; then the result of target detection can be converted into a target positioning diagram; training a semantic separator by taking the target positioning graph as a false-true semantic separation label; then a semantic separator predicts the segmentation mask of the image; and finally, calculating the weight of the candidate region by dividing the mask to correct the training of the target detector. As shown in fig. 2, the first column represents an input image, the second column represents a segmentation map based on CAM (b.zhou, a.khosla, a.lapedria, a.oliva, and a.torralba, "Learning Deep Features for segmentation," in CVPR, 2016 "), the third column represents an object localization map of the present invention, and the fourth column represents a corrected object localization map. First, it can be seen that the object localization graph can provide higher quality pseudo-real semantic segmentation labels than the CAM-based segmentation graph. Second, it can be seen that weakly supervised semantic segmentation often fails to predict consistent object contours. This is also the case for many semantic segmentation methods that require modification of the segmentation mask by means of CRF (p. krahenbuhl and v. koltun, "Efficient induction in full Connected CRFs with Gaussian Edge Potentials," in neuroips, 2011.). Finally, it can be seen that while weakly supervised target detection can usually predict the correct object contour, weakly supervised target detection often cannot distinguish the number of objects, and the predicted result is often only a part of the objects. The failure modes of the weakly supervised target detection and the weakly supervised semantic segmentation are found to be complementary through experiments. In one aspect, a predictive segmentation mask of weakly supervised semantic segmentation may help weakly supervised target detection escape from local minima. On the other hand, the target positioning diagram of the weak supervision target detection can provide high-quality pseudo-real semantic segmentation labels.
As shown in fig. 3, the present invention uses a network such as VGGNet (simony, Karen, and Andrew zisserman. "Very Deep conditional Networks for Large-scale Image Recognition," arxiv.2014.) as a basic model backend structure. Generally, the deeper the depth of the model's back end, the more expressive the model is. The model of the present invention has two branches. The first branch is a weakly supervised target detection branch and the second branch is a weakly supervised semantic segmentation branch.
Weakly supervised target detection branching. The Weakly Supervised target Detection branch uses the WSDDN (h.bilen and a.vedaldi, "weak Supervised Deep Detection Networks," in CVPR, 2016.) model as the basic model. Firstly, inputting an image, obtaining a feature map of the image, and then obtaining R candidate regions { p } through an SPP layer (K.He, X.Zhang, S.Ren, and J.Sun, "Spatial Pyramid position in Deep conditional Networks for visual recognition," in ECCV, 2014.)1 … pRCharacteristics of. The features of the candidate region are then passed through two tributaries: classifying the branch flow and detecting the branch flow. The two branches respectively use the full connection layer to output two scoring matrixes
Figure BDA0002157903850000051
The two scoring matrices are normalized in category and candidate region dimensions with the sofimax layer σ (-) respectively.
Figure BDA0002157903850000052
Figure BDA0002157903850000053
And performing dot product on the normalized score matrix:
xs=σ(xc)·σ(xd) (3)
to obtain a prediction at the image level, an accumulation pooling is used:
and finally, obtaining cross entropy loss:
Figure BDA0002157903850000055
wherein, tkRepresenting the true label of the kth category.
And (5) weakly supervising semantic segmentation branches. The weakly supervised Semantic segmentation branch is based on the DeepLab-ASPP (L. -C.Chen, G.Papandrou, I.Kokkinos, K.Murphy, and A.L.Yuille, "DeepLab: Semantic image segmentation with Deep computational networks, atom fusion, and Fullyconnected CRFs," TPAMI.2017.) model. The invention uses the target location map generated by the weak supervision target detection as the supervision information of the weak supervision semantic segmentation branch. Most weakly supervised semantic partitions use a full convolution network, a softmax normalization layer, and a polynomial cross entropy loss function. The invention uses a sigmoid normalization layer and a binary cross entropy loss function:
Figure BDA0002157903850000061
m and S are respectively an object positioning diagram detected by the weak supervision target and a segmentation mask of weak supervision semantic segmentation prediction.
Figure BDA0002157903850000062
Andrepresenting the height and width of the dividing mask, typically H and W, respectively
Figure BDA0002157903850000064
And (5) circularly guiding learning. Theoretically, the error patterns of weakly supervised target detection and weakly supervised semantic segmentation are complementary. In one aspect, weakly supervised target detection is typically formulated as a multi-instance classification. It can explicitly raise the background image to penalize the FalsePositive, so the weakly supervised target detection has a lower FalsePositive rate. However, to prevent self-emphasis from falling into local minima, weakly supervised target detection usually penalizes only high confidence false alarm. Therefore, weakly supervised target detection usually has ambiguous feature patterns in non-salient regions. In another aspect, the loss of weakly supervised semantic segmentation is at the pixel level. The lack of explicit penalty for FalsePositive results in noisy background prediction. However fine-grained prediction of ambiguous areas at weakly supervised target detection can be used to aid target localization. The present invention therefore proposes a loop-directed learning strategy to assist the respective task with complementary information.
And guiding semantic segmentation by target detection. Object localization maps for weakly supervised target detection are used to help train weakly supervised semantic segmentation. Weak surveillance targets are used to detect built-in foreground and background cues. It is specified that no additional parameters are required. In particular, the gradient of the classification score is propagated back to the first layer of the network, resulting in a coarse object localization map, as shown in the second row of FIG. 4. On the coarse object localization map, it is first normalized to between (0, 1). Then, a position with a value higher than 0.1 is set as a foreground region, and a position with a value lower than 0.005 is set as a background region. Finally, the remaining region is set as an uncertain region. And finally, taking the obtained corrected object positioning diagram as a false-true semantic segmentation label, as shown in the third line of fig. 4.
And (4) guidance of target detection by semantic segmentation. The candidate region is corrected using the segmentation mask of the weakly supervised semantic segmentation prediction as a priori of the localization. By dividing the mask map SkThe position and shape of the object can be roughly estimated. The density of each candidate region is then calculated:
Figure BDA0002157903850000065
wherein the content of the first and second substances,denotes SkThe i-th row and the j-th column of the element, gamma is 0.1, max MkRepresents MkMaximum value of (2). The density of the context region of the obtained candidate region is also calculated
Figure BDA0002157903850000067
Calculating a response value for each candidate region:
Figure BDA0002157903850000071
finally obtaining weighted candidate region scores
Xr=σ(Xc)·σ(Xd)·Wr (9)
Wherein, XrAnd representing the score matrix after the candidate area is corrected. The prediction score of the modified image plane may be calculated:
Figure BDA0002157903850000072
and finally, obtaining a corrected cross entropy loss function:
Figure BDA0002157903850000073
in the inference process, the detection result of the candidate area is calculated by using the target detection branch, and then the detection result is suppressed and filtered by using the non-maximum value. Meanwhile, the semantic segmentation branch outputs a semantic segmentation mask of the whole image. Finally, extracting the mask surrounding the box can obtain the result of example segmentation.
The invention relates to a novel weak supervision combined target detection and semantic segmentation method based on loop guidance. It is well known that the current weakly supervised target detection and weakly supervised semantic segmentation algorithms are usually separate and have poor performance. The invention provides a mechanism of multi-task learning to combine weak supervision target detection and semantic segmentation, and provides a learning mechanism of cycle guidance to mutually assist the learning of two tasks. The invention uses a deep convolutional neural network to train three modules simultaneously: the system comprises a backbone neural network, a target detection branch and a semantic segmentation branch. The backbone neural network is used for extracting the characteristics of the whole image. And the target detection branch carries out classified prediction on each candidate region. The semantic division branch classifies each position to form a division mask.
The invention utilizes the information complementary with the target detection and the semantic segmentation to enhance the respective tasks. The target positioning map of the weak supervised target detection can provide false and real semantic segmentation labels for weak supervised semantic segmentation, and the prediction mask of the weak supervised semantic segmentation can evaluate the weight value for the candidate region of the weak supervised detection. In conclusion, the invention introduces a cyclic learning guiding strategy on the existing weak supervision model, and simultaneously learns two models of weak supervision target detection and weak supervision semantic segmentation. The final effect is: the invention improves the weak supervision target detector and the model of weak supervision semantic segmentation, and is more accurate than the original model. A large number of experimental results show that the method provided by the invention achieves excellent weak supervision target detection and weak supervision semantic segmentation performances.

Claims (3)

1. A weak supervision combined target detection and semantic segmentation method based on loop guidance is characterized by comprising the following steps:
the model training process comprises the following steps:
1) initializing a convolutional neural network;
2) the neural network forwards propagates to obtain a characteristic map of the image;
3) forward propagation of the target detection branch and obtaining a target positioning diagram;
4) forward propagating the semantic segmentation branches and obtaining segmentation masks;
5) obtaining a false-true semantic segmentation label through a target positioning graph, and taking the false-true semantic segmentation label as supervision information to train semantic segmentation;
6) obtaining the weight of the image candidate region through the segmentation mask, and correcting the candidate region as the prior of positioning;
7) calculating a loss of semantic segmentation branches based on the false-true semantic segmentation labels;
8) calculating the loss of the target detection branch by combining the weight of the candidate region;
9) updating parameters by using a random gradient descent algorithm;
10) repeating the steps 2) to 9) until convergence;
11) inputting an image into a neural network to obtain a target detection and semantic segmentation result;
(II) model reasoning process:
12) initializing a convolutional neural network;
13) the neural network forwards propagates to obtain a characteristic map of the image;
14) the target detection branch is transmitted forward and a target detection result is obtained;
15) forward propagating the semantic segmentation branches and obtaining a semantic segmentation mask;
16) and obtaining an example segmentation mask through the target detection result and the semantic segmentation mask.
2. The method for weakly supervised joint object detection and semantic segmentation based on loop guidance as claimed in claim 1, wherein in step 7), the loss function of the semantic segmentation branch is:
Figure FDA0002157903840000011
m and S respectively represent an object positioning diagram detected by a weak supervision target and a segmentation mask predicted by weak supervision semantic segmentation;
Figure FDA0002157903840000014
and
Figure FDA0002157903840000015
representing the height and width of the dividing mask, typically H and W, respectively
Figure FDA0002157903840000012
3. The method for weakly supervised joint target detection and semantic segmentation based on loop guidance as claimed in claim 1, wherein in step 8), the loss function of the target detection branch is:
Figure FDA0002157903840000013
CN201910723018.7A 2019-08-06 2019-08-06 Weak supervision combined target detection and semantic segmentation method based on loop guidance Pending CN110633632A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910723018.7A CN110633632A (en) 2019-08-06 2019-08-06 Weak supervision combined target detection and semantic segmentation method based on loop guidance

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910723018.7A CN110633632A (en) 2019-08-06 2019-08-06 Weak supervision combined target detection and semantic segmentation method based on loop guidance

Publications (1)

Publication Number Publication Date
CN110633632A true CN110633632A (en) 2019-12-31

Family

ID=68969289

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910723018.7A Pending CN110633632A (en) 2019-08-06 2019-08-06 Weak supervision combined target detection and semantic segmentation method based on loop guidance

Country Status (1)

Country Link
CN (1) CN110633632A (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111242059A (en) * 2020-01-16 2020-06-05 合肥工业大学 Method for generating unsupervised image description model based on recursive memory network
CN111415373A (en) * 2020-03-20 2020-07-14 北京以萨技术股份有限公司 Target tracking and segmenting method, system and medium based on twin convolutional network
CN111523585A (en) * 2020-04-16 2020-08-11 厦门大学 Weak supervision target detection method based on improved depth residual error network
CN111680739A (en) * 2020-06-04 2020-09-18 通号通信信息集团有限公司 Multi-task parallel method and system for target detection and semantic segmentation
CN111898439A (en) * 2020-06-29 2020-11-06 西安交通大学 Deep learning-based traffic scene joint target detection and semantic segmentation method
CN112085739A (en) * 2020-08-20 2020-12-15 深圳力维智联技术有限公司 Semantic segmentation model training method, device and equipment based on weak supervision
CN112465801A (en) * 2020-12-09 2021-03-09 北京航空航天大学 Instance segmentation method for extracting mask features in scale division mode
CN112633086A (en) * 2020-12-09 2021-04-09 西安电子科技大学 Near-infrared pedestrian monitoring method, system, medium and equipment based on multitask EfficientDet
CN113139471A (en) * 2021-04-25 2021-07-20 上海商汤智能科技有限公司 Target detection method and device, electronic equipment and storage medium
CN113177947A (en) * 2021-04-06 2021-07-27 广东省科学院智能制造研究所 Complex environment target segmentation method and device based on multi-module convolutional neural network
CN113283438A (en) * 2021-03-25 2021-08-20 北京工业大学 Weak surveillance video target segmentation method based on multi-source significance and space-time sample adaptation
CN113408499A (en) * 2021-08-19 2021-09-17 天津所托瑞安汽车科技有限公司 Joint evaluation method and device of dual-network model and storage medium
CN113505781A (en) * 2021-06-01 2021-10-15 北京旷视科技有限公司 Target detection method and device, electronic equipment and readable storage medium
US20220405506A1 (en) * 2021-06-22 2022-12-22 Intrinsic Innovation Llc Systems and methods for a vision guided end effector

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109118519A (en) * 2018-07-26 2019-01-01 北京纵目安驰智能科技有限公司 Target Re-ID method, system, terminal and the storage medium of Case-based Reasoning segmentation
CN109255790A (en) * 2018-07-27 2019-01-22 北京工业大学 A kind of automatic image marking method of Weakly supervised semantic segmentation

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109118519A (en) * 2018-07-26 2019-01-01 北京纵目安驰智能科技有限公司 Target Re-ID method, system, terminal and the storage medium of Case-based Reasoning segmentation
CN109255790A (en) * 2018-07-27 2019-01-22 北京工业大学 A kind of automatic image marking method of Weakly supervised semantic segmentation

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Y. SHEN 等: "Weakly Supervised Object Detection via Object-Specific Pixel Gradient", 《 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS》 *
YUNHANG SHEN 等: "Cyclic Guidance for Weakly Supervised Joint Detection and Segmentation", 《2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)》 *

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111242059B (en) * 2020-01-16 2022-03-15 合肥工业大学 Method for generating unsupervised image description model based on recursive memory network
CN111242059A (en) * 2020-01-16 2020-06-05 合肥工业大学 Method for generating unsupervised image description model based on recursive memory network
CN111415373A (en) * 2020-03-20 2020-07-14 北京以萨技术股份有限公司 Target tracking and segmenting method, system and medium based on twin convolutional network
CN111523585A (en) * 2020-04-16 2020-08-11 厦门大学 Weak supervision target detection method based on improved depth residual error network
CN111523585B (en) * 2020-04-16 2022-05-31 厦门大学 Weak supervision target detection method based on improved depth residual error network
CN111680739A (en) * 2020-06-04 2020-09-18 通号通信信息集团有限公司 Multi-task parallel method and system for target detection and semantic segmentation
CN111680739B (en) * 2020-06-04 2024-03-22 通号通信信息集团有限公司 Multi-task parallel method and system for target detection and semantic segmentation
CN111898439A (en) * 2020-06-29 2020-11-06 西安交通大学 Deep learning-based traffic scene joint target detection and semantic segmentation method
CN112085739A (en) * 2020-08-20 2020-12-15 深圳力维智联技术有限公司 Semantic segmentation model training method, device and equipment based on weak supervision
CN112085739B (en) * 2020-08-20 2024-05-24 深圳力维智联技术有限公司 Training method, device and equipment of semantic segmentation model based on weak supervision
CN112465801B (en) * 2020-12-09 2022-11-29 北京航空航天大学 Instance segmentation method for extracting mask features in scale division mode
CN112633086B (en) * 2020-12-09 2024-01-26 西安电子科技大学 Near-infrared pedestrian monitoring method, system, medium and equipment based on multitasking EfficientDet
CN112633086A (en) * 2020-12-09 2021-04-09 西安电子科技大学 Near-infrared pedestrian monitoring method, system, medium and equipment based on multitask EfficientDet
CN112465801A (en) * 2020-12-09 2021-03-09 北京航空航天大学 Instance segmentation method for extracting mask features in scale division mode
CN113283438A (en) * 2021-03-25 2021-08-20 北京工业大学 Weak surveillance video target segmentation method based on multi-source significance and space-time sample adaptation
CN113283438B (en) * 2021-03-25 2024-03-29 北京工业大学 Weak supervision video target segmentation method based on multisource saliency and space-time list adaptation
CN113177947A (en) * 2021-04-06 2021-07-27 广东省科学院智能制造研究所 Complex environment target segmentation method and device based on multi-module convolutional neural network
CN113177947B (en) * 2021-04-06 2024-04-26 广东省科学院智能制造研究所 Multi-module convolutional neural network-based complex environment target segmentation method and device
CN113139471A (en) * 2021-04-25 2021-07-20 上海商汤智能科技有限公司 Target detection method and device, electronic equipment and storage medium
CN113505781A (en) * 2021-06-01 2021-10-15 北京旷视科技有限公司 Target detection method and device, electronic equipment and readable storage medium
US20220405506A1 (en) * 2021-06-22 2022-12-22 Intrinsic Innovation Llc Systems and methods for a vision guided end effector
CN113408499A (en) * 2021-08-19 2021-09-17 天津所托瑞安汽车科技有限公司 Joint evaluation method and device of dual-network model and storage medium

Similar Documents

Publication Publication Date Title
CN110633632A (en) Weak supervision combined target detection and semantic segmentation method based on loop guidance
CN110097568B (en) Video object detection and segmentation method based on space-time dual-branch network
CN108960184B (en) Pedestrian re-identification method based on heterogeneous component deep neural network
CN110929593B (en) Real-time significance pedestrian detection method based on detail discrimination
CN110033007B (en) Pedestrian clothing attribute identification method based on depth attitude estimation and multi-feature fusion
Long et al. Object detection in aerial images using feature fusion deep networks
CN110765906A (en) Pedestrian detection algorithm based on key points
Waheed et al. Deep learning algorithms-based object detection and localization revisited
CN109447082B (en) Scene moving object segmentation method, system, storage medium and equipment
CN108875754B (en) Vehicle re-identification method based on multi-depth feature fusion network
CN112861917A (en) Weak supervision target detection method based on image attribute learning
CN114821014A (en) Multi-mode and counterstudy-based multi-task target detection and identification method and device
Valappil et al. CNN-SVM based vehicle detection for UAV platform
Khatri et al. Detection of animals in thermal imagery for surveillance using GAN and object detection framework
Aledhari et al. Multimodal machine learning for pedestrian detection
CN114913409A (en) Camouflage target identification method for marine organisms
Karunakaran Deep learning based object detection using mask RCNN
CN113129336A (en) End-to-end multi-vehicle tracking method, system and computer readable medium
CN114359493B (en) Method and system for generating three-dimensional semantic map for unmanned ship
CN113128441B (en) System and method for identifying vehicle weight by embedding structure of attribute and state guidance
CN115546668A (en) Marine organism detection method and device and unmanned aerial vehicle
Filipovych et al. Learning human motion models from unsegmented videos
CN114170625A (en) Context-aware and noise-robust pedestrian searching method
Nguyen et al. Real-time human detection under omni-dir ectional camera based on cnn with unified detection and agmm for visual surveillance
Kamaleswari et al. An Assessment of Object Detection in Thermal (Infrared) Image Processing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20191231