CN113095229A - Unsupervised domain self-adaptive pedestrian re-identification system and method - Google Patents

Unsupervised domain self-adaptive pedestrian re-identification system and method Download PDF

Info

Publication number
CN113095229A
CN113095229A CN202110399589.7A CN202110399589A CN113095229A CN 113095229 A CN113095229 A CN 113095229A CN 202110399589 A CN202110399589 A CN 202110399589A CN 113095229 A CN113095229 A CN 113095229A
Authority
CN
China
Prior art keywords
pedestrian
pictures
picture
representing
loss function
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110399589.7A
Other languages
Chinese (zh)
Other versions
CN113095229B (en
Inventor
李佳函
李云龙
程德强
寇旗旗
张皓翔
韩成功
徐进洋
江曼
刘瑞航
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China University of Mining and Technology CUMT
Original Assignee
Jiangsu Huatu Mining Technology Co ltd
China University of Mining and Technology CUMT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Huatu Mining Technology Co ltd, China University of Mining and Technology CUMT filed Critical Jiangsu Huatu Mining Technology Co ltd
Priority to CN202110399589.7A priority Critical patent/CN113095229B/en
Publication of CN113095229A publication Critical patent/CN113095229A/en
Application granted granted Critical
Publication of CN113095229B publication Critical patent/CN113095229B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/103Static body considered as a whole, e.g. static pedestrian or occupant recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/46Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Image Analysis (AREA)

Abstract

The invention relates to an unsupervised domain self-adaptive pedestrian re-identification system and method, belongs to the technical field of pedestrian re-identification, and solves the problems of high difficulty and low identification accuracy rate of the existing unsupervised domain self-adaptive pedestrian re-identification. The system comprises a data acquisition module, a data acquisition module and a data processing module, wherein the data acquisition module is used for acquiring a plurality of source domain sample subsets and a plurality of target domain sample subsets; the network model training module is used for obtaining a classification loss function and a sample invariance loss function of the pedestrian re-recognition network model, sequencing and layering pedestrian pictures in the target domain sample subset according to the similarity between each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset to obtain a layering loss function, and further performing iterative optimization on the pedestrian re-recognition network model; and the re-identification module is used for re-identifying the pedestrians by utilizing the optimized pedestrian re-identification network model to obtain the images which are the same as or similar to the images of the pedestrians to be identified. The system can reduce the migration loss of the network and improve the accuracy of pedestrian re-identification.

Description

Unsupervised domain self-adaptive pedestrian re-identification system and method
Technical Field
The invention relates to the technical field of pedestrian re-identification, in particular to an unsupervised domain self-adaptive pedestrian re-identification system and method.
Background
In conventional unsupervised domain adaptive learning, most methods are used in a closed set scenario where sample classes are consistent in the source domain as well as in the target domain. However, the conventional unsupervised domain adaptive algorithm cannot be used for unsupervised domain adaptive pedestrian re-recognition, because due to image acquisition, the pedestrian categories of the source domain and the target domain are almost different, and a large migration loss is generated when a network trained by using source domain images migrates to the target domain for recognition. The large migration loss caused by network migration makes unsupervised domain adaptive pedestrian re-identification more challenging than most unsupervised learning.
In the prior art, unsupervised domain self-adaptive pedestrian re-identification mainly achieves the purpose of distinguishing label-free data sets through self-clustering of target domains. In the prior art, three invariance losses based on a target domain are proposed to reduce the migration loss of the network, but marked data are not fully utilized, and deep semantic information of a non-tag picture is not fully mined, so that the migration loss of the network cannot be efficiently reduced, and the accuracy of pedestrian re-identification needs to be improved.
In the prior art, at least the following defects exist, the source domain picture information and the target domain picture information cannot be fully utilized, and the network distinguishing learning cannot be carried out on negative samples with different similarities according to the similarities between the source domain picture and the target domain picture, so that the migration loss of a pedestrian re-identification network is large, and the accuracy of the result of the pedestrian re-identification is low.
Disclosure of Invention
In view of the foregoing analysis, embodiments of the present invention provide an unsupervised domain adaptive pedestrian re-identification system and method, so as to solve the problems of large migration loss of a pedestrian re-identification network and low accuracy of a result of pedestrian re-identification in the prior art.
In one aspect, the present invention provides an unsupervised domain adaptive pedestrian re-identification system, including:
a data acquisition module for acquiring a source domain sample set comprising a plurality of source domain sample subsets and a target domain sample set comprising a plurality of target domain sample subsets;
the network model training module is used for obtaining a classification loss function and a sample invariance loss function of the pedestrian re-recognition network model, sequencing and layering pedestrian pictures in the target domain sample subset according to the similarity between each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset, and obtaining a layering loss function based on layered pedestrian picture characteristics and corresponding layering weights; traversing each source domain sample subset and each target domain sample subset based on the classification loss function, the sample invariance loss function and the hierarchical loss function to perform iterative optimization on the pedestrian re-identification network model;
and the re-identification module is used for identifying the pedestrian image to be identified by utilizing the optimized pedestrian re-identification network model to obtain the image which is the same as or similar to the pedestrian image to be identified.
Further, the pedestrian re-identification network model comprises a residual error network structure, a full connection layer and a Softmax normalization layer which are connected in sequence and correspond to the classification loss function, a target domain memory and an L, wherein the target domain memory and the L are connected in sequence and correspond to the sample invariance loss function2The system comprises a normalization layer, a source domain memory and a similarity measurement axis network structure which are sequentially connected and correspond to a hierarchical loss function, wherein the residual error network structure is respectively connected with the full connection layer, the target domain memory, the source domain memory and the similarity measurement axis network structure.
Further, the network model training module obtains the hierarchical loss function specifically by:
respectively inputting the source domain sample subset and the target domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain and respectively store the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset, and multiplying the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset to obtain corresponding similarity;
based on the similarity, sequencing the pedestrian pictures in the target domain sample subset in a descending order, sequentially selecting a first preset number of pictures as first-layer pictures, selecting a second preset number of pictures as second-layer pictures, taking the rest pictures as third-layer pictures, and respectively setting layer weights of the three layers of pictures;
obtaining a hierarchical loss function based on the characteristics of each pedestrian picture in the target domain sample subset and the level weight corresponding to the level to which the pedestrian picture belongs:
Figure BDA0003019905840000031
Figure BDA0003019905840000032
wherein L isSLRepresenting the hierarchical loss function, ntRepresenting the number of pedestrian pictures, w, in the target field sample subsett,rRepresenting the level weight, r representing the sequence number of the pedestrian pictures in the target domain sample subset according to the similarity, xt,iRepresenting target domain sample subset input pedestrian re-recognitionInputting a pedestrian picture with the sequence of i, f (x) in the network modelt,i) Picture x representing pedestriant,iP (m | x)t,i) Picture x representing pedestriant,iProbability of similarity to all classes of pedestrian pictures in all stored subsets of source domain samples, R1[m]Features, R, representing all classes of pedestrian pictures in all stored subsets of active domain samples1[j]Features representing class j pedestrian pictures in all stored subsets of active domain samples, NsAnd the stored category number of the pedestrian pictures in all the active domain sample subsets is represented, and beta is a temperature coefficient.
Further, the layer weights of the three layers of pictures are set as follows:
Figure BDA0003019905840000041
Figure BDA0003019905840000042
where b denotes the number of first layer pictures, k1Represents the sum of the numbers of the first layer picture and the second layer picture, and p (l | i) represents the pedestrian picture xt,iProbability of class i pedestrian pictures in all target domain sample subsets, R1[l]Features representing class i pedestrian pictures in all stored subsets of target domain samples, NtRepresenting the number of categories of pedestrian pictures in all the stored subsets of the target domain samples.
Further, the network model training module obtains a classification loss function specifically by the following means:
inputting the source domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain the features of each pedestrian picture in the source domain sample subset;
sequentially inputting the characteristics of each pedestrian picture into a full connection layer and a softmax regression layer of a pedestrian re-identification network model, and performing characteristic dimension conversion and characteristic normalization;
classifying a loss function by adopting the following formula based on the characteristics of each pedestrian picture after dimension conversion and normalization;
Figure BDA0003019905840000043
Figure BDA0003019905840000044
wherein L issrcRepresenting the classification loss function, nsRepresenting the number of pedestrian pictures, x, in a subset of source domain sampless,mRepresents the m-th pedestrian picture in the source domain sample subset, f (x)s,m) Picture x representing pedestrians,mCharacteristic of (a), ys,mRepresenting a pedestrian picture x in a subset of source domain sampless,mClass label of p (y)s,m|xs,m) Picture x representing pedestrians,mBelong to the category ys,mProbability of (a), f (x)s,j) Representing features of class j pedestrian pictures in the source domain sample subset, NsRepresenting the number of categories of pedestrian pictures in all the stored subsets of the active domain samples.
Further, the network model training module obtains a sample invariance loss function specifically by the following means:
inputting the target domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain the features of each pedestrian picture in the target domain sample subset, and performing L on the features of each pedestrian picture2Carrying out standardization treatment;
obtaining a sample invariance loss function according to the characteristics of each pedestrian picture after the normalization processing through the following formula:
Figure BDA0003019905840000051
Figure BDA0003019905840000052
wherein L isTRepresenting the sample invariance loss function, ntRepresenting the number, x, of pictures of a person in a sample subset of the target fieldt,iRepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)t,i) Picture x representing pedestriant,iIs characterized by wi,lPicture x representing pedestriant,iWeight of pedestrian pictures belonging to class I, p (l | x)t,i) Picture x representing pedestriant,iProbability of a pedestrian belonging to class I picture, NtAnd the stored category number of the pedestrian pictures in all the target domain sample subsets is represented, and beta is a temperature coefficient.
Further, the network model training module performs iterative optimization on the pedestrian re-recognition network model specifically by the following means:
obtaining a total loss function according to the classification loss function, the sample invariance loss function and the layering loss function:
L=λ1Lsrc2LT3LSL
wherein L represents the total loss function, LsrcRepresenting the classification loss function, λ1Weight representing the classification loss function, LTRepresenting the sample invariance loss function, λ2Weight, L, representing the sample invariance loss functionSLRepresenting the hierarchical loss function, λ3A weight representing a hierarchical loss function;
and traversing each source domain sample subset and each target domain sample subset, and iteratively updating the total loss function until the variable quantity of the total loss function value is smaller than a preset value so as to complete the optimization of the pedestrian re-identification network model.
On the other hand, the invention provides an unsupervised domain self-adaptive pedestrian re-identification method, which comprises the following steps:
obtaining a source domain sample set comprising a plurality of source domain sample subsets and a target domain sample set comprising a plurality of target domain sample subsets;
respectively obtaining a classification loss function and a sample invariance loss function of a pedestrian re-identification network model based on the source domain sample subset and the target domain sample subset, sequencing and layering pedestrian pictures in the target domain sample subset according to the similarity of each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset, and obtaining a layering loss function based on layered pedestrian picture characteristics and corresponding layering weights; traversing each source domain sample subset and each target domain sample subset based on the classification loss function, the sample invariance loss function and the hierarchical loss function to perform iterative optimization on the pedestrian re-identification network model;
and identifying the image of the pedestrian to be identified by utilizing the optimized pedestrian re-identification network model to obtain the image which is the same as or similar to the image of the pedestrian to be identified.
Further, the hierarchical loss function is obtained by specifically:
respectively inputting the source domain sample subset and the target domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain and respectively store the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset, and multiplying the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset to obtain corresponding similarity;
based on the similarity, sequencing the pedestrian pictures in the target domain sample subset in a descending order, sequentially selecting a first preset number of pictures as first-layer pictures, selecting a second preset number of pictures as second-layer pictures, taking the rest pictures as third-layer pictures, and respectively setting layer weights of the three layers of pictures;
obtaining a hierarchical loss function based on the characteristics of each pedestrian picture in the target domain sample subset and the level weight corresponding to the level to which the pedestrian picture belongs:
Figure BDA0003019905840000071
Figure BDA0003019905840000072
wherein L isSLRepresenting the hierarchical loss function, ntRepresenting the number of pedestrian pictures, w, in the target field sample subsett,rRepresenting the level weight, r representing the sequence number of the pedestrian pictures in the target domain sample subset according to the similarity, xt,iRepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)t,i) Picture x representing pedestriant,iP (m | x)t,i) Picture x representing pedestriant,iProbability of similarity to all classes of pedestrian pictures in all stored subsets of source domain samples, R1[m]Features, R, representing all classes of pedestrian pictures in all stored subsets of active domain samples1[j]Features representing class j pedestrian pictures in all stored subsets of active domain samples, NsAnd the stored category number of the pedestrian pictures in all the active domain sample subsets is represented, and beta is a temperature coefficient.
Further, the layer weights of the three layers of pictures are set as follows:
Figure BDA0003019905840000073
Figure BDA0003019905840000074
where b denotes the number of first layer pictures, k1Represents the sum of the numbers of the first layer picture and the second layer picture, and p (l | i) represents the pedestrian picture xt,iProbability of class i pedestrian pictures in all target domain sample subsets, R1[l]Features representing class i pedestrian pictures in all stored subsets of target domain samples, NtRepresenting the number of categories of pedestrian pictures in all the stored subsets of the target domain samples.
Compared with the prior art, the invention can realize at least one of the following beneficial effects:
1. the unsupervised domain self-adaptive pedestrian re-identification system and the unsupervised domain self-adaptive pedestrian re-identification method effectively combine the supervised pedestrian re-identification with the unsupervised pedestrian re-identification through the combination of the source domain sample with the label and the target domain sample without the label, and firstly propose to layer the target domain sample through the similarity between the source domain sample and the target domain sample, so that the discrimination of the pedestrian re-identification network on the target domain sample is improved, the migration loss of the network is reduced, and the accuracy of the pedestrian re-identification result of the pedestrian re-identification network is further improved.
2. The unsupervised domain self-adaptive pedestrian re-recognition system and the unsupervised domain self-adaptive pedestrian re-recognition method, provided by the invention, utilize the similarity of the characteristics of the source domain sample and the target domain sample to carry out layering and weighting on the negative sample to obtain a layered loss function, the loss function enables the network to be self-adaptive in the training of a pedestrian re-recognition network model so as to reduce the learning of the characteristics of the negative sample, thereby reducing the migration loss of the network, and on the basis, the classification loss function and the sample invariance loss function are combined to carry out iterative optimization on the pedestrian re-recognition network, so that the accuracy of the pedestrian re-recognition result of the pedestrian re-recognition network.
In the invention, the technical schemes can be combined with each other to realize more preferable combination schemes. Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and drawings.
Drawings
The drawings are only for purposes of illustrating particular embodiments and are not to be construed as limiting the invention, wherein like reference numerals are used to designate like parts throughout.
FIG. 1 is a schematic diagram of an unsupervised domain adaptive pedestrian re-identification system according to an embodiment of the present invention;
FIG. 2 is a diagram illustrating a pedestrian re-identification network model according to an embodiment of the present invention;
fig. 3 is a flowchart of an unsupervised domain adaptive pedestrian re-identification method according to an embodiment of the present invention.
Reference numerals:
110-a data acquisition module; 120-a network model training module; 130-re-identification module.
Detailed Description
The accompanying drawings, which are incorporated in and constitute a part of this application, illustrate preferred embodiments of the invention and together with the description, serve to explain the principles of the invention and not to limit the scope of the invention.
System embodiment
The invention discloses an unsupervised domain self-adaptive pedestrian re-identification system. As shown in fig. 1, the system includes:
a data acquisition module 110 for acquiring a source domain sample set comprising a plurality of source domain sample subsets and a target domain sample set comprising a plurality of target domain sample subsets. Specifically, a large number of pictures of pedestrians are randomly acquired from different angles, each picture contains a pedestrian, the acquired pictures are divided into a training set and a testing set, the training set is used for training the pedestrian re-recognition network model, and the testing set is used for testing the trained pedestrian re-recognition network model so as to ensure the recognition accuracy of the pedestrian re-recognition network model. The method comprises the steps of randomly selecting partial pictures in a training set, adding a label to each picture, specifically, adding the same label to a plurality of pictures of the same pedestrian, wherein different labels represent different pedestrians, dividing the pictures with the labels added into a plurality of source domain sample subsets, taking other pictures without the labels in the training set as a target domain sample set, adding a number to each picture, and randomly dividing the pictures with the numbers added into a plurality of target domain sample subsets.
The network model training module 120 is configured to obtain a classification loss function and a sample invariance loss function of the pedestrian re-recognition network model, rank and layer pedestrian pictures in the target domain sample subset according to a similarity between each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset, and obtain a layer loss function based on layered pedestrian picture features and corresponding layer weights. And traversing each source domain sample subset and each target domain sample subset based on a classification loss function, a sample invariance loss function and a hierarchical loss function to perform iterative optimization on the pedestrian re-identification network model.
And the re-identification module is used for identifying the pedestrian image to be identified by utilizing the optimized pedestrian re-identification network model to obtain the image which is the same as or similar to the pedestrian image to be identified. Specifically, a to-be-identified pedestrian picture is input into a trained pedestrian re-identification network model, the model can output a picture most similar to the to-be-identified pedestrian picture, exemplarily, the model is set to output the number of the first three pictures with the highest similarity to the to-be-identified pedestrian picture, the first three pictures are pictures in a target domain sample set, and then the to-be-identified pedestrian picture and the first three pictures are manually compared to determine the category of the to-be-identified pedestrian picture.
Preferably, as shown in fig. 2, the pedestrian re-identification network model includes a residual network structure, a fully-connected layer and a Softmax normalization layer connected in sequence corresponding to the classification loss function, a target domain memory connected in sequence corresponding to the sample invariance loss function, and an L2The system comprises a normalization layer, a source domain memory and a similarity measurement axis network structure which are connected in sequence and correspond to a hierarchical loss function, wherein the residual error network structure is connected with a full connection layer, a target domain memory, the source domain memory and the similarity measurement axis network structure respectively. Specifically, the target domain memory and the source domain memory are both in a key value storage structure, the key is used for storing the characteristics of the pedestrian picture, the value is used for storing a number or a label corresponding to the pedestrian picture, and the residual network structure is ResNet 50.
Preferably, the network model training module obtains the classification loss function, the sample invariance loss function and the hierarchical loss function in the following way:
step 1, inputting the source domain sample subset and the target domain sample subset into a residual error network structure of a pedestrian re-identification network model respectively to extract image features so as to obtain the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset.
And storing the characteristics of each pedestrian picture in the target domain sample subset in a target domain memory, and storing the characteristics of each pedestrian picture in the source domain sample subset in a source domain memory.
Step 2, specifically, obtaining a classification loss function in the following way:
and sequentially inputting the characteristics of each pedestrian picture in the source domain sample subset into a full connection layer and a softmax regression layer of the pedestrian re-identification network model, and performing characteristic dimension conversion and characteristic normalization to strengthen the nonlinearity of the pedestrian re-identification network model in learning.
Classifying a loss function by adopting the following formula based on the characteristics of each pedestrian picture after dimension conversion and normalization;
Figure BDA0003019905840000111
Figure BDA0003019905840000112
wherein L issrcRepresenting the classification loss function, nsRepresenting the number of pedestrian pictures, x, in a subset of source domain sampless,mRepresents the m-th pedestrian picture in the source domain sample subset, f (x)s,m) Picture x representing pedestrians,mCharacteristic of (a), ys,mRepresenting a pedestrian picture x in a subset of source domain sampless,mClass label of p (y)s,m|xs,m) Picture x representing pedestrians,mBelong to the category ys,mProbability of (a), f (x)s,j) Representing features of class j pedestrian pictures in the source domain sample subset, NsRepresenting the number of categories of pedestrian pictures in all the stored subsets of the active domain samples.
Step 3, specifically, obtaining a sample invariance loss function through the following method:
l is carried out on the characteristics of each pedestrian picture in the current target domain sample subset stored in the target domain memory2And (6) carrying out standardization processing.
Obtaining a sample invariance loss function according to the characteristics of each pedestrian picture after the normalization processing through the following formula:
Figure BDA0003019905840000121
Figure BDA0003019905840000122
wherein L isTRepresenting the sample invariance loss function, ntRepresenting the number, x, of pictures of a person in the current target field sample subsett,iRepresenting the pedestrian picture with the input sequence i when the current target domain sample subset is input into the pedestrian re-identification network model, f (x)t,i) Picture x representing pedestriant,iIs characterized by wi,lPicture x representing pedestriant,iWeight of pedestrian pictures belonging to class I, p (l | x)t,i) Picture x representing pedestriant,iProbability of a pedestrian belonging to class I picture, NtAnd the stored category number of the pedestrian pictures in all the target domain sample subsets is represented, and beta is a temperature coefficient.
Step 4, obtaining a layering loss function by the following method:
multiplying the characteristics of each pedestrian picture in the current source domain sample subset and the characteristics of each pedestrian picture in the current target domain sample subset based on the similarity measurement axis network structure to obtain corresponding similarity;
and sequencing the pedestrian pictures in the target domain sample subset in a descending order based on the similarity, sequentially selecting a first preset number of pictures as first-layer pictures, selecting a second preset number of pictures as second-layer pictures, taking the rest pictures as third-layer pictures, and respectively setting layer weights of the three layers of pictures. Illustratively, the number of first layer pictures is 3, and the number of second layer pictures is 147.
Obtaining a hierarchical loss function based on the characteristics of each pedestrian picture in the target domain sample subset and the level weight corresponding to the level to which the pedestrian picture belongs:
Figure BDA0003019905840000123
Figure BDA0003019905840000131
wherein L isSLRepresenting the hierarchical loss function, ntRepresenting the number of pedestrian pictures, w, in the target field sample subsett,rRepresenting the level weight, r representing the sequence number of the pedestrian pictures in the target domain sample subset according to the similarity, xt,iRepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)t,i) Picture x representing pedestriant,iP (m | x)t,i) Picture x representing pedestriant,iProbability of similarity to all classes of pedestrian pictures in all stored subsets of source domain samples, R1[m]Features, R, representing all classes of pedestrian pictures in all stored subsets of active domain samples1[j]Features representing class j pedestrian pictures in all stored subsets of active domain samples, NsAnd the stored category number of the pedestrian pictures in all the active domain sample subsets is represented, and beta is a temperature coefficient.
Preferably, the layer weights of the three layers of pictures are set as follows:
Figure BDA0003019905840000132
Figure BDA0003019905840000133
where b denotes the number of first layer pictures, k1Represents the sum of the numbers of the first layer picture and the second layer picture, and p (l | i) represents the pedestrian picture xt,iProbability of class i pedestrian pictures in all target domain sample subsets, R1[l]Features representing class i pedestrian pictures in all stored subsets of target domain samples, NtRepresenting pedestrians in all stored subsets of target domain samplesNumber of categories of pictures.
Specifically, the steps 2 to 4 are not sequentially performed, and can be performed simultaneously.
Step 5, specifically, the pedestrian re-identification network model is subjected to iterative optimization in the following mode:
obtaining a total loss function according to the classification loss function, the sample invariance loss function and the layering loss function:
L=λ1Lsrc2LT3LSL
wherein L represents the total loss function, LsrcRepresenting the classification loss function, λ1Weight representing the classification loss function, LTRepresenting the sample invariance loss function, λ2Weight, L, representing the sample invariance loss functionSLRepresenting the hierarchical loss function, λ3Weights representing a hierarchical loss function, exemplary, λ1A value of 0.7, λ2A value of 0.3, λ3Is 0.2.
And traversing each source domain sample subset and each target domain sample subset, and repeating the steps 1 to 5 to update the total loss function in an iterative manner until the variable quantity of the total loss function value is smaller than a preset value so as to complete the optimization of the pedestrian re-identification network model.
Specifically, in the iterative update process, the pedestrian image features of the target domain sample subset stored in the target domain memory are updated in real time in the following manner:
Figure BDA0003019905840000141
xt,irepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)t,i) Picture x representing pedestriant,iWarp L2Normalized characteristic, RiImage x representing pedestrian in target domain stored in target domain memoryt,iIs characterized in that it is a mixture of two or more of the above-mentioned components,
Figure BDA0003019905840000142
is a hyper-parameter that controls the speed of feature updates.
Method embodiment
The embodiment of the method is based on the same principle as the system embodiment, so that the method is not repeated herein, and repeated parts can refer to the system embodiment on the market.
Specifically, as shown in fig. 3, the method includes:
s110, obtaining a source domain sample set comprising a plurality of source domain sample subsets and a target domain sample set comprising a plurality of target domain sample subsets.
S120, respectively obtaining a classification loss function and a sample invariance loss function of the pedestrian re-identification network model based on the source domain sample subset and the target domain sample subset, sequencing and layering pedestrian pictures in the target domain sample subset according to the similarity of each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset, and obtaining a layering loss function based on layered pedestrian picture characteristics and corresponding layering weights; and traversing each source domain sample subset and each target domain sample subset based on the classification loss function, the sample invariance loss function and the hierarchical loss function to perform iterative optimization on the pedestrian re-identification network model.
S130, identifying the pedestrian image to be identified by utilizing the optimized pedestrian re-identification network model, and obtaining the image which is the same as or similar to the pedestrian image to be identified.
Preferably, the stratification loss function is obtained in particular by:
respectively inputting the source domain sample subset and the target domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain and respectively store the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset, and multiplying the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset to obtain corresponding similarity;
based on the similarity, sequencing the pedestrian pictures in the target domain sample subset in a descending order, sequentially selecting a first preset number of pictures as first-layer pictures, selecting a second preset number of pictures as second-layer pictures, taking the rest pictures as third-layer pictures, and respectively setting layer weights of the three layers of pictures;
obtaining a hierarchical loss function based on the characteristics of each pedestrian picture in the target domain sample subset and the level weight corresponding to the level to which the pedestrian picture belongs:
Figure BDA0003019905840000151
Figure BDA0003019905840000152
wherein L isSLRepresenting the hierarchical loss function, ntRepresenting the number of pedestrian pictures, w, in the target field sample subsett,rRepresenting the level weight, r representing the sequence number of the pedestrian pictures in the target domain sample subset according to the similarity, xt,iRepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)t,i) Picture x representing pedestriant,iP (m | x)t,i) Picture x representing pedestriant,iProbability of similarity to all classes of pedestrian pictures in all stored subsets of source domain samples, R1[m]Features, R, representing all classes of pedestrian pictures in all stored subsets of active domain samples1[j]Features representing class j pedestrian pictures in all stored subsets of active domain samples, NsAnd the stored category number of the pedestrian pictures in all the active domain sample subsets is represented, and beta is a temperature coefficient.
Preferably, the layer weights of the three layers of pictures are set as follows:
Figure BDA0003019905840000161
Figure BDA0003019905840000162
where b denotes the number of first layer pictures, k1Represents the sum of the numbers of the first layer picture and the second layer picture, and p (l | i) represents the pedestrian picture xt,iProbability of class i pedestrian pictures in all target domain sample subsets, R1[l]Features representing class i pedestrian pictures in all stored subsets of target domain samples, NtRepresenting the number of categories of pedestrian pictures in all the stored subsets of the target domain samples.
Compared with the prior art, the unsupervised domain self-adaptive pedestrian re-identification system and the unsupervised domain self-adaptive pedestrian re-identification method provided by the embodiment have the advantages that on one hand, the combination of the source domain sample with the label and the target domain sample without the label effectively combines the supervised pedestrian re-identification with the unsupervised pedestrian re-identification, namely, the target domain sample is layered through the similarity between the source domain sample and the target domain sample for the first time, so that the discrimination of the pedestrian re-identification network on the target domain sample is improved, the migration loss of the network is reduced, and the accuracy of the pedestrian re-identification result by the pedestrian re-identification network is improved; on the other hand, the negative samples are layered and weighted by utilizing the similarity of the characteristics of the source domain samples and the target domain samples to obtain a layered loss function, the loss function enables the network to adaptively reduce the learning of the characteristics of the negative samples in the training of the pedestrian re-recognition network model, so that the migration loss of the network is reduced, on the basis, the classification loss function and the sample invariance loss function are combined to perform iterative optimization on the pedestrian re-recognition network, and the accuracy of the pedestrian re-recognition network on the pedestrian re-recognition result can be greatly improved.
Those skilled in the art will appreciate that all or part of the flow of the method implementing the above embodiments may be implemented by a computer program, which is stored in a computer readable storage medium, to instruct related hardware. The computer readable storage medium is a magnetic disk, an optical disk, a read-only memory or a random access memory.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention.

Claims (10)

1. An unsupervised domain adaptive pedestrian re-identification system, comprising:
a data acquisition module for acquiring a source domain sample set comprising a plurality of source domain sample subsets and a target domain sample set comprising a plurality of target domain sample subsets;
the network model training module is used for obtaining a classification loss function and a sample invariance loss function of the pedestrian re-recognition network model, sequencing and layering pedestrian pictures in the target domain sample subset according to the similarity between each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset, and obtaining a layering loss function based on layered pedestrian picture characteristics and corresponding layering weights; traversing each source domain sample subset and each target domain sample subset based on the classification loss function, the sample invariance loss function and the hierarchical loss function to perform iterative optimization on the pedestrian re-identification network model;
and the re-identification module is used for identifying the pedestrian image to be identified by utilizing the optimized pedestrian re-identification network model to obtain the image which is the same as or similar to the pedestrian image to be identified.
2. The unsupervised domain adaptive pedestrian re-identification method according to claim 1, wherein the pedestrian re-identification network model comprises a residual network structure, a fully-connected layer and a Softmax normalization layer which are connected in sequence and correspond to a classification loss function, a target domain memory which is connected in sequence and correspond to a sample invariance loss function, and an L2A normalization layer, a source domain memory and a similarity measurement axis network structure which are connected in sequence and correspond to the hierarchical loss function, wherein the residual error network structure is respectively connected with the full connection layer, the target domain memory and the source domain memoryAnd similarity measure axis network structure connections.
3. The unsupervised domain adaptive pedestrian re-recognition method according to claim 1 or 2, wherein the network model training module obtains the hierarchical loss function by:
respectively inputting the source domain sample subset and the target domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain and respectively store the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset, and multiplying the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset to obtain corresponding similarity;
based on the similarity, sequencing the pedestrian pictures in the target domain sample subset in a descending order, sequentially selecting a first preset number of pictures as first-layer pictures, selecting a second preset number of pictures as second-layer pictures, taking the rest pictures as third-layer pictures, and respectively setting layer weights of the three layers of pictures;
obtaining a hierarchical loss function based on the characteristics of each pedestrian picture in the target domain sample subset and the level weight corresponding to the level to which the pedestrian picture belongs:
Figure FDA0003019905830000021
Figure FDA0003019905830000022
wherein L isSLRepresenting the hierarchical loss function, ntRepresenting the number of pedestrian pictures, w, in the target field sample subsett,rRepresenting the level weight, r representing the sequence number of the pedestrian pictures in the target domain sample subset according to the similarity, xt,iRepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)t,i) Picture x representing pedestriant,iP (m | x)t,i) Picture x representing pedestriant,iProbability of similarity to all classes of pedestrian pictures in all stored subsets of source domain samples, R1[m]Features, R, representing all classes of pedestrian pictures in all stored subsets of active domain samples1[j]Features representing class j pedestrian pictures in all stored subsets of active domain samples, NsAnd the stored category number of the pedestrian pictures in all the active domain sample subsets is represented, and beta is a temperature coefficient.
4. The unsupervised domain adaptive pedestrian re-identification method according to claim 3, wherein the set layer weights of the three layers of pictures are:
Figure FDA0003019905830000023
Figure FDA0003019905830000031
where b denotes the number of first layer pictures, k1Represents the sum of the numbers of the first layer picture and the second layer picture, and p (l | i) represents the pedestrian picture xt,iProbability of class i pedestrian pictures in all target domain sample subsets, R1[l]Features representing class i pedestrian pictures in all stored subsets of target domain samples, NtRepresenting the number of categories of pedestrian pictures in all the stored subsets of the target domain samples.
5. The unsupervised domain adaptive pedestrian re-recognition method of claim 3, wherein the network model training module obtains the classification loss function by:
inputting the source domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain the features of each pedestrian picture in the source domain sample subset;
sequentially inputting the characteristics of each pedestrian picture into a full connection layer and a softmax regression layer of a pedestrian re-identification network model, and performing characteristic dimension conversion and characteristic normalization;
classifying a loss function by adopting the following formula based on the characteristics of each pedestrian picture after dimension conversion and normalization;
Figure FDA0003019905830000032
Figure FDA0003019905830000033
wherein L issrcRepresenting the classification loss function, nsRepresenting the number of pedestrian pictures, x, in a subset of source domain sampless,mRepresents the m-th pedestrian picture in the source domain sample subset, f (x)s,m) Picture x representing pedestrians,mCharacteristic of (a), ys,mRepresenting a pedestrian picture x in a subset of source domain sampless,mClass label of p (y)s,m|xs,m) Picture x representing pedestrians,mBelong to the category ys,mProbability of (a), f (x)s,j) Representing features of class j pedestrian pictures in the source domain sample subset, NsRepresenting the number of categories of pedestrian pictures in all the stored subsets of the active domain samples.
6. The unsupervised domain adaptive pedestrian re-recognition method according to claim 3, wherein the network model training module obtains the sample invariance loss function by:
inputting the target domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain the features of each pedestrian picture in the target domain sample subset, and performing L on the features of each pedestrian picture2Carrying out standardization treatment;
obtaining a sample invariance loss function according to the characteristics of each pedestrian picture after the normalization processing through the following formula:
Figure FDA0003019905830000041
Figure FDA0003019905830000042
wherein L isTRepresenting the sample invariance loss function, ntRepresenting the number, x, of pictures of a person in a sample subset of the target fieldt,iRepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)t,i) Picture x representing pedestriant,iIs characterized by wi,lPicture x representing pedestriant,iWeight of pedestrian pictures belonging to class I, p (l | x)t,i) Picture x representing pedestriant,iProbability of a pedestrian belonging to class I picture, NtAnd the stored category number of the pedestrian pictures in all the target domain sample subsets is represented, and beta is a temperature coefficient.
7. The unsupervised domain adaptive pedestrian re-recognition method according to claim 2, wherein the network model training module iteratively optimizes the pedestrian re-recognition network model by:
obtaining a total loss function according to the classification loss function, the sample invariance loss function and the layering loss function:
L=λ1Lsrc2LT3LSL
wherein L represents the total loss function, LsrcRepresenting the classification loss function, λ1Weight representing the classification loss function, LTRepresenting the sample invariance loss function, λ2Weight, L, representing the sample invariance loss functionSLRepresenting the hierarchical loss function, λ3A weight representing a hierarchical loss function;
and traversing each source domain sample subset and each target domain sample subset, and iteratively updating the total loss function until the variable quantity of the total loss function value is smaller than a preset value so as to complete the optimization of the pedestrian re-identification network model.
8. An unsupervised domain adaptive pedestrian re-identification method is characterized by comprising the following steps:
obtaining a source domain sample set comprising a plurality of source domain sample subsets and a target domain sample set comprising a plurality of target domain sample subsets;
respectively obtaining a classification loss function and a sample invariance loss function of a pedestrian re-identification network model based on the source domain sample subset and the target domain sample subset, sequencing and layering pedestrian pictures in the target domain sample subset according to the similarity of each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset, and obtaining a layering loss function based on layered pedestrian picture characteristics and corresponding layering weights; traversing each source domain sample subset and each target domain sample subset based on the classification loss function, the sample invariance loss function and the hierarchical loss function to perform iterative optimization on the pedestrian re-identification network model;
and identifying the image of the pedestrian to be identified by utilizing the optimized pedestrian re-identification network model to obtain the image which is the same as or similar to the image of the pedestrian to be identified.
9. The unsupervised domain adaptive pedestrian re-identification method according to claim 8, characterized in that the hierarchical loss function is obtained by:
respectively inputting the source domain sample subset and the target domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain and respectively store the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset, and multiplying the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset to obtain corresponding similarity;
based on the similarity, sequencing the pedestrian pictures in the target domain sample subset in a descending order, sequentially selecting a first preset number of pictures as first-layer pictures, selecting a second preset number of pictures as second-layer pictures, taking the rest pictures as third-layer pictures, and respectively setting layer weights of the three layers of pictures;
obtaining a hierarchical loss function based on the characteristics of each pedestrian picture in the target domain sample subset and the level weight corresponding to the level to which the pedestrian picture belongs:
Figure FDA0003019905830000061
Figure FDA0003019905830000062
wherein L isSLRepresenting the hierarchical loss function, ntRepresenting the number of pedestrian pictures, w, in the target field sample subsett,rRepresenting the level weight, r representing the sequence number of the pedestrian pictures in the target domain sample subset according to the similarity, xt,iRepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)t,i) Picture x representing pedestriant,iP (m | x)t,i) Picture x representing pedestriant,iProbability of similarity to all classes of pedestrian pictures in all stored subsets of source domain samples, R1[m]Features, R, representing all classes of pedestrian pictures in all stored subsets of active domain samples1[j]Features representing class j pedestrian pictures in all stored subsets of active domain samples, NsAnd the stored category number of the pedestrian pictures in all the active domain sample subsets is represented, and beta is a temperature coefficient.
10. The unsupervised domain adaptive pedestrian re-identification method according to claim 9, wherein the set layer weights of the three layers of pictures are:
Figure FDA0003019905830000063
Figure FDA0003019905830000064
where b denotes the number of first layer pictures, k1Represents the sum of the numbers of the first layer picture and the second layer picture, and p (l | i) represents the pedestrian picture xt,iProbability of class i pedestrian pictures in all target domain sample subsets, R1[l]Features representing class i pedestrian pictures in all stored subsets of target domain samples, NtRepresenting the number of categories of pedestrian pictures in all the stored subsets of the target domain samples.
CN202110399589.7A 2021-04-14 2021-04-14 Self-adaptive pedestrian re-identification system and method for unsupervised domain Active CN113095229B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110399589.7A CN113095229B (en) 2021-04-14 2021-04-14 Self-adaptive pedestrian re-identification system and method for unsupervised domain

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110399589.7A CN113095229B (en) 2021-04-14 2021-04-14 Self-adaptive pedestrian re-identification system and method for unsupervised domain

Publications (2)

Publication Number Publication Date
CN113095229A true CN113095229A (en) 2021-07-09
CN113095229B CN113095229B (en) 2024-04-12

Family

ID=76677387

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110399589.7A Active CN113095229B (en) 2021-04-14 2021-04-14 Self-adaptive pedestrian re-identification system and method for unsupervised domain

Country Status (1)

Country Link
CN (1) CN113095229B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115510926A (en) * 2022-11-23 2022-12-23 武汉理工大学 Cross-machine type diesel engine combustion chamber fault diagnosis method and system
CN116405651A (en) * 2023-06-09 2023-07-07 中国矿业大学(北京) Multi-camera cross-view pedestrian data generation method and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160307072A1 (en) * 2015-04-17 2016-10-20 Nec Laboratories America, Inc. Fine-grained Image Classification by Exploring Bipartite-Graph Labels
CN111126360A (en) * 2019-11-15 2020-05-08 西安电子科技大学 Cross-domain pedestrian re-identification method based on unsupervised combined multi-loss model
CN111401281A (en) * 2020-03-23 2020-07-10 山东师范大学 Unsupervised pedestrian re-identification method and system based on deep clustering and sample learning
US20200226421A1 (en) * 2019-01-15 2020-07-16 Naver Corporation Training and using a convolutional neural network for person re-identification

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160307072A1 (en) * 2015-04-17 2016-10-20 Nec Laboratories America, Inc. Fine-grained Image Classification by Exploring Bipartite-Graph Labels
US20200226421A1 (en) * 2019-01-15 2020-07-16 Naver Corporation Training and using a convolutional neural network for person re-identification
CN111126360A (en) * 2019-11-15 2020-05-08 西安电子科技大学 Cross-domain pedestrian re-identification method based on unsupervised combined multi-loss model
CN111401281A (en) * 2020-03-23 2020-07-10 山东师范大学 Unsupervised pedestrian re-identification method and system based on deep clustering and sample learning

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
JIAHAN LI等: "Unsupervised Person Re-Identification Based on Measurement Axis", IEEE SIGNAL PROCESSING LETTERS, vol. 28, pages 379 - 383, XP011839499, DOI: 10.1109/LSP.2021.3055116 *
SUNCHENG XIANG等: "Unsupervised person re-identification by hierarchical cluster and domain transfer", MULTIMEDIA TOOLS AND APPLICATIONS, vol. 79, pages 19769 - 19786, XP037214928, DOI: 10.1007/s11042-020-08723-x *
ZHUN ZHONG等: "Invariance Matters: Exemplar Memory for Domain Adaptive Person Re-identification", 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), pages 598 - 607 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115510926A (en) * 2022-11-23 2022-12-23 武汉理工大学 Cross-machine type diesel engine combustion chamber fault diagnosis method and system
CN116405651A (en) * 2023-06-09 2023-07-07 中国矿业大学(北京) Multi-camera cross-view pedestrian data generation method and system
CN116405651B (en) * 2023-06-09 2023-08-01 中国矿业大学(北京) Multi-camera cross-view pedestrian data generation method and system

Also Published As

Publication number Publication date
CN113095229B (en) 2024-04-12

Similar Documents

Publication Publication Date Title
CN111967294B (en) Unsupervised domain self-adaptive pedestrian re-identification method
Liao et al. Learning deep parsimonious representations
CN110516095B (en) Semantic migration-based weak supervision deep hash social image retrieval method and system
CN109993236B (en) One-shot Simese convolutional neural network-based small-sample Manchu matching method
CN112990280B (en) Class increment classification method, system, device and medium for image big data
CN110909820A (en) Image classification method and system based on self-supervision learning
CN110942091B (en) Semi-supervised few-sample image classification method for searching reliable abnormal data center
CN110941734B (en) Depth unsupervised image retrieval method based on sparse graph structure
CN113887580B (en) Contrast type open set image recognition method and device considering multi-granularity correlation
CN112541458A (en) Domain-adaptive face recognition method, system and device based on meta-learning
CN113065516B (en) Sample separation-based unsupervised pedestrian re-identification system and method
CN113326390B (en) Image retrieval method based on depth feature consistent Hash algorithm
CN114241273A (en) Multi-modal image processing method and system based on Transformer network and hypersphere space learning
CN113095229B (en) Self-adaptive pedestrian re-identification system and method for unsupervised domain
CN112766496B (en) Deep learning model safety guarantee compression method and device based on reinforcement learning
CN113139664A (en) Cross-modal transfer learning method
CN114842343A (en) ViT-based aerial image identification method
CN115439685A (en) Small sample image data set dividing method and computer readable storage medium
CN114692750A (en) Fine-grained image classification method and device, electronic equipment and storage medium
CN114357221A (en) Self-supervision active learning method based on image classification
CN116910571B (en) Open-domain adaptation method and system based on prototype comparison learning
CN116524243A (en) Classification method and device for fossil images of penstones
CN111401519B (en) Deep neural network unsupervised learning method based on similarity distance in object and between objects
CN110941994B (en) Pedestrian re-identification integration method based on meta-class-based learner
CN108304546B (en) Medical image retrieval method based on content similarity and Softmax classifier

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB03 Change of inventor or designer information

Inventor after: Cheng Deqiang

Inventor after: Kou Qiqi

Inventor after: Li Jiahan

Inventor after: Li Yunlong

Inventor after: Zhang Haoxiang

Inventor after: Han Chenggong

Inventor after: Xu Jinyang

Inventor after: Jiang Man

Inventor after: Liu Ruihang

Inventor before: Li Jiahan

Inventor before: Li Yunlong

Inventor before: Cheng Deqiang

Inventor before: Kou Qiqi

Inventor before: Zhang Haoxiang

Inventor before: Han Chenggong

Inventor before: Xu Jinyang

Inventor before: Jiang Man

Inventor before: Liu Ruihang

CB03 Change of inventor or designer information
TA01 Transfer of patent application right

Effective date of registration: 20240315

Address after: 221116 No. 1 Tongshan University Road, Xuzhou City, Jiangsu Province

Applicant after: CHINA University OF MINING AND TECHNOLOGY

Country or region after: China

Address before: 221116 No. 1 Tongshan University Road, Xuzhou City, Jiangsu Province

Applicant before: CHINA University OF MINING AND TECHNOLOGY

Country or region before: China

Applicant before: Jiangsu Huatu Mining Technology Co.,Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant