CN115294600A

CN115294600A - Method, system, electronic device and storage medium for pedestrian clothing color identification

Info

Publication number: CN115294600A
Application number: CN202210768732.XA
Authority: CN
Inventors: 查杭
Original assignee: Wuhan Zhongzhi Digital Technology Co ltd
Current assignee: Wuhan Zhongzhi Digital Technology Co ltd
Priority date: 2022-06-30
Filing date: 2022-06-30
Publication date: 2022-11-04

Abstract

The application provides a method, a system, an electronic device and a storage medium for identifying colors of clothes of pedestrians, comprising the following steps: acquiring a pedestrian image from a training image through a preset training algorithm; dividing the pedestrian image into a jacket image and a pants image in a height direction, the jacket image including: the image of the coat and the image of the normal coat are shielded and mistakenly detected, and the image of the trousers comprises: shielding and false detecting the trousers image and the normal trousers image; performing color label labeling on the coat image and the trousers image to obtain a label coat image and a label trousers image; inputting the label jacket image and the label trousers image into a preset convolutional neural network to obtain color characteristics; and inputting the image to be recognized into the convolutional neural network trained by the color characteristics to obtain a color recognition result. The usability of pedestrian dressing color identification is improved; the identification of pedestrian shielding and false detection is increased, and the reliability of pedestrian dressing color identification is improved.

Description

Method, system, electronic device and storage medium for pedestrian clothing color identification

Technical Field

The present application relates to the field of color recognition, and in particular, to a method, a system, an electronic device, and a storage medium for recognizing a color of a pedestrian garment.

Background

Due to the wide application of video monitoring in security and protection in various places, the fine identification of pedestrians is particularly important. Pedestrian clothing color is an important attribute in monitoring systems, and is also the most prominent feature of appearance for pedestrians. The pedestrian clothing color identification aims at identifying the colors of the coat and trousers of the pedestrian in the video or image, and the accurate color identification can greatly improve the efficiency of finding the pedestrian.

The existing pedestrian clothing color identification method based on deep learning can generally obtain higher accuracy than the traditional identification method. However, the existing methods are all multi-classification methods, the identification result is a specific certain color, the method has a good effect on the upper and lower body clothes with pure colors, and the method is not applicable to the condition that the upper body or the lower body is mixed with colors. In addition, the existing deep learning pedestrian color identification method cannot overcome the condition that the pedestrian is shielded or misdetected in the pedestrian image, and when the pedestrian is shielded or misdetected, the identified color is the color of a shielding object or a misdetected object, so that the reliability of clothes color identification and retrieval is greatly reduced.

Disclosure of Invention

The embodiment of the invention mainly aims to provide a method, a system, electronic equipment and a storage medium for identifying colors of clothes worn by pedestrians, and the color multi-label model is adopted to overcome the problem that a common color multi-classification model has a good pure color classification effect but cannot handle the mixed color condition, so that the usability of identifying colors of clothes worn by pedestrians is improved; the identification of pedestrian shielding and false detection is increased, and the reliability of pedestrian dressing color identification is improved.

In a first aspect, there is provided a method of pedestrian clothing color identification, the method comprising:

acquiring a pedestrian image from a training image through a preset training algorithm;

dividing the pedestrian image into a jacket image and a pants image in a height direction, the jacket image including: the image of the coat and the image of the normal coat are shielded and mistakenly detected, and the image of the trousers comprises: shielding and false detecting the trousers image and the normal trousers image;

performing color label labeling on the coat image and the trousers image to obtain a label coat image and a label trousers image;

inputting the label coat image and the label trousers image into a preset convolution neural network to obtain color characteristics;

and inputting the image to be recognized into the convolutional neural network trained by the color characteristics to obtain a color recognition result.

In one possible implementation, the inputting the labeled jacket image and the labeled trousers image into a preset convolutional neural network to obtain the color characteristics includes:

adjusting the label coat image and the label trousers image to preset standard sizes;

constructing a convolutional neural network resnet18 model through a pyrrch framework to perform feature extraction on the adjusted label coat image and the adjusted label trousers image to obtain color features;

the resnet18 model is trained by a back-propagation algorithm with the average two-class cross entropy loss of the labels as the target loss function, and the training is done when the loss is no longer reduced.

In another possible implementation manner, the inputting the image to be recognized into the convolutional neural network trained by the color feature to obtain the result of color recognition includes:

adjusting the image to be recognized to a preset standard size;

inputting the adjusted image to be recognized into a trained convolutional neural network, and adopting a logistic regression binary classification algorithm to activate a function through sigmoid

Obtaining probability values for different tags, the tags comprising: color, occlusion status, or false detection status;

determining a label according to the probability value, comprising: and if the probability value is greater than a preset probability value threshold, the identification result is a label corresponding to the probability value.

In another possible implementation, two-class cross entropy is employedWhen the convolutional neural network is trained by loss, the loss function of a single sample is as follows: loss _i ＝-y _i log(h(x))-(1-y _i ) log (1-h (x)), h (x) is sigmoid activation function, y _i Is the label, and the value is 0 or 1; for a plurality of samples, the loss function is the average loss of the plurality of samples:

m is the number of samples.

In a second aspect, there is provided a system for pedestrian clothing color identification, the system comprising:

the pedestrian image acquisition module is used for acquiring a pedestrian image from the training image through a preset training algorithm;

an image dividing module for dividing the pedestrian image into a jacket image and a trousers image in a height direction, the jacket image including: the image of the coat and the image of the normal coat are shielded and mistakenly detected, and the image of the trousers comprises: shielding and false detecting the trousers image and the normal trousers image;

the label labeling module is used for performing color label labeling on the coat image and the trousers image to obtain a labeled coat image and a labeled trousers image;

the color feature acquisition module is used for inputting the label jacket image and the label trousers image into a preset convolutional neural network to acquire color features;

and the color recognition result acquisition module is used for inputting the image to be recognized into the convolutional neural network trained by the color characteristics to acquire a color recognition result.

constructing a convolutional neural network resnet18 model through a pyrrch framework to perform feature extraction on the adjusted label coat image and label trousers image to obtain color features;

training the resnet18 model with the average two-class cross entropy loss of the labels as the target loss function through a back propagation algorithm, and completing the training when the loss is not reduced any more.

adjusting the image to be recognized to a preset standard size;

determining a label according to the probability value, comprising: and if the probability value is larger than a preset probability value threshold, the identified result is a label corresponding to the probability value.

In another possible implementation, when the convolutional neural network is trained by using two-class cross entropy loss, the loss function of a single sample is: loss _i ＝-y _i log(h(x))-(1-y _i ) log (1-h (x)), h (x) is sigmoid activation function, y _i Is the label, and the value is 0 or 1; for a plurality of samples, the loss function is the average loss of the plurality of samples:

m is the number of samples.

In a third aspect, an electronic device is provided, which includes a memory, a processor and a computer program stored on the memory and executable on the processor, and when the processor executes the program, the method for identifying the color of the pedestrian clothing provided in the first aspect is implemented.

In a fourth aspect, a non-transitory computer readable storage medium is provided, on which a computer program is stored, which computer program, when executed by a processor, implements the method of pedestrian clothing color identification as provided in the first aspect.

Drawings

In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings used in the description of the embodiments of the present application will be briefly described below.

FIG. 1 is a flow chart of a method for pedestrian clothing color identification according to one embodiment of the present invention;

FIG. 2 is a block diagram of a system for pedestrian clothing color identification according to one embodiment of the present invention;

fig. 3 is a schematic physical structure diagram of an electronic device according to the present invention.

Detailed description of the invention

Reference will now be made in detail to embodiments of the present application, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar modules or modules having the same or similar functionality throughout. The embodiments described below with reference to the drawings are exemplary only for the purpose of explaining the present application and are not to be construed as limiting the present invention.

As used herein, the singular forms "a", "an", "the" and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms "comprises" and/or "comprising," when used in this specification, specify the presence of stated features, integers, steps, operations, modules, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, modules, components, and/or groups thereof. It will be understood that when a module is referred to as being "connected" or "coupled" to another module, it can be directly connected or coupled to the other module or intervening modules may also be present. Further, "connected" or "coupled" as used herein may include wirelessly connected or wirelessly coupled. As used herein, the term "and/or" includes all or any module and all combinations of one or more of the associated listed items.

To make the objectives, technical solutions and advantages of the present application more clear, the following detailed description of the implementations of the present application will be made with reference to the accompanying drawings.

The technical solutions of the present application and the technical solutions of the present application, for example, and solving the above technical problems, will be described in detail with specific examples below. The following several specific embodiments may be combined with each other, and details of the same or similar concepts or processes may not be repeated in some embodiments. Embodiments of the present application will be described below with reference to the accompanying drawings.

Fig. 1 is a flowchart illustrating a method for identifying colors of clothing of a pedestrian according to an embodiment of the present invention, the method including:

101, acquiring a pedestrian image from a training image through a preset training algorithm;

step 102, dividing the pedestrian image into a jacket image and a trousers image in the height direction, wherein the jacket image comprises: the image of the coat and the image of the normal coat are shielded and mistakenly detected, and the image of the trousers comprises: shielding and false detecting the trousers image and the normal trousers image;

103, labeling color labels on the jacket image and the trousers image to obtain a labeled jacket image and a labeled trousers image;

step 104, inputting the label coat image and the label trousers image into a preset convolutional neural network to obtain color characteristics;

and 105, inputting the image to be recognized into the convolutional neural network trained by the color characteristics to obtain a color recognition result.

In the embodiment of the invention, each frame of image in the monitoring video under different scenes and time is obtained to be the training image, the yolov5 algorithm is adopted to detect the training image, the deepsort tracking algorithm is utilized to track and snapshot in combination with the detection result, and the snapshot pedestrian detection result is collected and stored. And ensuring that the collected pedestrian detection images comprise non-blocking pedestrian detection images, blocking pedestrian detection images and false-detection pedestrian images.

And for the pedestrian image, dividing the image into a jacket image area and a trousers image area in the height direction, wherein the jacket area is the height position where the height from the upper edge of the image to the whole image accounts for 0.4, the trousers area is the height position where the height from the lower edge of the image to the upper part accounts for 0.45 of the whole image height, and the blocked image and the erroneously detected jacket and trousers image are also intercepted in the same way.

And for the intercepted jacket image and the intercepted trousers image, dividing the jacket image and the intercepted trousers image into 11 types of labels according to 10 colors of black, gray, white, red, orange, yellow, green, cyan, blue and purple and shielding or false detection, and sequentially carrying out multi-label labeling on all the intercepted jacket image and trousers image. If the color or the occlusion or the false detection exists in the intercepted image, the corresponding label is marked as 1, otherwise, the corresponding label is marked as 0.

Wherein, the inputting the label jacket image and the label trousers image into a preset convolution neural network to obtain the color characteristics comprises:

The inputting the image to be recognized into the convolutional neural network trained by the color features to obtain the result of color recognition, including:

adjusting the image to be recognized to a preset standard size;

When the convolutional neural network is trained by adopting the two-classification cross entropy loss, the loss function of a single sample is as follows: loss _i ＝-y _i log(h(x))-(1-y _i ) log (1-h (x)), h (x) is sigmoid activation function, y _i Is the label, and the value is 0 or 1; for a plurality of samples, the loss function is the average loss of the plurality of samples:

m is the number of samples.

According to the embodiment of the invention, the pedestrian image is obtained from the training image through a preset training algorithm; dividing the pedestrian image into a jacket image and a trousers image in the height direction; performing color label labeling on the coat image and the trousers image to obtain a label coat image and a label trousers image; inputting the label jacket image and the label trousers image into a preset convolutional neural network to obtain color characteristics; and inputting the image to be recognized into the convolutional neural network trained by the color characteristics to obtain a color recognition result. The color multi-label model is adopted to overcome the problem that a common color multi-classification model has good pure color classification effect but cannot process the mixed color condition, and the usability of pedestrian dressing color identification is improved; the identification of pedestrian shielding and false detection is increased, and the reliability of pedestrian dressing color identification is improved.

As shown in fig. 2, an embodiment of the present invention provides a system for pedestrian clothing color identification, the system comprising:

a pedestrian image obtaining module 201, configured to obtain a pedestrian image from a training image through a preset training algorithm;

an image dividing module 202, configured to divide the pedestrian image into a jacket image and a trousers image in a height direction, where the jacket image includes: the image of the coat and the image of the normal coat are shielded and mistakenly detected, and the image of the trousers comprises: shielding and false detecting the trousers image and the normal trousers image;

the label labeling module 203 is used for performing color label labeling on the jacket image and the trousers image to obtain a labeled jacket image and a labeled trousers image;

a color feature obtaining module 204, configured to input the labeled jacket image and the labeled trousers image into a preset convolutional neural network, so as to obtain a color feature;

the color recognition result obtaining module 205 is configured to input the image to be recognized into the convolutional neural network trained by the color feature, and obtain a color recognition result.

In the embodiment of the invention, each frame of image in the monitoring video under different scenes and time is obtained as the training image, the yolov5 algorithm is adopted to detect the training image, the deepsort tracking algorithm is utilized to track and snapshot by combining the detection result, and the snapshot pedestrian detection result is collected and stored. And ensuring that the collected pedestrian detection images comprise non-blocking pedestrian detection images, blocking pedestrian detection images and false-detection pedestrian images.

For the pedestrian image, the image is divided into a jacket image area and a trousers image area in the height direction, wherein the jacket area is a height position where the height of the upper edge of the image to the whole image accounts for 0.4, the trousers area is a height position where the height of the lower edge of the image from the lower edge upwards accounts for 0.45 of the whole image height, and the blocked image and the image of the jacket and trousers area which is detected by mistake are also intercepted in the same way.

Wherein, the inputting the image to be recognized into the convolutional neural network trained by the color features to obtain the result of color recognition, which comprises:

adjusting the image to be recognized to a preset standard size;

determining a label according to the probability value, including: and if the probability value is larger than a preset probability value threshold, the identified result is a label corresponding to the probability value.

When the convolutional neural network is trained by adopting two-classification cross entropy loss, the loss function of a single sample is as follows: loss _i ＝-y _i log(h(x))-(1-y _i ) log (1-h (x)), h (x) is sigmoid activation function, y _i Is the label, and the value is 0 or 1; for a plurality of samples, the loss function is the average loss of the plurality of samples:

m is the number of samples.

According to the embodiment of the invention, the pedestrian image is obtained from the training image through a preset training algorithm; dividing the pedestrian image into a jacket image and a trousers image in the height direction; labeling the color labels on the coat image and the trousers image to obtain a labeled coat image and a labeled trousers image; inputting the label jacket image and the label trousers image into a preset convolutional neural network to obtain color characteristics; and inputting the image to be recognized into the convolutional neural network trained by the color characteristics to obtain a color recognition result. The color multi-label model is adopted to overcome the problem that a common color multi-classification model has good pure color classification effect but cannot process the mixed color condition, and the usability of pedestrian dressing color identification is improved; the identification of pedestrian shielding and false detection is increased, and the reliability of pedestrian dressing color identification is improved.

Fig. 3 illustrates a physical structure diagram of an electronic device, which may include, as shown in fig. 3: a processor (processor) 301, a communication Interface (communication Interface) 302, a memory (memory) 303 and a communication bus 304, wherein the processor, the communication Interface and the memory complete communication with each other through the communication bus. The processor may invoke logic instructions in the memory to perform a method of pedestrian clothing color identification, the method comprising: acquiring a pedestrian image from a training image through a preset training algorithm; dividing the pedestrian image into a jacket image and a pants image in a height direction, the jacket image including: the image of the coat and the image of the normal coat are shielded and mistakenly detected, and the image of the trousers comprises: shielding and false detecting the trousers image and the normal trousers image; performing color label labeling on the coat image and the trousers image to obtain a label coat image and a label trousers image; inputting the label coat image and the label trousers image into a preset convolution neural network to obtain color characteristics; and inputting the image to be recognized into the convolutional neural network trained by the color characteristics to obtain a color recognition result.

In addition, the logic instructions in the memory may be implemented in the form of software functional units and may be stored in a computer readable storage medium when sold or used as a stand-alone product. Based on such understanding, the technical solution of the present invention or a part thereof which substantially contributes to the prior art may be embodied in the form of a software product, which is stored in a storage medium and includes several instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.

In another aspect, embodiments of the present invention also provide a computer program product, the computer program product includes a computer program stored on a non-transitory computer-readable storage medium, the computer program includes program instructions, when the program instructions are executed by a computer, the computer can execute the method for identifying colors of pedestrian clothing provided by the above method embodiments, the method includes: acquiring a pedestrian image from a training image through a preset training algorithm; dividing the pedestrian image into a jacket image and a pants image in a height direction, the jacket image including: the image of the coat and the image of the normal coat are shielded and mistakenly detected, and the image of the trousers comprises: shielding and false detecting the trousers image and the normal trousers image; performing color label labeling on the coat image and the trousers image to obtain a label coat image and a label trousers image; inputting the label jacket image and the label trousers image into a preset convolutional neural network to obtain color characteristics; and inputting the image to be recognized into the convolutional neural network trained by the color characteristics to obtain a color recognition result.

In yet another aspect, an embodiment of the present invention further provides a non-transitory computer-readable storage medium, on which a computer program is stored, the computer program being implemented by a processor to perform the method for identifying colors of pedestrian clothing provided by the above embodiments, the method including: acquiring a pedestrian image from a training image through a preset training algorithm; dividing the pedestrian image into a jacket image and a pants image in a height direction, the jacket image including: the image of the coat and the image of the normal coat are shielded and mistakenly detected, and the image of the trousers comprises: shielding and false detecting the trousers image and the normal trousers image; labeling the color labels on the coat image and the trousers image to obtain a labeled coat image and a labeled trousers image; inputting the label jacket image and the label trousers image into a preset convolutional neural network to obtain color characteristics; and inputting the image to be recognized into the convolutional neural network trained by the color characteristics to obtain a color recognition result.

It should be understood that, although the steps in the flowcharts of the figures are shown in order as indicated by the arrows, the steps are not necessarily performed in order as indicated by the arrows. The steps are not performed in the exact order shown and may be performed in other orders unless explicitly stated herein. Moreover, at least a portion of the steps in the flow chart of the figure may include multiple sub-steps or multiple stages, which are not necessarily performed at the same time, but may be performed at different times, which are not necessarily performed in sequence, but may be performed alternately or alternately with other steps or at least a portion of the sub-steps or stages of other steps.

The above description is only a partial implementation of the present invention, and it should be noted that, for those skilled in the art, a plurality of modifications and embellishments can be made without departing from the principle of the present invention, and these modifications and embellishments should also be regarded as the protection scope of the present invention.

Claims

1. A method of pedestrian clothing color identification, the method comprising:

inputting the label jacket image and the label trousers image into a preset convolutional neural network to obtain color characteristics;

2. The method of claim 1, wherein said inputting said labeled jacket image and said labeled pants image into a predetermined convolutional neural network to obtain color features comprises:

3. The method as claimed in claim 1, wherein the inputting the image to be recognized into the convolutional neural network trained by the color features to obtain the result of color recognition comprises:

adjusting the image to be recognized to a preset standard size;

inputting the adjusted image to be recognized into a trained convolutional neural network, and performing sigmoid activation function by adopting a logistic regression binary classification algorithm

4. The method of claim 3, wherein when the convolutional neural network is trained using two-class cross entropy loss, the loss function for a single sample is: loss _i ＝-y _i log(h(x))-(1-y _i ) log (1-h (x)), h (x) is sigmoid activation function, y _i Is the label, and the value is 0 or 1; for a plurality of samples, the loss function is the average loss of the plurality of samples:

m is the number of samples.

5. A system for pedestrian clothing color identification, the system comprising:

6. The system of claim 5, wherein said inputting said labeled jacket image and said labeled pants image into a predetermined convolutional neural network to obtain color features comprises:

7. The system of claim 5, wherein the inputting the image to be recognized into the convolutional neural network trained by the color features to obtain the result of color recognition comprises:

adjusting the image to be recognized to a preset standard size;

8. The system of claim 7, wherein when the convolutional neural network is trained using two-class cross entropy loss, the loss function for a single sample is: loss _i ＝-y _i log(h(x))-(1-y _i ) log (1-h (x)), h (x) is sigmoid activation function, y _i Is the label, and the value is 0 or 1; for a plurality of samples, the loss function is the average loss of the plurality of samples:

m isThe number of samples.

9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor when executing the program implements the method of pedestrian clothing color identification as claimed in any one of claims 1 to 4.

10. A non-transitory computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, implements the method of pedestrian clothing color identification according to any one of claims 1 to 4.