CN110316630B

CN110316630B - Deviation early warning method and system for installation angle of elevator camera

Info

Publication number: CN110316630B
Application number: CN201910475330.9A
Authority: CN
Inventors: 陈清梁; 陈国特; 王伟; 王超; 蔡巍伟
Original assignee: Zhejiang Xinzailing Technology Co ltd
Current assignee: Zhejiang Xinzailing Technology Co ltd
Priority date: 2019-06-03
Filing date: 2019-06-03
Publication date: 2020-12-29
Anticipated expiration: 2039-06-03
Also published as: CN110316630A

Abstract

The invention discloses a deviation early warning method and a system for an installation angle of an elevator camera, wherein the method comprises the following steps: detecting the state of the door; a picture is grabbed, and a camera at the top of the car is used for grabbing an elevator image; detecting whether people exist or not, detecting the captured picture by using a yolov3 detection model, if the head in the picture and a target frame for shielding a human body can be detected, indicating that people exist, otherwise, detecting that no people exist, when an unmanned signal is output, performing an image analysis process, and when an unmanned signal is output, waiting for a next elevator door closing signal; performing semantic segmentation on the image; estimating the vertexes of the rectangle, namely fitting an external rectangle aiming at the ground area of the elevator car obtained by segmentation to obtain the positions of four vertexes of the external rectangle; tilt and coverage metrics. The invention solves the problem of the inclination of the camera at the top of the elevator car caused by some reasons, can predict which cameras incline in an image mode, and estimates the inclination degree and the car ground area coverage rate.

Description

Deviation early warning method and system for installation angle of elevator camera

Technical Field

The invention belongs to the technical field of data analysis, and particularly relates to a deviation early warning method and system for an installation angle of an elevator camera.

Background

The camera in the elevator car is an important medium for carrying out elevator car picture acquisition and elevator car environment monitoring, and the installation angle of the camera can greatly influence the picture acquisition effect.

Therefore, in practical application, the early warning of the deviation of the installation angle of the elevator camera in the car is necessary.

Disclosure of Invention

In view of the above technical problems, the present invention is directed to providing a method and a system for warning deviation of an installation angle of an elevator camera.

In order to solve the technical problems, the invention adopts the following technical scheme:

in one aspect of the invention, a deviation early warning method for an installation angle of an elevator camera is provided, which comprises the following steps:

detecting a door state, acquiring an image sequence by a camera at the top of a car, detecting the position of an alarm prompt tag from the whole image when the car runs for the first time, defining a subsequent detected local area according to the position of the alarm prompt tag, analyzing the subsequent image sequence to define the local area, and judging the opening and closing state of the door according to the distance between the tags, wherein the opening state, the closing state and the closing state are four states, the state is judged once every 200ms, the door state change refers to the state from closing to opening, otherwise, a door closing signal is output when the door state is changed from opening to closing, and other state changes do not output signals any more;

a picture is grabbed, and a camera at the top of the car is used for grabbing an elevator image;

detecting whether people exist or not, detecting the captured picture by using a yolov3 detection model, if the head in the picture and a target frame for shielding a human body can be detected, indicating that people exist, otherwise, detecting that no people exist, when an unmanned signal is output, performing an image analysis process, and when an unmanned signal is output, waiting for a next elevator door closing signal;

performing image semantic segmentation, namely performing image semantic segmentation on the captured picture to obtain the ground area of the elevator car; the semantic segmentation network output is defined herein as two classes, car floor and background, respectively. The pixel area occupied by the car ground is calibrated manually by collecting data when different elevator cars have sundries. Training the semantic segmentation network by using the calibrated data, and obtaining a semantic segmentation network model after the training is finished; because the target outputs two types, the sigmod cross entropy loss function is used for guiding training, the original picture can be input by installing and deploying the semantic segmentation model, and the corresponding mask area is output;

estimating the vertexes of the rectangle, namely fitting an external rectangle aiming at the ground area of the elevator car obtained by segmentation to obtain the positions of four vertexes of the external rectangle;

the inclination and coverage rate measurement is carried out, wherein the inclination measurement is to determine which vertex is used for measurement with the corresponding image boundary according to the installation position of the camera and the position relation between the circumscribed rectangle and the picture, and if the camera is installed at a corner, the intersection angle between the circumscribed rectangle and the image boundary is large; if the camera is arranged in the middle, the intersection angle of the circumscribed rectangle and the image boundary is small, if the vertex of the circumscribed rectangle is outside the image boundary, the distance between the circumscribed rectangle and the corresponding image boundary is defined as negative, otherwise, the distance is defined as positive; for one corner of the camera installed inside the top of the car, whether the elevator camera is installed properly is judged according to the distance from the left point and the right point of the bottom of the car to the image boundary and the distance from the top point of the lower part of the bottom of the car to the image lower boundary; for the camera to be installed in the middle of one side inside the top of the car, the distance from the two top points at the bottom to the bottom edge of the image and the distance from the two top points to the two sides of the image are required to be used for judging whether the camera is installed properly; and the coverage rate measurement definition defines the ratio of the bottom area of the divided car to the area of the external rectangle, and the coverage condition of the camera on the bottom visual angle of the car is obtained.

Preferably, the steps of door state detection, image capture, human detection, image semantic segmentation, rectangular vertex estimation and inclination and coverage rate measurement are performed once every other time, and whether the angle of the elevator camera is changed or not is judged according to the calculation result of each time.

Preferably, in the process of judging whether a person exists in the captured picture in the car, the target is defined as a head and a sheltered human body, and whether the head and the human body exist in the picture is identified through target detection; the image semantic segmentation area is the ground at the bottom of the car, and the picture is segmented semantically to obtain the ground area of the car, so that an external rectangle of the picture is obtained, and four vertexes are obtained; the door state identification also utilizes a target detection mode, corresponding alarm prompt tags are pasted at the higher position of the elevator door, and the door state detection is realized by detecting the positions among the alarm prompt tags.

Preferably, the semantic segmentation is classification on a pixel level, pixels belonging to the same class are classified into one class, a semantic segmentation network is formed by adopting an Encoder-Decoder architecture of U-net and simultaneously borrowing a spatial pyramid formatting module in PSPnet, a single-model single-scale method is adopted, an encoding part Enencoder input picture passes through 4 convolution blocks to be subjected to 4 times of downsampling, then spatial pyramid formatting operation is carried out on a 16 x 16 feature map, a newly generated feature map and a previous 16 x 16 feature map are spliced, and a decoding part Decode performs channel compression by utilizing 1 x 1 channel convolution. The method comprises the steps of utilizing bilinear interpolation to conduct upsampling, conducting adding operation on a feature graph obtained through upsampling and a previous 32 x 32 feature graph, adjusting to be consistent through 1 x 1 channel convolution if the number of channels of the previous 32 x 32 feature graph is inconsistent, conducting repeated operation twice subsequently to obtain a 128 x 128 feature graph, conducting upsampling for the last time to obtain a 256 x 256 feature graph, finally obtaining a 2 x 256 prediction template through a plurality of convolution layers, and comparing with a calibration template value, wherein 2 represents two categories, namely car ground and background.

In another aspect of the present invention, there is provided a deviation warning system for an installation angle of an elevator camera, including:

the door state detection unit is used for acquiring an image sequence by utilizing a camera at the top of the car, detecting the position of an alarm prompt tag from the whole image when the car is in operation for the first time, defining a subsequent detected local area according to the position of the alarm prompt tag, analyzing the subsequent image sequence to define the local area, and judging the opening and closing state of the door according to the distance between the tags, wherein the opening and closing state comprises four states of opening the door, closing the door and closing the door, the state change is judged once every 200ms, the door state change refers to the state from closing the door to opening the door, otherwise, a door closing signal is output when the door state is changed from opening the door to closing the door, and;

the image grabbing unit is used for grabbing an elevator image by using a camera on the top of the car;

the device comprises a presence and absence detection unit, a door closing detection unit and a door closing detection unit, wherein the presence and absence detection unit is used for detecting a captured picture by using a yolov3 detection model, if the head and a target frame for shielding a human body in the picture can be detected, the presence of the person is indicated, otherwise, the absence of the person is indicated, when an absence signal is output, an image analysis process is carried out, and when an absence signal is output, a door closing signal of the elevator needs to wait for the next time;

the image semantic segmentation unit is used for performing image semantic segmentation on the captured picture to obtain an elevator car ground area; the semantic segmentation network output is defined herein as two classes, car floor and background, respectively. The pixel area occupied by the car ground is calibrated manually by collecting data when different elevator cars have sundries. Training the semantic segmentation network by using the calibrated data, and obtaining a semantic segmentation network model after the training is finished; because the target outputs two types, the sigmod cross entropy loss function is used for guiding training, the original picture can be input by installing and deploying the semantic segmentation model, and the corresponding mask area is output;

the rectangular vertex estimation unit is used for fitting an external rectangle aiming at the ground area of the elevator car obtained by segmentation to obtain four vertex positions of the external rectangle;

the inclination and coverage rate measuring unit is used for judging which vertex is used and the corresponding image boundary to measure according to the installation position of the camera and the position relation between the circumscribed rectangle and the picture, and if the camera is installed at a corner, the intersection angle between the circumscribed rectangle and the image boundary is large; if the camera is arranged in the middle, the intersection angle of the circumscribed rectangle and the image boundary is small, if the vertex of the circumscribed rectangle is outside the image boundary, the distance between the circumscribed rectangle and the corresponding image boundary is defined as negative, otherwise, the distance is defined as positive; for one corner of the camera installed inside the top of the car, whether the elevator camera is installed properly is judged according to the distance from the left point and the right point of the bottom of the car to the image boundary and the distance from the top point of the lower part of the bottom of the car to the image lower boundary; for the camera to be installed in the middle of one side inside the top of the car, the distance from the two top points at the bottom to the bottom edge of the image and the distance from the two top points to the two sides of the image are required to be used for judging whether the camera is installed properly; and the coverage rate measurement definition defines the ratio of the bottom area of the divided car to the area of the external rectangle, and the coverage condition of the camera on the bottom visual angle of the car is obtained.

The invention has the following beneficial effects: the problem of the camera slope at elevator car top because some reasons cause is solved, can predict which camera heads take place the slope through the mode of image, estimate inclination degree and car visual angle coverage to and provide real-time correction evaluation for the maintainer, guide maintainer angle adjustment.

Drawings

Fig. 1 is a flowchart illustrating steps of a method for correcting an installation angle of an elevator camera according to an embodiment of the present invention;

FIG. 2 is a schematic diagram of a camera mounted at a corner with a large intersection angle between a circumscribed rectangle and an image boundary according to an embodiment of the present invention;

FIG. 3 is a schematic diagram of a case where the ground area of the car is blocked when the angle deviation of the camera is large, and the area of the case is small compared with that of the circumscribed rectangle according to the embodiment of the present invention;

FIG. 4 is a flow chart illustrating semantic segmentation according to an embodiment of the present invention;

fig. 5 is a flowchart of steps of a system for correcting an installation angle of an elevator camera according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Referring to fig. 1, a flow chart of steps of a method for correcting an installation angle of an elevator camera according to an embodiment of the present invention is shown, which includes the following steps:

detecting a door state, acquiring an image sequence by a camera at the top of a car, detecting the position of an alarm prompt tag from the whole image when the car runs for the first time, defining a subsequent detected local area according to the position of the alarm prompt tag, analyzing the subsequent image sequence to define the local area, and judging the opening and closing state of the door according to the distance between the tags, wherein the opening state, the closing state and the closing state are four states, the state is judged once every 200ms, the door state change refers to the state from closing to opening, otherwise, a door closing signal is output when the door state is changed from opening to closing, and other state changes do not output signals any more; the door state analysis needs to analyze the door state in real time through videos, in order to reduce the calculated amount, firstly, the position of an alarm prompt label is obtained through the global visual field, then a local detection area is set according to the position of the alarm prompt label, and then only the local detection area is detected, and finally, the local detection area can be automatically adjusted according to the central position of the label, so that the situation that the camera head deviates due to some reasons is prevented.

and a tilt and coverage measure, wherein the tilt measure is used for determining which vertices are used to measure with corresponding image boundaries according to the camera mounting position and the position relation of the circumscribed rectangle and the picture, namely h1, h2 and h 3. If the camera is installed at a corner, the intersection angle of the circumscribed rectangle and the image boundary is large, as shown in FIG. 2; if the camera is arranged in the middle, the intersection angle of the circumscribed rectangle and the image boundary is small. If the vertex of the circumscribed rectangle is outside the image boundary, the distance from the vertex to the corresponding image boundary is defined as negative, otherwise, the distance is defined as positive. For one corner of the camera installed inside the top of the car, whether the elevator camera is installed properly is judged through the distances from the left point and the right point of the bottom of the car to the image boundary, namely h1 and h2 (the left distance and the right distance are required to be equal in sign in principle) and the distance from the top point of the lower part of the bottom of the car to the image lower boundary, namely h3 (the distance is required to be smaller and close to zero in principle, and the sign is positive). For the camera to be installed in the middle of one side inside the top of the car, the distances from the two top points at the bottom to the bottom edge of the image and the distances from the two top points to the two sides of the image (the distance is required to be smaller and close to zero and the sign is positive in principle) are respectively utilized to judge whether the camera is installed properly; and the coverage rate measurement defines the ratio of the bottom area of the divided car to the area of the external rectangle to obtain the coverage condition of the camera on the view angle of the car. For example, when the angular deviation of the camera is large, the floor area of the car is blocked, which is small compared with the area of the circumscribed rectangle, as shown in fig. 3. Whether the installation angle of the elevator camera is suitable is analyzed, the elevator video monitoring is facilitated to shoot the concerned area to the maximum extent, and the elevator video monitoring system is also facilitated to inform a cloud management platform of arrangement personnel to maintain in time due to vibration or angle change caused by people in real time.

In an actual application scene, whether the angle installation of the elevator camera is proper or not can be analyzed by detecting four vertexes at the bottom of the car. It should be noted that most cameras are installed at one corner of the top of the car, so the circumscribed rectangle obtained by the method is an erected rhombus; of course, part of the camera is arranged in the middle of one side of the inner part of the top of the car, and the obtained external rectangle is a rectangle with one side parallel to the bottom edge of the image. For both cases, the camera installation location may be identified by the installer input system, or may be programmatically analyzed. In the embodiment of the invention, the installation position of the camera is identified for an installer.

In a specific application example, the steps of door state detection, image capture, human existence detection, image semantic segmentation, rectangular vertex estimation and inclination and coverage rate measurement are detected once every other period of time, and whether the angle of the elevator camera is changed or not is judged according to the calculation result of each time.

In a specific application example, in the process of judging whether a person exists in a captured picture in a car, a target is defined as a head and a sheltered human body, and whether the head and the human body exist in the picture is identified through target detection; the image semantic segmentation area is the ground at the bottom of the car, and the picture is segmented semantically to obtain the ground area of the car, so that an external rectangle of the picture is obtained, and four vertexes are obtained; the door state identification also utilizes a target detection mode, corresponding alarm prompt tags are pasted at the higher position of the elevator door, and the door state detection is realized by detecting the positions among the alarm prompt tags.

Further, semantic segmentation is classification at a pixel level, pixels belonging to the same class are classified into one class, referring to fig. 4, a semantic segmentation network is formed by adopting an Encoder-Decoder architecture of U-net and simultaneously using a spatial pyramid clustering module in PSPnet, a single-model single-scale method is adopted, an encoding part Encoder input picture passes through 4 convolution blocks to perform 4 times of downsampling, then, a spatial pyramid clustering operation is performed on a 16 × 16 feature map, a newly generated feature map is spliced with the previous 16 × 16 feature map, and a decoding part Decoder performs channel compression by using 1 × 1 channel convolution. The method comprises the steps of utilizing bilinear interpolation to conduct upsampling, conducting adding operation on a feature graph obtained through upsampling and a previous 32 x 32 feature graph, adjusting to be consistent through 1 x 1 channel convolution if the number of channels of the previous 32 x 32 feature graph is inconsistent, conducting repeated operation twice subsequently to obtain a 128 x 128 feature graph, conducting upsampling for the last time to obtain a 256 x 256 feature graph, finally obtaining a 2 x 256 prediction template through a plurality of convolution layers, and comparing with a calibration template value, wherein 2 represents two categories, namely car ground and background.

Corresponding to the embodiment of the method of the invention, referring to fig. 5, the invention provides a deviation early warning system for the installation angle of an elevator camera, which comprises:

the image semantic segmentation unit is used for performing image semantic segmentation on the captured picture to obtain an elevator car ground area; the semantic segmentation network output is defined herein as two classes, car floor and background, respectively. The pixel area occupied by the car ground is calibrated manually by collecting data when different elevator cars have sundries. Training the semantic segmentation network by using the calibrated data, and obtaining a semantic segmentation network model after the training is finished; because the target outputs two types, the sigmod cross entropy loss function is used for guiding training, the original picture can be input by installing and deploying the semantic segmentation model, and the corresponding mask area is output.

It is to be understood that the exemplary embodiments described herein are illustrative and not restrictive. Although one or more embodiments of the present invention have been described with reference to the accompanying drawings, it will be understood by those of ordinary skill in the art that various changes in form and details may be made therein without departing from the spirit and scope of the present invention as defined by the following claims.

Claims

1. A deviation early warning method for an installation angle of an elevator camera is characterized by comprising the following steps:

performing image semantic segmentation, namely performing image semantic segmentation on the captured picture to obtain the ground area of the elevator car; the semantic segmentation network output is defined as two types, namely car ground and background, the pixel area occupied by the car ground is calibrated manually by collecting data when different elevator cars are provided with sundries, the semantic segmentation network is trained by using the calibrated data, and a semantic segmentation network model is obtained after the training is finished; because the target outputs two types, the sigmod cross entropy loss function is used for guiding training, the original picture can be input by installing and deploying the semantic segmentation model, and the corresponding mask area is output;

2. The method for warning the deviation of the installation angle of the elevator camera according to claim 1, wherein the steps of detecting the state of the upper door, grabbing a picture, detecting the existence of people, segmenting the semantic meaning of the image, estimating the vertex of the rectangle and measuring the inclination and the coverage rate are detected once every period of time, and whether the angle of the elevator camera is changed or not is judged according to the calculation result of each time.

3. The deviation early warning method for the installation angle of the elevator camera according to claim 1 or 2, characterized in that in the process of judging whether a person exists in the captured picture in the car, the target is defined as the head and the sheltered human body, and whether the head and the human body exist in the picture is identified through target detection; the image semantic segmentation area is the ground at the bottom of the car, and the picture is segmented semantically to obtain the ground area of the car, so that an external rectangle of the picture is obtained, and four vertexes are obtained; the door state identification also utilizes a target detection mode, corresponding alarm prompt tags are pasted at the higher position of the elevator door, and the door state detection is realized by detecting the positions among the alarm prompt tags.

4. The deviation warning method of the installation angle of the elevator camera as claimed in claim 3, characterized in that the semantic segmentation is classification at the pixel level, the pixels belonging to the same class are classified into one class, the Encoder-Decoder architecture of U-net is adopted, and the spatial pyramid boosting module in PSPnet is simultaneously used to form the semantic segmentation network, the method of single model and single scale is adopted, the Encoder input picture passes through 4 convolution blocks to perform 4 times of downsampling, then the spatial pyramid boosting operation is performed on the 16 x 16 feature map, the newly generated feature map is spliced with the previous 16 x 16 feature map, the Decoder performs channel compression by using 1 x 1 channel convolution, the upsampling is performed by using bilinear interpolation, the upsampling obtained feature map is summed with the previous 32 x 32 feature map, if the number of the previous 32 x 32 feature map channels is not consistent, adjusting to be consistent through 1 × 1 channel convolution, repeating the operation twice subsequently to obtain a 128 × 128 feature map, then performing last up-sampling to obtain a 256 × 256 feature map, finally obtaining a 2 × 256 × 256 prediction template through a plurality of convolution layers, and comparing the prediction template with a calibration template value, wherein 2 represents two categories, namely car ground and background.

5. The utility model provides an elevator camera installation angle's skew early warning system which characterized in that includes:

the image semantic segmentation unit is used for performing image semantic segmentation on the captured picture to obtain an elevator car ground area; the semantic segmentation network output is defined as two types, namely car ground and background; the method comprises the steps that data of different elevator cars with or without sundries are collected, and pixel areas occupied by the car ground are calibrated manually; training the semantic segmentation network by using the calibrated data, and obtaining a semantic segmentation network model after the training is finished; because the target outputs two types, the sigmod cross entropy loss function is used for guiding training, the original picture can be input by installing and deploying the semantic segmentation model, and the corresponding mask area is output;

6. The system of claim 5, wherein the steps of detecting the state of the door, capturing the image, detecting the existence of people, segmenting the semantic of the image, estimating the vertex of the rectangle, and measuring the inclination and the coverage rate are detected once every a period of time, and whether the angle of the elevator camera is changed or not is judged according to the calculation result of each time.

7. The deviation early warning system of the installation angle of the elevator camera according to claim 5 or 6, characterized in that in the process of judging whether a person exists in the captured picture in the car, the target is defined as a head and a sheltered human body, and whether the head and the human body exist in the picture is identified through target detection; the image semantic segmentation area is the ground at the bottom of the car, and the picture is segmented semantically to obtain the ground area of the car, so that an external rectangle of the picture is obtained, and four vertexes are obtained; the door state identification also utilizes a target detection mode, corresponding alarm prompt tags are pasted at the higher position of the elevator door, and the door state detection is realized by detecting the positions among the alarm prompt tags.

8. The system of claim 7, wherein the semantic segmentation is a classification at a pixel level, pixels belonging to the same class are classified into one class, an Encoder-Decoder architecture of U-net is adopted, and a spatial pyramid boosting module in PSPnet is used to form a semantic segmentation network, a single-model single-scale method is adopted, an encoding portion Enencoder input picture is passed through 4 convolution blocks to perform 4 times of downsampling, then a spatial pyramid boosting operation is performed on a 16 x 16 feature map, a newly generated feature map is spliced with a previous 16 x 16 feature map, a decoding portion Decode is convolved by a 1 x 1 channel to perform channel compression, bilinear interpolation is used to perform upsampling, an upsampling operation is performed on the upsampled feature map and a previous 32 x 32 feature map, and if the number of the previous 32 x 32 feature maps is inconsistent, adjusting to be consistent through 1 × 1 channel convolution, repeating the operation twice subsequently to obtain a 128 × 128 feature map, then performing last up-sampling to obtain a 256 × 256 feature map, finally obtaining a 2 × 256 × 256 prediction template through a plurality of convolution layers, and comparing the prediction template with a calibration template value, wherein 2 represents two categories, namely car ground and background.