CN113095301A - Road occupation operation monitoring method, system and server - Google Patents

Road occupation operation monitoring method, system and server Download PDF

Info

Publication number
CN113095301A
CN113095301A CN202110554463.2A CN202110554463A CN113095301A CN 113095301 A CN113095301 A CN 113095301A CN 202110554463 A CN202110554463 A CN 202110554463A CN 113095301 A CN113095301 A CN 113095301A
Authority
CN
China
Prior art keywords
booth
target
iou
anchor
frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110554463.2A
Other languages
Chinese (zh)
Other versions
CN113095301B (en
Inventor
杨帆
张凯翔
胡建国
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiaoshi Technology Jiangsu Co ltd
Original Assignee
Nanjing Zhenshi Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Zhenshi Intelligent Technology Co Ltd filed Critical Nanjing Zhenshi Intelligent Technology Co Ltd
Priority to CN202110554463.2A priority Critical patent/CN113095301B/en
Publication of CN113095301A publication Critical patent/CN113095301A/en
Application granted granted Critical
Publication of CN113095301B publication Critical patent/CN113095301B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Tourism & Hospitality (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Software Systems (AREA)
  • Multimedia (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Image Analysis (AREA)

Abstract

The invention provides a method, a system and a server for monitoring road occupation operation, which comprises the steps of firstly defining an illegal road occupation operation area, taking 1 frame of picture at intervals of N frames aiming at a monitoring video, detecting a booth in a related area through a target detection algorithm, and extracting bbox coordinate information of a related booth target; then calculating the intersection ratio of all target rectangular frames in the current frame and the previous frame of picture and sequencing; then, acquiring a current detection target, and adding the target picture into a bottom library when the intersection ratio of the detection target is smaller than all thresholds (the first occurrence); and finally, when the intersection ratio of the detection target is greater than the threshold value and is continuously processed for M times, judging that the illegal lane occupation operation is carried out. By the monitoring of the invention, the problems of difficult management, time consumption and labor consumption of the occupied road operation can be solved. Compared with the prior art, the method for monitoring the track occupation operation has the advantages of higher accuracy, lower false detection, strong monitoring real-time performance, convenience in implementation and low deployment cost.

Description

Road occupation operation monitoring method, system and server
Technical Field
The invention relates to the technical field of image processing, in particular to a method, a system and a server for monitoring occupied road operation based on target detection and position constraint.
Background
The lane occupation operation refers to the behavior that an operator occupies public places such as urban roads, bridges and urban squares to buy and sell goods or services in a profitable manner. The behavior affects urban traffic safety, and causes urban environmental pollution, which brings inconvenience to the traveling and daily life of surrounding citizens. Due to the limited manpower resources of city management, the illegal road occupation operation is difficult to be radically controlled only by a manual patrol mode and the efficiency is low. The manual patrol is carried out, at least 3-5 workers are put into each street in the district daily to carry out catering open-air barbecue and patrol work under the road occupying operation condition, and the management mode of staring at and guarding and patrol depending on manpower is low in efficiency and quality and extremely easy to repeat the renovation effect.
In the prior art, an end-to-end-based deep learning method is tried to be adopted to detect the illegal occupation booths in public fields such as a specific road in real time, and the detection result is analyzed and fed back in real time, so that the working intensity of workers can be reduced to a great extent, and the working efficiency is improved.
In the prior art, recognition of the occupied road operation based on a clustering algorithm is proposed, for example, an optimized DBSCAN clustering algorithm is adopted to process collected urban street occupied road operation data so as to finally determine an urban street occupied road operation high-rise area. However, in the scheme, an optimized DBSCAN clustering algorithm is adopted to determine the occupied road operation area of the urban street, but the occupied road operation behavior cannot be found in time, and a city manager is informed to process the occupied road operation event in time.
In the prior art, recognition of the occupied road operation is realized through a target detection technology, but the conditions of high system false alarm rate, low accuracy rate and the like can be caused through simple target recognition, so that frequent alarm is caused, and higher false alarm rate is generated.
Prior art documents:
patent document 1: the patent application with the publication number of CN108304798A and the publication date of 2018, 7 and 20 discloses a street order event video detection method based on deep learning and motion consistency, wherein a detection system for an outdoor business event and a lane occupation management event is designed by jointly judging the event through multiple conditions in a mode of combining a target detection technology in a static video frame and a target behavior analysis technology in a dynamic video in the field of video intelligent analysis, and the detection method adopts multi-model fusion judgment combining target identification and behavior identification, so that the models are relatively complex;
patent document 2: the patent application with the publication number of CN108831158A and the publication date of 2018, 11, 16 discloses an illegal parking monitoring method, device and electronic terminal, which are used for extracting a feature vector in an image of a road-occupied business district based on an SIFT algorithm and determining whether illegal parking exists or not through similarity calculation and judgment. In the scheme, the characteristics of the occupied road operation area are determined only by using a characteristic vector extraction and matching mode, and when the scheme is applied to booth identification, compared with the illegal parking identification of an automobile on a main road, the frequent alarm and false alarm are more easily caused;
patent document 3: patent application publication No. CN111931864A, published as 11/13/2020, discloses a method and system for multiple optimization of target detectors based on vertex distance and intersection ratio, comprising: acquiring an intersection ratio IOU of an anchor and a marking frame; based on the normalized distance coefficient distance between the anchor and the four vertexes corresponding to the labeling frame, correcting the cross-over ratio IOU to obtain a corrected cross-over ratio P-IOU; redefining positive and negative samples based on the modified cross-over ratio P-IOU; and training a detector based on the redefined positive and negative sample classifications. By the method of multiple optimization of the target detector based on the vertex distance and the intersection ratio, the classification performance of the detector is optimized by optimizing a matching mechanism, and false detection is reduced.
Disclosure of Invention
The invention aims to provide a method for monitoring the operation of occupying a road based on target detection and position constraint, which comprises the following steps:
extracting the continuous images of the input video stream according to a preset period to obtain images P according to a time sequencei,PiRepresenting the ith frame image, wherein the value of i is a natural number which is more than or equal to 1;
detecting the ith frame image P by adopting a target detector based on central point position constraintiBooth Q ini,QiRepresents the set of all jth booths in the ith frame image, and extracts the detected booths QiBbox coordinate information of (a); the target detector based on the central point position constraint is a target detector trained based on correction of anchor coordinates, vertex coordinates and central point coordinates of a labeling frame;
from the ith frame picture PiInitially, the detected booth Q is judgediWhether the bbox coordinate information is in the range of a pre-defined illegal lane occupying operation area Z or not, and if the bbox coordinate information is in the illegal lane occupying operation area Z, entering the next step for identification processing; the identification processing procedure comprises the following steps:
in the i +1 th frame picture Pi+1Detected booth Qi+1In the middle, based on bbox coordinate information, calculating the stall Qi+1Each booth in (1) and booth QiThe cross-over ratio of each booth in (a); and judging that:
1) if the booth Qi+1One of the booths Qi+1,kAs an object of identification, with booth QiThe intersection ratio of each booth in the image processing system is less than a set first threshold value, the (i + 1) th frame image P is processedi+1The booth Q ini+1,kStoring the data into a target base; each booth stored in the target base library is a target;
2) if the booth Qi+1One of the booths Qi+1,jAs an object of identification, with booth QiOne of the booths Qi,lThe intersection ratio of the identification object and the booth identified in the previous frame image exceeds a set second threshold value, in the continuous m frames of images from the i +1 th frame image, the intersection ratio of the identification object and the booth identified in the previous frame image exceeds the second threshold value, the road occupation management is judged to send early warning information, otherwise, the target is abandoned, and no early warning is carried out.
Preferably, the target detector based on the central point position constraint is a pre-trained target detector, and the training process includes:
firstly, calculating the intersection ratio IOU of an anchor and a marking frame;
then, calculating a center point distance D and a diagonal distance c based on the vertex coordinates and the center point coordinates of the anchor and the labeling frame, and correcting the cross parallel comparison IOU according to the center point distance D and the diagonal distance c to obtain a cross parallel comparison D-IOU;
redefining positive and negative samples based on the modified orthogonal sum-of-sums ratio D-IOU; and
training the detector based on the redefined positive and negative sample classification to obtain a target detector based on central point position constraint.
Preferably, the redefining positive and negative samples based on the modified orthogonal union ratio D-IOU includes:
the maximum value of the modified orthogonal sum-of-difference D-IOU corresponding to each anchor and the category of the corresponding labeling frame gt are reserved, if the maximum value of the modified orthogonal sum-of-difference D-IOU is 0, the anchor category is judged to be background, and the anchor is judged to be a negative sample;
comparing the maximum value of the modified cross-comparison D-IOU corresponding to all the anchors with a preset training threshold, keeping the category of a labeling frame gt matched with the anchors when the maximum value of the modified cross-comparison D-IOU corresponding to the anchors is larger than the preset training threshold, and judging the anchors as positive samples; and for the anchor corresponding to the modification sum smaller than or equal to the maximum value of the D-IOU, modifying the class of the labeling frame gt matched with the anchor into the background, and judging that the anchor is a negative sample.
Preferably, the image P in the (i + 1) th framei+1Detected booth Qi+1In the middle, based on bbox coordinate information, calculating the stall Qi+1Each booth in (1) and booth QiThe cross-over ratio for each booth in (a), comprising:
for detected booth Qi+1In any booth Qi+1,xRespectively calculate the booth Qi+1,xAnd the ith frame picture PiDetected booth Q in (1)iThe cross-over ratio of each booth in the system is obtained, a plurality of cross-over ratio values are obtained and sorted, and the maximum value of the cross-over ratio values is taken as the Q value of the boothi+1,xAnd comparing with the booth identified by the previous frame image.
Preferably, when determining that the road occupation management sends the early warning information, the method further comprises the step of identifying the target base library of the identified object, and the method specifically comprises the following steps:
matching the identification object determined in the step 2) with a target base library, if the matching is successful, judging that the identification object is an illegal lane occupation management, and sending early warning information; otherwise, judging that the illegal occupied road operation is newly migrated.
Preferably, the matching the identified object determined in the step 2) with the target base library includes:
extracting the characteristics of the identified object determined in the step 2), respectively matching with the characteristics of the target stored in the target base, calculating the Euclidean distance between the characteristics of the identified object determined in the step 2) and each target characteristic in the target base, and if the distance is smaller than a set threshold value, judging that the two are the same target.
The second aspect of the present invention further provides a lane occupation operation monitoring system based on target detection and location constraint, including:
one or more processors;
a memory storing instructions that are operable, when executed by the one or more processors, to cause the one or more processors to perform operations comprising processes of the aforementioned lane keep monitoring method based on target detection and location constraints.
The third aspect of the present invention further provides a server, including:
one or more processors;
a memory storing instructions that are operable, when executed by the one or more processors, to cause the one or more processors to perform operations comprising the aforementioned processes of the lane operation monitoring method based on target detection and location constraints.
Compared with the prior art, the monitoring method can solve the problems of difficult management, time consumption and labor consumption of the occupied road operation. Compared with the prior art, the method for monitoring the track occupation operation has the advantages of higher accuracy, lower false detection, strong monitoring real-time performance, convenience in implementation and low deployment cost.
It should be understood that all combinations of the foregoing concepts, as well as additional concepts described in greater detail below, may be considered as part of the presently disclosed subject matter, provided such concepts are not mutually inconsistent. In addition, all combinations of claimed subject matter are considered a part of the presently disclosed subject matter.
Drawings
Fig. 1 is a schematic flow diagram of a lane occupancy monitoring method according to an exemplary embodiment of the present invention.
FIG. 2 is a schematic diagram of modifying a cross-correlation based on a center position constraint according to a preferred embodiment of the invention.
Fig. 3-5 are exemplary illustrations of the preemption identification scenario at various time periods.
Detailed Description
In this disclosure, aspects of the present invention are described with reference to the accompanying drawings, in which a number of illustrative embodiments are shown. Embodiments of the present disclosure are not necessarily intended to include all aspects of the invention. It should be appreciated that the various concepts and embodiments described above, as well as those described in greater detail below, may be implemented in any of numerous ways, as the disclosed concepts and embodiments are not limited to any one implementation. In addition, some aspects of the present disclosure may be used alone, or in any suitable combination with other aspects of the present disclosure.
In combination with the method for monitoring the lane occupation operation based on the target detection and the position constraint shown in fig. 1, the method based on the deep learning is adopted to detect the lane occupation operation, whether a booth is located in a designated lane occupation operation area, whether the same booth is continuously occupied, and whether the target of the lane occupation operation is the same booth are analyzed and early warned through the continuous detection of the booth target in the previous frame image and the next frame image, and an accumulation mechanism is used for processing, so that the problems of false detection and frequent warning are avoided and reduced.
In an optional embodiment, the lane occupation operation monitoring method based on target detection and position constraint can pre-define an illegal lane occupation operation area, in the monitoring process, 1 frame of picture is taken for every N frames of a monitoring video, a booth in a related area is detected through a target detection algorithm, and bbox coordinate information of a related booth target is extracted; then calculating the intersection ratio of all target rectangular frames in the current frame and the previous frame of picture and sequencing; then, acquiring a current detection target, and adding the target picture into a bottom library when the intersection ratio of the detection target is smaller than all thresholds (the first occurrence); and when the intersection ratio of the detection target is greater than the threshold value and is continuously processed for M times, judging that the illegal lane occupation operation is carried out. By the monitoring of the invention, the problems of difficult management, time consumption and labor consumption of the occupied road operation can be solved. Compared with the prior art, the method for monitoring the track occupation operation has the advantages of higher accuracy, lower false detection, strong monitoring real-time performance, convenience in implementation and low deployment cost.
The method for monitoring the operation of the occupied road based on the target detection and the position constraint, which is combined with the example shown in FIG. 1, comprises the following steps:
extracting the continuous images of the input video stream according to a preset period to obtain images P according to a time sequencei,PiRepresenting the ith frame image, wherein the value of i is a natural number which is more than or equal to 1;
detecting the ith frame image P by adopting a target detector based on central point position constraintiBooth Q ini,QiRepresents the set of all jth booths in the ith frame image, and extracts the detected booths QiBbox coordinate information of (a); the target detector based on the central point position constraint is a target detector trained based on correction of anchor coordinates, vertex coordinates and central point coordinates of a labeling frame;
from the ith frame picture PiInitially, the detected booth Q is judgediWhether the bbox coordinate information is in the range of a pre-defined illegal lane occupying operation area Z or not, and if the bbox coordinate information is in the illegal lane occupying operation area Z, entering the next step for identification processing; the identification processing procedure comprises the following steps:
in the i +1 th frame picture Pi+1Detected booth Qi+1In the middle, based on bbox coordinate information, calculating the stall Qi+1Each booth in (1) and booth QiThe cross-over ratio of each booth in (a); and judging that:
1) if the booth Qi+1One of the booths Qi+1,kAs an object of identification, with booth QiThe intersection ratio of each booth in the image processing system is less than a set first threshold value, the (i + 1) th frame image P is processedi+1The booth Q ini+1,kStoring the data into a target base; each booth stored in the target base library is a target;
2) if the booth Qi+1One of the booths Qi+1,jAs an object of identification, with booth QiOne of the booths Qi,lThe intersection ratio of the identification object and the booth identified in the previous frame image exceeds a set second threshold value, in the continuous m frames of images from the i +1 th frame image, the intersection ratio of the identification object and the booth identified in the previous frame image exceeds the second threshold value, the road occupation management is judged to send early warning information, otherwise, the target is abandoned, and no early warning is carried out.
The various steps and aspects of the embodiments described above are described in more detail below with reference to the drawings.
Frame image extraction
In an embodiment of the invention, the images of the input video stream are extracted continuously according to a preset period, for example, an extraction period of 25ms, and the images P in time series are obtainedi,PiAnd (3) representing the ith frame image, wherein the value of i is a natural number which is more than or equal to 1, so that an image in the subsequent processing process is obtained.
It should be understood that the video stream is continuously input, and the process of image extraction according to the preset period is also continuously performed.
Booth target detection
In the embodiment of the invention, the ith frame image P is detected by adopting a target detector based on central point position constraintiBooth Q ini,QiRepresents the set of all jth booths in the ith frame image, and extracts the detected booths QiBbox coordinate information of.
By way of example, the target detector based on the central point position constraint is a target detector trained based on correction of anchor coordinates, vertex coordinates and central point coordinates of a labeling frame, compared with a traditional target detector, the target detector is subjected to central position constraint in the training process of the detector, so that a prediction frame and the labeling frame are overlapped as much as possible, the prediction frame with the highest degree of overlap is selected as a final prediction result through the central position constraint correction, and the accuracy of model learning and training and the accuracy of prediction are improved.
In an alternative embodiment, the target detector constrained based on the position of the center point is a pre-trained target detector, and the training process includes:
firstly, calculating the intersection ratio IOU of an anchor (anchor) and a marking box (gt);
then, calculating a central point distance D and a diagonal distance c based on the vertex coordinates and the central point coordinates of the anchor (anchor) and the labeling frame (gt), and correcting the cross-correlation IOU according to the central point distance D and the diagonal distance c to obtain a cross-correlation D-IOU;
redefining positive and negative samples based on the modified orthogonal sum-of-sums ratio D-IOU; and
training the detector based on the redefined positive and negative sample classification to obtain a target detector based on central point position constraint.
As shown in fig. 2, the cross-over comparison IOU is corrected according to the center point distance d and the diagonal line distance c, and the specific implementation includes the following steps:
and correcting according to the following formula to obtain a corrected parallel ratio D-IOU:
Figure DEST_PATH_IMAGE002
the distance d between the center points represents the distance between the center points of the anchor frame and the marking frame, and the diagonal distance c represents the diagonal distance of the minimum closure area which can simultaneously contain the anchor frame and the marking frame;
Figure DEST_PATH_IMAGE004
representing a hyperparameter for controlling the weight of the latter half ratio, which is equivalent to the original IOU when its value is large;
the original IOU is represented as follows:
Figure DEST_PATH_IMAGE006
wherein, the super parameter defaults to 1, and is used for controlling the weight of the latter half ratio, and is equivalent to the original IOU when the value is large.
In fig. 2, the rectangular boxes formed by two solid line portions represent the anchor and the label box gt, respectively. The purpose is to make two frames coincide, but there is usually a little deviation and dislocation in the actual prediction, and the purpose of the center distance corrected D-IOU proposed is to select the prediction frame with the highest coincidence degree as the final prediction result.
On the basis, redefining positive and negative samples for target detector training based on the modified orthogonal union ratio D-IOU, and the redefining comprises the following steps:
the maximum value of the modified orthogonal sum-of-difference D-IOU corresponding to each anchor and the category of the corresponding labeling frame gt are reserved, if the maximum value of the modified orthogonal sum-of-difference D-IOU is 0, the anchor category is judged to be background, and the anchor is judged to be a negative sample;
comparing the maximum value of the modified cross-comparison D-IOU corresponding to all the anchors with a preset training threshold, keeping the category of a labeling frame gt matched with the anchors when the maximum value of the modified cross-comparison D-IOU corresponding to the anchors is larger than the preset training threshold, and judging the anchors as positive samples; and for the anchor corresponding to the modification sum smaller than or equal to the maximum value of the D-IOU, modifying the class of the labeling frame gt matched with the anchor into the background, and judging that the anchor is a negative sample.
Therefore, the redefined positive sample and the redefined negative sample are used as training sets, the detectors are classified and trained on the basis of the existing target detection algorithm, and the precision and the accuracy of the classified detectors are improved, so that the recognition precision is improved under the complex scene of the track occupation management of the invention, and as shown in fig. 3-5, typical representations of recognition scenes in different time periods are shown.
Booth target location determination
In this embodiment, the bbox coordinate information of the booth object identified from the image is obtained, and it can be determined whether the booth object is in the predefined illegal lane-occupying business area Z according to the bbox coordinate information of the booth object.
In an alternative embodiment, we use the cross-over ratio calculation to realize from the ith frame image PiInitially, the detected booth is judgedQiWhether the bbox coordinate information is in the range of a pre-defined illegal lane occupation operating area Z or not specifically comprises the following steps:
calculating the detected booth QiAnd comparing the bbox coordinate information with the preset violation lane occupying operation area Z, if the value of the cross-over ratio exceeds a preset value, judging that the bbox coordinate information is positioned in the violation lane occupying operation area Z, and otherwise, judging that the bbox coordinate information is not positioned in the violation lane occupying operation area Z.
Booth target road occupation operation identification processing
In this embodiment, if it is determined that the booth target extracted from the image is in the pre-defined illegal lane-occupation management area Z, the lane-occupation management recognition process is further performed, and the i +1 th frame image P is processedi+1Detected booth Qi+1And the image P of the i-th frame which is the previous frameiDetected booth QiThe booth in the system is compared to determine whether the booth belongs to road occupation operation or temporary parking, and the method specifically comprises the following steps:
in the i +1 th frame picture Pi+1Detected booth Qi+1In the middle, based on bbox coordinate information, calculating the stall Qi+1Each booth in (1) and the i-th frame image PiDetected booth QiThe cross-over ratio of each booth in (a); and judging that:
1) if the booth Qi+1One of the booths Qi+1,kAs an object of identification, with booth QiThe intersection ratio of each booth in the image processing system is less than a set first threshold value, the (i + 1) th frame image P is processedi+1The booth Q ini+1,kStoring the data into a target base; each booth stored in the target base library is a target;
2) if the booth Qi+1One of the booths Qi+1,jAs an object of identification, with booth QiOne of the booths Qi,lThe intersection ratio of the identification object and the booth identified in the previous frame image exceeds a set second threshold value, in the continuous m frames of images from the i +1 th frame image, the intersection ratio of the identification object and the booth identified in the previous frame image exceeds the second threshold value, the road occupation management is judged to send early warning information, otherwise, the target is abandoned, and no early warning is carried out.
Wherein, the booth Qi+1,jAnd booth Qi+1,kThe subscripts j and k are used for distinguishing purposes only and have no special meaning, i.e., the booth Qi+1To a booth.
In the scene shown in fig. 3-5, there is an overlapping phenomenon of booth detection frames, so it is preferable that we calculate the intersection ratio of booths in two adjacent frames of images, and calculate one by one and take the maximum value, that is:
in the i +1 th frame picture Pi+1Detected booth Qi+1In the middle, based on bbox coordinate information, calculating the stall Qi+1Each booth in (1) and booth QiThe cross-over ratio for each booth in (a), comprising:
for detected booth Qi+1In any booth Qi+1,xRespectively calculate the booth Qi+1,xAnd the ith frame picture PiDetected booth Q in (1)iThe cross-over ratio of each booth in the system is obtained, a plurality of cross-over ratio values are obtained and sorted, and the maximum value of the cross-over ratio values is taken as the Q value of the boothi+1,xAnd comparing with the booth identified by the previous frame image.
Wherein, the booth Qi+1,jThe subscript x, used for distinction only, has no special meaning, i.e. booth Qi+1To a booth.
For example, for the i +1 th frame image P of the current framei+1Detected booth Qi+1A booth object A in the system, and the previous frame image PiDetected booth Q in (1)iThe intersection ratio of each booth is calculated, and the vertex coordinates and bbox coordinates of the detection frame of the detected booth are determined, so that the intersection ratio Q of each booth can be calculatediThe value of the cross-over ratio of each booth is sorted, and the maximum value of the cross-over ratio is taken as the cross-over ratio of the booth target A and the booth identified by the previous frame image.
Wherein if calculated with booth QiThe value of the cross-over ratio of each booth in the system is less than a preset first threshold value, the booth target is determined as a new booth targetAnd adding the target, namely the target does not appear in the previous frame of image, into the target base library.
If, with the booth Qi+1,xAnd comparing the intersection ratio of the booth identified by the previous frame image with a preset second threshold value, exceeding the set second threshold value, and in m frames of continuous images from the i +1 frame image, indicating that the booth identified by the identification object is an occupied booth target and giving an alarm, wherein the intersection ratio of the identification object and the booth identified by the previous frame image exceeds the second threshold value.
Same target recognition decision
When the occupied road management is judged to send the early warning information, the method also comprises the step of carrying out target base identification on the identification object, and specifically comprises the following steps:
matching the identification object determined in the step 2) with a target base library, if the matching is successful, judging that the identification object is an illegal lane occupation management, and sending early warning information; otherwise, judging that the illegal occupied road operation is newly migrated.
In an optional example, matching the identified object determined in step 2) with the target base library includes:
extracting the characteristics of the identified object determined in the step 2), respectively matching with the characteristics of the target stored in the target base, calculating the Euclidean distance between the characteristics of the identified object determined in the step 2) and each target characteristic in the target base, and if the distance is smaller than a set threshold value, judging that the two are the same target.
Track occupation operation monitoring system based on target detection and position constraint
According to the disclosed embodiment of the present invention, there is also provided a lane occupation operation monitoring system based on target detection and location constraint, comprising:
one or more processors;
a memory storing instructions that are operable, when executed by the one or more processors, to cause the one or more processors to perform operations comprising the processes of the lane management monitoring method based on target detection and location constraints of any of the preceding embodiments.
In an alternative embodiment, the track-occupied operation monitoring system based on target detection and location constraint according to the embodiment of the present invention may be implemented based on a computer system, where the computer system has a system architecture design including a processor, a memory, and a data bus, so as to implement logic and functions such as data transceiving and data processing on this basis.
Server
According to a disclosed embodiment of the invention, a server comprises:
one or more processors;
a memory storing instructions that are operable, when executed by the one or more processors, to cause the one or more processors to perform operations comprising the processes of the lane management monitoring method based on target detection and location constraints of any of the preceding embodiments.
In an alternative embodiment, the server provided in the present invention may be implemented based on a local server or a cloud server, and whether the server is deployed locally or in a cloud, the server has a system architecture formed by components such as a processor, a memory, a network transmission unit, and the like, so as to implement logic and functions such as data transceiving and data processing on this basis.
Although the present invention has been described with reference to the preferred embodiments, it is not intended to be limited thereto. Those skilled in the art can make various changes and modifications without departing from the spirit and scope of the invention. Therefore, the protection scope of the present invention should be determined by the appended claims.

Claims (10)

1. A method for monitoring the occupied road operation is characterized by comprising the following steps:
extracting the continuous images of the input video stream according to a preset period to obtain images P according to a time sequencei,PiRepresenting the ith frame image, wherein the value of i is a natural number which is more than or equal to 1;
detecting the ith frame image P by adopting a target detector based on central point position constraintiBooth Q ini,QiRepresents the set of all jth booths in the ith frame image, and extracts the detected booths QiBbox coordinate information of (a); the target detector based on the central point position constraint is a target detector trained based on correction of anchor coordinates, vertex coordinates and central point coordinates of a labeling frame;
from the ith frame picture PiInitially, the detected booth Q is judgediWhether the bbox coordinate information is in the range of a pre-defined illegal lane occupying operation area Z or not, and if the bbox coordinate information is in the illegal lane occupying operation area Z, entering the next step for identification processing; the identification processing procedure comprises the following steps:
in the i +1 th frame picture Pi+1Detected booth Qi+1In the middle, based on bbox coordinate information, calculating the stall Qi+1Each booth in (1) and booth QiThe cross-over ratio of each booth in (a); and judging that:
1) if the booth Qi+1One of the booths Qi+1,kAs an object of identification, with booth QiThe intersection ratio of each booth in the image processing system is less than a set first threshold value, the (i + 1) th frame image P is processedi+1The booth Q ini+1,kStoring the data into a target base; each booth stored in the target base library is a target;
2) if the booth Qi+1One of the booths Qi+1,jAs an object of identification, with booth QiOne of the booths Qi,lThe intersection ratio of the identification object and the booth identified in the previous frame image exceeds a set second threshold value, in the continuous m frames of images from the i +1 th frame image, the intersection ratio of the identification object and the booth identified in the previous frame image exceeds the second threshold value, the road occupation management is judged to send early warning information, otherwise, the target is abandoned, and no early warning is carried out.
2. The method of claim 1, wherein the target detector based on the center point position constraint is a pre-trained target detector, and the training process comprises:
firstly, calculating the intersection ratio IOU of an anchor and a marking frame;
then, calculating a center point distance D and a diagonal distance c based on the vertex coordinates and the center point coordinates of the anchor and the labeling frame, and correcting the cross parallel comparison IOU according to the center point distance D and the diagonal distance c to obtain a cross parallel comparison D-IOU;
redefining positive and negative samples based on the modified orthogonal sum-of-sums ratio D-IOU; and
training the detector based on the redefined positive and negative sample classification to obtain a target detector based on central point position constraint.
3. The method of claim 2, wherein the modifying the IOU based on the intersection of the center point distance d and the diagonal distance c comprises:
and correcting according to the following formula to obtain a corrected parallel ratio D-IOU:
Figure DEST_PATH_IMAGE001
the distance d between the center points represents the distance between the center points of the anchor frame and the marking frame, and the diagonal distance c represents the diagonal distance of the minimum closure area which can simultaneously contain the anchor frame and the marking frame;
Figure 887554DEST_PATH_IMAGE002
representing a hyper-parameter;
the IOU is represented as follows:
Figure DEST_PATH_IMAGE003
where anchor represents an anchor and gt represents a callout box.
4. The method of claim 1, wherein the redefining positive and negative samples based on modified union ratio D-IOU comprises:
the maximum value of the modified orthogonal sum-of-difference D-IOU corresponding to each anchor and the category of the corresponding labeling frame gt are reserved, if the maximum value of the modified orthogonal sum-of-difference D-IOU is 0, the anchor category is judged to be background, and the anchor is judged to be a negative sample;
comparing the maximum value of the modified cross-comparison D-IOU corresponding to all the anchors with a preset training threshold, keeping the category of a labeling frame gt matched with the anchors when the maximum value of the modified cross-comparison D-IOU corresponding to the anchors is larger than the preset training threshold, and judging the anchors as positive samples; and for the anchor corresponding to the modification sum smaller than or equal to the maximum value of the D-IOU, modifying the class of the labeling frame gt matched with the anchor into the background, and judging that the anchor is a negative sample.
5. The method for monitoring busy management according to claim 1, wherein said determining of detected booth QiWhether the bbox coordinate information is in the range of a pre-defined illegal lane occupation operating area Z or not comprises the following steps:
calculating the detected booth QiAnd comparing the bbox coordinate information with the preset violation lane occupying operation area Z, if the value of the cross-over ratio exceeds a preset value, judging that the bbox coordinate information is positioned in the violation lane occupying operation area Z, and otherwise, judging that the bbox coordinate information is not positioned in the violation lane occupying operation area Z.
6. The method for monitoring the busy management according to claim 1, wherein the image P in the i +1 th frame is a picture Pi+1Detected booth Qi+1In the middle, based on bbox coordinate information, calculating the stall Qi+1Each booth in (1) and booth QiThe cross-over ratio for each booth in (a), comprising:
for detected booth Qi+1In any booth Qi+1,xRespectively calculate the booth Qi+1,xAnd the ith frame picture PiDetected booth Q in (1)iThe cross-over ratio of each booth in the system is obtained, a plurality of cross-over ratio values are obtained and sorted, and the maximum value of the cross-over ratio values is taken as the Q value of the boothi+1,xAnd comparing with the booth identified by the previous frame image.
7. The method for monitoring the busy management according to claim 1, wherein when determining that the busy management sends the warning information, the method further comprises a step of performing target base identification on the identified object, and specifically comprises the following steps:
matching the identification object determined in the step 2) with a target base library, if the matching is successful, judging that the identification object is an illegal lane occupation management, and sending early warning information; otherwise, judging that the illegal occupied road operation is newly migrated.
8. The method for monitoring busy management according to claim 7, wherein said matching the identified objects determined in said step 2) with the target base includes:
extracting the characteristics of the identified object determined in the step 2), respectively matching with the characteristics of the target stored in the target base, calculating the Euclidean distance between the characteristics of the identified object determined in the step 2) and each target characteristic in the target base, and if the distance is smaller than a set threshold value, judging that the two are the same target.
9. A system for monitoring the operation of a road occupancy comprising:
one or more processors;
a memory storing instructions that are operable, when executed by the one or more processors, to cause the one or more processors to perform operations comprising the process of the duty management monitoring method of any of claims 1-8.
10. A server, comprising:
one or more processors;
a memory storing instructions that are operable, when executed by the one or more processors, to cause the one or more processors to perform operations comprising the process of the duty management monitoring method of any of claims 1-8.
CN202110554463.2A 2021-05-21 2021-05-21 Road occupation operation monitoring method, system and server Active CN113095301B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110554463.2A CN113095301B (en) 2021-05-21 2021-05-21 Road occupation operation monitoring method, system and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110554463.2A CN113095301B (en) 2021-05-21 2021-05-21 Road occupation operation monitoring method, system and server

Publications (2)

Publication Number Publication Date
CN113095301A true CN113095301A (en) 2021-07-09
CN113095301B CN113095301B (en) 2021-08-31

Family

ID=76664613

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110554463.2A Active CN113095301B (en) 2021-05-21 2021-05-21 Road occupation operation monitoring method, system and server

Country Status (1)

Country Link
CN (1) CN113095301B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113505767A (en) * 2021-09-10 2021-10-15 南通辑兴紧固件科技有限公司 Cargo vehicle lane occupying operation behavior detection method and system based on cloud computing
CN113947620A (en) * 2021-10-22 2022-01-18 哈尔滨航天恒星数据***科技有限公司 Design method of overflow management system for stall in market stall based on lineation detection
CN115272984A (en) * 2022-09-29 2022-11-01 江西电信信息产业有限公司 Method, system, computer and readable storage medium for detecting lane occupation operation

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108304798A (en) * 2018-01-30 2018-07-20 北京同方软件股份有限公司 The event video detecting method of order in the street based on deep learning and Movement consistency
CN109214389A (en) * 2018-09-21 2019-01-15 上海小萌科技有限公司 A kind of target identification method, computer installation and readable storage medium storing program for executing
CN110400332A (en) * 2018-04-25 2019-11-01 杭州海康威视数字技术股份有限公司 A kind of target detection tracking method, device and computer equipment
CN110503095A (en) * 2019-08-27 2019-11-26 中国人民公安大学 Alignment quality evaluation method, localization method and the equipment of target detection model
CN111126252A (en) * 2019-12-20 2020-05-08 浙江大华技术股份有限公司 Stall behavior detection method and related device
CN111931864A (en) * 2020-09-17 2020-11-13 南京甄视智能科技有限公司 Method and system for multiple optimization of target detector based on vertex distance and cross-over ratio
CN112465799A (en) * 2020-12-09 2021-03-09 南京甄视智能科技有限公司 Optimization of object detector and object detection
CN112507813A (en) * 2020-11-23 2021-03-16 北京旷视科技有限公司 Event detection method and device, electronic equipment and storage medium
CN112651293A (en) * 2020-10-30 2021-04-13 华设设计集团股份有限公司 Video detection method for road illegal stall setting event

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108304798A (en) * 2018-01-30 2018-07-20 北京同方软件股份有限公司 The event video detecting method of order in the street based on deep learning and Movement consistency
CN110400332A (en) * 2018-04-25 2019-11-01 杭州海康威视数字技术股份有限公司 A kind of target detection tracking method, device and computer equipment
CN109214389A (en) * 2018-09-21 2019-01-15 上海小萌科技有限公司 A kind of target identification method, computer installation and readable storage medium storing program for executing
CN110503095A (en) * 2019-08-27 2019-11-26 中国人民公安大学 Alignment quality evaluation method, localization method and the equipment of target detection model
CN111126252A (en) * 2019-12-20 2020-05-08 浙江大华技术股份有限公司 Stall behavior detection method and related device
CN111931864A (en) * 2020-09-17 2020-11-13 南京甄视智能科技有限公司 Method and system for multiple optimization of target detector based on vertex distance and cross-over ratio
CN112651293A (en) * 2020-10-30 2021-04-13 华设设计集团股份有限公司 Video detection method for road illegal stall setting event
CN112507813A (en) * 2020-11-23 2021-03-16 北京旷视科技有限公司 Event detection method and device, electronic equipment and storage medium
CN112465799A (en) * 2020-12-09 2021-03-09 南京甄视智能科技有限公司 Optimization of object detector and object detection

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113505767A (en) * 2021-09-10 2021-10-15 南通辑兴紧固件科技有限公司 Cargo vehicle lane occupying operation behavior detection method and system based on cloud computing
CN113505767B (en) * 2021-09-10 2021-11-16 南通辑兴紧固件科技有限公司 Cargo vehicle lane occupying operation behavior detection method and system based on cloud computing
CN113947620A (en) * 2021-10-22 2022-01-18 哈尔滨航天恒星数据***科技有限公司 Design method of overflow management system for stall in market stall based on lineation detection
CN115272984A (en) * 2022-09-29 2022-11-01 江西电信信息产业有限公司 Method, system, computer and readable storage medium for detecting lane occupation operation

Also Published As

Publication number Publication date
CN113095301B (en) 2021-08-31

Similar Documents

Publication Publication Date Title
CN113095301B (en) Road occupation operation monitoring method, system and server
CN108053427B (en) Improved multi-target tracking method, system and device based on KCF and Kalman
CN108062349B (en) Video monitoring method and system based on video structured data and deep learning
CN106878674B (en) A kind of parking detection method and device based on monitor video
CN108052859B (en) Abnormal behavior detection method, system and device based on clustering optical flow characteristics
CN109977782B (en) Cross-store operation behavior detection method based on target position information reasoning
CN109190444B (en) Method for realizing video-based toll lane vehicle feature recognition system
Song et al. Vehicle behavior analysis using target motion trajectories
KR102122850B1 (en) Solution for analysis road and recognition vehicle license plate employing deep-learning
KR102089298B1 (en) System and method for recognizing multinational license plate through generalized character sequence detection
CN112560807B (en) Crowd gathering detection method based on human head detection
CN113822285A (en) Vehicle illegal parking identification method for complex application scene
CN111259796A (en) Lane line detection method based on image geometric features
CN104463232A (en) Density crowd counting method based on HOG characteristic and color histogram characteristic
CN107578048A (en) A kind of long sight scene vehicle checking method based on vehicle rough sort
CN113191273A (en) Oil field well site video target detection and identification method and system based on neural network
Wu et al. Block-based hough transform for recognition of zebra crossing in natural scene images
Chughtai et al. Traffic Surveillance System: Robust Multiclass Vehicle Detection and Classification
CN115311458B (en) Real-time expressway pedestrian intrusion event detection method based on multi-task learning
Kurniawan et al. Image processing technique for traffic density estimation
CN115187886A (en) Vehicle violation detection method and device and electronic equipment
CN116129343A (en) Fire-fighting channel occupation detection method and device and electronic equipment
Feng et al. Non-motor vehicle illegal behavior discrimination and license plate detection based on real-time video
CN113989749A (en) Station passenger flow monitoring method, electronic device and storage medium
Liu et al. Metro passenger flow statistics based on YOLOv3

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP01 Change in the name or title of a patent holder
CP01 Change in the name or title of a patent holder

Address after: 210000 Longmian Avenue 568, High-tech Park, Jiangning District, Nanjing City, Jiangsu Province

Patentee after: Xiaoshi Technology (Jiangsu) Co.,Ltd.

Address before: 210000 Longmian Avenue 568, High-tech Park, Jiangning District, Nanjing City, Jiangsu Province

Patentee before: NANJING ZHENSHI INTELLIGENT TECHNOLOGY Co.,Ltd.