CN107273873B

CN107273873B - Pedestrian based on irregular video sequence recognition methods and system again

Info

Publication number: CN107273873B
Application number: CN201710572155.6A
Authority: CN
Inventors: 胡瑞敏; 黄文军; 梁超; 阮威健; 王正
Original assignee: SUZHOU INSTITUTE OF WUHAN UNIVERSITY; Wuhan University WHU
Current assignee: SUZHOU INSTITUTE OF WUHAN UNIVERSITY; Wuhan University WHU
Priority date: 2017-07-13
Filing date: 2017-07-13
Publication date: 2019-09-17
Anticipated expiration: 2037-07-13
Also published as: CN107273873A

Abstract

The present invention provides the pedestrian based on irregular video sequence recognition methods and system again, including extracting multiple continuous subsequences from video sequence, obtaining candidate sequence by the stable point in detecting state curve；The reconstructed error that each subsequence is sought using rarefaction representation obtains the noise measurement result of each subsequence；According to the noise measurement of each subsequence as a result, cancelling noise is greater than the subsequence of respective threshold from candidate sequence, candidate pool is constituted；The pedestrian's character representation for carrying out adaptive weighting, obtains the search result based on video sequence.The present invention improves the performance that pedestrian identifies again under irregular sequence, can be widely used for monitoring field, video analysis and other multimedia application, and precision is high, and effect is good, has important market value.

Description

Pedestrian based on irregular video sequence recognition methods and system again

Technical field

The present invention relates to a kind of pedestrians again recognition methods, and in particular to a kind of pedestrian based on irregular video sequence knows again Other method and system.

Background technique

Pedestrian identifies again, i.e., pedestrian interested is identified under different cameras.In recent years, due to its monitoring field, Video analysis and key effect in other multimedia application and be widely noticed.It is usually due to inquiry picture and in library picture It is shot by different camera, therefore, illumination and pedestrian's appearance can have greatly changed, in addition other external rings such as blocking The interference in border, so that pedestrian, which identifies, becomes a quite challenging job again.Currently, the work identified again based on pedestrian can To be roughly divided into two kinds: character representation or the effective distance metric model of study with judgement index.Although being based in recent years The pedestrian of image identifies again has been achieved for significant progress, however, the Limited information having due to single picture, it is difficult to obtain The external appearance characteristic and space time information of pedestrian's robust, so that this method is difficult to obtain under complex scene identification effect well Fruit.In actual application, pedestrian is recorded by video, this means that pedestrian has more under each camera Open continuous image information.Therefore, the performance that pedestrian identifies again is promoted using this sequence image be one naturally select It selects, this has also pushed directly on the research of the recognition methods again of the pedestrian based on video sequence.

It is identified again compared to the pedestrian based on single image, video sequence has information more abundant.Firstly, video sequence Column have space time information abundant；Secondly, pedestrian's visual signature in video sequence is more sufficient, so as to construct more Shandong Pedestrian's appearance of stick indicates；Finally, in the video sequence, it can effectively be mitigated by certain method and be blocked and complex background Bring influences, and is difficult to overcome background in single picture and blocks the influence to identification.Recently, part work causes Power identifies work in pedestrian of the research based on video again, these work mainly construct more robust pedestrian using video sequence Feature or the distance metric method more using the study of pedestrian's video sequence with judgement index.It is existing based on video sequence Recognition methods ignores noise present in video sequence mostly again, and every image in video sequence is utilized on an equal basis.However, In practical applications, the parts of images in video sequence has very strong noise jamming mostly, especially blocks and carries on the back The interference of scape.The present invention by it is this have compared with the video sequence that very noisy interferes be known as irregular video sequence, this sequence pair It is had a very big impact in the character representation of pedestrian, how effectively to handle irregular video sequence is a skill urgently to be resolved Art problem.

Summary of the invention

It is directed to conventional method and is difficult to solve the problems, such as noise jamming in video sequence, the invention proposes a kind of feasible Technical solution, to retain the lesser video sequence of noise, is constructed for detecting and eliminating the irregular subsequence in video More robust pedestrian's character representation improves retrieval effect.

In order to achieve the above objectives, the technical solution adopted by the present invention provides a kind of pedestrian based on irregular video sequence Recognition methods again includes the following steps,

Step 1, the segmentation of video sequence, including being extracted from video sequence M more by the stable point in detecting state curve A continuous subsequence, obtains candidate sequence S；

Step 2, the irregular Sequence Detection based on rarefaction representation, the weight of each subsequence is sought including the use of rarefaction representation Structure error obtains the noise measurement result of each subsequence；

Step 3, the removal of irregular subsequence, including according to the noise measurement of each subsequence of gained in step 2 as a result, from Cancelling noise is greater than the subsequence of respective threshold in candidate sequence S, if the subsequence retained is s₁,...,s_T, constitute candidate pool Q；

Step 4, the pedestrian's character representation for carrying out adaptive weighting, obtains the search result based on video sequence M, including with Lower sub-step,

Step 4.1, subsequence foundation characteristic indicates, including using corresponding base respectively for each subsequence in candidate pool Q Plinth character representation, is denoted as f₁,...,f_N；

Step 4.2, any subsequence s in candidate pool Q is calculated_tWeights omega_tIt is as follows,

Wherein, ω^*For normalization factor, σ_tFor subsequence s_tNoise, t=1 ..., T；

Step 4.3, pedestrian's character representation, including the use of in candidate pool Q subsequence and corresponding weight, weighted calculation row The final character representation of people, then the character representation of video sequence M is as follows,

Search result is obtained according to the character representation of video sequence M.

Moreover, step 1 includes following sub-step,

Step 1.1, to video sequence M={ I₁,I₂,...,I_N, if I_iIt indicates the i-th frame of video, calculates consecutive frame I_i-1 Block information o_i, wherein N is frame number, i=1,2,3 ..., N；

Step 1.2, variation of the state of each frame in video sequence M relative to former frame is calculated using stability metric； Video frame I_iStability φ_iIt is defined as follows,

Wherein, c is constant, and exp is exponential function；

Step 1.3, the sequence stability metric SSM of video sequence M is defined as ε=(φ₂,...,φ_N), if the office in SSM Portion's maximum value is stationary point；

Step 1.4, if the local maximum in detection SSM curve obtains m stationary point, m sub- sequences are extracted from video sequence M Column, obtain candidate sequence S={ s_j, j=1 ..., m；If j-th of stationary point respective image is I_t, according to the sub- sequence of stationary point extraction Arrange s_j=(I_t-L,...,I_t,...,I_t+L), wherein I_tIndicate that t frame image in video sequence M, L are preset segmentation spacing.

Moreover, step 2 includes following sub-step,

Step 2.1, the building of dictionary, including from subsequence s_jIn every frame image extract K topography's block respectively, it is right In subsequence s_jIn k-th of image block in certain image IK=1 ..., K, corresponding dictionary is by except when preceding subsequence s_jIt The image block composition of all video frame same positions in other outer subsequences；

Step 2.2, sparse reconstruct, including by subsequence s_jReconstructed error e_jIt is defined as follows,

Wherein, Z^kFor subsequence s_jIn each image k-th of image block formed matrix, X^kFor corresponding sparse coefficient；

Step 2.3, by subsequence s_jNoise σ_jIt is defined as σ_j=exp ((e_j)²)。

Moreover, erased noise is more than or equal to the corresponding subsequence of specified threshold θ from candidate sequence S, wherein θ in step 3 It is defined as the intermediate value of all subsequence noises.

Pedestrian's weight identifying system based on irregular video sequence that the present invention also provides a kind of, comprises the following modules,

First module, for the segmentation of video sequence, including passing through the stable point in detecting state curve, from video sequence M extracts multiple continuous subsequences, obtains candidate sequence S；

Second module seeks every height including the use of rarefaction representation for the irregular Sequence Detection based on rarefaction representation The reconstructed error of sequence obtains the noise measurement result of each subsequence；

Third module, for the removal of irregular subsequence, including the noise according to each subsequence of gained in the first module Measurement results, cancelling noise is greater than the subsequence of respective threshold from candidate sequence S, if the subsequence retained is s₁,...,s_T, Constitute candidate pool Q；

4th module obtains the retrieval knot based on video sequence M for carrying out pedestrian's character representation of adaptive weighting Fruit, including with lower unit,

First unit is indicated for subsequence foundation characteristic, including using phase respectively for each subsequence in candidate pool Q It answers foundation characteristic to indicate, is denoted as f₁,...,f_N；

Second unit, for calculating any subsequence s in candidate pool Q_tWeights omega_tIt is as follows,

Third unit, be used for pedestrian's character representation, including the use of in candidate pool Q subsequence and corresponding weight, weighting Calculating the final character representation of pedestrian, then the character representation of video sequence M is as follows,

Moreover, the first module include with lower unit,

First unit, for video sequence M={ I₁,I₂,...,I_N, if I_iIt indicates the i-th frame of video, calculates adjacent Frame I_i-1Block information o_i, wherein N is frame number, i=1,2,3 ..., N；

Second unit, for calculating the state of each frame in video sequence M using stability metric relative to former frame Variation；Video frame I_iStability φ_iIt is defined as follows,

Wherein, c is constant, and exp is exponential function；

Third unit, the sequence stability metric SSM for video sequence M are defined as ε=(φ₂,...,φ_N), if SSM In local maximum be stationary point；

Unit the 4th obtains m stationary point for setting the local maximum in detection SSM curve, extracts m from video sequence M A subsequence obtains candidate sequence S={ s_j, j=1 ..., m；If j-th of stationary point respective image is I_t, extracted according to the stationary point Subsequence s_j=(I_t-L,...,I_t,...,I_t+L), wherein I_tIndicate that t frame image in video sequence M, L are preset segmentation Spacing.

Moreover, the second module include with lower unit,

First unit, for the building of dictionary, including from subsequence s_jIn every frame image extract K topography respectively Block, for subsequence s_jIn k-th of image block in certain image IK=1 ..., K, corresponding dictionary is by except when preceding sub- sequence Arrange s_jExcept other subsequences in all video frame same positions image block composition；

Second unit is used for sparse reconstruct, including by subsequence s_jReconstructed error e_jIt is defined as follows,

Third unit is used for subsequence s_jNoise σ_jIt is defined as σ_j=exp ((e_j)²)。

Moreover, erased noise is more than or equal to the corresponding subsequence of specified threshold θ from candidate sequence S in third module, Middle θ is defined as the intermediate value of all subsequence noises.

The present invention has the positive effect that and advantage:

1, the present invention is multiple with subsequence by extracting in the stability metric method of sequence from video sequence.

2, the present invention is detected and is removed the biggish subsequence of noise based on the method for sparse reconstruct using a kind of, thus structure Build the lesser candidate pool of noise.

3, the present invention comprehensively utilizes subsequence all in candidate pool and carrys out structure using a kind of adaptive weight calculation scheme Clarification of objective expression is built, so that target signature is more robust and has judgement index.

4, the present invention improves the performance that pedestrian identifies again under irregular sequence, can be widely used for monitoring field, video point Analysis and other multimedia application, precision is high, and effect is good, has important market value.

Detailed description of the invention

Fig. 1 is the flow chart of the embodiment of the present invention.

Specific embodiment

Firstly, the pedestrian of traditional video based on image again recognition methods due to its limited information, it is difficult to overcome row People blocks, background interference etc. pedestrian under complex scenes identify problem again.And the identification again based on video sequence can use Its visual information abundant and space-time characteristic, effective solution is blocked and the influence pedestrians such as background interference weigh asking for recognition effect Topic.

It is done secondly, the existing recognition methods again based on video sequence ignores noise present in video sequence mostly It disturbs, same utilizes all information in video.And this noise is generally existing and identifies have very again for pedestrian Big negative influence.

It is different from the consistency that conventional method assumes sequence of pictures, the present invention pays close attention to the noise in video sequence for knowing again Other performance bring adverse effect.Observation for image state differentiation in video sequence, the present invention are found through experiments that A universal phenomenon present in sequence: the variation of sequence image state has certain rule, the change of dbjective state in sequence Change the stability for reflecting target to a certain extent.

A kind of recognition methods again of the pedestrian based on irregular video sequence proposed by the present invention, including having for irregular sequence Effect detection and removal.Firstly, video sequence has time continuity, therefore the removal and detection of noise image should be based on continuous The measurement of video image, i.e. video noise should be based on the subsequence of video.It is worth noting that, video that the present invention herein refers to Sequence refers to that the image in the same subsequence should have similar state (its noise size of state instruction).

Further, to the valid metric of dbjective state in sequence, the present invention utilizes blocking between consecutive frame in sequence Information measures the stability of its sequence, effectively realizes that the subsequence of video sequence is extracted using its stability change, thus So that the image state having the same in same subsequence.

It further, is the detection and removal for realizing irregular subsequence, the present invention is using a kind of based on rarefaction representation The noise of method sub-sequences is measured, and the size of its noise is characterized using the reconstructed error of subsequence.It makes an uproar finally, rejecting The excessive subsequence of sound constructs the lesser target candidate pond of noise.

It further, is the effective building for realizing pedestrian's character representation, the present invention is calculated often using the size of its noise The weight of a subsequence, it is final to comprehensively utilize all subsequences in candidate pool to weight pedestrian's character representation of building robust, It is greatly improved the effect that pedestrian identifies again.The adaptive weighing computation method of one kind proposed by the present invention, can be effective Comprehensively utilize the lesser all subsequences of noise so that pedestrian's feature that the present invention constructs describe it is more robust and more with sentencing Other power.

The present invention is capable of the dry of effective less noise by detecting and eliminating the biggish subsequence of noise in video sequence It disturbs, to be greatly improved the effect that pedestrian identifies again.

Referring to Fig. 1, a kind of recognition methods again of the pedestrian based on irregular video sequence that embodiment provides is specifically included:

Step 1, the segmentation of video sequence.The state change of image has the continuity in time domain, therefore this hair in video It is bright to propose a kind of sequence stability metric method, to effectively indicate the state change of pedestrian in video sequence.Pass through detection Stable point in condition curve, the present invention extract multiple continuous subsequences from video sequence, the row in the same subsequence People's image has similar state (noiseless or having noise).

For most of video sequences, noise is larger in the subsequence of part, and remaining subsequence relative noise compared with Small, therefore, video sequence is divided into multiple subsequences according to the variation of its state by the present invention, thus the figure that will have unified state As sequences segmentation is unified subsequence.The step 1 of embodiment is specific as follows:

Step 1.1: to video sequence M={ I₁,I₂,...,I_N, if I_iThe i-th frame for indicating video, calculates its consecutive frame I_i-1Block information o_i, N is frame number, i=1,2,3 ..., N.

Preferably, embodiment optimizes optical flow computation and occlusion detection as a united optimization problem, from And a reliable occlusion detection is obtained, specific implementation can be found in Alper Ayvaci, Michalis Raptis, and Stefano Soatto.2010.Occlusion detection and motion estimation with convex optimization.In Advances in Neural Information Processing Systems.100–108

Step 1.2: the block information o being calculated in step 1.1_iReflection is that interframe is blocked, i.e. o_iShow present frame I_i Compared to former frame I_i-1Circumstance of occlusion.From the perspective of certain, o_iIllustrate the shape in video sequence M between successive frame State variation.Therefore, the present invention proposes a kind of stability metric method to calculate the state of each frame in video sequence M relative to it The variation of former frame.Video frame I_iStability φ_iIt can be defined as:

Wherein, c is a constant, and exp is exponential function.When it is implemented, empirical value can be used in c, embodiment takes 2.

Step 1.3: at this point, the sequence stability metric (SSM) of video sequence M is defined as: ε=(φ₂,...,φ_N), It can be observed that the local maximum in SSM, the present invention are called stationary point, i.e., the video frame around stationary point is all having the same State；

Step 1.4: setting and several stationary points are obtained by the local maximum in detection SSM curve, the present invention is each by extracting Video frame around stationary point extracts multiple subsequences from video sequence M, obtains collection of candidate sequences S={ s_j, j=1 ..., M, m are the number (i.e. the number in stationary point) of subsequence.If j-th of stationary point respective image is I_t, according to the sub- sequence of stationary point extraction Arrange s_j=(I_t-L,...,I_t,...,I_t+L), wherein I_tIndicate that t frame image, L are preset segmentation spacing, the son in each stationary point Sequence includes the stationary point and front and back L adjacent video frames.When it is implemented, L can be arranged according to the distance before stationary point, Or empirical value, such as L is used to take 10.

Step 2, the irregular Sequence Detection based on rarefaction representation.The present invention is using rarefaction representation to each of video Subsequence carries out its noise measurement, and the size of its noise is indicated with the reconstructed error of each subsequence.

Subsequence is extracted from video sequence by the front present invention, the picture in same subsequence has similar state (noiseless or having noise).On this basis, the step 2 of embodiment detects irregular subsequence using rarefaction representation, tool It is divided into following steps for body:

Step 2.1: the building of dictionary, for subsequence s_j, the present invention is every wherein with certain spatial mesh structure A video frame picture I (including I_t-L,...,I_t,...,I_t+L) in extract K topography's block respectively, it is such as that every frame picture is uniform The grid for being divided into 10 × 10, obtain K=100 topography's block.These image blocks are used to construct corresponding dictionary.Tool Body is said, for subsequence s_jAny image I in k-th of image blockK=1 ..., K, corresponding dictionary are as follows:That is,Dictionary be by except when preceding subsequence s_jExcept other subsequences in The image block of all video frame same positions forms.It must be noted that the state of image has height in the same subsequence Similitude, therefore the dictionary of building corresponding image block cannot be used to, the present invention uses each video frame in other subsequences The image block of same position, dictionary are stablized, and not will receive the interference of noise.

Step 2.2: sparse reconstruct, for subsequence s_jAny image I in arbitrary image blockSimplification is denoted as image Block P, sparse representation model can indicate are as follows:

Wherein, β indicates sparse coefficient, and λ is regularization factors, and D is corresponding dictionary, then the reconstructed error of image block P at this time It is defined as:

Correspondingly, the present invention is by subsequence s_jReconstructed error e_jIt is defined as follows:

Wherein, Z^kFor subsequence s_jIn each image the matrix that is formed of k-th of image block, including 2L+1 image block, each column It is all pixels of an image block by the result being from left to right unfolded from top to bottom；X^kIt is 2L+1 for corresponding sparse coefficient The vector that the rarefaction representation calculated result of image block is constituted.

Step 2.3: 2.2 reconstructed error for having obtained each subsequence through the above steps, the present invention is by subsequence s_j's Noise σ_jIs defined as:

σ_j=exp ((e_j)²)；

Step 3, the removal of irregular subsequence.According to the noise size of subsequence each in step 2, the present invention rejects those Noise is greater than the subsequence of specified threshold, retains the lesser subsequence of noise.

In above-mentioned steps 2.3, it can be observed that s_jNoise reconstructed error e_jIt is bigger, then its noise σ_jIt is bigger, then sub- sequence Arrange s_jThe poorer therefore deleted probability of availability should be bigger, step 3 specific steps of embodiment are as follows:

Step 3.1: erased noise is more than or equal to the corresponding subsequence of specified threshold θ from collection of candidate sequences S, wherein θ Is defined as:

θ=median (σ_j), j=1 ..., m

That is the threshold θ intermediate value that is defined as all subsequence noises；

Step 3.2: setting the lesser subsequence of noise retained after step 3.1 processing and (meet σ_jThe subsequence of < θ) be s₁,...,s_T, construct candidate pool Q={ s₁,...,s_T, T indicates the lesser subsequence number of noise retained.This step basis It is retaining after step 3.1 processing as a result, the lesser subsequence of noise is constructed candidate pool.

Step 4, the pedestrian's character representation for carrying out adaptive weighting, obtains the search result based on video sequence M.The present invention To the lesser subsequence of the noise of reservation, its feature weight of adaptive calculating is finally added the feature of each subsequence Power fusion, to construct more robust pedestrian's character representation.

In above-mentioned steps, the present invention deletes the biggish subsequence of noise, is waited using the lesser subsequence building of noise Scavenger, in order to obtain more sufficient pedestrian's external appearance characteristic, all subsequences of comprehensive candidate pool are constructed pedestrian by the present invention Character representation.Step 4 specific steps of embodiment are as follows:

Step 4.1: subsequence foundation characteristic indicates that, for each subsequence in candidate pool Q, the present invention uses accordingly respectively Foundation characteristic f is indicated, is denoted as f respectively₁,...,f_N.I.e. the feature of Q can indicate are as follows:

F_Q={ f₁,...,f_T}；

When constructing foundation characteristic f, in order to obtain more robust character representation, preferably by the visual signature of sequence The space-time characteristic (HOG3D) of (color characteristic, textural characteristics etc.) and sequence indicates to integrate building pedestrian's foundation characteristic.Such as When constructing the visual signature of sequence, color characteristic (such as hsv color histogram) is extracted to every frame picture in sequence first, so It is indicated afterwards using the visual signature of maxpooling building sequence

Step 4.2: calculating any subsequence s in candidate pool_tWeight, weights omega_tIt is defined as follows:

Wherein, ω^*For normalization factor, for ensuring the weight of each subsequence and being 1；σ_tFor subsequence s_tNoise, t =1 ..., T.

Step 4.3: pedestrian's character representation, the present invention using in candidate pool subsequence and its corresponding weight come weight meter The final character representation of pedestrian is calculated, then the character representation of video sequence M are as follows:

After obtaining the character representation of sequence, it can use existing characteristic measure method and be trained and test.It is surveying When examination, for each search sequence, the search sequence is calculated using existing measure (such as XQDA, KISSME) and in library The characteristic distance of pedestrian's sequence, finally according to the search result that the search sequence can be obtained apart from ascending sort.

When it is implemented, method provided by the present invention can realize automatic running process based on software technology, mould can also be used Block mode realizes corresponding system.

Each module specific implementation can be found in corresponding steps, and it will not go into details by the present invention.

Specific embodiment described herein only illustrates that spirit of the invention.Technology belonging to the present invention The technical staff in field can do various modifications or additions to described specific embodiment or use similar side Formula substitution, however, it does not deviate from the spirit of the invention or beyond the scope of the appended claims.

Claims

1. a kind of recognition methods again of the pedestrian based on irregular video sequence, it is characterised in that: include the following steps,

Step 1, the segmentation of video sequence, including extracting multiple companies from video sequence M by the stable point in detecting state curve Continuous subsequence obtains candidate sequence S；

Step 2, the irregular Sequence Detection based on rarefaction representation is missed including the use of the reconstruct that rarefaction representation seeks each subsequence Difference obtains the noise measurement result of each subsequence；

Step 3, the removal of irregular subsequence, including according to the noise measurement of each subsequence of gained in step 2 as a result, from candidate Cancelling noise is greater than the subsequence of respective threshold in sequence S, if the subsequence retained is s₁,...,s_T, constitute candidate pool Q；

Step 4, the pedestrian's character representation for carrying out adaptive weighting obtains the search result based on video sequence M, including following son Step,

Step 4.1, subsequence foundation characteristic indicates, including using corresponding basis special respectively each subsequence in candidate pool Q Sign indicates, is denoted as f₁,...,f_N；

Step 4.3, pedestrian's character representation, including the use of in candidate pool Q subsequence and corresponding weight, weighted calculation pedestrian is most Whole character representation, then the character representation of video sequence M is as follows,

Search result is obtained according to the character representation of video sequence M；

Wherein, step 1 includes following sub-step,

Step 1.1, to video sequence M={ I₁,I₂,...,I_N, if I_iIt indicates the i-th frame of video, calculates consecutive frame I_i-1Block Information ο_i, wherein N is frame number, i=1,2,3 ..., N；

Step 1.2, variation of the state of each frame in video sequence M relative to former frame is calculated using stability metric；Video Frame I_iStability φ_iIt is defined as follows,

Wherein, c is constant, and exp is exponential function；

Step 1.3, the sequence stability metric SSM of video sequence M is defined as ε=(φ₂,...,φ_N), if the part in SSM is most Big value is stationary point；

Step 1.4, if the local maximum in detection SSM curve obtains m stationary point, m subsequence is extracted from video sequence M, Obtain candidate sequence S={ s_j, j=1 ..., m；If j-th of stationary point respective image is I_t, according to the subsequence of stationary point extraction s_j=(I_t-L,...,I_t,...,I_t+L), wherein I_tIndicate that t frame image in video sequence M, L are preset segmentation spacing.

2. pedestrian's recognition methods again according to claim 1 based on irregular video sequence, it is characterised in that: step 2 packet Following sub-step is included,

Step 2.1, the building of dictionary, including from subsequence s_jIn every frame image extract K topography's block respectively, for son Sequence s_jIn k-th of image block in certain image ICorresponding dictionary is by except when preceding subsequence s_jExcept The image block composition of all video frame same positions in other subsequences；

Step 2.3, by subsequence s_jNoise σ_jIt is defined as σ_j=exp ((e_j)²)。

3. the recognition methods again of the pedestrian based on irregular video sequence according to claim 1 or claim 2, it is characterised in that: step 3 In, erased noise is more than or equal to the corresponding subsequence of specified threshold θ from candidate sequence S, and wherein θ is defined as all subsequences and makes an uproar The intermediate value of sound.

4. a kind of pedestrian's weight identifying system based on irregular video sequence, it is characterised in that: it comprises the following modules,

First module, for the segmentation of video sequence, including being mentioned from video sequence M by the stable point in detecting state curve Multiple continuous subsequences are taken, candidate sequence S is obtained；

Second module seeks each subsequence including the use of rarefaction representation for the irregular Sequence Detection based on rarefaction representation Reconstructed error, obtain the noise measurement result of each subsequence；

Third module, for the removal of irregular subsequence, including the noise measurement according to each subsequence of gained in the first module As a result, cancelling noise is greater than the subsequence of respective threshold from candidate sequence S, if the subsequence retained is s₁,...,s_T, constitute Candidate pool Q；

4th module obtains the search result based on video sequence M, wraps for carrying out pedestrian's character representation of adaptive weighting It includes with lower unit,

First unit is indicated for subsequence foundation characteristic, including using corresponding base respectively for each subsequence in candidate pool Q Plinth character representation, is denoted as f₁,...,f_N；

Third unit, be used for pedestrian's character representation, including the use of in candidate pool Q subsequence and corresponding weight, weighted calculation The final character representation of pedestrian, then the character representation of video sequence M is as follows,

Wherein, the first module include with lower unit,

First unit, for video sequence M={ I₁,I₂,...,I_N, if I_iIt indicates the i-th frame of video, calculates consecutive frame I_i-1 Block information ο_i, wherein N is frame number, i=1,2,3 ..., N；

Second unit, for calculating variation of the state of each frame in video sequence M relative to former frame using stability metric； Video frame I_iStability φ_iIt is defined as follows,

Wherein, c is constant, and exp is exponential function；

Third unit, the sequence stability metric SSM for video sequence M are defined as ε=(φ₂,...,φ_N), if in SSM Local maximum is stationary point；

Unit the 4th obtains m stationary point for setting the local maximum in detection SSM curve, extracts m son from video sequence M Sequence obtains candidate sequence S={ s_j, j=1 ..., m；If j-th of stationary point respective image is I_t, according to the son of stationary point extraction Sequence s_j=(I_t-L,...,I_t,...,I_t+L), wherein I_tIndicate t frame image in video sequence M, L is between preset segmentation Away from.

5. pedestrian's weight identifying system based on irregular video sequence according to claim 4, it is characterised in that: the second module Including with lower unit,

First unit, for the building of dictionary, including from subsequence s_jIn every frame image extract K topography's block respectively, it is right In subsequence s_jIn k-th of image block in certain image ICorresponding dictionary is by except when preceding subsequence s_jIt The image block composition of all video frame same positions in other outer subsequences；

6. pedestrian's weight identifying system according to claim 4 or 5 based on irregular video sequence, it is characterised in that: third In module, erased noise is more than or equal to the corresponding subsequence of specified threshold θ from candidate sequence S, and wherein θ is defined as all sub- sequences The intermediate value of column noise.