CN111666881B

CN111666881B - Giant panda pacing, bamboo eating and estrus behavior tracking analysis method

Info

Publication number: CN111666881B
Application number: CN202010510090.4A
Authority: CN
Inventors: 张名岳; 汪子君; 刘玉良; 蔡志刚; 侯蓉; 张晓卉; 安俊辉; 张瑛
Original assignee: CHENGDU RESEARCH BASE OF GIANT PANDA BREEDING
Current assignee: CHENGDU RESEARCH BASE OF GIANT PANDA BREEDING
Priority date: 2020-06-08
Filing date: 2020-06-08
Publication date: 2023-04-28
Anticipated expiration: 2040-06-08
Also published as: CN111666881A

Abstract

The invention relates to the technical field of information, and provides a panda pacing and eating bamboo and estrus behavior tracking analysis method. The method aims at solving the problems that the panda motion background is complex, and the traditional background extraction algorithm is difficult to achieve the ideal foreground target extraction effect. The method comprises the following steps that 1, a panda video image is input, and a video frame is subjected to foreground target extraction by an improved vibe method; step 2, performing morphological corrosion expansion on the extracted foreground template; step 3, taking the smallest circumscribed rectangle of the outline with the largest area of the communication area as a target area and taking the centroid of the target area as the position of the target; and 4, carrying out the same operation as the step 1-3 on each frame of image, outputting the movement track and movement speed of the pandas, and analyzing the behaviors.

Description

Giant panda pacing, bamboo eating and estrus behavior tracking analysis method

Technical Field

The invention relates to the technical field of information, and provides a panda pacing and eating bamboo and estrus behavior tracking analysis method.

Background

Pandas are special rare or endangered wild animals in China. For many years, panda populations have been under pressure for habitat loss and fragmentation due to human activities such as large-area forest cutting, barren, hunting, construction of large infrastructures such as highway railways, etc. At present, only about 1864 wild pandas exist, and are distributed on the south foot of Minshan, qiong, qinling mountain and Qinling mountain. The site protection, namely the artificial breeding and reproduction, is one of the basic approaches of the protection of endangered species, namely the supplement and expansion of the site protection (namely the habitat protection), and has important effects on increasing the population quantity, maintaining the existing breeding population of the housed panda and maintaining the continuation of the species. However, the current population of the containment panda faces the problems of high morbidity, low birth rate, poor health condition, degraded behaviors and the like.

Protection of endangered wild animals includes two important means of on-site protection and detour protection, and detour protection has made an important breakthrough in recent years as an important supplementary means of on-site protection of pandas. The rearing of pandas in stock was started in 1936, 11-month hakuni (Rush Harkness) obtained one two-month-old male pandas in the Sichuan-Wenchen grass slope in 1936 and named "Su Lin" (pandas international lineage number 1). "Su Lin" is the first living panda to be brought out of the country and developed in chicago zoos in the united states at month 2 in 1937. After the establishment of new China, the panda is raised from the adult zoo for the first time in 1953, and the history of raising pandas in China is started. Although the raising of pandas has been over 70 years old, the raising and breeding history of pandas is abnormal and tortuous. The progress from 1936 to 90 th century is slow, and the breeding of the containment pandas is very difficult. The technologies of raising, breeding and disease prevention and control of the housed pandas have made great progress since the 90 s of the last century, especially since 2000, but there are some directions in which further research is needed.

Most of the behavior recognition technologies based on videos are used for recognizing human behaviors, the behavior recognition of animals is few, and only a small number of researchers are used for researching the behavior recognition of animals such as pigs, chickens and the like. The research on pandas is mainly in the biological fields of genes, genetics and the like, a small amount of detection research on pandas in static images is carried out, the research on pandas behavior recognition in videos is currently in a blank stage, research analysis is carried out on pandas behavior recognition and tracking in videos, the monitoring of physiological, mental health and propagation states of pandas is facilitated, and contribution is made to the promotion of the health state and population number of the population of pandas.

In order to master the physiological health, mental health and oestrus state of the pandas, the actions of eating bamboo, pacing and oestrus of the pandas are detected and recorded, then analyzed, and if the pandas are abnormal, reasonable measures are taken in time to ensure that the pandas are in a healthy state.

Disclosure of Invention

The invention aims to solve the problems that the panda motion background is complex, and the traditional background extraction algorithm is difficult to achieve the ideal foreground target extraction effect.

The invention adopts the following technical scheme to solve the technical problems:

a giant panda pacing behavior tracking analysis method is characterized by comprising the following steps:

step 1, inputting panda video images, and extracting foreground targets of the video frames by using an improved vibe method;

step 2, performing morphological corrosion expansion on the extracted foreground template;

step 3, taking the smallest circumscribed rectangle of the outline with the largest area of the communication area as a target area and taking the centroid of the target area as the position of the target;

and 4, carrying out the same operation as the step 1-3 on each frame of image, outputting the movement track and movement speed of the pandas, and analyzing the behaviors.

In the above technical solution, the improved vibe method includes the following steps:

step 1.1, initializing a background, and selecting the first n frames of a video by using a multi-frame averaging method to construct an initial background B0;

step 1.2, a sample set M (x, y) = { v1, v2, &..vn }, where vi is an 8-neighborhood random sample value of (x, y), i=1, 2, …, N, is created for each pixel (x, y) of the initial background B0;

step 1.3, calculating an i-th frame image fi (i=2, 3..n):

TB _i ＝F _OTSU (abs(f _i -B _rd ))

TF _i ＝F _OTSU (abs(f _i -f _i-1 ))

R _i ＝TF _i +(1-a)·TB _i

wherein B is _rd Representing the background of selecting the rd sample from each sample set, rd is a randomly selected value from {1,2,..N }, F _OTSU (. Cndot.) represents the background segmentation threshold, TB, after foreground segmentation calculated using the OTSU method _i Segmentation threshold value and Inf of background differential result calculated by OTSU method _i (x, y) represents the binarization result of the ith frame image at (x, y), TF _i Segmentation threshold representing frame difference result calculated by OTSU method, R _i The value of the radius threshold R of the ith frame is represented, and alpha is a weighting coefficient and is generally a zero point;

such as Inf _i (x, y) =1, the following process is performed:

step 1.3.1, judging whether the current pixel (x, y) is background, judging whether the current pixel is background by calculating the similarity degree between the current pixel (x, y) and the corresponding sample set, and specifically calculating as follows:

cnt _j the method comprises the steps of representing a judging result of the similarity degree of a current pixel (x, y) and a jth background sample pixel in a background sample set, and judging that the current pixel point is a background pixel if the sum of comparison results of the current pixel and all background pixel points in the background sample set is greater than or equal to a threshold value T; otherwise, foreground pixels; f (f) _i Showing an ith frame of video frame, which refers to the video frame where the current video frame is located; dis represents the Euclidean distance between two pixels; v _j Representing background samplesThe j-th pixel point in the set;

DB _i (x, y) represents a judgment result that the pixel point (x, y) in the ith frame image is a foreground or background pixel point, and the current pixel is a foreground pixel, namely DBi (x, y) =1;

the current pixel (x, y) is the background pixel, i.e. DB _i When (x, y) =0, background update is performed with a probability of 1/θ, the background update is divided into two parts of current sample set update and neighborhood update, θ is a time sampling factor, and is generally taken as 16, it is not necessary to update the background model in every new video frame, and when a pixel point is classified as a background point, it has a probability of 1/θ to update the background model;

first, the sample set is updated with the pixel value f of the current pixel (x, y) _i (x, y) replacing a randomly selected one of the samples v in its corresponding set of background samples M (x, y) _i d is v _i d＝f _i (x，y)；

Secondly, a neighborhood update is carried out, and a current pixel (x) at a position is randomly selected in 8 neighborhood of the current pixel (x, y) ₁ ，y ₁ ) And then the background sample set M (x ₁ ，y ₁ ) Medium shorthand selects one sample v ₁ The current pixel is used for replacement, namely v _i ＝f _i (x，y)。

The invention provides a panda bamboo eating and oestrus behavior identification method, which comprises the following steps:

step 1, inputting panda video images, and extracting foreground targets of the video frames by using an improved vibe method to obtain foreground target images;

step 2, constructing a multi-scale space pyramid in the foreground target image, acquiring candidate points of a dense track through dense sampling, and extracting the dense track from different space scales;

step 3, using ut to represent horizontal component in the optical flow field, vt to represent vertical component in the optical flow field, ω= (ut, vt)Then the dense optical flow field between the t frame and the t+1st frame image is represented, and the characteristic point Pt= (xt, yt) on the t frame image is in the optical flow field omega _t The above smoothing process is performed by using a median filter M, and the position on the t+1st frame corresponding to the point after smoothing is defined as:

wherein the method comprises the steps of

Is represented by (x) _t ，y _t ) A circular region of center omega _t For the light stream domain, M is median filtering (please supplement), and the motion trail (P) is formed by connecting the characteristic points tracked in the subsequent frames in series _t ，P _t+1 ，……)；

Step 4, tracking the characteristic points in an optical flow field to form a motion track, restraining the tracking length L to avoid tracking drift phenomenon caused by long-time tracking, constructing a characteristic descriptor along a dense track, collecting HOG and track shapes as shape descriptors, and utilizing HOF and MBH as motion descriptors;

step 5, performing dimension reduction on the obtained feature descriptors by adopting principal component analysis (Principal Component Analysis, PCA), mapping data from a high-dimensional space to a low-latitude space, and simultaneously ensuring that as much main information as possible is reserved during mapping to obtain feature descriptors with feature dimension d after dimension reduction;

step 6, modeling local features by adopting a Gaussian Mixture Model (GMM) based on feature coding and classification of Fisher Vector, taking the number K of Gaussian clusters, and training a local feature set by using an EM algorithm to solve the GMM; then using Fisher Vector to encode the feature descriptors after dimension reduction, wherein the feature dimension obtained after encoding is 2 Xd X K;

and 8, finally, sending the obtained coded feature descriptors into an SVM classifier for classification.

step 1.3, calculating an i-th frame image fi (i=2, 3..n):

TB _i ＝F _OTSU (abs(f _i -B _rd ))

TF _i ＝F _OTSU (abs(f _i -f _i-1 ))

R _i ＝TF _i +(1-a)·TB _i

such as Inf _i (x, y) =1, the following process is performed:

cnt _j representing a current imageJudging the similarity degree of the pixel (x, y) and the jth background sample pixel in the background sample set, and judging the current pixel point as a background pixel if the sum of the comparison results of the current pixel and all the background pixel points in the background sample set is greater than or equal to a threshold value T; otherwise, foreground pixels; f (f) _i Showing an ith frame of video frame, which refers to the video frame where the current video frame is located; dis represents the Euclidean distance between two pixels; v _j Representing the j-th pixel point in the background sample set;

the current pixel (x, y) is the background pixel, i.e. DB _i When (x, y) =0, background updating is carried out with the probability of 1/theta, the background updating is divided into two parts of current sample set updating and neighborhood updating, and theta is a time sampling factor;

Because the invention adopts the technical scheme, the invention has the following beneficial effects:

because pandas live in an artificially constructed environment, compared with the wild environment, the pandas have limited activity space and insufficient environment, and some pandas possibly have mental boredom after a period of time, so that mental problems are generated, and the pandas can do repetitive actions, such as repeated walking in a closed route, and the actions are also called as the plate-carving and walking actions. In order to identify the pacing behavior of the panda, the following method is adopted:

1) Because the panda motion background is complex, the traditional background extraction algorithm is difficult to achieve the ideal foreground target extraction effect, and the improved vibe algorithm is used for extracting the panda foreground target. By constructing an initial background by adopting a multi-frame averaging method and then modeling a background model, the problem that the traditional vibe algorithm cannot reflect scene changes in time and the quality of the extracted foreground target is low is solved, and the accuracy of extracting the foreground target is effectively improved.

2) Carrying out morphological corrosion expansion operation and connected domain analysis on the extracted image, taking the outline with the largest area of the connected region as a panda active region, solving the minimum circumscribed rectangle of the region, and taking the centroid of the region as the position of the panda; and carrying out the same tracking operation on each frame of image to obtain the movement track of the pandas. Compared with the traditional method, the method provided by the invention ensures higher tracking accuracy and reduces the computational complexity.

3) By analyzing the repeatability of the movement track, judging whether the movement belongs to the movement behavior of the notch board, judging whether the movement habit of the panda is abnormal, if so, indicating that the mental state is wrong, and timely taking corresponding measures to treat the panda.

Drawings

FIG. 1 is a schematic diagram of the steps in the flow of the present invention.

Detailed Description

The present invention will be described in further detail with reference to the drawings and examples, in order to make the objects, technical solutions and advantages of the present invention more apparent. It should be understood that the particular embodiments described herein are illustrative only and are not intended to limit the invention, i.e., the embodiments described are merely some, but not all, of the embodiments of the invention. The components of the embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.

Thus, the following detailed description of the embodiments of the invention, as presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be made by a person skilled in the art without making any inventive effort, are intended to be within the scope of the present invention.

It is noted that relational terms such as "first" and "second", and the like, are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

In the panda behavior recognition process, the following key problems mainly exist:

the first key problem is how to accurately track the pandas and record the movement track and movement duration of the pandas.

The second key problem is how to accurately identify the behavior of pandas, because the shape, behavior and human being of pandas are very different, most of the existing behavior identification based on video is to identify the behavior of human body, and aiming at the characteristics of irregular behavior and changeable shape of pandas, a reasonable algorithm is required to be designed to realize more accurate identification and record.

Tracking and analyzing the pacing behaviors of pandas. The panda repeatedly walks for more than three times along the same route on the same road section is called as a notch board pacing behavior, and the occurrence of the behavior possibly means that the mental condition of the panda is abnormal to a certain extent, unconscious repeated behavior is generated, and a certain measure is needed to be taken in time for intervention, so that the mental condition of the panda is improved.

Panda feeding behavior recognition analysis, wherein panda feeding is mainly performed by bamboo, feeding time is compared in a peace time through research recognition analysis, and if the time is abnormal, whether the analysis is related to abnormal health conditions of pandas, such as teeth or digestive system, is caused, and treatment means need to be timely adopted.

Panda oestrus behavior recognition, wherein the occurrence frequency of special behaviors of pandas in oestrus period can be increased suddenly, such as inverted rubbing yin, side lifting, leg rubbing yin, tail lifting and the like. The panda estrus can be effectively monitored by identifying and counting the occurrence times of the special behaviors, so that preparation is made for the breeding work of the panda.

The invention provides a panda pacing behavior tracking analysis method, which is characterized by comprising the following steps of:

step 1.3, calculating an i-th frame image fi (i=2, 3..n):

TB _i ＝F _OTSU (abs(f _i -B _rd ))

TF _i ＝F _OTSU (abs(f _i -f _i-1 ))

R _i ＝TF _i +(1-a)·TB _i

such as Inf _i (x, y) =1, the following process is performed:

cnt _j the method comprises the steps of representing a judging result of the similarity degree of a current pixel (x, y) and a jth background sample pixel in a background sample set, and judging that the current pixel point is a background pixel if the sum of comparison results of the current pixel and all background pixel points in the background sample set is greater than or equal to a threshold value T; otherwise, foreground pixels; f (f) _i Showing an ith frame of video frame, which refers to the video frame where the current video frame is located; dis represents the Euclidean distance between two pixels; v _j Representing the j-th pixel point in the background sample set;

The behavior recognition and analysis of pandas eating bamboo can often identify the health state of pandas, if the eating time suddenly increases or decreases greatly, the situation appears several times continuously, whether the pandas has problems in teeth or digestive system needs to be checked, and the treatment needs to be performed in time. Two steps are needed for realizing the recognition and recording of panda bamboo eating behaviors: firstly, identifying the behavior of eating the bamboo, secondly, recording the time length of eating the bamboo, comparing the time length with historical data, and analyzing whether abnormal conditions exist or not.

The panda estrus behavior recognition and analysis, the propagation of pandas is an important factor influencing the population quantity of pandas, and the pandas has short estrus period, so that the pandas estrus behavior is monitored in advance, the pandas estrus period is accurately mastered, the propagation of pandas can be promoted to a great extent, the pandas quantity is enlarged, and the pandas estrus behavior recognition method has important significance.

The estrus behavior of pandas is marked with smell or rubbed into the vagina, and the pandas moves rapidly, lifts the tail, collides, etc. The panda estrus behavior recognition analysis mainly comprises two steps: firstly, the oestrus behaviors are classified and identified, secondly, the number of oestrus behaviors is recorded, and the occurrence number of each behavior is recorded, so that the subsequent analysis of the oestrus period of the pandas is facilitated.

Panda behavior recognition based on foreground object extraction and dense tracks, wherein panda activities are in an artificially constructed activity area, and the background is complex. If the dense track extraction is directly performed based on the original image, the feature bit number is too high, the calculation amount is large, and a large amount of background redundant information is contained, and a behavior recognition method based on foreground object extraction and dense tracks is provided for solving the problem. Firstly, extracting a target area of a video frame, then extracting a dense track in the target area, constructing a feature descriptor along the track, reducing the feature dimension of the obtained feature descriptor by using principal component analysis PCA, reducing the calculated amount, modeling the local feature by using a Gaussian mixture model, encoding the feature by using a Fisher vector, and finally training and classifying by using SVM.

step 3, using ut to represent horizontal component in the optical flow field, vt to represent vertical component in the optical flow field, and ω= (ut, vt) thenShowing dense optical flow field between t frame and t+1st frame, for characteristic point Pt= (xt, yt) on t frame image in optical flow field omega _t The above smoothing process is performed by using a median filter M, and the position on the t+1st frame corresponding to the point after smoothing is defined as:

wherein the method comprises the steps of

Is represented by (x) _t，，y _t ) A circular region of center omega _t For the light stream domain, M is median filtering (please supplement), and the motion trail (P) is formed by connecting the characteristic points tracked in the subsequent frames in series _t ，P _t+1 ，……)；

step 1.3, calculating an i-th frame image fi (i=2, 3..n):

TB _i ＝F _OTSU (abs(f _i -B _rd ))

TF _i ＝F _OTSU (abs(f _i -f _i-1 ))

R _i ＝TF _i +(1-a)·TB _i

such as Inf _i (x, y) =1, the following process is performed:

cnt _j representing a current pixelJudging the similarity degree of the (x, y) and the jth background sample pixel in the background sample set, and judging that the current pixel point is a background pixel if the sum of the comparison results of the current pixel and all the background pixel points in the background sample set is greater than or equal to a threshold value T; otherwise, foreground pixels; f (f) _i Showing an ith frame of video frame, which refers to the video frame where the current video frame is located; dis represents the Euclidean distance between two pixels; v _j Representing the j-th pixel point in the background sample set;

Claims

1. A giant panda pacing behavior tracking analysis method is characterized by comprising the following steps:

step 4, carrying out the same operation as the step 1-3 on each frame of image, outputting the movement track and movement speed of the pandas, and analyzing the behaviors;

the improved vibe method comprises the following steps:

step 1.2, for each pixel (x, y) of the initial background B0, creating a sample set M (x, y) = { v1, v2,..v }, where viIs an 8-neighborhood random sampling value of (x, y),i=1，2，⋯，N；

step 1.3, pair IiFrame imagefi(i=2,3..n) And (3) performing calculation:

wherein the method comprises the steps of

Representing selection of the first sample setrdA background of the composition of the individual samples,rdis a value randomly selected from {1,2,..N }, is }>

Representing the foreground segmentation post-background segmentation threshold value calculated by the OTSU method, < >>

Segmentation threshold value representing the calculation of background differential result by OTSU method,/->

Represent the firstiFrame image is +.>

Binarization results at->

Segmentation threshold value representing the result of calculating the frame difference by OTSU method,/->

Represent the firstiFrame radius thresholdRIs used for the value of (a) and (b),αis a weighting coefficient;

such as

The following process is performed:

step 1.3.1, judging the current pixel

Whether or not it is background by calculating the current pixel +.>

The similarity degree of the current pixel and the corresponding sample set is used for judging whether the current pixel is background, and the specific calculation is as follows:

representing the current pixel +.>

And the background sample setjJudging the similarity degree of the pixels of the background sample, and judging the current pixel as the background pixel if the sum of the comparison results of the current pixel and all the background pixels in the background sample is greater than or equal to a threshold value T; otherwise, foreground pixels; />

Show the firstiA frame video frame refers to the video frame in which the current video frame is located; />

Representing solving the Euclidean distance between two pixels; />

Representing the first of the background sample setsjA plurality of pixel points; />

The judgment result that the pixel point (x, y) in the ith frame image is a foreground or background pixel point is shown, and the current pixel is a foreground pixel, namely DBi (x, y) =1;

current pixel

Is a background pixel, i.e.)>

When the method is used, background updating is carried out according to the probability of 1/theta, wherein the background updating is divided into two parts, namely current sample set updating and neighborhood updating, and theta is a time sampling factor;

first, sample set update with current pixel

Pixel value +.>

Replace its corresponding background sample set +.>

Is a randomly selected sample->

Namely +.>

；

Secondly, the neighborhood is updated, the current pixel

Randomly selecting a position current pixel in 8 neighborhoods of (1)>

In the corresponding background sample set +.>

Medium shorthand select a sample +.>

The current pixel is used for replacement, namely

。

2. A panda bamboo eating and oestrus behavior recognition method is characterized by comprising the following steps:

step 3, useutRepresenting the horizontal component in the optical flow field,vtrepresenting the vertical component in an optical flow field，

=(ut，vt) Then the dense optical flow field between the t frame and the t+1st frame image is represented, and the characteristic point Pt= (xt, yt) on the t frame image is in the optical flow field->

The above smoothing process is performed by using a median filter M, and the position on the t+1st frame corresponding to the point after smoothing is defined as:

wherein the method comprises the steps of

A circular area as the center,>

is thatThe light flow field M is median filtering, and the characteristic points tracked in the subsequent frames are connected in series to form a motion track (>

，……）；

step 8, finally, sending the obtained coded feature descriptors into an SVM classifier for classification;

the improved vibe method comprises the following steps:

step 1.3, pair IiFrame imagefi(i=2,3..n) And (3) performing calculation:

wherein the method comprises the steps of

Represent the firstiFrame image is +.>

Binarization results at->

such as

The following process is performed:

step 1.3.1, judging the current pixel

Whether or not it is background by calculating the current pixel +.>

representing the current pixel +.>

Representing solving the Euclidean distance between two pixels; />

Representing the first of the background sample setsjA plurality of pixel points;

current pixel

Is a background pixel, i.e.)>

When the method is used, background updating is carried out according to the probability of 1/theta, wherein the background updating is divided into two parts, namely current sample set updating and neighborhood updating, and theta is a time sampling factor; />

First, sample set update with current pixel