CN102098449B

CN102098449B - A kind of method utilizing Mark Detection to carry out TV programme automatic inside segmentation

Info

Publication number: CN102098449B
Application number: CN201010574074.8A
Authority: CN
Inventors: 董远; 肖国锐
Original assignee: Beijing University of Posts and Telecommunications
Current assignee: Beijing University of Posts and Telecommunications
Priority date: 2010-12-06
Filing date: 2010-12-06
Publication date: 2016-08-03
Anticipated expiration: 2030-12-06
Also published as: CN102098449A

Abstract

The present invention relates to Video processing and mode identification technology, it is proposed that a kind of method utilizing Mark Detection to carry out TV programme automatic inside segmentation.At present, the inside division of TV programme has urgent needs；It is good structural that the time discontinuity of program sign makes program have.The method of the invention mainly comprises the steps that program video shot segmentation, each camera lens are extracted key frame and the subgraph of program sign region by (1)；(2) extract the characteristic vector of subgraph, use the SVM classifier for program sign to classify；(3) statistical mark classification results, demarcates the flag property of each camera lens；(4) the flag property shear point at adjacent camera lens splits video.Key frame, during Mark Detection, is only processed, thus improves the efficiency of method by the present invention；It addition, the main application of the present invention is the TV programme that program sign has discontinuity, to program content type no requirement (NR), enhance the universality of method application.

Description

A kind of method utilizing Mark Detection to carry out TV programme automatic inside segmentation

Technical field

The invention belongs to Video processing and mode identification technology, be specifically related to a kind of method utilizing Mark Detection to carry out TV programme automatic inside segmentation.

Background technology

At present, radio and television are all producing the video of magnanimity every day, and give electric program menu.Widely available along with Web TV and DTV, more preferably views and admires impression to provide, and many TV programme attempt, by internal paragraph segmentation, providing program-internal rating and instructing.Meanwhile, the inside division of program is also the premise of further content analysis and retrieval.In the face of the video of magnanimity, artificial mark segmentation can not meet timeliness requirement, and what machine completed is partitioned into urgent needs automatically.Video structure analyzing refers to that video flowing carries out shot segmentation, key-frame extraction and scene cut etc. to be processed, thus obtains the structured message of video.Scene cut is concentrated mainly on scene clustering, repeats Video Detection, and shot similarity ratio is on counterpart method, the most more complicated.Currently, increasing TV programme pay close attention to intellectual property when using station symbol or the own mark of program: at the video paragraph of the non-own property right of program-internal, such as advertisement, the vidclip etc. quoted, will not load these marks；And use the video paragraph of mark, it is common that teaser or tail, interview part, or other fragments recorded by this program oneself.It is the strongest structural that mark discontinuity in time series makes TV programme have, and the inside division for TV programme provides foundation.

Summary of the invention

For a certain specific television program, its station symbol or program sign, referred to collectively below as mark, has temporal discontinuity, and the present invention provides a kind of to this kind of TV programme inside division method, reaches segmentation effect quickly and accurately.

The key step of TV programme automatic inside segmentation method of the present invention is as follows:

Step one, utilizes a kind of existing shot segmentation technique that television program video is carried out shot segmentation, it is thus achieved that the shot sequence information of Time Continuous；

Step 2, takes 5 frame key frames, and extracts the subgraph of the rectangular area of special sign position in all key frames each camera lens temporally average mode；

Step 3, extracts the image feature vector of all subgraphs of training set, and the sub-pictures containing mark is positive sample, and the sub-pictures without mark is negative sample, and training obtains SVM classifier；

Step 4, to this program video to be split, all subgraphs are obtained through step (1) and (2), extract the image feature vector identical with step (3), the SVM classifier obtained by step (3) is classified, and obtains the classification results of each subgraph；

Step 5, labelling camera lens flag property, if at least 3 frame subgraphs are judged as existing mark in camera lens, then this camera lens of labelling is mark camera lens, is otherwise labeled as non-mark camera lens；

Step 6, program video inside division, video has the adjacent shot boundary of unlike signal attribute as cut-point, Video segmentation is become paragraph.

Accompanying drawing explanation

Fig. 1 is television program structure exemplary plot of the present invention.

Fig. 2 is the basic flow sheet of the method for the invention.

Detailed description of the invention

As shown in Fig. 2 flow chart, the method for the invention comprises two stages: off-line training grader and online treatment video to be split.Two stages common step is shot segmentation, extracts 5 frame key frames and specific region subgraph thereof.It it is below method detailed description of the invention.

(1) shot segmentation step is to utilize existing a kind of shot segmentation algorithm, as based on rectangular histogram, based on moving and for the algorithm compressing video, specific television program video slicing becoming the shot sequence of Time Continuous.

(2) each camera lens is temporally divided into 6 sections, takes 5 two field pictures of adjacent segment as key frame；For these TV programme, known special sign determining the rectangular area at its place, mark is surrounded by this rectangle just completely, and rectangular coordinates is (x, y, w, h), wherein x, y is respectively the transverse and longitudinal coordinate of rectangle upper left angle point, and w, h are respectively width and the height of rectangle；To this rectangle of all key-frame extraction, referred to as subgraph.

(3) three kinds of image feature vectors of all subgraphs are extracted: HSV space color histogram, edge gradient rectangular histogram, SIFT feature point rectangular histogram based on word bag model；Then three kinds of features are connected, form last image feature vector.Specific features extracting method is as follows:

1. color histogram extracts

Subgraph is extracted hsv color statistic histogram, and wherein H space is divided into 8 intervals, and S space is divided into 3 intervals, and V space is divided into 3 spaces, by rectangular histogram normalization, forms the characteristic vector of 72 dimensions；

2. edge gradient rectangular histogram is extracted

Subgraph is extracted edge gradient rectangular histogram, and every 5 degree is an interval, the gradient in each interval range cumulative, by rectangular histogram normalization, forms the characteristic vector of 72 dimensions；

3. SIFT feature point rectangular histogram based on word bag model is extracted

Extract all subgraph SIFT feature vector；Use the K means clustering algorithm SIFT feature vector clusters to training set data, obtain 64 cluster centres, as the code book of word bag model；By all SIFT feature vector projections of each subgraph to code book, form the rectangular histogram of 64 dimensions and do normalization, obtaining characteristic vector；

4. three of the above characteristic vector is contacted, form the characteristic vector of 208 last dimensions.

(4) SVM classifier of off-line training mark, inputs the training of SVM instrument by the image feature vector of the positive negative sample of training set, and in training, positive and negative collection number of samples is all higher than 1000 herein, and SVM selects kernel function based on card side's distance；The training of this step obtains the grader for this TV programme special sign.

(5) subgraph to video to be split extracts the image feature vector identical with step (3), totally 208 dimension；Wherein, the code book of the needs forming the histogram feature vector of SIFT is the code book used in step (3), training set obtain through K Mean Method cluster.

(6) characteristic vector that step (5) is obtained by the SVM classifier using step (4) to obtain is classified, and classification results demarcates whether each subgraph exists mark.

(7) checked that if greater than equal to 3, then this camera lens of labelling is mark camera lens, and otherwise this camera lens of labelling is non-mark camera lens containing the number of key frames indicated in each camera lens by step (6) result.

(8) check the camera lens labelling of video to be split by camera lens, if adjacent two camera lens labellings are different, then using the border of the two camera lens as a cut-point, until the complete all adjacent camera lenses of sequential search, finally this program video inside division completes.

Claims

1. utilizing the method that Mark Detection carries out TV programme automatic inside segmentation, its feature is, the method includes:

Step 3, extracts the image feature vector of all subgraphs of training set, and the subgraph containing mark is positive sample, and the subgraph without mark is negative sample, and training obtains SVM classifier；

Step 4, to this program video to be split, obtains all subgraphs through step one and step 2, extracts the image feature vector identical with step 3, and the SVM classifier obtained by step 3 is classified, and obtains the classification results of each subgraph；

Step 6, program video inside division, video has the adjacent shot boundary of unlike signal attribute as cut-point, Video segmentation is become paragraph；

Wherein, described step 2 specifically includes:

Step 1, is temporally divided into 6 sections by each camera lens, takes 5 two field pictures of adjacent segment as key frame；

Step 2, for these TV programme, determines the rectangular area at its place to known special sign, and mark is surrounded by this rectangle just completely, rectangular coordinates be (x, y, w, h), wherein x, y are respectively the transverse and longitudinal coordinate of rectangle upper left angle point, and w, h are respectively width and the height of rectangle；

Step 3, to this rectangle of all key-frame extraction, referred to as subgraph；

Wherein, described step 3 specifically includes:

Step 1, extracts hsv color statistic histogram to subgraph, and wherein H space is divided into 8 intervals, and S space is divided into 3 intervals, and V space is divided into 3 spaces, by rectangular histogram normalization, forms the characteristic vector of 72 dimensions；

Step 2, extracts edge gradient rectangular histogram to subgraph, and every 5 degree is an interval, the gradient in each interval range cumulative, by rectangular histogram normalization, forms the characteristic vector of 72 dimensions；

Step 3, extract all subgraph SIFT feature vector, use the K means clustering algorithm SIFT feature vector clusters to training set data, obtain 64 cluster centres, as code book, by all SIFT feature vector projections of each subgraph to code book, form the rectangular histogram of 64 dimensions and do normalization, obtaining characteristic vector；

Step 4, contacts three of the above characteristic vector, forms the characteristic vector of 208 last dimensions；

Step 5, uses the characteristic vector training SVM classifier of training set sample, and in training, positive and negative collection number of samples is all higher than 1000 herein, and SVM selects kernel function based on card side's distance, the training of this step to obtain the grader for this TV programme special sign.