CN102685574A

CN102685574A - System for automatically extracting images from digital television program and application thereof

Info

Publication number: CN102685574A
Application number: CN2011100555321A
Authority: CN
Inventors: 须泽中
Original assignee: Individual
Current assignee: Individual
Priority date: 2011-03-09
Filing date: 2011-03-09
Publication date: 2012-09-19

Abstract

The invention discloses a system for automatically extracting images from a digital television program and applications of the system for automatically extracting the images from the digital television program. The system provided by the invention comprises an image distinguishing module, a subtitle distinguishing module, a voice distinguishing module and a key video frame and voice extracting module, wherein the image distinguishing module analyzes video data through scene change detection, face detection, movement detection and character object detection; the subtitle distinguishing module confirms whether subtitle exists at present; the voice distinguishing module judges whether currently received voice frequency data is voice; and the key video frame and voice extracting module receives analysis and detection results for the video data, distinguishing results for the subtitle and the distinguishing results for the voice and extracts key video frames and relevant voices according to the arrangement of a user and the analysis and detection results for the video data, distinguishing results for the subtitle and the distinguishing results for the voice. The system provided by the invention can automatically extract the 'key frames' representing the understanding of the content from video signals of a digital television. Besides, the system provided by the invention can be applied to electronic books and cartoon and can be used for manufacturing a video document which has low resolution and can display images at fixed period.

Description

The system of Automatic Extraction image and application thereof from digital television program

Technical field

The present invention relates to the digital television techniques field, particularly relate to a kind of system and application thereof of Automatic Extraction image from digital television program.

Background technology

Video is appreciated that and is continuous images sequence in time, finds with test by inquiry, and when the video content of DTV is understood, be not that all images all play a part no less important.The real image that the understanding of digital TV video frequency is played an important role has following several types:

● the frame of corresponding captions is arranged;

● the frame that occurrence scene switches;

● the frame that has people's face to occur;

● the frame that has who object to occur;

● the frame of certain movement amplitude is arranged;

● the frame that has corresponding voice to occur.

Above-mentioned image can be used as and judges whether to be the play an important role condition of image of the understanding to digital TV video frequency; Can make up according to user's demand; Finally define " key frame ", should " key frame " meet one of them conditioned disjunction full terms at least.

Therefore, how Automatic Extraction goes out the understanding of content significant " key frame " from the vision signal of DTV, just the problem paid close attention to of the present invention and insider.

Summary of the invention

The technical problem that the present invention will solve provide a kind of from digital television program the system of Automatic Extraction image, can from the vision signal of DTV, Automatic Extraction go out " key frame " that the understanding of content is had the meaning represented, simple in structure, realize easily.

For solving the problems of the technologies described above; Of the present invention from digital television program the system of Automatic Extraction image; Comprise receiving terminal for digital television, this receiving terminal for digital television comprises demultiplexing module, video decode module, subtitle resolving module, audio decoder module, video display module and audio frequency display module; Wherein, also comprise:

The image discriminating module is connected with said video decode module, is used for the output frame of receiver, video decoder module; Analyze video data through scene change detection, the detection of people's face, motion detection and who object detection algorithm;

The captions discrimination module is connected with said subtitle resolving module, receives the analysis result of subtitle resolving module, confirms whether exist at the current time captions;

The voice discrimination module is connected with said audio decoder module, receives the voice data of audio decoder module output, judges whether the current voice data of receiving is speech;

Key video sequence frame and speech extraction module are connected with said image discriminating module, captions discrimination module, voice discrimination module, video decode module and audio decoder module; The result of receiving video data analyzing and testing, captions discrimination result and speech discrimination result; And Voice & Video data; Judge whether the current video that receives has captions; Whether the current video that receives has corresponding speech, extracts key video sequence frame and associated voice according to user's the setting and result, captions discrimination result and the speech discrimination result of video data analyzing and testing.

Adopt of the present invention from digital television program the system of Automatic Extraction image; Can from the vision signal of DTV, Automatic Extraction go out " key frame " that the understanding of content is had the meaning represented; Can reduce resolution and carry out special effect treatment these key frames, like effects such as " oil painting ", " cartoons "; Be equipped with captions and voice again, under the understandable prerequisite of the substance of DTV, greatly reduce data volume and increase interest, final purposes comprises:

● cooperate captions and voice to be made into e-book;

● cooperate captions to print and be made into cartoon;

● cooperate captions and be made into little resolution with voice after multiplexing, the video file of regular display image is play on mobile device.

Description of drawings

Below in conjunction with accompanying drawing and embodiment the present invention is done further detailed explanation:

Fig. 1 is a structural principle block diagram of the present invention;

Fig. 2-Fig. 7 adopts the present invention to carry out the effect contrast figure of image processing front and back.

Embodiment

Referring to shown in Figure 1, of the present invention from digital television program the system of Automatic Extraction image comprise receiving terminal for digital television and image discriminating module, captions discrimination module, voice discrimination module and key video sequence frame and speech extraction module.

Said receiving terminal for digital television is the prior standard framework, does not wherein comprise channel tuner, rectification part.Said receiving terminal for digital television comprises demultiplexing module, video decode module, subtitle resolving module, audio decoder module, video display module and audio frequency display module.

Said demultiplexing module is used for resolution system layer such as TS stream (MPEG2 System Layer), isolates audio frequency stream, video-frequency basic flow and caption information basically, is input to audio decoder, Video Decoder and subtitle resolving module respectively.

Said video decode module is connected with demultiplexing module, is used for video compression stream decoding back output is used for the video data of reprocessing or broadcast.

Said subtitle resolving module is connected with demultiplexing module, is used for converting caption information to can show form.

Said audio decoder module is connected with demultiplexing module, is used for audio compression stream decoding back output is used for the voice data of reprocessing or broadcast.

Said video display module is connected with subtitle resolving module with the video decode module, is used for video information and caption information are shown to the user.

Said audio frequency display module is connected with the audio decoder module, is used for the broadcast of audio-frequency information.

Said image discriminating module is connected with said video decode module, is used for the output frame of receiver, video decoder module.Detect scheduling algorithm through scene change detection, the detection of people's face, motion detection and who object and analyze video data, and the result of analyzing and testing is delivered to " key video sequence frame and speech extraction module ".Said image discriminating module can be carried out combination in any (promptly realizing one or more algorithms wherein) to scene change detection, the detection of people's face, motion detection and who object detection algorithm; And realize wherein a kind of algorithm at least but and do not require and realize whole algorithms that concrete combination should be confirmed according to actual needs.

Said captions discrimination module is connected with said subtitle resolving module, receives the analysis result of subtitle resolving module, confirms whether exist at the current time captions, and the result is delivered to " key video sequence and speech extraction module ".

Said voice discrimination module is connected with said audio decoder module, receives the voice data of audio decoder module output, judges whether the current voice data of receiving is speech, and the result is delivered to " key video sequence and speech extraction module ".

Image cartoon processing module is connected with the speech extraction module with said key video sequence frame, receives " key frame " that extract, and " key frame " form with image is appeared.Adopt the Boundary Extraction like image, image special effect Processing Algorithm commonly used such as oil paint effect processing is carried out special effect processing to " key frame ".Under the prerequisite that keeps " key frame " substance, increase the interest of image.Thereby for e-book is made and the cartoon print module provides material.

The e-book manufacturing module is connected with said image cartoon processing module, and " key frame " after image cartoon processing module is handled is made into e-book.

The cartoon print module is connected with said image cartoon processing module, and " key frame " after image cartoon processing module is handled is printed as cartoon.

Graphics/audio code multiplexing module is connected with the speech extraction module with said image cartoon processing module and key video sequence frame, and " key frame " after image cartoon processing module is handled is made into little resolution, regularly the video file of display image.

Adopt system of the present invention, after extracting " key frame " and carrying out special effect treatment, can reach effects such as " oil painting ", " cartoon ".Fig. 2-Fig. 7 adopts the final effect comparison instance of realizing of the present invention.Fig. 2 (a)-Fig. 7 is original video image a), and Fig. 2 (b)-Fig. 4 (b) carries out the oil paint effect image with boundary corresponding after the special effect treatment; Fig. 5 (b) carries out the image that colored thick border corresponding after the special effect treatment proposes; Fig. 6 (b) carries out the image that colored thin border corresponding after the special effect treatment proposes; Fig. 7 (b) carries out sketch effect image corresponding after the special effect treatment.

More than through embodiment and embodiment the present invention has been carried out detailed explanation, but these are not to be construed as limiting the invention.Under the situation that does not break away from the principle of the invention, those skilled in the art also can make many distortion and improvement, and these also should be regarded as protection scope of the present invention.

Claims

1. the system of an Automatic Extraction image from digital television program; Comprise; Receiving terminal for digital television, this receiving terminal for digital television comprise demultiplexing module, video decode module, subtitle resolving module, audio decoder module, video display module and audio frequency display module; It is characterized in that, also comprise:

2. the system of claim 1, it is characterized in that: said image discriminating module can be carried out combination in any to scene change detection, the detection of people's face, motion detection and who object detection algorithm, and realizes wherein a kind of algorithm at least.

3. the application of the described system of claim 1 in e-book.

4. the application of the described system of claim 1 in cartoon.

5. the described system of claim 1 is cooperating captions and is being made into little resolution, the application that the video file of regular display image is play with voice after multiplexing on mobile device.