CN101742289A - Method, system and device for compressing video code stream - Google Patents

Method, system and device for compressing video code stream Download PDF

Info

Publication number
CN101742289A
CN101742289A CN200810226180A CN200810226180A CN101742289A CN 101742289 A CN101742289 A CN 101742289A CN 200810226180 A CN200810226180 A CN 200810226180A CN 200810226180 A CN200810226180 A CN 200810226180A CN 101742289 A CN101742289 A CN 101742289A
Authority
CN
China
Prior art keywords
background area
roi
discardable
image
content
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN200810226180A
Other languages
Chinese (zh)
Other versions
CN101742289B (en
Inventor
王浩
邱嵩
陈朝武
卢京辉
李晓峰
季鹏飞
孙大瑞
于烨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vimicro Corp
Original Assignee
Vimicro Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vimicro Corp filed Critical Vimicro Corp
Priority to CN 200810226180 priority Critical patent/CN101742289B/en
Publication of CN101742289A publication Critical patent/CN101742289A/en
Application granted granted Critical
Publication of CN101742289B publication Critical patent/CN101742289B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention discloses a method for compressing video code stream. The method comprises the following steps that: front-end equipment acquires an original image; the front-end equipment captures a ROI regional image and a background regional image from the original image; the front-end equipment encodes the ROI regional image and the background regional image, sends the encoded ROI regional image and the encoded background regional image to a video server and marks a background region, which cannot be referred, as discardable; and the video server discards the content of the background region marked to be discardable when the video code stream needs reducing. The reducing video rate is ensured, and a receiving end can avoid losing action details in the ROI region in a decoding process.

Description

Video code flow compression method, system and device
Technical field
The present invention relates to the image technique field, particularly a kind of video code flow compression method, system and device.
Background technology
The code stream that video compression obtains generally need send and receive by transmission mediums such as networks, comprises various situations such as private network, the Internet, ADSL, mobile radio networks.According to the difference of the network bandwidth, might need code stream is adjusted, so that under lower bandwidth, send.Existing technology has the mode of multiple adjustment code stream, comprises reducing spatial resolution, reduction frame per second (temporal resolution), reduction image quality etc., to realize the reduction to the code stream size.
For example in more common field of video monitoring, rig camera, is kept at code stream in the local storage medium (for example disk array) through behind the video compressing module with the picture of the speed output 720x576 pixel of per second 25 frames.During code stream under certain staff far wants to record by this rig camera of ADSL access to netwoks, because 720x576 pixel, code check surpasses the bandwidth of ADSL after 25 frames/compression second, therefore can't transmit complete code stream under the situation that makes current ADSL network.In this case, fairly simple method is frame losing in the prior art, for example loses the data of B frame, perhaps can not abandon etc. with reference to the P frame, and what lose like this can not be the reference frame of other frames with reference to the P frame, therefore can not influence the decoding of other frames.The benefit that the mode of employing frame losing reduces code check is not need again code stream to be compressed.The P frame can be described as forward predicted frame again, the picture frame of having decoded before the reference when being characterized in decoding.Generally speaking, the data behind the P frame decoding are can be by other frames (for example B frame, or certain P frame afterwards) as using with reference to frame.By whether being the sign of reference picture, can further the P frame be divided into two classes to the increase of P frame: can by other frames do reference frame can with reference to the P frame and can not by other frames do reference frame can not be with reference to the P frame.If current P frame is can not be with reference to the P frame, then subsequent frame just can not be with this P frame as the reference frame; If present frame is can be with reference to the P frame, subsequent frame can be with this P frame as the reference frame.As shown in Figure 1, be the schematic diagram of P frame in the prior art, as can be seen from the figure can will can not abandon to reduce frame per second, to realize the purpose of downscaled video code check with reference to the P frame.
The shortcoming that prior art exists is some important action details of losing monitored object because of frame losing, will make the action of monitored object discontinuous.
Summary of the invention
Purpose of the present invention is intended to solve at least one of above-mentioned technological deficiency, particularly solves owing to some important action details of monitored object are lost in frame losing, thereby makes the incoherent technological deficiency of action of monitored object.
For achieving the above object, one aspect of the present invention proposes a kind of video code flow compression method, may further comprise the steps: headend equipment is gathered original image; Described headend equipment intercepts region of interest ROI area image and background area image from described original image; Described headend equipment sends to video server described ROI area image and the described background area image back of encoding, background area that simultaneously can not reference is labeled as discardable, and described video server abandons the described content that is labeled as discardable background area when needing the downscaled video code stream.
As one embodiment of the present of invention, described headend equipment sends to video server described ROI area image and the described background area image back of encoding, background area that simultaneously can not reference is labeled as discardable being specially: described headend equipment sends to video server described ROI area image and the described background area image back of encoding, and can not add discardable sign in the frame head of reference frame simultaneously.
As one embodiment of the present of invention, also comprise: when described headend equipment carries out interframe movement prediction and compensation to the present frame image, not with reference to before be labeled as discardable background area data in the image that reconstructs.
As one embodiment of the present of invention, also comprise: when described headend equipment carries out infra-frame prediction to the present frame image, must not be with reference to being labeled as discardable background area data in this frame image.
As one embodiment of the present of invention, described video server abandons the described content that is labeled as discardable background area specifically when needs reductions code stream and comprises: described video server need to judge whether the downscaled video code stream; If judging needs the downscaled video code stream, then described video server sends to videoconference client after the described content that is labeled as discardable background area is abandoned; If background area and ROI area image data that the described headend equipment that judging does not need downscaled video code stream, then described video server to receive sends all send to described videoconference client.
As one embodiment of the present of invention, be labeled as after discardable background area abandons described at described video server, comprise that also described video server adds and abandoned sign in the frame head of described discardable frame.
As one embodiment of the present of invention, after described video server abandons the described content that is labeled as discardable background area and sends to videoconference client, also comprise: described videoconference client will be labeled as the content completion of the background area that has abandoned.
As one embodiment of the present of invention, also comprise: in the frame head of described frame, increase by two parameters, represent respectively whether the content of described background area is discardable, and whether the content of described background area is dropped.
As one embodiment of the present of invention, describedly in can not the frame head of reference frame, increase by two parameters, whether the content of representing the background area respectively is discardable, and whether the content that reaches the background area has been dropped specifically and has comprised: newly-increased as giving a definition in the syntactic definition of image parameter collection:
??pic_parameter_set_rbsp(){ Descriptor
??pic_parameter_set_id ??ue(v)
??seq_parameter_set_id ??ue(v)
??entropy_coding_mode_flag ??u(1)
??pic_init_qp ??u(6)
??if(roi_flag){
??num_roi ??ue(v)
??if(num_roi>0){
??background_discardable ??u(1)
??if(background_discardable){
??pic_parameter_set_rbsp(){ Descriptor
??non_roi_skip_flag ??u(1)
??}
??pic_init_qp_forroi ??u(6)
??for(iGroup=0;iGroup<num_roi;iGroup++){
??top_left[iGroup] ??ue(v)
??bottom_right[iGroup] ??ue(v)
??}
??}
??}
??Else
??fixed_pic_qp
??loop_filter_disable_flag ??u(1)
??rbsp_trailing_bits()
??}
Wherein, background_discardable represents whether the content of present frame background area can abandon, and non_roi_skip_flag represents whether the content of present frame background area is dropped.
As one embodiment of the present of invention, described can not reference frame by the coded system decision of described headend equipment, perhaps, whether reality successfully receives this background area data decision according to video server, and described whether successfully the reception by described video server feeds back to described headend equipment.
The present invention also proposes a kind of video code flow compressibility on the other hand, comprise video server and at least one headend equipment of video image is provided for described video server, and videoconference client, described headend equipment, be used to gather original image, and from described original image intercepting region of interest ROI area image and background area image, and described ROI area image and the described background area image back of encoding sent to described video server, background area that simultaneously can not reference is labeled as discardable; Described video server is used for when needs downscaled video code stream, the described content that is labeled as discardable background area is abandoned the back send to described videoconference client, and described discardable background area is labeled as abandons; Described videoconference client is used to receive background area and ROI area image data that described video server sends, and is labeled as when abandoning the content completion with described background area in the background area.
As one embodiment of the present of invention, increase by two parameters in the frame head in can not reference frame, represent respectively whether the content of described background area is discardable, and whether the content of described background area is dropped.
As one embodiment of the present of invention, increase the syntactic definition realization that two parameters are concentrated by following image parameter in the described frame head in can not reference frame:
??pic_parameter_set_rbsp(){ Descriptor
??pic_parameter_set_id ??ue(v)
??seq_parameter_set_id ??ue(v)
??entropy_coding_mode_flag ??u(1)
??pic_init_qp ??u(6)
??if(roi_flag){
??num_roi ??ue(v)
??if(num_roi>0){
??background_discardable ??u(1)
??if(background_discardable){
??non_roi_skip_flag ??u(1)
??}
??pic_init_qp_for_roi ??u(6)
??for(iGroup=0;iGroup<num_roi;iGroup++){
??top_left[iGroup] ??ue(v)
??bottom_right[iGroup] ??ue(v)
??}
??pic_parameter_set_rbsp(){ Descriptor
??}
??}
??else
??fixed_pic_qp
??loop_filter_disable_flag ??u(1)
??rbsp_trailing_bits()
??}
Wherein, background_discardable represents whether the content of present frame background area can abandon, and non_roi_skip_flag represents whether the content of present frame background area is dropped.
The present invention also proposes a kind of headend equipment, comprises image capture module, interception module and coding module, and described image capture module is used to gather original image; Described interception module is used for described original image intercepting ROI area image and the background area image gathered from described image capture module; Described coding module, be used for described ROI area image and described background area image are encoded, background area that simultaneously can not reference is labeled as discardable, send to video server afterwards, described video server abandons the described content that is labeled as discardable background area when needing the downscaled video code stream.
As one embodiment of the present of invention, described coding module comprises that mark adds submodule, is used for adding discardable sign in the frame head of reference frame.
As one embodiment of the present of invention, described can not reference frame by the coded system decision of described headend equipment, perhaps, whether reality successfully receives this background area data decision according to video server, and described whether successfully the reception by described video server feeds back to described headend equipment.
The present invention also proposes a kind of video server, comprises receiver module, judge module and sending module, and described receiver module is used for background area and ROI area image data that receiving front-end equipment sends; Described judge module is used to need to judge whether the downscaled video code stream; Described sending module is used for when described judge module judgement needs the downscaled video code stream, sends to videoconference client after the described content that is labeled as discardable background area is abandoned; When described judge module judgement did not need the downscaled video code stream, background area and ROI area image data that the described headend equipment that receives is sent all sent to described videoconference client.
As one embodiment of the present of invention, also comprise sign interpolation module, be used in the frame head of described discardable frame, adding having abandoned sign.
The present invention also proposes a kind of video code flow compression method, may further comprise the steps: encoder is gathered original image; Described encoder intercepts ROI area image and background area image from described original image; Described encoder is encoded to described ROI area image and described background area image, and the content of background area that simultaneously can not reference abandons, and the background area after will encoding and ROI area image data send to decoder.
As one embodiment of the present of invention, at described encoder described ROI area image and described background area image are encoded, after the content of background area that simultaneously can not reference abandons, also comprise: described encoder adds in can not the frame head of reference frame and has abandoned sign.
As one embodiment of the present of invention, background area after described encoder will be encoded and ROI area image data are after decoder sends, and also comprise: described decoder will be labeled as the content completion of the background area that has abandoned.
The present invention also proposes a kind of encoder, comprises image capture module, interception module, coding module and sending module, and described image capture module is used to gather original image; Described interception module is used for described original image intercepting ROI area image and the background area image gathered from described image capture module; Described coding module is used for described ROI area image and described background area image are encoded, and the content of background area that simultaneously can not reference abandons; Described sending module is used for background area and ROI area image data behind the coding are sent to decoder.
As one embodiment of the present of invention, described coding module comprises that sign adds submodule, is used for adding in can not the frame head of reference frame having abandoned sign.
The present invention can also make receiving terminal not lose action details in the ROI zone when decoding when guaranteeing the downscaled video code check.
Aspect that the present invention adds and advantage part in the following description provide, and part will become obviously from the following description, or recognize by practice of the present invention.
Description of drawings
Above-mentioned and/or additional aspect of the present invention and advantage are from obviously and easily understanding becoming the description of embodiment below in conjunction with accompanying drawing, wherein:
Fig. 1 is the schematic diagram of P frame in the prior art;
Fig. 2 is the schematic diagram of the embodiment of the invention one;
Fig. 3 is the video code flow compression method flow chart of the embodiment of the invention one;
Fig. 4 is the video code flow compressibility structure chart of the embodiment of the invention one;
Fig. 5 is the video code flow compression method of the embodiment of the invention two;
Fig. 6 is the coder structure figure of the embodiment of the invention two.
Embodiment
Describe embodiments of the invention below in detail, the example of described embodiment is shown in the drawings, and wherein identical from start to finish or similar label is represented identical or similar elements or the element with identical or similar functions.Below by the embodiment that is described with reference to the drawings is exemplary, only is used to explain the present invention, and can not be interpreted as limitation of the present invention.
The present invention mainly is can not the reference background area identification to be discardable with the background area image, when needing the downscaled video code check, will have discardable sign can not the reference background zone content abandon, thereby when guaranteeing the downscaled video code check, can also make receiving terminal when decoding, not lose action details in the ROI zone.
Embodiment one,
Described a kind of application scenarios in this embodiment, comprised video server and provide at least one headend equipment of video image and videoconference client for video server.Send to video server after each headend equipment images acquired, offer videoconference client again after gathering by video server.Headend equipment can not the reference background area identification be discardable to the background area image encoding time in this embodiment, if expression abandons the decoding that the content of this background area can not influence other frames.Thereby when video server provides image for videoconference client, can corresponding background area be abandoned, and the content of this background area of instruction video client is deleted according to discardable sign.Need to prove; above-mentioned in the present invention employing video server receiving front-end device data only is a kind of implementation of the present invention; do not breaking away from the scope of the inventive concept; reach constructed effect, other any receiving terminals that are similar to video server that play same function all should be protection range of the present invention and contain.
As shown in Figure 2, be the schematic diagram of the embodiment of the invention one, will put in order frame image and be divided into ROI zone (shown in the solid box) and background area (shown in the frame of broken lines) that to the area-of-interest normal encoding, promptly current P frame ROI zone can be with reference to the ROI zone of former frame.When headend equipment is encoded, k, k+2, k+4 frame background area is identified as not discardable; K+1, k+3 frame background area is identified as discardable.And when video server provided image for videoconference client, video server was with k+1, and the content of k+3 frame background area abandons, and these background areas are designated were dropped.
As one embodiment of the present of invention, the present invention can not increase by two parameters in the frame head of reference frame, whether the content of representing the background area respectively is discardable, and whether the content of background area is dropped, for example these two parameters are " discardable " and " abandoning ", if " discardable " is labeled as 1, represent that then the content of this background area can abandon when needed, do not influence the subsequent frame decoding.If " discardable " is labeled as 0, represent that then the content of this background area cannot abandon, otherwise the subsequent frame decoding can make mistakes.Be labeled as 1 if " abandon ", represent that this frame background area content is dropped, does not need to separate the background area content during decoding.Be labeled as 0 if " abandon ", represent this frame background area content also in code stream, need decode during decoding obtains the background area image.The so as above described schematic diagram of Fig. 2, k, k+2, k+4 frame background area k, k+2, " discardable " mark of the background area in the k+4 frame is set as 0, and " abandoning " mark also is set as 0; K+1, " discardable " mark of the background area in the k+3 frame is set as 1, and " abandoning " mark is set as 0.
More specifically, the present invention proposes a specific embodiment that above-mentioned parameter is set, as in the syntactic definition of picture parameter set, newly-increased as giving a definition:
??pic_parameter_set_rbsp(){ Descriptor Explanation
??pic_parameter_set_id ??ue(v)
??seq_parameter_set_id ??ue(v)
??entropy_coding_mode_flag ??u(1)
??pic_init_qp ??u(6)
??if(roi_flag){
??num_roi ??ue(v)
??if(num_roi>0){
??pic_parameter_set_rbsp(){ Descriptor Explanation
??background_discardable ??u(1) Background " discardable "
??if(background_discardable){
??non_roi_skip_flag ??u(1) Background " abandons "
??}
??pic_init_qp_for_roi ??u(6)
??for(iGroup=0;iGroup<num_roi;iGroup++){
??top_left[iGroup] ??ue(v)
??bottom_right[iGroup] ??ue(v)
??}
??}
??}
??else
??fixed_pic_qp
??loop_filter_disable_flag ??u(1)
??rbsp_trailing_bits()
??}
Wherein, the semantic description of background_discardable and non_roi_skip_flag syntactic element is as follows: background_discardable: whether the background area of expression present frame can abandon, if this value is 1, should be when then subsequent frame is encoded with reference to the background area in the present frame.Non_roi_skip_flag: whether the background area in the expression present frame is dropped, if this value is 1, does not comprise the background area part in the present frame in the expression code stream.
As shown in Figure 3, the video code flow compression method flow chart for the embodiment of the invention one may further comprise the steps:
Step S301, headend equipment is gathered original image.
Step S302, headend equipment intercept ROI area image and background area image from original image.
Step S303, headend equipment encode to ROI area image and background area image respectively, obtain ROI zone and background area view data, and background area that can not reference is labeled as discardablely, and the data after will encoding afterwards are sent to video server.Particularly, headend equipment can not add discardable sign in the frame head of reference frame.Wherein, background area that can not reference is by the coded system decision of described headend equipment, and whether perhaps whether reality successfully receives this background area data decision according to video server, wherein successfully receive by video server to feed back to headend equipment.As one embodiment of the present of invention, when practical operation, " discardable " mark of background area in continuous a plurality of frames all can also be made as 1, can taper to video code flow littler in case of necessity in future like this.As one embodiment of the present of invention, when headend equipment carries out interframe movement prediction and compensation to the present frame image, not with reference to before be labeled as discardable background area data in the image that reconstructs.Perhaps when headend equipment carries out infra-frame prediction to the present frame image, must not be with reference to being labeled as discardable background area data in this frame image.
Step S304, video server need to judge whether the downscaled video code stream.
Step S305 needs the downscaled video code stream if video server is judged, sends to videoconference client after then the video server content that will be labeled as discardable background area abandons.Simultaneously, video server adds in the frame head of described discardable background area and has abandoned sign.
Step S306 does not need the downscaled video code stream if video server is judged, then video server all sends to videoconference client with background area and the ROI area image data that the headend equipment that receives sends.
Step S307, videoconference client is resolved and is received data, if the indication that has been dropped is arranged in frame head, illustrates that then the content of this background area is dropped, and then videoconference client will be labeled as the content completion of the background area that has abandoned.Wherein, can come the data of completion background area among the present invention, for example, can duplicate a background area from successfully the decode background area that obtains of last time to present frame with diverse ways.
As shown in Figure 4, be the video code flow compressibility structure chart of the embodiment of the invention one, this system comprises video server 100 and provides at least one headend equipment 200 of video image and videoconference client 300 for video server 100.Headend equipment 200 is used to gather original image, and from the original image of gathering intercepting ROI area image and background area image, and ROI area image and the background area image back of encoding sent to video server 100, background area that simultaneously can not reference is labeled as discardable.Video server 100 is used for when needs downscaled video code stream, the content that is labeled as discardable background area is abandoned the back send to videoconference client 300, and discardable background area is labeled as abandons.Videoconference client 300 is used for background area and the ROI area image data that receiver, video server 100 sends, and is labeled as when abandoning the content completion with the background area in the background area.
Wherein, above-mentioned headend equipment 200 comprises image capture module 210, interception module 220 and coding module 230.Image capture module 210 is used to gather original image.Interception module 220 is used for intercepting ROI area image and background area image from the original image that image capture module 210 is gathered.Coding module 230 is used for ROI area image and background area image are encoded, background area that simultaneously can not reference is labeled as discardable, sends video servers 100 will be labeled as discardable background area when needing the downscaled video code stream content to video server 100 afterwards and abandons.
As one embodiment of the present of invention, coding module 230 comprises that mark adds submodule 231, is used for adding discardable sign in the frame head of reference frame.
Wherein, video server 100 comprises receiver module 110, judge module 120 and sending module 130.Receiver module 110 is used for background area and the ROI area image data that receiving front-end equipment 200 sends.Judge module 120 is used to need to judge whether the downscaled video code stream.Sending module 130 is used for when judge module 120 judgements need the downscaled video code stream, sends to videoconference client 300 after the content that is labeled as discardable background area is abandoned; When judge module 120 judgements did not need the downscaled video code stream, background area and ROI area image data that the headend equipment 200 that receives is sent all sent to videoconference client 300.
As one embodiment of the present of invention, video server 100 also comprises sign interpolation module 140, is used for adding in the frame head of discardable background area having abandoned sign.
Embodiment two,
In this embodiment, send to decoder again after the content of the background area that encoder can not reference directly abandons.
As shown in Figure 5, the video code flow compression method for the embodiment of the invention two may further comprise the steps:
Step S501, encoder is gathered original image.
Step S502, encoder intercept ROI area image and background area image from original image.
Step S503, encoder is encoded to described ROI area image and described background area image, and the content of background area that simultaneously can not reference abandons, and the background area after will encoding and ROI area image data send to decoder.Simultaneously, encoder adds in can not the frame head of reference frame and has abandoned sign.
Step S504, decoder is resolved and is received view data, if the indication that has been dropped is arranged in frame head, illustrates that then the content of background area is dropped, and then decoder will be labeled as the content completion of the background area that has abandoned.
As shown in Figure 6, be the coder structure figure of the embodiment of the invention two, this encoder comprises image capture module 610, interception module 620, coding module 630 and sending module 640.Image capture module 610 is used to gather original image.Interception module 620 is used for intercepting ROI area image and background area image from the original image that image capture module 610 is gathered.Coding module 630 is used for ROI area image and background area image are encoded, and the content of background area that simultaneously can not reference directly abandons.Sending module 640 is used for background area and ROI area image data behind the coding are sent to decoder.
As one embodiment of the present of invention, coding module 630 comprises that sign adds submodule 631, is used for adding in can not the background area frame head of reference having abandoned sign.
The present invention can also make receiving terminal not lose action details in the ROI zone when decoding when guaranteeing the downscaled video code check.
Although illustrated and described embodiments of the invention, for the ordinary skill in the art, be appreciated that without departing from the principles and spirit of the present invention and can carry out multiple variation, modification, replacement and modification that scope of the present invention is by claims and be equal to and limit to these embodiment.

Claims (23)

1. a video code flow compression method is characterized in that, may further comprise the steps:
Headend equipment is gathered original image;
Described headend equipment intercepts region of interest ROI area image and background area image from described original image;
Described headend equipment sends to video server described ROI area image and the described background area image back of encoding, background area that simultaneously can not reference is labeled as discardable, and described video server abandons the described content that is labeled as discardable background area when needing the downscaled video code stream.
2. video code flow compression method as claimed in claim 1, it is characterized in that, described headend equipment sends to video server described ROI area image and the described background area image back of encoding, and background area that simultaneously can not reference is labeled as discardable being specially:
Described headend equipment is encoded the back to the video server transmission to described ROI area image and described background area image, can not add discardable sign in the frame head of reference frame simultaneously.
3. video code flow compression method as claimed in claim 1 or 2 is characterized in that, also comprises:
When described headend equipment carries out interframe movement prediction and compensation to the present frame image, not with reference to the content that is labeled as discardable background area in the image that reconstructs before.
4. video code flow compression method as claimed in claim 1 or 2 is characterized in that, also comprises:
Described headend equipment is when infra-frame prediction is carried out in other zones except that being labeled as discardable background area in the present frame image, not with reference to the content that is labeled as discardable background area in this frame image.
5. video code flow compression method as claimed in claim 1 is characterized in that, described video server abandons the described content that is labeled as discardable background area specifically when needs reductions code stream and comprises:
Described video server need to judge whether the downscaled video code stream;
If judging needs the downscaled video code stream, then described video server sends to videoconference client after the described content that is labeled as discardable background area is abandoned;
If background area and ROI area image data that the described headend equipment that judging does not need downscaled video code stream, then described video server to receive sends all send to described videoconference client.
6. video code flow compression method as claimed in claim 5 is characterized in that, is labeled as after discardable background area abandons described at described video server, also comprise,
Described video server adds in the frame head of described discardable frame and has abandoned sign.
7. video code flow compression method as claimed in claim 6 is characterized in that, after described video server abandons the described content that is labeled as discardable background area and sends to videoconference client, also comprises:
Described videoconference client will be labeled as the content completion of the background area that has abandoned.
8. video code flow compression method as claimed in claim 1 is characterized in that, also comprises:
In the frame head of described frame, increase by two parameters, represent respectively whether the content of described background area is discardable, and whether the content of described background area is dropped.
9. video code flow compression method as claimed in claim 8 is characterized in that, describedly increases by two parameters in can not the frame head of reference frame, represents respectively whether the content of background area is discardable, and whether the content of background area has been dropped specifically and comprises:
Newly-increased as giving a definition in the syntactic definition of image parameter collection:
pic_parameter_set_rbsp(){ Descriptor ??pic_parameter_set_id ??ue(v) ??seq_parameter_set_id ??ue(v) ??entropy_coding_mode_flag ??u(1) ??pic_init_qp ??u(6) ??if(roi_flag){ ?????num_roi ??ue(v) ?????if(num_roi>0){ ?????????background_discardable ??u(1) ?????????if(background_discardable){ ???????????non_roi_skip_flag ??u(1) ?????} ?????pic_init_qp_for_roi ??u(6) ????for(iGroup=0;iGroup<num_roi;iGroup++){ ??????top_left[iGroup] ??ue(v) ??????bottom_right[iGroup] ??ue(v) ?????} ????} ?} ?Else ??fixed_pic_qp
pic_parameter_set_rbsp(){ Descriptor ?loop_filter_disable_flag ??u(1) ?rbsp_trailing_bits() ??}
Wherein, background_discardable represents whether the content of present frame background area can abandon, and non_roi_skip_flag represents whether the content of present frame background area is dropped.
10. video code flow compression method as claimed in claim 1 is characterized in that, described can not reference frame by the coded system decision of described headend equipment,
Perhaps, whether the actual content that successfully receives this background area determines that described whether successfully the reception by described video server feeds back to described headend equipment according to video server.
11. a video code flow compressibility is characterized in that, comprises video server and for described video server provides at least one headend equipment of video image, and videoconference client,
Described headend equipment, be used to gather original image, and from described original image intercepting region of interest ROI area image and background area image, and described ROI area image and the described background area image back of encoding sent to described video server, background area that simultaneously can not reference is labeled as discardable;
Described video server is used for when needs downscaled video code stream, the described content that is labeled as discardable background area is abandoned the back send to described videoconference client, and described discardable background area is labeled as abandons;
Described videoconference client is used to receive background area and ROI area image data that described video server sends, and is labeled as when abandoning the content completion with described background area in the background area.
12. video code flow compressibility as claimed in claim 11 is characterized in that, increases by two parameters in the frame head in can not reference frame, represent respectively whether the content of described background area is discardable, and whether the content of described background area is dropped.
13. video code flow compressibility as claimed in claim 12 is characterized in that, increases the syntactic definition realization that two parameters are concentrated by following image parameter in the described frame head in can not reference frame:
pic_parameter_set_rbsp(){ Descriptor ??pic_parameter_set_id ??ue(v) ??seq_parameter_set_id ??ue(v) ??entropy_coding_mode_flag ??u(1) ??pic_init_qp ??u(6) ??if(roi_flag){
pic_parameter_set_rbsp(){ Descriptor ????num_roi ??ue(v) ???if(num_roi>0){ ??????background_discardable ??u(1) ??if(background_discardable){ ????non_roi_skip_flag ??u(1) ?} ?pic_init_qp_forroi ??u(6) ?for(iGroup=0;iGroup<num_roi;iGroup++){ ???top_left[iGroup] ??ue(v) ???bottom_right[iGroup] ??ue(v) ??} ?} } else ???fixed_pic_qp ??loop_flter_disable_flag ??u(1) ??rbsp_trailing_bits() ?}
Wherein, background_discardable represents whether the content of present frame background area can abandon, and non_roi_skip_flag represents whether the content of present frame background area is dropped.
14. a headend equipment is characterized in that, comprises image capture module, interception module and coding module,
Described image capture module is used to gather original image;
Described interception module is used for described original image intercepting ROI area image and the background area image gathered from described image capture module;
Described coding module, be used for described ROI area image and described background area image are encoded, background area that simultaneously can not reference is labeled as discardable, send to video server afterwards, described video server abandons the described content that is labeled as discardable background area when needing the downscaled video code stream.
15. headend equipment as claimed in claim 14 is characterized in that, described coding module comprises that mark adds submodule, is used for adding discardable sign in the frame head of reference frame.
16. headend equipment as claimed in claim 14 is characterized in that, described can not reference frame by the coded system decision of described headend equipment,
Perhaps, whether the actual content that successfully receives this background area determines that described whether successfully the reception by described video server feeds back to described headend equipment according to video server.
17. a video server is characterized in that, comprises receiver module, judge module and sending module,
Described receiver module is used for background area and ROI area image data that receiving front-end equipment sends;
Described judge module is used to need to judge whether the downscaled video code stream;
Described sending module is used for when described judge module judgement needs the downscaled video code stream, sends to videoconference client after the described content that is labeled as discardable background area is abandoned; When described judge module judgement did not need the downscaled video code stream, background area and ROI area image data that the described headend equipment that receives is sent all sent to described videoconference client.
18. video server as claimed in claim 17 is characterized in that, also comprises sign interpolation module, is used for adding in the frame head of described discardable frame having abandoned sign.
19. a video code flow compression method is characterized in that, may further comprise the steps:
Encoder is gathered original image;
Described encoder intercepts ROI area image and background area image from described original image;
Described encoder is encoded to described ROI area image and described background area image, and the content of background area that simultaneously can not reference abandons, and the background area after will encoding and ROI area image data send to decoder.
20. video code flow compression method as claimed in claim 19 is characterized in that, at described encoder described ROI area image and described background area image is encoded, and after the content of background area that simultaneously can not reference abandons, also comprises:
Described encoder adds in can not the frame head of reference frame and has abandoned sign.
21. video code flow compression method as claimed in claim 19 is characterized in that, background area after described encoder will be encoded and ROI area image data also comprise after decoder sends:
Described decoder will be labeled as the content completion of the background area that has abandoned.
22. an encoder is characterized in that, comprises image capture module, interception module, coding module and sending module,
Described image capture module is used to gather original image;
Described interception module is used for described original image intercepting ROI area image and the background area image gathered from described image capture module;
Described coding module is used for described ROI area image and described background area image are encoded, and the content of background area that simultaneously can not reference abandons;
Described sending module is used for background area and ROI area image data behind the coding are sent to decoder.
23. encoder as claimed in claim 22 is characterized in that, described coding module comprises that sign adds submodule, is used for adding in can not the frame head of reference frame having abandoned sign.
CN 200810226180 2008-11-14 2008-11-14 Method, system and device for compressing video code stream Active CN101742289B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200810226180 CN101742289B (en) 2008-11-14 2008-11-14 Method, system and device for compressing video code stream

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200810226180 CN101742289B (en) 2008-11-14 2008-11-14 Method, system and device for compressing video code stream

Publications (2)

Publication Number Publication Date
CN101742289A true CN101742289A (en) 2010-06-16
CN101742289B CN101742289B (en) 2013-10-16

Family

ID=42465069

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200810226180 Active CN101742289B (en) 2008-11-14 2008-11-14 Method, system and device for compressing video code stream

Country Status (1)

Country Link
CN (1) CN101742289B (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103442213A (en) * 2013-08-21 2013-12-11 海能达通信股份有限公司 Method and device for monitoring video in real time
CN106954082A (en) * 2016-01-07 2017-07-14 中兴通讯股份有限公司 Video coding-decoding method, coding and decoding device and system
CN107026827A (en) * 2016-02-02 2017-08-08 上海交通大学 It is a kind of to be used for the optimization transmission mechanism of rest image in video flowing
CN109120936A (en) * 2018-09-27 2019-01-01 贺禄元 A kind of coding/decoding method and device of video image
CN109168032A (en) * 2018-11-12 2019-01-08 广州酷狗计算机科技有限公司 Processing method, terminal, server and the storage medium of video data
CN109982085A (en) * 2017-12-28 2019-07-05 新岸线(北京)科技集团有限公司 A kind of method of high precision image mixing compression
CN112203090A (en) * 2020-11-30 2021-01-08 浙江智慧视频安防创新中心有限公司 Video encoding and decoding method and device, electronic equipment and medium
CN113556551A (en) * 2020-04-23 2021-10-26 上海高德威智能交通***有限公司 Encoding and decoding methods, devices and equipment
CN113810739A (en) * 2020-06-17 2021-12-17 国基电子(上海)有限公司 Image transmission method, terminal and computer readable storage medium
WO2024067405A1 (en) * 2022-09-30 2024-04-04 中国电信股份有限公司 Video transmission method and apparatus, and electronic device, storage medium and program product

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6798834B1 (en) * 1996-08-15 2004-09-28 Mitsubishi Denki Kabushiki Kaisha Image coding apparatus with segment classification and segmentation-type motion prediction circuit
US20060244812A1 (en) * 2005-04-28 2006-11-02 Hyeonkuk Jeong Video processing in a multi-participant video conference
CN101072344A (en) * 2007-03-30 2007-11-14 腾讯科技(深圳)有限公司 Video coding method, decoding method and device
CN101202903A (en) * 2006-12-11 2008-06-18 谢剑斌 Monitoring video coding and decoding method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6798834B1 (en) * 1996-08-15 2004-09-28 Mitsubishi Denki Kabushiki Kaisha Image coding apparatus with segment classification and segmentation-type motion prediction circuit
US20060244812A1 (en) * 2005-04-28 2006-11-02 Hyeonkuk Jeong Video processing in a multi-participant video conference
CN101202903A (en) * 2006-12-11 2008-06-18 谢剑斌 Monitoring video coding and decoding method
CN101072344A (en) * 2007-03-30 2007-11-14 腾讯科技(深圳)有限公司 Video coding method, decoding method and device

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103442213B (en) * 2013-08-21 2016-07-06 海能达通信股份有限公司 Photographic technique method and device
CN103442213A (en) * 2013-08-21 2013-12-11 海能达通信股份有限公司 Method and device for monitoring video in real time
CN106954082A (en) * 2016-01-07 2017-07-14 中兴通讯股份有限公司 Video coding-decoding method, coding and decoding device and system
CN107026827B (en) * 2016-02-02 2020-04-03 上海交通大学 Optimized transmission method for static image in video stream
CN107026827A (en) * 2016-02-02 2017-08-08 上海交通大学 It is a kind of to be used for the optimization transmission mechanism of rest image in video flowing
CN109982085A (en) * 2017-12-28 2019-07-05 新岸线(北京)科技集团有限公司 A kind of method of high precision image mixing compression
CN109120936A (en) * 2018-09-27 2019-01-01 贺禄元 A kind of coding/decoding method and device of video image
CN109120935A (en) * 2018-09-27 2019-01-01 贺禄元 A kind of coding method of video image and device
CN109168032A (en) * 2018-11-12 2019-01-08 广州酷狗计算机科技有限公司 Processing method, terminal, server and the storage medium of video data
CN113556551A (en) * 2020-04-23 2021-10-26 上海高德威智能交通***有限公司 Encoding and decoding methods, devices and equipment
CN113810739A (en) * 2020-06-17 2021-12-17 国基电子(上海)有限公司 Image transmission method, terminal and computer readable storage medium
US11812036B2 (en) 2020-06-17 2023-11-07 Ambit Microsystems (Shanghai) Ltd. Method for image transmitting, transmitting device and receiving device
CN113810739B (en) * 2020-06-17 2024-02-09 富联国基(上海)电子有限公司 Image transmission method, terminal and computer readable storage medium
CN112203090A (en) * 2020-11-30 2021-01-08 浙江智慧视频安防创新中心有限公司 Video encoding and decoding method and device, electronic equipment and medium
WO2024067405A1 (en) * 2022-09-30 2024-04-04 中国电信股份有限公司 Video transmission method and apparatus, and electronic device, storage medium and program product

Also Published As

Publication number Publication date
CN101742289B (en) 2013-10-16

Similar Documents

Publication Publication Date Title
CN101742289B (en) Method, system and device for compressing video code stream
JP4808161B2 (en) Method and apparatus for moving image communication error processing
CN1242623C (en) Video coding
US7957465B2 (en) Moving picture data code conversion/transmission method and device, code conversion/reception method and device
CA2737728C (en) Low latency video encoder
US7958532B2 (en) Method of transmitting layered video-coded information
US6226328B1 (en) Transcoding apparatus for digital video networking
US20110090957A1 (en) Video codec method, video encoding device and video decoding device using the same
CN110392284B (en) Video encoding method, video data processing method, video encoding apparatus, video data processing apparatus, computer device, and storage medium
US6215824B1 (en) Transcoding method for digital video networking
RU2009116472A (en) DYNAMIC MODIFICATION OF VIDEO PROPERTIES
CN111263192A (en) Video processing method and related equipment
JP2011029868A (en) Terminal device, remote conference system, method for controlling terminal device, control program of terminal device, and computer readable recording medium recording control program of terminal device
KR20110058841A (en) Moving-picture image data-distribution method
KR19980081099A (en) Image transfer device and image transfer method
JPH0984019A (en) Image data storage device
US9025672B2 (en) On-demand intra-refresh for end-to end coded video transmission systems
CN101841704B (en) Method and device for compressing and decompressing video bit stream
CN114827669B (en) Video data transmission method, device, medium and equipment
CN106998328B (en) Video transmission method and device
TW202203652A (en) Detection of still picture profile
KR100899666B1 (en) Dispersed multistreaming transmission apparatus
US11490121B2 (en) Transform device, decoding device, transforming method, and decoding method
EP4210332A1 (en) Method and system for live video streaming with integrated encoding and transmission semantics
KR100690710B1 (en) Method for transmitting moving picture

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant