CN106791864B - Realization method for improving video transcoding rate based on HEVC standard - Google Patents

Realization method for improving video transcoding rate based on HEVC standard Download PDF

Info

Publication number
CN106791864B
CN106791864B CN201611119061.5A CN201611119061A CN106791864B CN 106791864 B CN106791864 B CN 106791864B CN 201611119061 A CN201611119061 A CN 201611119061A CN 106791864 B CN106791864 B CN 106791864B
Authority
CN
China
Prior art keywords
coding unit
coding
video
video stream
training result
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201611119061.5A
Other languages
Chinese (zh)
Other versions
CN106791864A (en
Inventor
伏长虹
陈则希
王允
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Tech University
Original Assignee
Nanjing Tech University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Tech University filed Critical Nanjing Tech University
Priority to CN201611119061.5A priority Critical patent/CN106791864B/en
Publication of CN106791864A publication Critical patent/CN106791864A/en
Application granted granted Critical
Publication of CN106791864B publication Critical patent/CN106791864B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/40Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video transcoding, i.e. partial or full decoding of a coded input stream followed by re-encoding of the decoded output stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention provides a realization method for improving video transcoding rate based on HEVC standard, which comprises the following steps: step 1, according to QP, the original video stream with high bit rate1Extracting the related coding unit division information of the corresponding frame of the original video stream in the process of decoding into a video sequence; step 2, after the decoded video sequence is processed according to QP2Extracting stored related coding unit division information after coding of a previous frame of a time domain in the process of coding the video stream into a low bit rate; step 3, inputting the division information obtained in the step 1 and the step 2 as features into a corresponding naive Bayes classifier which is trained to obtain a training result; step 4, if the training result is 1, continuously dividing the current frame coding unit; and if the training result is 0, stopping dividing the current frame coding unit.

Description

Realization method for improving video transcoding rate based on HEVC standard
Technical Field
The invention relates to a video transcoding technology, in particular to a realization method for improving video transcoding rate based on HEVC standard.
Background
High Efficiency Video Coding (HEVC) is a new generation of Video Coding standard jointly established by ISO/IEC Moving Picture Experts Group (MPEG) and ITU-T Video Coding Experts Group (VCEG). As a successor of h.264, HEVC overcomes the defect of the h.264 macroblock mechanism in processing high definition video, is more suitable for processing high definition video, is a mainstream coding standard for a long time in the future, and will have a wide application prospect in video telephony, video conferencing, and network streaming media video on demand. With the progress of science and technology, a network video playing platform is not limited to a traditional personal computer, mobile terminals such as mobile phones and iPads also occupy a great proportion, and meanwhile, the network video playing platform is subject to network bandwidth limitation factors under different scenes, so that the urgent need of the market for video transcoding is caused at present.
Generally, video transcoding mainly includes: bit rate transcoding, resolution transcoding, video coding format conversion, etc. For bit rate transcoding, the earliest foreign research on transcoding starts with reducing the bit rate of an original compressed code stream, and aims to reduce the code rate under the conditions of low complexity and high quality output on the basis of not changing the resolution. The most direct method is a pixel domain cascade structure (CPDT), which is very simple and has no error propagation, but the computation is very complex, it is difficult to transcode in real time, and information is lost during motion re-estimation, and the video quality is also reduced. The transcoding structure of the current bit rate transcoding is mainly divided into two categories: open-loop and closed-loop configurations. The open-loop transcoding does not need motion estimation and DCT/IDCT conversion, has high efficiency, but has error diffusion, and the effective solution is to adopt an intra-frame block refreshing structure and increase the number of coding macro blocks. The closed loop structure increases algorithm complexity and storage, and can obtain ideal image quality. According to the texture data processing method, the method can be divided into: pixel domain cascade transcoding and DCT domain transcoding. The pixel domain cascade structure is a combination of partial decoding and partial encoding, and is relatively complex, while the pixel domain fast cascade transcoding structure is a prediction residual error obtained by performing motion compensation on the prediction residual error of an original code stream, and the DCT/IDCT operation can be reduced by half even. In the transcoding structure (DDT) of the DCT domain, when the motion vector is a multiple of 0 or 8, no operation is required for motion compensation in the DCT domain. For low-speed moving or still video series, DDT is less computationally intensive, but the application of this structure is limited due to some other factor. The invention is based on the latest video compression standard HEVC, and mainly aims at carrying out speed optimization on the closed-loop structure in the resolution-reducing transcoding. In the process of decoding an input video stream with a high bit rate and then re-encoding a reconstructed sequence into a video stream with a lower bit rate by using a larger Quantization Parameter (QP), the dividing mode of a Coding Unit (CU) during re-encoding is judged in advance by using information of the dividing mode of the CU in the original input video stream and a naive Bayesian classifier which is subjected to a large amount of training, the calculation amount of RDcost is reduced, and the transcoding time is greatly reduced while the video quality is ensured.
Disclosure of Invention
The method mainly utilizes machine learning in the HEVC standard, and reduces the judgment of CU partition modes during recoding through original code stream information and previous frame information, thereby reducing the calculated amount, and effectively improving the speed and efficiency of video transcoding.
A realization method for improving video transcoding rate based on HEVC standard comprises the following steps:
step 1, according to QP, the original video stream with high bit rate1Extracting the related coding unit division information of the corresponding frame of the original video stream in the process of decoding into a video sequence;
step 2, after the decoded video sequence is processed according to QP2Extracting stored related coding unit division information after coding of a previous frame of a time domain in the process of coding the video stream into a low bit rate;
step 3, inputting the division information obtained in the step 1 and the step 2 as features into a corresponding naive Bayes classifier which is trained to obtain a training result;
step 4, if the training result is 1, continuously dividing the current frame coding unit; if the training result is 0, stopping dividing the current frame coding unit;
the relevant coding units are coding units with the same position and size in the corresponding frames;
wherein, QP1<QP2
Compared with the prior art, the invention has the following advantages:
(1) compared with other methods which only judge the division of the CU at the encoding end, the method disclosed by the invention is quicker and more effective by utilizing the prior information of the original code stream;
(2) the invention not only utilizes the original code stream information, but also combines the CU partition information of the adjacent frames in the time domain, so that the training result is more scientific and comprehensive;
(3) the invention uses a naive Bayes classification model, the model is visual and simple, the operation is easy, the effect is very obvious, and a large number of training samples can be trained, thereby being more accurate than the traditional statistical method.
The invention is further described below with reference to the accompanying drawings.
Drawings
Fig. 1 is an HEVC full de-full codec transcoding flow diagram.
Fig. 2 is a transcoding flow diagram of the present invention.
Fig. 3 shows CU partitioning corresponding depth.
FIG. 4 classifier T0、T1、T2Schematic diagrams are used.
Fig. 5 is a flow chart of a method of the present invention.
Detailed Description
With reference to fig. 5, a method for improving video transcoding rate based on the HEVC standard includes the following steps:
step 1, according to QP, the original video stream with high bit rate1Extracting the related coding unit division information of the corresponding frame of the original video stream in the process of decoding into a video sequence;
step 2, after the decoded video sequence is processed according to QP2Extracting stored related coding unit division information after coding of a previous frame of a time domain in the process of coding the video stream into a low bit rate;
step 3, inputting the division information obtained in the step 1 and the step 2 as features into a corresponding naive Bayes classifier which is trained to obtain a training result;
step 4, if the training result is 1, continuously dividing the current frame coding unit; if the training result is 0, stopping dividing the current frame coding unit;
the relevant coding units are coding units with the same position and size in the corresponding frames;
wherein, QP1<QP2
Extracting the partition information of a frame-related Coding Unit (CU) corresponding to an original code stream in the step 1, extracting the stored partition information of the CU after the previous frame is coded in the step 2, inputting the two partition information as features into a trained corresponding naive Bayesian classifier to obtain a class A and a class B, and further judging a CU partition mode according to the classification, thereby reducing re-coding calculation amount and improving transcoding speed.
Specifically, let Δ QP be QP2-QP1The quantization parameter of the video original code stream is QP1The quantization parameter after transcoding is QP2And QP1<QP2Then the quantization parameter variance is Δ QP. For this fixed Δ QP, classifier training is performed.
The training video samples are required to cover various types as much as possible, including low-speed motion, medium-speed motion, high resolution, low resolution and the like. For example, the alternatives ParkScene, basetballdrill, BQMall, BQSquare, FourPeople; each video segment selects 50 frames, and only P-frames and B-frames.
Run the training video sample by QP1Transcoding the high bit rate video stream to QP2The low bit rate video stream of (2) is transcoded by full codec, as shown in fig. 1.
The extracted naive Bayes classifier training sample video is characterized in that:
delta Depth _ corresponding CU (hereinafter referred to as Delta Dc)
Delta Depth _ timeproviousCU (hereinafter referred to as Delta Dt)
Wherein, corresponding CU represents the CU with the strongest correlation with the CU to be processed in the input original video stream, timeproviouscu represents the CU with the strongest correlation with the CU to be processed in the previous frame in the time domain, wherein the CU with the strongest correlation represents the CU with the same position and size in the corresponding LCU in the corresponding frame (the same coding sequence number of the frame), and the position can be determined by the uiabsprotidx in the program; Δ Depth represents the difference between the Depth corresponding to the CU size and the deepest Depth of the partition in the CU. Note: CU depth of 64 × 64 is 0, CU depth of 32 × 32 is 1, CU depth of 16 × 16 is 2, and CU depth of 8 × 8 is 3 (see fig. 3 in particular).
Classifier training result classification:
a type: the split-flag value of the CU to be processed at the recoding end is 1, and the CU continues to be divided;
b type: and the split-flag value of the CU to be processed at the encoding end is 0, and the CU stops dividing.
The classifier model is a naive Bayes classificationMachine, three in total: t is0,T1,T2. For three sizes of CUs 64 × 64, 32 × 32, 16 × 16, respectively.
Training is carried out for different trainers: training classifier T0Extracting features and categories corresponding to all CU with the size of 64 x 64 of the encoding end in the training video; in the same way, training classifier T1Extracting features and categories corresponding to all CU with the size of 32 × 32 at the encoding end in the training video; training classifier T2Then, the features and categories corresponding to all the CUs with the size of 16 × 16 at the encoding end in the training video are extracted for training.
After three classifiers are trained, transcoding is carried out according to the flow shown in fig. 2, if the frame is an I frame during recoding, no processing is carried out, if the frame is a B frame or a P frame, information (delta Dc) in an original code stream and information (delta Dt) of a previous frame in a time domain are extracted according to different CU sizes, and the information (delta Dc) and the information (delta Dt) pass through corresponding classifiers (T) respectively0,T1,T2) And classifying to obtain a predicted split-flag value (pSF) of the corresponding CU, and directly judging the CU division condition without performing a large amount of RDcost calculation, wherein a specific judgment flow is shown in FIG. 4.

Claims (2)

1. A realization method for improving video transcoding rate based on HEVC standard is characterized by comprising the following steps:
step 1, according to QP, the original video stream with high bit rate1Extracting the related coding unit division information of the corresponding frame of the original video stream in the process of decoding into a video sequence;
step 2, after the decoded video sequence is processed according to QP2Extracting stored related coding unit division information after coding of a previous frame of a time domain in the process of coding the video stream into a low bit rate;
step 3, inputting the division information obtained in the step 1 and the step 2 as features into a corresponding naive Bayes classifier which is trained to obtain a training result;
step 4, if the training result is 1, continuously dividing the current frame coding unit; if the training result is 0, stopping dividing the current frame coding unit;
the relevant coding units are coding units with the same position and size in the corresponding frames;
wherein, QP1<QP2
2. The method according to claim 1, wherein the naive Bayes classifier in step 3 is plural, and corresponds to coding units of different sizes respectively.
CN201611119061.5A 2016-12-08 2016-12-08 Realization method for improving video transcoding rate based on HEVC standard Expired - Fee Related CN106791864B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611119061.5A CN106791864B (en) 2016-12-08 2016-12-08 Realization method for improving video transcoding rate based on HEVC standard

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611119061.5A CN106791864B (en) 2016-12-08 2016-12-08 Realization method for improving video transcoding rate based on HEVC standard

Publications (2)

Publication Number Publication Date
CN106791864A CN106791864A (en) 2017-05-31
CN106791864B true CN106791864B (en) 2019-12-27

Family

ID=58881250

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611119061.5A Expired - Fee Related CN106791864B (en) 2016-12-08 2016-12-08 Realization method for improving video transcoding rate based on HEVC standard

Country Status (1)

Country Link
CN (1) CN106791864B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107734331A (en) * 2017-11-17 2018-02-23 南京理工大学 A kind of video transcoding method based on HEVC standard
CN108600759B (en) * 2018-04-16 2021-11-12 北京工业大学 3D-HEVC (high efficiency video coding) rapid transcoding method based on unbalanced quadtree

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101583036A (en) * 2009-06-22 2009-11-18 浙江大学 Method for determining the relation between movement characteristics and high efficient coding mode in pixel-domain video transcoding
CN101924943A (en) * 2010-08-27 2010-12-22 郭敏 Real-time low-bit rate video transcoding method based on H.264
CN102025999A (en) * 2010-12-31 2011-04-20 北京工业大学 Video transcoding fast intra-frame predicating method based on support vector machine
CN105306947A (en) * 2015-10-27 2016-02-03 中国科学院深圳先进技术研究院 Video transcoding method based on machine learning

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101583036A (en) * 2009-06-22 2009-11-18 浙江大学 Method for determining the relation between movement characteristics and high efficient coding mode in pixel-domain video transcoding
CN101924943A (en) * 2010-08-27 2010-12-22 郭敏 Real-time low-bit rate video transcoding method based on H.264
CN102025999A (en) * 2010-12-31 2011-04-20 北京工业大学 Video transcoding fast intra-frame predicating method based on support vector machine
CN105306947A (en) * 2015-10-27 2016-02-03 中国科学院深圳先进技术研究院 Video transcoding method based on machine learning

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Fast coding unit size selection for HEVC based on Bayesian decision rule;Shen X;《Picture Coding Symposium》;20121231;全文 *
Fast transrating for high efficiency video coding based on machine learning;L.P.Van;《Proc. IEEE Int. Conf. Image Process》;20131231;全文 *
Low complexity H.264 encoder using Machine learning;Han D;《Dissertations&Theses-Gradworks》;20101231;全文 *
结合模式对应与机器学习的HEVC降分辨率转码;李雪晴;《电视技术》;20160131;全文 *

Also Published As

Publication number Publication date
CN106791864A (en) 2017-05-31

Similar Documents

Publication Publication Date Title
US11689726B2 (en) Hybrid motion-compensated neural network with side-information based video coding
US9071841B2 (en) Video transcoding with dynamically modifiable spatial resolution
US10666962B2 (en) Training end-to-end video processes
CN102150427B (en) System and method for video encoding using adaptive loop filter
CN110087087A (en) VVC interframe encode unit prediction mode shifts to an earlier date decision and block divides and shifts to an earlier date terminating method
CN108989802B (en) HEVC video stream quality estimation method and system by utilizing inter-frame relation
US9591313B2 (en) Video encoder with transform size preprocessing and methods for use therewith
Shen et al. Ultra fast H. 264/AVC to HEVC transcoder
CN103647972A (en) Moving picture decoding method and moving picture encoding method
AU2012211249A1 (en) Encoding of video stream based on scene type
WO2010132977A1 (en) Method and system for efficient video transcoding
CN105359511A (en) Systems and methods of encoding multiple video streams with adaptive quantization for adaptive bitrate streaming
CN107404648B (en) A kind of multi-channel video code-transferring method based on HEVC
Pessoa et al. End-to-end learning of video compression using spatio-temporal autoencoders
CN104581170A (en) Rapid inter-frame transcoding method for reducing video resolution based on HEVC
CN104811729B (en) A kind of video multi-reference frame coding method
CN112188196A (en) Method for rapid intra-frame prediction of general video coding based on texture
CN106791864B (en) Realization method for improving video transcoding rate based on HEVC standard
WO2020227911A1 (en) Method for accelerating coding/decoding of hevc video sequence
Wang et al. Machine learning-based rate distortion modeling for VVC/H. 266 intra-frame
CN110351552B (en) Fast coding method in video coding
CN114913250A (en) Video coding and decoding system and method based on semantic segmentation and Hungarian algorithm
CA2817790A1 (en) Method for h264 transcoding with code stream information reuse
CN106878754B (en) A kind of 3D video depth image method for choosing frame inner forecast mode
Jeong et al. A fast intra mode decision based on accuracy of rate distortion model for AV1 intra encoding

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20191227

Termination date: 20211208

CF01 Termination of patent right due to non-payment of annual fee