CN106713935A - Fast method for HEVC (High Efficiency Video Coding) block size partition based on Bayes decision - Google Patents

Fast method for HEVC (High Efficiency Video Coding) block size partition based on Bayes decision Download PDF

Info

Publication number
CN106713935A
CN106713935A CN201710014097.5A CN201710014097A CN106713935A CN 106713935 A CN106713935 A CN 106713935A CN 201710014097 A CN201710014097 A CN 201710014097A CN 106713935 A CN106713935 A CN 106713935A
Authority
CN
China
Prior art keywords
decision
current
making
value
risk
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710014097.5A
Other languages
Chinese (zh)
Other versions
CN106713935B (en
Inventor
姚英彪
贾天婕
杨旭
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhu Qibo Intellectual Property Operation Co.,Ltd.
Zhejiang Zhiduo Network Technology Co ltd
Original Assignee
Hangzhou Dianzi University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Dianzi University filed Critical Hangzhou Dianzi University
Priority to CN201710014097.5A priority Critical patent/CN106713935B/en
Publication of CN106713935A publication Critical patent/CN106713935A/en
Application granted granted Critical
Publication of CN106713935B publication Critical patent/CN106713935B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/119Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/147Data rate or code amount at the encoder output according to rate distortion criteria
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/149Data rate or code amount at the encoder output by estimating the code amount by means of a model, e.g. mathematical model or statistical model
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Algebra (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention discloses a fast method for HEVC (High Efficiency Video Coding) block size partition based on a Bayes decision. The fast method comprises the following steps: first of all, dividing a video sequence into an online learning stage and a fast partitioning stage by employing scene change detection based on an average gray scale difference; then, for the online learning stage and a video frame which occurs a scene change, in each partitioning depth, respectively extracting Jinter and Jintra of a CU (Coding Unit) as characteristic values, thereby establishing a mixed Gaussian model, wherein specific parameters of the model are determined according to an EM algorithm initialized by a K-Means algorithm; and for a to-be-partitioned CU in the fast partitioning stage, extracting the characteristic values and finding a conditional probability on whether to partition according to the mixed Gaussian model, and at last, finding the decision with a relatively small risk by employing a Bayes formula of a minimum risk to take as a judgment basis on whether the current CU is partitioned. According to the fast method disclosed by the invention, the algorithm complexity is reduced, and the coding time can be greatly reduced.

Description

A kind of HEVC blocks based on Bayesian decision divide fast method
Technical field
It is the invention belongs to high-definition digital video compression coding and decoding technical field more particularly to a kind of based on Bayesian decision HEVC (High Efficiency Video Coding, efficient video coding) block divides fast method.
Background technology
In the fast development of digital video application industrial chain, in face of Video Applications constantly to fine definition, frame per second high, height The trend that compression ratio direction is developed, previous generation video compression standards agreement limitation H.264/AVC is constantly highlighted.Therefore, by The tissue JCT-VC that International Telecommunication Union (ITU-T) and Motion Picture Experts Group (MPEG) joint are set up proposes video of future generation Encoding and decoding standard --- H.265/HEVC.Its target is that under the premise of identical picture quality, compression ratio is than H.264/AVC high-grade It is secondary to double.
HEVC is in terms of coding principle and basic structure without breakthrough, and H.264/AVC basically identical, i.e. prediction plus conversion Block encoding mode;On Ciphering details and H.264/AVC also very close to comprising infra-frame prediction, inter prediction, estimation With the coding/decoding module such as compensation, orthogonal transformation, quantization, loop filtering, entropy code and reconstruction.But, and H.264/AVC compare Compared with HEVC almost takes important corrective measure in each coding link, and such as infra-frame prediction supports 35 kinds of patterns, frames Between predict that introducing Merge patterns, change quantization supports that being up to 32 × 32 converter units, entropy code uses CABAC (Context Adaptive Binary Arithmetic Coder, the adaptive binary arithmetic coding based on context) algorithm, newly introducing Self adaptation sampling point compensation loop filtering etc..
HEVC encoders employ code tree unit (CTU, Coding Tree Unit), coding unit (CU, Coding Unit), predicting unit (PU, Prediction Unit), converter unit (Transform Unit, TU) structure so that HEVC energy Enough videos to different resolution and applied environment are encoded.When HEVC is encoded, piece image can be divided into multiple mutual Nonoverlapping CTU, each CTU is divided into one or more CU;When frame in or inter prediction encoding is carried out, a CU can To select to be divided into one or more PU;In conversion, quantization operation, a CU can be divided into one or more TU.
CU is the elementary cell of HEVC codings, using recursive dividing mode.HEVC support 8 × 8,16 × 16,32 × 32, 64 × 64 this 4 kinds of CU of size, corresponding division depth is respectively 3,2,1,0.
PU is the elementary cell of HEVC predictions, it is stipulated that the predictive mode of CU, all coded datas related to prediction are all Transmitted by PU.For a size for 2N × 2N CU, its frame in PU can lectotype have 2 kinds:2N × 2N and N × N;Interframe PU can lectotype have 8 kinds, including 4 kinds of symmetric patterns:2N × 2N, 2N × N, N × 2N, N × N and 4 kinds of asymmetric moulds Formula:2N×nD、2N×nU、nR×2N、nL×2N.When using Merge when encoding, residual information is to be coded of, The PU patterns for now using are 2N × 2N skip patterns, and PU partition modes are as shown in Figure 1.
TU is the elementary cell of HEVC transform and quantizations, and its size is also flexible change, is entered in quaternary tree form Row recursive subdivision.When CTU sizes are 64 × 64, the quaternary tree recurrence of TU is divided as shown in Fig. 2 wherein CU's draws in each CU Divide indicated by the solid line, the division of TU is represented by dashed line, and coded sequence is represented with alpha code.
Similar with other international encoding standards, HM encoders are also based on rate-distortion optimization for the decision-making of block size (Rate-Distortion Optimization's, RDO), in HEVC standard encryption algorithm, the division process description of CU is such as Under:
(1) depth for setting current CU is zero;
(2) rate distortion costs (the Rate-Distortion Cost, RD of the Skip patterns under current division depth are calculated Cost)Jskip, and update the rate distortion costs of current optimal partition mode (BestCuMode=Skip) and minimum (MinRDcost=Jskip);
(3) the rate distortion costs J of the Merge_2Nx2N patterns under current division depth is calculatedmerge_2Nx2N, and with before The minimum rate distortion costs for going out are compared, if Jmerge_2Nx2N<MinRDcost, updates current optimal partition mode And minimum rate distortion costs (MinRDcost=J (BestCuMode=Merge_2Nx2N)merge_2Nx2N);
(4) the rate distortion costs J of the Inter_2Nx2N patterns under current division depth is calculatedInter_2Nx2N, and with before The minimum rate distortion costs for going out are compared, if JInter_2Nx2N<MinRDcost, updates current optimal partition mode And minimum rate distortion costs (MinRDcost=J (BestCuMode=Inter_2Nx2N)Inter_2Nx2N);
(5) if the maximum that the depth of current CU is encoder to be allowed divides depth, and the size of CU is bigger than 8 × 8, then ask The rate distortion costs J of Inter_NxN patternslnter_NxN, and compared with the minimum rate distortion costs for drawing before, if Jlnter_NxN<MinRDcost, updates the rate distortion of current optimal partition mode (BestCuMode=Inter_NxN) and minimum Cost (MinRDcost=JInter_NxN), if being unsatisfactory for above-mentioned CU depth and size requirement, skip step 5;
(6) the rate distortion costs J of the Inter_2NxN patterns under current division depth is calculatedInter_2NxN, and with before The minimum rate distortion costs for going out are compared, if Jinter_2NxN<MinRDcost, updates current optimal partition mode And minimum rate distortion costs (MinRDcost=J (BestCuMode=Inter_2NxN)Inter_2NxN);
(7) the rate distortion costs J of the Inter_Nx2N patterns under current division depth is calculatedInter_Nx2NIf, JInter_Nx2N< MinRDcost, updates the rate distortion costs of current optimal partition mode (BestCuMode=Inter_Nx2N) and minimum (MinRDcost=JInter_Nx2N);
(8) calculate whether current CU needs to calculate the rate distortion costs of AMP patterns, if BestCuMode=Inter_ 2Nx2N or BestCuMode=Inter_2NxN, Test_Hor condition are true, and otherwise Test_Hor is false;If BestCuMode=Inter_2Nx2N or BestCuMode=Inter_Nx2N, Test_Ver condition are true, otherwise Test_ Ver is false.
(9) if Test_Hor conditions are true, next step is continued;Otherwise, step 12 is jumped to;
(10) the rate distortion costs J of the Inter_2NxnU patterns under current division depth is calculatedInter_2NxnUIf, JInter_2NxnU<M inRDcost, update the rate of current optimal partition mode (BestCuMode=Inter_2NxnU) and minimum Distortion cost (MinRDcost=JInter_2NxnU);
(11) the rate distortion costs J of the Inter_2NxnD patterns under current division depth is calculatedInter_2NxnDIf, JInter_2NxnD<MinRDcost, updates the rate of current optimal partition mode (BestCuMode=Inter_2NxnD) and minimum Distortion cost (MinRDcost=JInter_2NxnD);
(12) if Test_Ver conditions are true, next step is continued;Otherwise, step 15 is jumped to;
(13) the rate distortion valency J of the Inter_nLx2N patterns under current division depth is calculatedInter_nLx2NIf, JInter_nLx2N < MinRDcost, update the rate distortion costs of current optimal partition mode (BestCuMode=Inter_nLx2N) and minimum (MinRDcost=JInter_nLx2N);
(14) the rate distortion costs J of the Inter_nRx2N patterns under current division depth is calculatedInter_nRx2NIf, JInter_nRx2N<MinRDcost, updates the rate of current optimal partition mode (BestCuMode=Inter_nRx2N) and minimum Distortion cost (MinRDcost=JInter_nRx2N);
(15) the rate distortion costs J of the Intra_2Nx2N patterns under current division depth is calculatedIntra_2Nx2NIf, JIntra_2Nx2N<MinRDcost, updates the rate of current optimal partition mode (BestCuMode=Intra_2Nx2N) and minimum Distortion cost (MinRDcost=JIntra_2Nx2N);
(16) if the current division depth of CU is by the depth capacity for allowing of encoding setting, its rate of lntra_NxN is calculated Distortion cost Jlntra_NxNIf, Jlntra_NxN<MinRDcost, updates current optimal partition mode (BestCuMode=lntra_ ) and minimum rate distortion costs (MinRDcost=J NxNlntra_NxN);
(17) if the maximum that the current division depth ratio PCM patterns of CU are allowed is divided, depth is small, calculates the rate of PCM patterns Distortion cost JPCMIf, JPCM< MinRDcost, then update current optimal partition mode (BestCuMode=PCM) and minimum Rate distortion costs (MinRDcost=JPCM);
(18) if BestCuMode is not more than coding with permitting for centering by the current size of Skip patterns or CU Perhaps minimum dimension, jumps to step 19;Otherwise, current CU will be divided into four sub-blocks, and the depth ratio of each sub-block is current Block is big by 1.Then to the algorithm steps of each Sub-CU iteration (1)-(18), the MinRDcost of sub-block is obtained respectively, after adding up It is J to makesub-CU
(19) rate distortion costs for comparing before dividing and being divided into after sub-block, if MinRDcost<JSub-CU, it is current to divide deep CU under degree does not continue segmentation, updates minimum rate distortion costs (MinCucost=MinRDcost);Otherwise current CU is divided It is four Sub-CU, and updates MinCuCost=JSub-CU
The algorithm steps determined by above-mentioned CU sizes understand that encoder will determine a quad-tree partition mode of CTU, Not only to determine the division depth of CU, also to travel through all possible PU, TU division side under a certain depth, finally therefrom select RD Cost minimum coding mode and corresponding division depth, therefore the computation complexity of HEVC block partition process is high.
The content of the invention
The present invention for technical problem present in above-mentioned prior art, according to video sequence time-space domain correlation, Propose a kind of HEVC blocks based on Bayesian decision and divide fast method.In the present invention, CU division operations are regarded as two points Class problem, i.e. W={ ωN, ωP, wherein ωNRepresent that current CU does not continue to divide, ωpRepresent that current CU continues to divide, and use Least risk Bayes decision-making solves two classification problems.
The technical solution adopted for the present invention to solve the technical problems comprises the following steps:
Step 1, off-line learning obtain loss function CNPAnd CPNAnd prior probability P (ωN) and P (ωP)。
When decision-making is done using the Bayesian decision criterion of minimal error rate, if the division result decision error of CU, i.e. shellfish The judgement whether this decision-making of leaf divides to current depth CU is different from the result of canonical algorithm, under will causing distortion performance Drop.Simultaneously for the size CU different with characteristic value, the degree that the distortion performance that the result of decision of mistake causes declines is not yet Together.Therefore, the present invention proposes the decline of the distortion performance for bringing erroneous decision as bayes risk function, and foundation is based on The Bayes decision-making model of minimum risk, the distortion performance brought with reducing erroneous decision is lost, complete rate distortion costs letter Number is shown below:
JFRD=(SSElumachroma×SSEchroma)+λmode×Bmode
Wherein, SSElumaRepresent the pixel difference error sum of squares between the original picture block and reconstruction image block of brightness; SSEchromaRepresent the pixel difference error sum of squares between the original picture block and reconstruction image block of carrier chrominance signal, ωchromaIt is color Spend the weight coefficient of component, λmodeIt is Lagrange's multiplier, BmodeRequired bit number when being to current coding mode.
For 64 × 64, the CU of 32 × 32,16 × 16 sizes is trained by off-line learning obtain loss matrix C respectively, is damaged Matrix is lost to be defined as follows shown in formula:
C hereinPNAnd CNPIt is defined as follows:
JFRD, NAnd JFRD, PBe respectively current CU to be divided do not continue to divide with continue to be divided into four sub-blocks it is corresponding completely Rate distortion costs.CNPRepresent and assume ωNIt is true, i.e., judges to assume ω when rate distorted result is " not dividing "PSet up, i.e., CU is adopted Take the value of risk brought during " division " decision-making;CPNRepresent and assume ωPIt is true, is brought when taking CU " not dividing " decision-making Value of risk.CNPAnd CPNMore than or equal to 0, (C when decision-making is correctNNOr CPP), the penalty values of decision-making are 0.
Prior probability P (ωR) it is by counting under current depth, making ωRThe proportion that the CU of decision-making accounts for all CU is obtained , i.e.,
P(ωR)=current depth belongs to ωRCU numbers/current depth CU sum;
Wherein R is N or P;
Step 2, for incoming sequence of pictures, judge whether whether it there occurs in on-line study stage or scene Change, if present frame is in on-line study stage or scene change frame, depth division is carried out according to HM canonical algorithms to it, The characteristic value of each depth is extracted and stored, step 7 is jumped to;Otherwise perform step 3.
Scene change be in order to judge the severe degree of video image motion, when there is scene change in video sequence, one As for i.e. show video content there occurs acutely variation, will be lost between adjacent frame of video temporal correlation (depending on Temporal redundancy of the frequency at switching disappears).In this case, the block based on Bayesian decision divides fast algorithm and discomfort With, once because scene is converted, the threshold value for judging whether to divide will change, based on offline statistics and on-line study Bayesian decision correct decisions will can not be made according to the characteristic value extracted.Therefore the present invention is proposed based on scene change inspection The on-line study stage of survey and fast alternate mechanism stage by stage.For preceding N frames, Dou Yaochong after N frames before sequence and scene change New characteristic value of extracting sets up new mixed Gauss model.
Because image intensity value can just be drawn before present frame is encoded by simple computation, if adjacent image is in together One scene, adjacent frame correlation is stronger, and gray value is more or less the same;Conversely, adjacent picture frame is in identical bits when occurrence scene is converted The gray value for putting pixel generally differs larger.
2-1. weighs the severe degree of video motion with average gray difference value, i.e., whether occurrence scene is converted, video sequence The average gray difference value of pixel of adjacent two frame in same position is calculated as follows shown in formula in row:
In formula, P, Q are picture traverse and height, Gn(i, j) is the gray value at n-th frame pixel (i, j) place, Gn-1(i, j) It is the gray value at (n-1)th frame pixel (i, j) place.If without scene switching, G is generally smaller.If occurrence scene is converted, G leads to Often it is increased dramatically.
In sum, a detection threshold value for judging whether occurrence scene conversion can be set, if G is more than detection threshold Value λsc, then it is assumed that scene is converted, otherwise it is assumed that scene is not changed in;λscIt is detection threshold value, its value is often empirical value.Threshold The size of value determines the accuracy of scene change, and crossing conference causes missing inspection and too small, can cause flase drop, therefore λscValue should be closed Reason.The frame number in on-line study stage is set to 6 by the present invention, and threshold value is set to 9, that is, work as G>Think that occurrence scene is converted when 9, otherwise There is no occurrence scene to convert.Once scene is converted, the statistical parameter of characteristic vector need be just reevaluated.
Step 3, judge whether picture is fast the first two field picture stage by stage, if it is not, performing step 4;Otherwise carry The characteristic value of on-line study stage or the storage of scene change frame is taken, and feature based value sets up mixed Gauss model, the system of model Meter parameter is estimated to expectation maximization (EM) algorithm of K-Means algorithm initializations.
If the observation information of each CU n dimensional feature vector X=[x1, x2, x3…xn] describe, p (X | ωR), R ∈ N, P }, it is ωRThe conditional probability density of feature vector, X is observed under state.State ωRPosterior probability p (ωR| X) can be by pattra leaves This formula is tried to achieve
For the risk that the decision-making for weighing mistake brings, present invention introduces loss function CRQ, represent rate-distortion optimization result It is decision-making R, mistakenly classifies as the risk in the case of decision-making Q.Consider that the Bayes rule in the case of risk is changed into:
RNN| X)=CNN p(X|ωN)P(ωN)+CPNp(X|ωP)P(ωP)
RPP| X)=CPPp(X|ωP)P(ωP)+CNP p(X|ωN)P(ωN)
RR(X) it is conditional risk, illustrates characteristic value X and be judged to ωRThe average lost during class, CNPRepresent and assume ωNFor Very, i.e., judge to assume ω when rate distorted result is " not dividing "PSet up, i.e., take CU the risk brought during " division " decision-making Cost.
In order to ensure expected risk minimum, when must ensure to do each decision-making, all cause that conditional risk is minimum, it is such to determine Plan is referred to as Least risk Bayes decision-making, and the rule of decision-making is as follows:If RNN|X)<RPP| X), then ω=ωN, decision-making It is ωN, represent the decision-making that " not dividing " is taken current CU;The decision-making of " division " is otherwise taken current CU.
3-1. feature extractions and selection
As described above, it is necessary to extract characteristic vector from observation to cause decision-making rationally, for the space-time of reflecting video Feature, our selected characteristic vectors are JinterAnd Jintra, obtained by following formula:
JLRD=SAD+ λpred×Bpred
Jinter=argminP ∈ Inter { JLRD(P)}
Jintra=argminP ∈ Intra { JLRD(P)}
λ hereinpredIt is Lagrange's multiplier, BpredTo all information under current prediction mode, (such as dividing mode, pattern are believed Breath, conversion coefficient etc.) bit number required when being encoded, P represents that the candidate of PU divides.The calculating of characteristics extraction is answered Miscellaneous degree is almost negligible, because the information of rate distortion costs can be obtained directly from cataloged procedure.
Feature vector, X is obtained from the picture in on-line study stage, if training picture is I frames, the dimension of characteristic vector Just drop to 1, X=Jintra, because I frames can only use intra prediction mode.
The probability density function of 3-2. on-line studies
In order to Bayesian decision is applied to quick CU partitioning algorithms, it is necessary to each on-line study stage estimate feature to The statistical parameter of amount.In the method for proposing, we are according to RNN| X) and RPP| X) value determine whether to divide current CU to be divided;From Least risk Bayes formula, to compare RRR| X) size, it is necessary to first try to achieve p (X | ωN)P (ωN) and p (X | ωP)P(ωP) value.
Each is divided in the case of " division " or " not dividing " of depth, and likelihood function is all obeyed dimensional Gaussian and divided at random Cloth, i.e. p (X | ωR) by two Gaussian probability-density function weighted sum approximate representations of independent probability, therefore can have:
Wherein i is i-th single Gauss model in mixed Gauss model.The parameter of each Gauss model is θi=(μi, σi), μiAnd σiRepresent class ωiCharacteristic vector mean vector and variance, the mixing probability function of feature vector, X is
Wherein Θ=(θ12) be mixed Gauss model distributed constant collection, P=(p (ω1),p(ω2)) it is corresponding height The prior probability collection of this model.The parameter of mixed Gauss model is determined by the EM algorithms with K-Means algorithm initializations.
Step 4, the characteristic value for extracting current CU, try to achieve p (X | ωR), R ∈ { N, P }.
The characteristic value X of CU to be divided is extracted, the letter on the Gaussian mixtures of same depth is tried to achieve according to residing depth Numerical value, i.e. p (X | ωN) and p (X | ωP)。
Step 5, decide whether to divide current CU, if dividing, mistake described in step 1- steps 5 is performed to each Sub-CU Journey;Do not divide current CU otherwise, jump to step 6.
Calculating is comparedWithSize, ifIt is larger, current CU is divided into four Sub-CU, And step 1-5 is performed to each Sub-CU;Otherwise, current CU is not divided.
Step 6, the CU is encoded, if current frame image CU divided completions, perform step 7;Otherwise, to next Individual CU performs process described in step 1-6.
Fast algorithm is divided according to the block based on Bayes and encodes current CU, then to next CU traversal steps 1-6.If working as All of CU has completed to divide in preceding picture frame, then perform step 7.Step 7,2-7 steps institutes is performed to next two field picture Process is stated, until having encoded all of frame of video.
For a two field picture of new incoming, the same block based on Bayes divides fast algorithm and processes, until having encoded All of frame of video.
So far, the whole HEVC blocks based on Bayesian decision divide fast method and terminate, for the division of a certain depth CU.
Beneficial effect
This method advantage is as follows:
(1) present invention does high-speed decision by Bayesian decision come the dividing mode to current depth CU, reduces CU and draws Divide algorithm complex, the HEVC scramble times can be greatly reduced.
(2) present invention employs the scene change detection mechanism based on average gray difference value, it is ensured that the stability of method And accuracy.
(3) using of the invention test result indicate that, compared with the original algorithms of HM13.0, announcement method of the present invention is reduced 44.5% scramble time, and coding bit rate BD-Rate only increases by 0.94%.
Brief description of the drawings
Fig. 1:H.265/HEVC the partition mode of PU.
Fig. 2:The quad-tree partition of CU, TU.
Fig. 3:On-line study stage and fast replace schematic diagram stage by stage.
Fig. 4:HEVC blocks based on Bayesian decision divide fast algorithm main-process stream.
Specific embodiment
By taking the block partition process of 16 sequences such as People on street as an example, the inventive method to proposing is carried out in detail Explanation.
The step of HEVC blocks based on Bayesian decision divide fast algorithm is as follows:
Step 1, off-line learning obtain loss function CNPAnd CPNAnd prior probability P (ωN) and P (ωP)。
According to below equation respectively to 64 × 64, the CU of 32 × 32,16 × 16 sizes is trained by off-line learning and damaged Lose Matrix C and prior probability:
JFRD=(SSElumachroma×SSEchroma)+λmode×Bmode
Wherein, SSEluma, SSEchromaRepresent the picture between the original picture block and reconstruction image block of luminance and chrominance information Element difference error sum of squares, ωchromaIt is the weight coefficient of chromatic component, λmodeIt is Lagrange's multiplier, BmodeIt is to present encoding Required bit number during pattern.JFRD, NAnd JFRD, PIt is respectively that current CU to be divided does not continue to divide and continue to be divided into four The corresponding complete rate distortion costs of sub-block.
Prior probability P (ωR) it is by counting under current depth, making ωRThe proportion that the CU of decision-making accounts for all CU is obtained , i.e.,
P(ωR)=current depth belongs to ωRCU numbers/current depth CU sum
Step 2, for incoming sequence of pictures, judge it whether in on-line study stage or whether scene there occurs Change, if present frame is in on-line study stage or scene change frame, depth division is carried out according to HM canonical algorithms to it, The characteristic value of each depth is extracted and stored, step 7 is jumped to;Otherwise perform step 3.
Video motion is weighed using the average gray difference value of pixel of adjacent two frame in same position in video sequence Severe degree, is calculated as follows shown in formula:
In formula, P, Q are picture traverse and height, Gn(i, j) is the gray value at n-th frame pixel (i, j) place, Gn-1(i, j) It is the gray value at (n-1)th frame pixel (i, j) place.If without scene switching, G is generally smaller.If occurrence scene is converted, G leads to Often it is increased dramatically.Once scene is converted, the statistical parameter of characteristic vector need be just reevaluated.Judge video according to following formula Whether occurrence scene changes frame:
Scene change be in order to judge the severe degree of video image motion, when there is scene change in video sequence, one As for i.e. show video content there occurs acutely variation, will be lost between adjacent frame of video temporal correlation (depending on Temporal redundancy of the frequency at switching disappears).In this case, the block based on Bayesian decision divides fast algorithm and discomfort With, once because scene is converted, the threshold value for judging whether to divide will change, based on offline statistics and on-line study Bayesian decision correct decisions will can not be made according to the characteristic value extracted.Therefore the present invention is proposed based on scene change inspection The on-line study stage of survey and fast alternate mechanism stage by stage.Schematic diagram is as shown in Figure 3.Become for N frames before sequence and scene N frames, will again extract characteristic value and set up new mixed Gauss model before after changing.
Step 3, judge whether picture is fast the first two field picture stage by stage, if it is not, performing step 4;Otherwise carry The characteristic value of on-line study stage or the storage of scene codes frame is taken, mixed Gauss model, the system of model are set up based on these data Meter parameter is estimated to the EM algorithms of K-Means algorithm initializations.
Assuming that the observation information of each CU n dimensional feature vector X=[x1, x2, x3…xn] describe, our selected characteristics Vector is JinterAnd Jintra, obtained by following formula:
JLRD=SAD+ λpred×Bpred
Jinter=argminP ∈ Inter { JLRD(P)}
Jintra=argminP ∈ Intra { JLRD(P)}
λ hereinpredIt is Lagrange's multiplier, BpredTo all information under current prediction mode, (such as dividing mode, pattern are believed Breath, conversion coefficient etc.) bit number required when being encoded, P represents that the candidate of PU divides.
Feature vector, X is obtained from the picture in on-line study stage, if training picture is I frames, the dimension of characteristic vector Just drop to 1, X=Jintra, because I frames can only use intra prediction mode.
Each is divided in the case of " division " or " not dividing " of depth, and likelihood function is all obeyed dimensional Gaussian and divided at random Cloth, i.e. p (X | ωR) by two Gaussian probability-density function weighted sum approximate representations of independent probability, therefore can have:
Wherein i is i-th single Gauss model in mixed Gauss model.The parameter of each Gauss model is θi=(μi, σi), μiAnd σiRepresent class ωiCharacteristic vector mean vector and variance, the mixing probability function of feature vector, X is
Wherein Θ=(θ12) be mixed Gauss model distributed constant collection, P=(p (ω1),p(ω2)) it is corresponding height The prior probability collection of this model.
In order to obtain the parameter of mixed Gauss model, defining log-likelihood function is
Wherein m represents the dimension of characteristic vector.Parametric solution is to need to obtain model parameter (Θ0, P0) cause formula logarithm Likelihood function value is maximum, then the statistical parameter of mixed model is estimated meet following formula:
The present invention is estimated the statistical parameter of mixed Gauss model with EM algorithms.Solution is divided into two using EM algorithms Individual step:The parameter for assuming initially that Gauss model is known, goes to estimate the weights of each Gauss model using parameter;Second Step, obtains after weights, recycles the weights of Gauss model to go to estimate its parameter.
The initial value for assuming initially that the mixed Gauss model parameter of the characteristic value of CU blocks is Θ(0), prior probability initial value is P(0), the mixed Gauss model estimates of parameters for then setting t steps is Θ(t)And P(t), then the calculating process of t+1 steps is as follows:
(1) E steps (calculate and expect):The estimate Θ of iteration is walked according to current mixed Gauss model parameter t(t)And P(t), calculating in known current characteristic value, the decision-making of CU belongs to ωiPosterior probability estimate:
Obviously, if characteristic value xm(m=1,2) ωiP (the ω of classi|xmi)≥P(ωi|xmj), j ≠ i, then xm Belong to ωiThe possibility of class is just very big.
(2) M steps (maximize and expect):The posterior probability that each data sample belongs to each subclass is obtained, this step profit The statistical parameter estimation formulas of formula mixed model are solved with gradient descent method.So as to obtain Θ, estimations of the P in the step of kth+1.Will be right Number likelihood function brings the posterior probability that above formula is calculated into respectively to Θ and P derivations, can be obtained after arrangement:
It is thereinBelong to the probability number of the characteristic value of the i-th class when representing the step iteration of kth+1.EM algorithms are by continuous The parameter of iteration E steps and M step improved models, until convergence can obtain the parameter of mixed Gauss model.The present invention The initial value of middle EM algorithms has K-Means algorithms to determine, and it is the metric algorithm of distinctiveness ratio to use Euclidean distance, Its formula is as follows:
Wherein A={ a1,a2,…,an, B={ a1,a2,…,anIt is two data element entries, each have n individual mensurable Characteristic attribute.
K-Means algorithms arbitrarily choose center of the k object as initial clustering first from sample.Then according to sample Distinctiveness ratio between cluster centre judges which cluster each sample belongs to.Then the cluster centre of each cluster is updated, is repeated Iteration said process, until criterion function is restrained, comprises the following steps that shown:
The initial cluster center point for randomly selecting k cluster first isThen repeat procedure below until Convergence:
(1) calculate each conditional probability p (X | ωi) belonging to class:
(2) in each class j, such barycenter is recalculated:
K is the number of cluster, it is necessary to be specified before cluster.c(i)Represent conditional probability p (X | ωi) with distance in k class most That near class, c(i)∈{1,2,3,…,k}.Barycenter μjRepresent j-th cluster centre position of class, iteration step (1) With step (2) until cluster centre no longer changes or varies less, in the present invention, k=2 is taken.
Step 4, the characteristic value for extracting current CU, try to achieve p (X | ωR), R ∈ { N, P }.
The characteristic value X of CU to be divided is extracted, the letter on the Gaussian mixtures of same depth is tried to achieve according to residing depth Numerical value, i.e. p (X | ωN) and p (X | ωP)。
Step 5, decide whether to divide current CU, if dividing, process described in step 1-5 is performed to each Sub-CU;It is no Do not divide current CU then, jump to step 6.
Calculating is comparedWithSize, ifIt is larger, current CU is divided into four Sub-CU, And step 1-5 is performed to each Sub-CU;Otherwise, current CU is not divided.
Step 6, the CU is encoded, if current frame image CU divided completions, perform step 7;Otherwise, to next Individual CU performs process described in step 1-6.
Fast algorithm is divided according to the block based on Bayes and encodes current CU, then to next CU traversal steps 1-6.If working as All of CU has completed to divide in preceding picture frame, then perform step 7.
Step 7, to next two field picture perform 2-7 steps described in process, until having encoded all of frame of video.
For a two field picture of new incoming, the same block based on Bayes divides fast algorithm and processes, until having encoded All of frame of video.
So far, the whole HEVC blocks based on Bayesian decision divide fast method and terminate, for the division of a certain depth CU, Algorithm flow chart is as shown in Figure 4.
Experiment test environment of the present invention is compiled using HEVC standard identifying code HM13.0 in the system VS2010 of windows 7 16 cycle tests are counted and tested by operation.With full I frames (All Intra, AI), arbitrary access (Random Access, RA), low latency-P frames (LP), low latency-B frames (LB) be coding environment, set 22,27,32,37 as QP values, it is sharp The block that will be proposed with the Bjontegaard-Delta bit rates (YBD-rate) of brightness divides fast algorithm and is drawn with HEVC standard block Divide algorithm to be contrasted, and count the encoding and decoding time, as a result as shown in table 1.
Algorithm proposed by the present invention is average under the configuration of AI compared to HM13.0 canonical algorithms as shown in Table 1 saves 46% Scramble time above, and BD-Rate only have lost 1.08%;45.9% scramble time, BD- are averagely saved under RA configurations Rate losses 1.12%;Scramble time under LB configurations reduces 42.8%, and BD-Rate only have lost 0.83%;LP is configured Averagely save for 42.8% scramble time down, BD-Rate have lost 0.73%, and coding efficiency declines very little.
The block that table 1 is based on Bayesian decision divides fast algorithm and HEVC standard algorithm effect comparing result
Certainly, those of ordinary skill in the art is it should be appreciated that above example is intended merely to illustrate this hair Bright, and limitation of the invention is not intended as, as long as within the scope of the invention, change, modification to above example are all Protection scope of the present invention will be fallen into.

Claims (4)

1. a kind of HEVC blocks based on Bayesian decision divide fast method, it is characterised in that regard CU division operations as two classification Problem, i.e. W={ ωN, ωP, wherein ωNRepresent that current CU does not continue to divide, ωpRepresent that current CU continues to divide, and using most Small risk Bayesian decision solves two classification problems, implements step as follows:
Step 1, off-line learning obtain loss function CNPAnd CPNAnd prior probability P (ωN) and P (ωP);
Step 2, for incoming sequence of pictures, judge whether whether it there occurs change in on-line study stage or scene Change, if present frame is in on-line study stage or scene change frame, depth division is carried out to it according to HM canonical algorithms, carry The characteristic value of each depth is taken and stored, step 7 is jumped to;Otherwise perform step 3;
Step 3, judge whether picture is fast the first two field picture stage by stage, if it is not, performing step 4;Otherwise it is extracted in Line learns the characteristic value of stage or the storage of scene change frame, and feature based value sets up mixed Gauss model, the statistics ginseng of model Number is estimated to the EM algorithms of K-Means algorithm initializations;
Step 4, the characteristic value for extracting current CU, try to achieve p (X | ωR)R∈{N,P};
The characteristic value X of CU to be divided is extracted, the functional value on the Gaussian mixtures of same depth is tried to achieve according to residing depth, I.e. p (X | ωN) and p (X | ωP);
Step 5, decide whether to divide current CU, if dividing, process described in step 1- steps 5 is performed to each Sub-CU;It is no Do not divide current CU then, jump to step 6;
The judgement divided for current CU is as follows:Calculating is comparedWithSize, ifIt is larger, ought Preceding CU is divided into four Sub-CU, and performs step 1-5 to each Sub-CU;
Step 6, the CU is encoded, if current frame image CU divided completions, perform step 7;Otherwise, to next CU Perform process described in step 1-6;
Step 7, to next two field picture perform 2-7 steps described in process, until having encoded all of frame of video.
2. a kind of HEVC blocks based on Bayesian decision according to claim 1 divide fast method;It is characterized in that step Rapid 1 detailed process is as follows:
The decline of the distortion performance that proposition brings erroneous decision is set up based on minimum risk as bayes risk function Bayes decision-making model, the distortion performance brought with reducing erroneous decision is lost, complete rate distortion costs function such as following formula institute Show:
JFRD=(SSElumachroma×SSEchroma)+λmode×Bmode
Wherein, SSEluma, represent the pixel difference error sum of squares between the original picture block and reconstruction image block of brightness;SSEchroma Represent the pixel difference error sum of squares between the original picture block and reconstruction image block of carrier chrominance signal, ωchromaIt is chromatic component Weight coefficient, λmodeIt is Lagrange's multiplier, BmodeRequired bit number when being to current coding mode;
For 64 × 64, the CU of 32 × 32,16 × 16 sizes is trained by off-line learning obtain loss matrix C respectively, loses square Battle array is defined as follows shown in formula:
C = C N N C N P C P N C P P
C hereinPNAnd CNPIt is defined as follows:
C P N = J F R D , P - J F R D , N J F R D , N
C N P = J F R D , N - J F R D , P J F R D , P
JFRD, NAnd JFRD, PIt is respectively that current CU to be divided does not continue to divide rate completely corresponding with continuing to be divided into four sub-blocks and loses True cost;CNPRepresent and assume ωNIt is true, i.e., judges to assume ω when rate distorted result is " not dividing "PSet up, i.e., CU is taken The value of risk brought during " division " decision-making;CPNRepresent and assume ωPIt is true, takes CU what is brought during " not dividing " decision-making Value of risk;CNPAnd CPNMore than or equal to 0, (C when decision-making is correctNNOr CPP), the penalty values of decision-making are 0;
Prior probability P (ωR) it is by counting under current depth, making ωRWhat the proportion that the CU of decision-making accounts for all CU was obtained, i.e.,
P(ωR)=current depth belongs to ωRCU numbers/current depth CU sum;
Wherein R is N or P.
3. a kind of HEVC blocks based on Bayesian decision according to claim 2 divide fast method;It is characterized in that step Rapid 2 are implemented as follows:
The severe degree of video motion is weighed with average gray difference value, i.e., whether occurrence scene is converted, adjacent in video sequence The average gray difference value of pixel of two frames in same position is calculated as follows shown in formula:
G = 1 P &CenterDot; Q &Sigma; i = 0 P - 1 &Sigma; j = 0 Q - 1 | G n ( i , j ) - G n - 1 ( i , j ) |
In formula, P, Q are picture traverse and height, Gn(i, j) is the gray value at n-th frame pixel (i, j) place, Gn-1(i, j) is the The gray value at n-1 frame pixel (i, j) place;If without scene switching, G is generally smaller;If occurrence scene is converted, G is generally anxious Increase severely big;
If G is more than detection threshold value, then it is assumed that scene is converted, otherwise it is assumed that scene is not changed in, i.e.,
λscIt is detection threshold value, its value is often empirical value, for judging whether that occurrence scene is converted;Detection threshold value is set to 9, that is, work as G> Think that occurrence scene is converted when 9, otherwise converted without occurrence scene.
4. a kind of HEVC blocks based on Bayesian decision according to claim 3 divide fast method;It is characterized in that step Rapid 3 concrete implementation process is as follows:
If the observation information of each CU n dimensional feature vector X=[x1, x2, x3…xn] describe, p (X | ωR), R ∈ { N, P } are ωRThe conditional probability density of feature vector, X is observed under state;State ωRPosterior probability p (ωR| X) can be by Bayesian formula Try to achieve
p ( &omega; R | X ) = p ( X | &omega; R ) P ( &omega; R ) P ( x ) , R &Element; { N , P }
Introduce loss function CRQ, represent that rate-distortion optimization result is decision-making R, mistakenly classify as the risk in the case of decision-making Q;Examine Consider the Bayes rule in the case of risk to be changed into:
RNN| X)=CNNp(X|ωN)P(ωN)+CPNp(X|ωP)P(ωP)
RPP| X)=CPPp(X|ωP)P(ωP)+CNPp(X|ωN)P(ωN)
RR(X) it is conditional risk, illustrates characteristic value X and be judged to ωRThe average lost during class, CNPRepresent and assume ωNBe it is true, i.e., Judge to assume ω when rate distorted result is " not dividing "PSet up, i.e., take CU the value of risk brought during " division " decision-making;
In order to ensure that expected risk is minimum, when must ensure to do each decision-making, all cause that conditional risk is minimum, such decision-making quilt Referred to as Least risk Bayes decision-making, the rule of decision-making is as follows:If RNN|X)<RPP| X), then ω=ωN, decision-making is ωN, represent the decision-making that " not dividing " is taken current CU;The decision-making of " division " is otherwise taken current CU;
3-1. feature extractions and selection
Characteristic vector J is extracted from observationinterAnd Jintra, obtained by following formula:
JLRD=SAD+ λpred×Bpred
Jinter=argminP ∈ Inter { JLRD(P)}
Jintra=argminP ∈ Intra { JLRD(P)}
λ hereinpredIt is Lagrange's multiplier, BpredTo all information under current prediction mode;Required bit when being encoded Number, P represents that the candidate of PU divides;
Feature vector, X is obtained from the picture in on-line study stage, if training picture is I frames, the dimension of characteristic vector just drops To 1, X=Jintra, because I frames can only use intra prediction mode;
The probability density function of 3-2. on-line studies
According to RNN| X) and RPP| X) value determine whether to divide current CU to be divided;By Least risk Bayes formula Understand, to compare RRR| X) size, it is necessary to first try to achieve p (X | ωN)P(ωN) and p (X | ωP)P(ωP) value;
Each is divided in the case of " division " or " not dividing " of depth, and likelihood function all obeys dimensional Gaussian random distribution, i.e. p (X|ωR) by two Gaussian probability-density function weighted sum approximate representations of independent probability, therefore have:
p ( X | &omega; i , &theta; i ) = 1 2 &pi;&sigma; i exp ( - 1 2 &sigma; i 2 ( X - &mu; i ) 2 ) i = 1 , 2
Wherein i is i-th single Gauss model in mixed Gauss model;The parameter of each Gauss model is θi=(μi, σi), μiAnd σi Represent class ωiCharacteristic vector mean vector and variance, the mixing probability function of feature vector, X is
p ( X | &Theta; , P ) = &Sigma; i = 1 2 P ( &omega; i ) p ( X | &omega; i , &theta; i )
Wherein Θ=(θ12) be mixed Gauss model distributed constant collection, P=(p (ω1),p(ω2)) it is corresponding Gaussian mode The prior probability collection of type;The parameter of mixed Gauss model is determined by the EM algorithms with K-Means algorithm initializations.
CN201710014097.5A 2017-01-09 2017-01-09 A kind of HEVC block division fast method based on Bayesian decision Active CN106713935B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710014097.5A CN106713935B (en) 2017-01-09 2017-01-09 A kind of HEVC block division fast method based on Bayesian decision

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710014097.5A CN106713935B (en) 2017-01-09 2017-01-09 A kind of HEVC block division fast method based on Bayesian decision

Publications (2)

Publication Number Publication Date
CN106713935A true CN106713935A (en) 2017-05-24
CN106713935B CN106713935B (en) 2019-06-11

Family

ID=58908184

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710014097.5A Active CN106713935B (en) 2017-01-09 2017-01-09 A kind of HEVC block division fast method based on Bayesian decision

Country Status (1)

Country Link
CN (1) CN106713935B (en)

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107547902A (en) * 2017-07-19 2018-01-05 南京邮电大学 Towards the adaptive rate distortion optimizing method of monitor video coding
CN108235018A (en) * 2017-12-13 2018-06-29 北京大学 A kind of point cloud intraframe coding optimization method and device based on Lagrange multiplier model
CN108702505A (en) * 2017-12-14 2018-10-23 北京金山云网络技术有限公司 A kind of coding unit divides decision-making technique, device, encoder and storage medium
CN109714584A (en) * 2019-01-11 2019-05-03 杭州电子科技大学 3D-HEVC depth map encoding unit high-speed decision method based on deep learning
CN109743575A (en) * 2018-12-05 2019-05-10 四川大学 A kind of DVC-HEVC video transcoding method based on naive Bayesian
CN109756719A (en) * 2019-01-28 2019-05-14 华侨大学 The 3D-HEVC interframe fast method of Bayesian decision is divided based on CU
CN109859248A (en) * 2018-12-24 2019-06-07 上海大学 A kind of secondary background modeling method based on time-domain difference
CN110581993A (en) * 2019-08-16 2019-12-17 中南大学 Coding unit rapid partitioning method based on intra-frame coding in multipurpose coding
CN111107359A (en) * 2019-12-16 2020-05-05 暨南大学 Intra-frame prediction coding unit dividing method suitable for HEVC standard
WO2020125363A1 (en) * 2018-12-18 2020-06-25 中国科学院深圳先进技术研究院 Low-complexity video signal coding processing method
CN111757129A (en) * 2020-07-07 2020-10-09 福州大学 VVC-oriented fast intra-frame prediction method
CN111837140A (en) * 2018-09-18 2020-10-27 谷歌有限责任公司 Video coded field consistent convolution model
WO2020242260A1 (en) * 2019-05-31 2020-12-03 한국전자통신연구원 Method and device for machine learning-based image compression using global context
CN112465664A (en) * 2020-11-12 2021-03-09 贵州电网有限责任公司 AVC intelligent control method based on artificial neural network and deep reinforcement learning
CN112822490A (en) * 2019-11-15 2021-05-18 北京博雅慧视智能技术研究院有限公司 Coding method for fast decision of intra-frame coding unit size based on perception
CN113129292A (en) * 2021-04-27 2021-07-16 陕西师范大学 Iterative Markov-based synthetic aperture radar image change detection method
CN113767400A (en) * 2019-03-21 2021-12-07 谷歌有限责任公司 Using rate distortion cost as a loss function for deep learning
CN114119789A (en) * 2022-01-27 2022-03-01 电子科技大学 Lightweight HEVC chrominance image quality enhancement method based on online learning
CN115514978A (en) * 2017-08-29 2022-12-23 谷歌有限责任公司 Mixing of probabilities for entropy coding in video compression
WO2023024115A1 (en) * 2021-08-27 2023-03-02 Oppo广东移动通信有限公司 Encoding method, decoding method, encoder, decoder and decoding system
WO2024077767A1 (en) * 2022-10-14 2024-04-18 北京大学深圳研究生院 Learning model-oriented coding decision processing method and apparatus, and device
TWI845197B (en) 2023-03-08 2024-06-11 大陸商星宸科技股份有限公司 Image encoder device and weighted prediction image encoding method

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110730350B (en) * 2019-09-25 2021-08-24 杭州电子科技大学 SHVC (scalable high-speed coding) quick coding method combining coding depth estimation and Bayesian judgment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020163532A1 (en) * 2001-03-30 2002-11-07 Koninklijke Philips Electronics N.V. Streaming video bookmarks
CN101833768A (en) * 2009-03-12 2010-09-15 索尼株式会社 Method and system for carrying out reliability classification on motion vector in video
US20110158320A1 (en) * 2008-09-04 2011-06-30 Yunfei Zheng Methods and apparatus for prediction refinement using implicit motion predictions
CN102333213A (en) * 2011-06-15 2012-01-25 夏东 H.264 compressed domain moving object detection algorithm under complex background
CN103237222A (en) * 2013-05-07 2013-08-07 河海大学常州校区 Motion estimation method adopting multi-mode searching manner
CN105430396A (en) * 2015-12-15 2016-03-23 浙江大学 Video coding method capable of deciding sizes of coding blocks by means of classification

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020163532A1 (en) * 2001-03-30 2002-11-07 Koninklijke Philips Electronics N.V. Streaming video bookmarks
US20110158320A1 (en) * 2008-09-04 2011-06-30 Yunfei Zheng Methods and apparatus for prediction refinement using implicit motion predictions
CN101833768A (en) * 2009-03-12 2010-09-15 索尼株式会社 Method and system for carrying out reliability classification on motion vector in video
CN102333213A (en) * 2011-06-15 2012-01-25 夏东 H.264 compressed domain moving object detection algorithm under complex background
CN103237222A (en) * 2013-05-07 2013-08-07 河海大学常州校区 Motion estimation method adopting multi-mode searching manner
CN105430396A (en) * 2015-12-15 2016-03-23 浙江大学 Video coding method capable of deciding sizes of coding blocks by means of classification

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
XIAOLIN SHEN ET AL.: "Fast Coding Unit Size Selection for HEVC based on Bayesian Decisio Rule", 《2012 PICTURE CODING SYMPOSIUM》 *
XUEFEI FANG ET AL.: "Fast HEVC Intra Coding Unit Size Decision based on an Improved Bayesian Classification Framework", 《PCS2013》 *

Cited By (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107547902A (en) * 2017-07-19 2018-01-05 南京邮电大学 Towards the adaptive rate distortion optimizing method of monitor video coding
CN107547902B (en) * 2017-07-19 2020-03-31 南京邮电大学 Adaptive rate distortion optimization method for surveillance video coding
CN115514978B (en) * 2017-08-29 2024-03-26 谷歌有限责任公司 Method and apparatus for mixing probabilities of entropy coding in video compression
CN115514978A (en) * 2017-08-29 2022-12-23 谷歌有限责任公司 Mixing of probabilities for entropy coding in video compression
CN108235018A (en) * 2017-12-13 2018-06-29 北京大学 A kind of point cloud intraframe coding optimization method and device based on Lagrange multiplier model
CN108235018B (en) * 2017-12-13 2019-12-27 北京大学 Point cloud intra-frame coding optimization method and device based on Lagrange multiplier model
CN108702505A (en) * 2017-12-14 2018-10-23 北京金山云网络技术有限公司 A kind of coding unit divides decision-making technique, device, encoder and storage medium
US11375192B2 (en) 2017-12-14 2022-06-28 Beijing Kingsoft Cloud Network Technology Co., Ltd. Coding unit division decision method and device, encoder, and storage medium
CN108702505B (en) * 2017-12-14 2020-12-11 北京金山云网络技术有限公司 Coding unit division decision method, device, coder and storage medium
CN111837140A (en) * 2018-09-18 2020-10-27 谷歌有限责任公司 Video coded field consistent convolution model
CN109743575A (en) * 2018-12-05 2019-05-10 四川大学 A kind of DVC-HEVC video transcoding method based on naive Bayesian
WO2020125363A1 (en) * 2018-12-18 2020-06-25 中国科学院深圳先进技术研究院 Low-complexity video signal coding processing method
CN109859248B (en) * 2018-12-24 2024-03-19 上海大学 Time domain difference-based secondary background modeling method
CN109859248A (en) * 2018-12-24 2019-06-07 上海大学 A kind of secondary background modeling method based on time-domain difference
CN109714584A (en) * 2019-01-11 2019-05-03 杭州电子科技大学 3D-HEVC depth map encoding unit high-speed decision method based on deep learning
CN109756719A (en) * 2019-01-28 2019-05-14 华侨大学 The 3D-HEVC interframe fast method of Bayesian decision is divided based on CU
CN113767400A (en) * 2019-03-21 2021-12-07 谷歌有限责任公司 Using rate distortion cost as a loss function for deep learning
WO2020242260A1 (en) * 2019-05-31 2020-12-03 한국전자통신연구원 Method and device for machine learning-based image compression using global context
CN110581993A (en) * 2019-08-16 2019-12-17 中南大学 Coding unit rapid partitioning method based on intra-frame coding in multipurpose coding
CN112822490A (en) * 2019-11-15 2021-05-18 北京博雅慧视智能技术研究院有限公司 Coding method for fast decision of intra-frame coding unit size based on perception
CN111107359A (en) * 2019-12-16 2020-05-05 暨南大学 Intra-frame prediction coding unit dividing method suitable for HEVC standard
CN111757129B (en) * 2020-07-07 2023-07-07 福州大学 VVC-oriented rapid intra-frame prediction method
CN111757129A (en) * 2020-07-07 2020-10-09 福州大学 VVC-oriented fast intra-frame prediction method
CN112465664B (en) * 2020-11-12 2022-05-03 贵州电网有限责任公司 AVC intelligent control method based on artificial neural network and deep reinforcement learning
CN112465664A (en) * 2020-11-12 2021-03-09 贵州电网有限责任公司 AVC intelligent control method based on artificial neural network and deep reinforcement learning
CN113129292A (en) * 2021-04-27 2021-07-16 陕西师范大学 Iterative Markov-based synthetic aperture radar image change detection method
WO2023024115A1 (en) * 2021-08-27 2023-03-02 Oppo广东移动通信有限公司 Encoding method, decoding method, encoder, decoder and decoding system
CN114119789A (en) * 2022-01-27 2022-03-01 电子科技大学 Lightweight HEVC chrominance image quality enhancement method based on online learning
WO2024077767A1 (en) * 2022-10-14 2024-04-18 北京大学深圳研究生院 Learning model-oriented coding decision processing method and apparatus, and device
TWI845197B (en) 2023-03-08 2024-06-11 大陸商星宸科技股份有限公司 Image encoder device and weighted prediction image encoding method

Also Published As

Publication number Publication date
CN106713935B (en) 2019-06-11

Similar Documents

Publication Publication Date Title
CN106713935A (en) Fast method for HEVC (High Efficiency Video Coding) block size partition based on Bayes decision
US20230199179A1 (en) Image and video coding using machine learning prediction coding models
CN107071438B (en) Encoder and encoding method, and decoder and decoding method
CN110198447A (en) Method and apparatus for sharing candidate list
CN106131546B (en) A method of determining that HEVC merges and skip coding mode in advance
Hu et al. Fast HEVC intra mode decision based on logistic regression classification
CN102893604A (en) Data compression for video
CN108174204B (en) Decision tree-based inter-frame rapid mode selection method
CN111355956A (en) Rate distortion optimization fast decision making system and method based on deep learning in HEVC intra-frame coding
CN107079165A (en) Use the method for video coding and device of prediction residual
CN109361920B (en) Multi-scene-oriented inter-frame rapid prediction algorithm for adaptive decision tree selection
CN108712647A (en) A kind of CU division methods for HEVC
CN108769696A (en) A kind of DVC-HEVC video transcoding methods based on Fisher discriminates
CN110365988B (en) H.265 coding method and device
CN103888763B (en) Intra-frame coding method based on HEVC
CN111741297A (en) Inter-frame prediction method, video coding method and related devices thereof
Blanch et al. Chroma intra prediction with attention-based CNN architectures
CN116489386A (en) VVC inter-frame rapid coding method based on reference block
CN107690069B (en) Data-driven cascade video coding method
CN104601992A (en) SKIP mode quickly selecting method based on Bayesian minimum hazard decision
CN104333755B (en) The CU based on SKIP/Merge RD Cost of B frames shifts to an earlier date terminating method in HEVC
CN110225339A (en) A kind of HEVC video sequence coding/decoding accelerated method
CN110149512A (en) Inter-prediction accelerated method, control device, electronic device, computer storage medium and equipment
CN105791863B (en) 3D-HEVC depth map intra-frame predictive encoding method based on layer
CN109547798B (en) Rapid HEVC inter-frame mode selection method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20201215

Address after: Room 806, building 5, Wuhu navigation Innovation Park, Wanbi Town, Wanbi District, Wuhu City, Anhui Province

Patentee after: Wuhu Qibo Intellectual Property Operation Co.,Ltd.

Address before: Room 3003-1, building 1, Gaode land center, Jianggan District, Hangzhou City, Zhejiang Province

Patentee before: Zhejiang Zhiduo Network Technology Co.,Ltd.

Effective date of registration: 20201215

Address after: Room 3003-1, building 1, Gaode land center, Jianggan District, Hangzhou City, Zhejiang Province

Patentee after: Zhejiang Zhiduo Network Technology Co.,Ltd.

Address before: 310018 No. 2 street, Xiasha Higher Education Zone, Hangzhou, Zhejiang

Patentee before: HANGZHOU DIANZI University

TR01 Transfer of patent right
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20170524

Assignee: Hangzhou Qihu Information Technology Co.,Ltd.

Assignor: Wuhu Qibo Intellectual Property Operation Co.,Ltd.

Contract record no.: X2021330000547

Denomination of invention: A fast hevc block partition method based on Bayesian decision

Granted publication date: 20190611

License type: Common License

Record date: 20211028

EE01 Entry into force of recordation of patent licensing contract
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20170524

Assignee: Hangzhou Julu enterprise management consulting partnership (L.P.)

Assignor: Wuhu Qibo Intellectual Property Operation Co.,Ltd.

Contract record no.: X2021330000726

Denomination of invention: A fast hevc block partition method based on Bayesian decision

Granted publication date: 20190611

License type: Common License

Record date: 20211109

EE01 Entry into force of recordation of patent licensing contract
EC01 Cancellation of recordation of patent licensing contract

Assignee: Hangzhou Julu enterprise management consulting partnership (L.P.)

Assignor: Wuhu Qibo Intellectual Property Operation Co.,Ltd.

Contract record no.: X2021330000726

Date of cancellation: 20221103

Assignee: Hangzhou Qihu Information Technology Co.,Ltd.

Assignor: Wuhu Qibo Intellectual Property Operation Co.,Ltd.

Contract record no.: X2021330000547

Date of cancellation: 20221103

EC01 Cancellation of recordation of patent licensing contract