RU2562414C1

RU2562414C1 - Method for fast selection of spatial prediction mode in hevc coding system

Info

Publication number: RU2562414C1
Application number: RU2014138769/08A
Authority: RU
Inventors: Максим Павлович Шарабайко; Олег Геннадьевич Пономарев
Original assignee: Закрытое акционерное общество "Элекард наноДевайсез"
Priority date: 2014-09-24
Filing date: 2014-09-24
Publication date: 2015-09-10
Also published as: EA201700050A1; WO2016048186A1

Abstract

FIELD: physics.

SUBSTANCE: disclosed is a method for fast selection of spatial prediction mode in a HEVC coding system. Analysis of the modulus of high-frequency horizontal and vertical components of the stationary wavelet of Haar transformation of an image of the encoded region is performed to construct a list of candidate modes. The process of constructing candidate modes consists of four steps. The first and second steps include making decisions to include a Planar mode and a DC mode in the list. The third step includes selecting angular candidate modes based on values in an array of vertical details and the fourth step includes selecting candidate modes based on analysis of the values of an array of horizontal details.

EFFECT: speeding up the encoding process by reducing the volume of computations when searching for an optimum block coding mode.

Description

ОБЛАСТЬ ТЕХНИКИFIELD OF TECHNOLOGY

Изобретение относится к кодированию и декодированию цифровых видеоданных.The invention relates to the encoding and decoding of digital video data.

УРОВЕНЬ ТЕХНИКИBACKGROUND

В основе алгоритмов компрессии видеоданных HEVC лежит несколько простых идей. Если взять некоторую часть изображения, то с большой вероятностью вблизи этого участка в данном кадре или в соседних кадрах окажется участок, содержащий похожее, мало отличающееся по значениям интенсивности пикселей, изображение. Таким образом, для передачи информации об изображении в текущем участке достаточно передать только его отличие от ранее закодированного похожего участка. Процесс поиска похожих участков среди ранее закодированных изображений называют предсказанием (от англ. prediction). Набор разностных значений, определяющих отличие текущего участка от найденного предсказания, называют остатком (от англ. residual). Можно выделить два основных типа предсказания. В первом из них значения Prediction представляют собой набор линейных комбинаций пикселей, примыкающих к текущему участку изображения слева и сверху. Такое предсказание называют пространственным (от англ. Intra Prediction). Во втором - в качестве предсказания используются линейные комбинации пикселей похожих участков изображений ранее закодированных кадров (эти кадры называют ссылочными - от англ. Reference). Такое предсказание называют временным (от англ. Inter Prediction). Для восстановления изображения текущего участка, закодированного с временным предсказанием, при декодировании необходима информация не только об остатке (Residual), но и о номере кадра, на котором находится похожий участок, и координатах этого участка.HEVC video compression algorithms are based on a few simple ideas. If we take a certain part of the image, then with a high probability near this section in this frame or in neighboring frames there will be a section containing a similar image that differs little in terms of pixel intensity. Thus, to transmit information about the image in the current section, it is sufficient to transmit only its difference from a previously encoded similar section. The process of finding similar areas among previously encoded images is called prediction (from the English prediction). A set of difference values that determine the difference between the current section and the found prediction is called the remainder (from the English residual). Two main types of prediction can be distinguished. In the first of them, Prediction values are a set of linear combinations of pixels adjacent to the current image area on the left and top. Such a prediction is called spatial (from the English Intra Prediction). In the second, linear combinations of pixels of similar sections of images of previously encoded frames are used as a prediction (these frames are called reference frames from the English Reference). Such a prediction is called temporary (from the English Inter Prediction). To restore the image of the current section encoded with time prediction, decoding requires information not only about the Residual, but also about the frame number on which the similar section is located and the coordinates of this section.

На следующем этапе кодирования полученные при предсказании значения Residual подвергаются двумерному косинус-преобразованию Фурье с последующим квантованием. Затем полученный набор квантованных спектральных коэффициентов, сопровождаемый информацией, необходимой для выполнения предсказаний при декодировании, подвергается энтропийному кодированию.At the next stage of coding, the Residual values obtained during the prediction are subjected to a two-dimensional Fourier cosine transform with subsequent quantization. Then, the obtained set of quantized spectral coefficients, accompanied by the information necessary to make predictions during decoding, is subjected to entropy encoding.

Основной структурной единицей в HEVC является блок кодирования (CU - сокр. от англ. coding unit). Внутри каждого такого блока выбираются области - блоки предсказания (PU - сокр. от англ. prediction unit). Разбиение видеокадра на CU производится адаптивно, так что есть возможность подстраивать границы CU под границы объектов на изображении, а вложенные CU образуют квадродерево.The basic structural unit in HEVC is the coding unit (CU - abbr. From English coding unit). Inside each such block, areas are selected - prediction blocks (PU - abbr. From English prediction unit). The video frame is divided into CU adaptively, so it is possible to adjust the borders of the CU to the boundaries of the objects in the image, and the embedded CUs form a quad tree.

В пределах каждой CU выбираются области для вычисления предсказания - Prediction Unit (PU). При пространственном предсказании область CU может совпадать с PU (режим 2N×2N) или может быть разбита на 4 квадратных PU вдвое меньшего размера (режим N×N). Стандартом определены минимально и максимально возможные размеры PU - 4×4 и 32×32 соответственно.Within each CU, areas for prediction calculation — the Prediction Unit (PU) —are selected. In spatial prediction, the CU region may coincide with the PU (2N × 2N mode) or may be divided into 4 square PUs half the size (N × N mode). The standard defines the minimum and maximum possible sizes of PUs - 4 × 4 and 32 × 32, respectively.

Пространственное предсказание выполняется в HEVC одним из 35 способов. При этом используются значения пикселей-«соседей», примыкающих к границе кодируемой PU слева и сверху. Способы пространственного предсказания в HEVC можно условно разделить на две неравные группы. К первой группе относятся два способа - Planar и DC. В режиме Planar значения, используемые в качестве предсказания, лежат на плоскости, наклон которой в вертикальном и горизонтальном направлениях определяется по пикселям-«соседям». Этот режим предназначен для предсказания областей PU с линейным изменением значений пикселей в каком-либо направлении. В режиме DC в качестве предсказания всех пикселей из PU используется одно значение, равное среднему арифметическому пикселям-«соседям». Назначение этого режима не требует пояснений.Spatial prediction is performed in HEVC in one of 35 ways. In this case, the values of pixels - "neighbors" are used, adjacent to the border of the encoded PU on the left and top. Spatial prediction methods in HEVC can be divided into two unequal groups. The first group includes two methods - Planar and DC. In Planar mode, the values used as a prediction lie on a plane whose slope in the vertical and horizontal directions is determined by the "neighboring" pixels. This mode is designed to predict areas of the PU with a linear change in pixel values in any direction. In DC mode, a single value equal to the arithmetic average of the neighboring pixels is used as a prediction of all pixels from the PU. The purpose of this mode is self-explanatory.

Режимы второй группы называют угловыми. При вычислении значения, используемого в качестве предсказания, во всех 33-х угловых режимах пиксели-«соседи» сдвигаются в заданном (одном из 33-х) направлении. Если положение предсказываемого пикселя попадает между сдвинутыми копиями пикселей-«соседей», для расчета предсказания используется линейная интерполяция. Точность оценки положения предсказываемого пикселя между сдвинутыми пикселями-«соседями» задана равной 1/32 межпиксельного интервала.The modes of the second group are called angular. When calculating the value used as a prediction, in all 33 angular modes, the neighboring pixels are shifted in a given (one of 33) directions. If the position of the predicted pixel falls between shifted copies of the "neighbor" pixels, linear interpolation is used to calculate the prediction. The accuracy of estimating the position of the predicted pixel between the shifted "neighbor" pixels is set to 1/32 of the pixel interval.

Адаптивность разбиения максимально возможной CU совместно с большим количеством возможных режимов предсказания порождает огромное число возможных вариантов предсказания каждой такой CU так, что процесс кодирования видеокадров становится крайне вычислительно емким. Снижение вычислительных затрат на кодирование возможно, прежде всего, за счет предварительного отбора режимов предсказания, построения списка режимов-кандидатов. После построения такого списка выбор режима предсказания для каждой CU осуществляется только из режимов-кандидатов. Понятно, что чем короче список, тем эффективнее в вычислительном отношении система кодирования. С другой стороны, такой список должен с высокой вероятностью содержать наилучший для каждой CU, то есть обеспечивающий минимальное отличие предсказания от кодируемых пикселей и минимальное количество битов, представляющих CU в закодированном видеопотоке, режим предсказания.The adaptability of splitting the maximum possible CU together with a large number of possible prediction modes gives rise to a huge number of possible prediction options for each such CU so that the process of encoding video frames becomes extremely computationally intensive. Reducing the computational cost of coding is possible, first of all, due to the preliminary selection of prediction modes, building a list of candidate modes. After constructing such a list, the prediction mode for each CU is selected only from candidate modes. It is clear that the shorter the list, the more computationally efficient the coding system. On the other hand, such a list should with high probability contain the best prediction mode for each CU, that is, ensuring the minimum difference between the prediction and the encoded pixels and the minimum number of bits representing the CU in the encoded video stream.

Из предшествующего уровня техники известен способ выбора одного из 35 режимов предсказания при кодировании, реализованных в кодере [HEVC Test Model НМ v. 11.0 [Электронный ресурс]. - Режим доступа: https://hevc.hhi.fraunhofer.de/svn/svn_HEVCSoftware/tags/HM-11.0/], в котором осуществляется перебор всех возможных для каждого кодируемого блока вариантов предсказания. В этом случае выполняется полный цикл кодирования-декодирования каждого блока, что позволяет провести т.н. Rate-Distortion Optimization (RDO). В процессе RDO из всех возможных режимов выбирается тот, который обеспечивает наибольшую степень сжатия видеоданных кодируемого блока (наименьший rate) при наименьшем уровне искажений, вносимых в эти данные в процессе кодирования (наименьший уровень Distortion).From the prior art, a method is known for selecting one of the 35 prediction modes during encoding implemented in the encoder [HEVC Test Model NM v. 11.0 [Electronic resource]. - Access mode: https://hevc.hhi.fraunhofer.de/svn/svn_HEVCSoftware/tags/HM-11.0/], which enumerates all the possible prediction options for each encoded block. In this case, a complete encoding-decoding cycle of each block is performed, which allows the so-called Rate-Distortion Optimization (RDO). In the RDO process, from all possible modes, one is selected that provides the greatest degree of compression of the video data of the encoded block (the lowest rate) at the lowest level of distortion introduced into this data during the encoding (lowest level of Distortion).

Недостатком этого решения является то, что такой подход оказывается крайне затратным в вычислительном отношении, но обеспечивает гарантированный выбор наилучшего режима предсказания.The disadvantage of this solution is that this approach is extremely computationally expensive, but provides a guaranteed choice of the best prediction mode.

Известен также способ, называемый быстрым, [Zhao, L.; Zhang, L.; Ma, S.; Zhao, D. Fast mode decision algorithm for intra prediction in HEVC. Visual Communications and Image Processing (VCIP). IEEE, 2011, pp. 1-4], который реализуется в два этапа. На первом этапе выполняется предсказание кодируемого блока всеми возможными способами и формируется укороченный список режимов-кандидатов. Выбор из сформированного списка наилучшего режима осуществляется в процессе RDO на втором этапе. Отбор режимов-кандидатов на этапе формирования списка может осуществляться по различным критериям, определяемым настройками кодирующей системы. В качестве такого критерия может выступать минимальное значение суммы абсолютных разностей пикселей предсказания и оригинала (SAD или Sum of Absolute Differences), минимальное значение суммы квадратов разностей пикселей предсказания и оригинала (SSE или Sum of Sqared Errors) или минимальное значение суммы модулей коэффициентов преобразования Адамара разностей пикселей предсказания и оригинала (SATD или Sum of Absolute Transformed Differences).There is also a method called fast, [Zhao, L .; Zhang, L .; Ma, S .; Zhao, D. Fast mode decision algorithm for intra prediction in HEVC. Visual Communications and Image Processing (VCIP). IEEE, 2011, pp. 1-4], which is implemented in two stages. At the first stage, the encoded block is predicted by all possible methods and a shortened list of candidate modes is formed. The selection of the best mode from the generated list is carried out in the second stage of the RDO process. The selection of candidate modes at the stage of list formation can be carried out according to various criteria determined by the settings of the coding system. Such a criterion may be the minimum value of the sum of the absolute differences of the prediction pixels and the original (SAD or Sum of Absolute Differences), the minimum value of the sum of the squares of the differences of the prediction pixels and the original (SSE or Sum of Sqared Errors), or the minimum value of the sum of the modules of the Hadamard transform coefficients pixels of the prediction and the original (SATD or Sum of Absolute Transformed Differences).

Недостатком этого способа является то, что для построения списка режимов-кандидатов необходимо выполнить предсказание кодируемой области всеми 35 способами. Быстрый способ опять основан на полном переборе всех 35 режимов предсказания для каждой кодируемой области. Некоторое ускорение здесь достигается не за счет уменьшения количества перебираемых режимов предсказания, а за счет упрощения процедуры оценки качества предсказания в каждом из режимов. На основе таких упрощенных процедур оценивания качества строится сокращенный список режимов-кандидатов, для которых уже проводится полная RDO-оценка качества предсказания, полностью аналогичная той, что используется в первом способе.The disadvantage of this method is that to build a list of candidate modes, it is necessary to predict the encoded area in all 35 ways. The fast method is again based on a complete enumeration of all 35 prediction modes for each coded area. Some acceleration here is achieved not by reducing the number of predicted prediction modes, but by simplifying the procedure for assessing the quality of prediction in each of the modes. Based on such simplified quality assessment procedures, a shortened list of candidate modes is constructed for which a full RDO-assessment of the quality of the prediction is already carried out, completely similar to that used in the first method.

СУЩНОСТЬ ИЗОБРЕТЕНИЯSUMMARY OF THE INVENTION

Технический результат, который решается с помощью предложенного решения, состоит в снижении объемов вычислений при поиске оптимального режима кодирования блока, что позволяет ускорить процесс кодирования в целом за счет предварительного отбора режимов предсказания, построения списка режимов-кандидатов.The technical result, which is solved with the help of the proposed solution, is to reduce the amount of computation when searching for the optimal coding mode of the block, which allows to speed up the coding process as a whole due to the preliminary selection of prediction modes, building a list of candidate modes.

Технический результат достигается тем, что в способе быстрого выбора режима пространственного предсказания в системе кодирования HEVC, заключающемся в построении укороченного списка режимов-кандидатов пространственного предсказания длиной не более шести позиций, основанном на анализе высокочастотных горизонтальной и вертикальной составляющих стационарного вейвлет преобразования Хаара кодируемой области изображения, согласно предложенному решению: - массив значений высокочастотной горизонтальной составляющей преобразования Хаара получается путем вычитания из значения каждого пикселя кодируемой области значения соседнего слева пикселя;The technical result is achieved in that in a method for quickly selecting a spatial prediction mode in the HEVC coding system, which consists in constructing a shortened list of spatial prediction candidate modes with a length of not more than six positions, based on the analysis of the high-frequency horizontal and vertical components of the stationary wavelet transform of the Haar encoded image region, according to the proposed solution: - an array of values of the high-frequency horizontal component of the Haar transformation It is obtained by subtracting the value of each pixel value of the encoded region adjacent left pixel;

- массив значений высокочастотной вертикальной составляющей преобразования Хаара получается путем вычитания из значения каждого пикселя кодируемой области значения соседнего сверху пикселя;- the array of values of the high-frequency vertical component of the Haar transform is obtained by subtracting from the value of each pixel of the encoded region the values of the neighboring neighboring top pixel;

- режим Planar включают в список режимов-кандидатов, если все абсолютные значения горизонтальных и вертикальных высокочастотных составляющих меньше шага квантования при кодировании;- Planar mode is included in the list of candidate modes if all absolute values of horizontal and vertical high-frequency components are less than the quantization step during encoding;

- режим DC включают в список режимов-кандидатов, если среднеквадратическое отклонение значений горизонтальных и вертикальных высокочастотных составляющих меньше шага квантования,- DC mode is included in the list of candidate modes if the standard deviation of the horizontal and vertical high-frequency components is less than the quantization step,

- два угловых режима предсказания, соответствующие взаимному смещению точек пересечения линии минимальных значений модуля градиента, проходящей через точку максимума массива вертикальных высокочастотных составляющих, с вертикальными границами массива высокочастотных составляющих включают в список режимов-кандидатов, - two angular prediction modes corresponding to the mutual displacement of the points of intersection of the line of minimum values of the gradient modulus passing through the maximum point of the array of vertical high-frequency components, with the vertical boundaries of the array of high-frequency components are included in the list of candidate modes,

-два угловых режима предсказания, соответствующие взаимному смещению точек пересечения линии минимальных значений модуля градиента, проходящей через точку максимума массива горизонтальных высокочастотных составляющих, с горизонтальными границами массива высокочастотных составляющих включают в список режимов-кандидатов,- two angular prediction modes corresponding to the mutual displacement of the points of intersection of the line of the minimum values of the gradient module passing through the maximum point of the array of horizontal high-frequency components, with horizontal boundaries of the array of high-frequency components, are included in the list of candidate modes,

- окончательный выбор режима предсказания для кодируемой области изображения из построенного списка режимов-кандидатов осуществляют на основе стандартной RDO (rate-distortion optimization) оценки.- the final choice of the prediction mode for the encoded image area from the constructed list of candidate modes is carried out on the basis of standard RDO (rate-distortion optimization) estimates.

СПОСОБ ОСУЩЕСТВЛЯЕТСЯ СЛЕДУЮЩИМ ОБРАЗОМThe method is carried out as follows.

Для построения списка режимов-кандидатов проводят анализ модуля высокочастотных горизонтальной и вертикальной составляющих (подробностей) стационарного вейвлет преобразования Хаара (СВПХ) изображения кодируемой области.To build a list of candidate modes, an analysis of the module of high-frequency horizontal and vertical components (details) of the stationary Haar wavelet transform (CVH) of the image of the encoded area is carried out.

Для получения горизонтальных и вертикальных подробностей СВПХ формируются массивы hC(x,y), х=-1,0,…,nТ-1, y=1,0,…,nТ-1 и νC(x,y), х=-1,0,…nТ-1, y=-1,0,…,nT-1. Значения элементов этих массивов при x=0,…, nТ-1, y=0,…,nТ-1 равны значениям интенсивности пикселов кодируемой PU. Значения hc(-1,y),y=-1,…nT равны значениям интенсивности пикселей-«соседей», примыкающих к области PU слева, a hc(x,-1),x=0,…,nΤ-1 равны значениям интенсивности пикселей-«соседей», примыкающих к PU сверху. Аналогично, значения νC(x,-1),х=-1,…,nΤ-1 равны значениям интенсивности пикселей-соседей», примыкающих к области PU сверху, а νC(-1,y),y=0,…,nТ-1 - значениям интенсивности пикселей-«соседей», примыкающих к PU слева. За nT обозначен размер кодируемой PU. Горизонтальные и вертикальные подробности H и V формируются как:To obtain horizontal and vertical details of the SVPK, arrays hC (x, y), x = -1.0, ..., nT-1, y = 1.0, ..., nT-1 and νC (x, y), x = -1.0, ... nT-1, y = -1.0, ..., nT-1. The values of the elements of these arrays at x = 0, ..., nТ-1, y = 0, ..., nТ-1 are equal to the pixel intensities of the encoded PU. The values of hc (-1, y), y = -1, ... nT are equal to the intensities of the pixel “neighbors” adjacent to the PU region to the left, and hc (x, -1), x = 0, ..., nΤ-1 are the values of the intensity of the pixels "neighbors" adjacent to the PU from above. Similarly, the values of νC (x, -1), x = -1, ..., nΤ-1 are equal to the intensities of the neighboring pixels ”adjacent to the PU region from above, and νC (-1, y), y = 0, ..., nТ-1 - the intensity values of the pixels - "neighbors" adjacent to the PU on the left. For nT, the size of the encoded PU is indicated. Horizontal and vertical details of H and V are formed as:

H(x,y)=|hC(x,y)-hC(x-1,y)|, x=0,…,nT-1, y=0,…,nT.H (x, y) = | hC (x, y) -hC (x-1, y) |, x = 0, ..., nT-1, y = 0, ..., nT.

V(x,y)=|νC(x,y)-νC(x,y-1)|, x=0,…,nΤ, y=0,…,nT-1.V (x, y) = | νC (x, y) -νC (x, y-1) |, x = 0, ..., nΤ, y = 0, ..., nT-1.

Процесс построения списка режимов-кандидатов состоит из четырех этапов. На первом этапе принимается решение о включении в список режима Planar. Этот режим включается в список, если:The process of building a list of candidate modes consists of four stages. At the first stage, a decision is made to include Planar mode in the list. This mode is included in the list if:

,

где: qStep - шаг квантования.where: qStep is the quantization step.

На втором этапе проверяется режим DC. Этот режим включается в список, если выполняется условие:In the second step, the DC mode is checked. This mode is included in the list if the condition is met:

На третьем этапе выбираются угловые режимы-кандидаты по значениям в массиве вертикальных подробностей V(x,y). Для этого определяется позиция y_max, x_max максимального элемента массива V(x,y). Текущая позиция y_cr устанавливается равной y_max, x_cr - равной x_max. Итеративно для каждого следующего столбца с номером x_cr=х_cr+1 новое значение y_cr устанавливается равным номеру максимального элемента в данном столбце из диапазона [y_cr-2, y_cr+2,].At the third stage, the candidate angular modes are selected by the values in the array of vertical details V (x, y). For this, the position y _max , x _{max of the} maximum element of the array V (x, y) is determined. The current position y _{cr is} set equal to y _max , x _cr equal to x _max . Iteratively for each next column with the number x _cr = x _cr +1, the new value of y _{cr is} set equal to the number of the maximum element in this column from the range [y _cr -2, y _cr +2,].

Обновление позиции продолжается, пока x_cr≤nT и 0<y_cr<nT-1. Точки x_cr, y_cr в процессе итераций проходят по линии минимального градиента значений V(x,y) справа от максимального элемента этого массива. Аналогичный итеративный процесс позволяет проследить линию наименьшего градиента значений V(x,y) слева от позиции x_max, y_max. Начинается этот процесс с установки текущей позиции y_cl равной y_max, х_сl - равной x_max. Итеративно для каждого следующего столбца с номером х_с=x_cl-1 новое значение y_сl устанавливается равным номеру максимального элемента в данном столбце из диапазона [y_cl-2, y_cl+2]. Обновление позиции х_сl, y_сl продолжается, пока х_сl≥0 и 0<у_cr<nТ-1.The position update continues until x _cr ≤nT and 0 <y _cr <nT-1. The points x _cr , y _cr during iterations pass along the line of the minimum gradient of V (x, y) values to the right of the maximum element of this array. A similar iterative process allows us to trace the line of the smallest gradient of the values of V (x, y) to the left of the position x _max , y _max . This process begins by setting the current position y _cl equal to y _max , x _cl - equal to x _max . Iteratively for each next column with the number x _{c =} x _cl -1, the new value of y _{cl is} set equal to the number of the maximum element in this column from the range [y _cl -2, y _cl +2]. The update of the position x _cl , y _cl continues until x _cl ≥ 0 and 0 <y _cr <nT-1.

Если обе позиции y_cr и y_cl попали на одну и ту же границу блока, результаты такого поиска некорректны. В таком случае поиск выполняется вверх и вниз в соответствии с алгоритмом четвертого этапа, применяемого к подробностям V(x,y).If both positions y _cr and y _cl hit the same block boundary, the results of such a search are incorrect. In this case, the search is performed up and down in accordance with the algorithm of the fourth stage, applied to the details of V (x, y).

Взаимное расположение найденных двух точек x_cl, y_cl и x_cr, y_cr задают направление, по которому определяется параметр режимов углового предсказания IntraPredAngle и номер режима предсказания. Определяются значения переменных следующими выражениями:The relative position of the two points x _cl , y _cl and x _cr , y _{cr found} determines the direction in which the parameter of the angular prediction modes IntraPredAngle and the number of the prediction mode are determined. The values of the variables are determined by the following expressions:

.

Если dir=1, то IntraPredAngle1=-α_i, IntraPredAngle2=-α_i+1,If dir = 1, then IntraPredAngle1 = -α _i , IntraPredAngle2 = -α _{i + 1} ,

где α∈{-32, -26, -21, -17, -13, -9, -5, -2, 0, 2, 5, 9, 13, 17, 21, 26, 32} и α_i≤tgphi<α_i+1, а номера соответствующих режимов предсказания лежат в диапазоне 2-17.where α∈ {-32, -26, -21, -17, -13, -9, -5, -2, 0, 2, 5, 9, 13, 17, 21, 26, 32} and α _i ≤ tgphi <α _{i + 1} , and the numbers of the corresponding prediction modes are in the range 2-17.

где α∈{-32, -26, -21, -17, -13_,-9, -5, -2, 0_,2, 5, 9, 13, 17, 21, 26, 32} и α_i≤tgphi<α_i+1, номера соответствующих режимов предсказания лежат в диапазоне 18-34.where α∈ {-32, -26, -21, -17, -13 _, -9, -5, -2, 0 _, 2, 5, 9, 13, 17, 21, 26, 32} and α _i ≤ tgphi <α _{i + 1} , the numbers of the corresponding prediction modes lie in the range 18-34.

Четвертый этап аналогичен третьему, но режимы-кандидаты выбираются на основе анализа значений массива горизонтальных подробностей H(x,y). Для этого определяется позиция y_max, x_max максимального элемента массива Н(x,y) Текущая позиция y_cr устанавливается равной y_max, x_cr - равной x_max. Итеративно для каждой следующей строки с номером y_cr=y_cr+1, новое значение x_cr устанавливается равным номеру максимального элемента в данной строке из диапазона [x_cr-2, x_cr+2]. Обновление позиции продолжается, пока y_cr≤nT и 0<x_cr<nT-1. Точки x_cr, y_cr в процессе итераций проходят по линии минимального градиента значений H(x,y) ниже положения максимального элемента этого массива. Аналогичный итеративный процесс позволяет проследить линию наименьшего градиента значений H(x,y) над позицей x_max, y_max. Начинается этот процесс с установки текущей позиции y_cl равной y_max, x_cl - равной x_max. Итеративно для каждой следующей строки с номером y_cl=y_cl-1, новое значение х_cl устанавливается равным номеру максимального элемента в данной строке из диапазона [x_cl-2, x_cl+2]. Обновление позиции х_сl, y_cl продолжаетсяе пока y_cl≥0 и 0<x_cl<nT-1.The fourth stage is similar to the third, but the candidate modes are selected based on the analysis of the values of the array of horizontal details H (x, y). To do this, determine the position y _max , x _{max of the} maximum element of the array H (x, y). The current position y _{cr is} set to y _max , x _cr equal to x _max . Iteratively for each next line with the number y _cr = y _cr +1, the new value x _{cr is} set equal to the number of the maximum element in this line from the range [x _cr -2, x _cr +2]. The position update continues until y _cr ≤nT and 0 <x _cr <nT-1. The points x _cr , y _cr during iterations pass along the line of the minimum gradient of the values of H (x, y) below the position of the maximum element of this array. A similar iterative process allows us to trace the line of the smallest gradient of the values of H (x, y) over the position x _max , y _max . This process begins by setting the current position y _cl equal to y _max , x _cl equal to x _max . Iteratively for each next line with the number y _cl = y _cl -1, the new value of x _{cl is} set to the number of the maximum element in this line from the range [x _cl -2, x _cl +2]. The update of the position x _cl , y _cl continues until y _cl ≥0 and 0 <x _cl <nT-1.

Если обе позиции х_cr и х_сl попали на одну и ту же границу блока, результаты такого поиска некорректны. В таком случае поиск выполняется влево и вправо в соответствии с алгоритмом третьего этапа, применяемого к массиву значений горизонтальных подробностей H(x,y).If both positions x _cr and x _cl fell on the same block boundary, the results of such a search are incorrect. In this case, the search is performed left and right in accordance with the algorithm of the third stage, applied to the array of values of horizontal details H (x, y).

Взаимное расположение найденных двух точек x_cl, y_cl и x_cr, y_cr задают направление, по которому определяется параметр режимов углового предсказания IntraPredAngle и номер режима предсказания. Определим значения переменных следующими выражениями:The relative position of the two points x _cl , y _cl and x _cr , y _{cr found} determines the direction in which the parameter of the angular prediction modes IntraPredAngle and the number of the prediction mode are determined. We define the values of the variables by the following expressions:

.

Если dir=0, то IntraPredAngle1=α_i, IntraPredAngle2=α_i+1,If dir = 0, then IntraPredAngle1 = α _i , IntraPredAngle2 = α _{i + 1} ,

где α∈{-32, -26, -21, -17, -13, -9, -5, -2, 0, 2, 5, 9, 13, 17, 21, 26, 32} и α_i≤tgphi<α_i+1, а номера соответствующих режимов предсказания лежат в диапазоне 18-34.where α∈ {-32, -26, -21, -17, -13, -9, -5, -2, 0, 2, 5, 9, 13, 17, 21, 26, 32} and α _i ≤ tgphi <α _{i + 1} , and the numbers of the corresponding prediction modes lie in the range 18-34.

Таким образом, предложенный способ позволяет формировать список режимов-кандидатов, качество предсказаний которых оценивается на заключительном этапе по стандартной процедуре RDO. Длина списка ограничена сверху шестью позициями. Формирование списка не требует проведения самой процедуры предсказания, что приводит к существенному сокращению объема вычислений при выполнении пространственного предсказания значений пикселей кодируемого блока.Thus, the proposed method allows you to create a list of candidate modes, the quality of the predictions of which are evaluated at the final stage according to the standard RDO procedure. The list length is limited to six at the top. The formation of the list does not require the prediction procedure itself, which leads to a significant reduction in the amount of computation when performing spatial prediction of the pixel values of the encoded block.

Claims

Способ быстрого выбора режима пространственного предсказания в системе кодирования HEVC, заключающийся в построении укороченного списка режимов-кандидатов пространственного предсказания длиной не более шести позиций, основанный на анализе высокочастотных горизонтальной и вертикальной составляющих стационарного вейвлет преобразования Хаара кодируемой области изображения, при этом массив значений высокочастотной горизонтальной составляющей преобразования Хаара получается путем вычитания из значения каждого пикселя кодируемой области значения соседнего слева пикселя, при этом массив значений высокочастотной вертикальной составляющей преобразования Хаара получается путем вычитания из значения каждого пикселя кодируемой области значения соседнего сверху пикселя, при этом режим Planar включают в список режимов-кандидатов, если все абсолютные значения горизонтальных и вертикальных высокочастотных составляющих меньше шага квантования при кодировании, при этом режим DC включают в список режимов-кандидатов, если среднеквадратическое отклонение значений горизонтальных и вертикальных высокочастотных составляющих меньше шага квантования, при этом два угловых режима предсказания, соответствующие взаимному смещению точек пересечения линии минимальных значений модуля градиента, проходящей через точку максимума массива вертикальных высокочастотных составляющих, с вертикальными границами массива высокочастотных составляющих включают в список режимов-кандидатов, при этом два угловых режима предсказания, соответствующие взаимному смещению точек пересечения линии минимальных значений модуля градиента, проходящей через точку максимума массива горизонтальных высокочастотных составляющих, с горизонтальными границами массива высокочастотных составляющих включают в список режимов-кандидатов, при этом окончательный выбор режима предсказания для кодируемой области изображения из построенного списка режимов-кандидатов осуществляют на основе стандартной RDO (rate-distortion optimization) оценки. A method for quickly selecting the spatial prediction mode in the HEVC coding system, which consists in constructing a shortened list of spatial prediction candidate modes with a length of not more than six positions, based on the analysis of the high-frequency horizontal and vertical components of the stationary Haar wavelet transform of the encoded image region, while the array of values of the high-frequency horizontal component Haar transform is obtained by subtracting from the value of each pixel encoded about the values of the pixel adjacent to the left, while the array of values of the high-frequency vertical component of the Haar transform is obtained by subtracting from the value of each pixel of the encoded region the values of the neighboring neighboring pixels, while the Planar mode is included in the list of candidate modes if all the absolute values of the horizontal and vertical high-frequency components are less quantization steps during encoding, while the DC mode is included in the list of candidate modes if the standard deviation of the horizontal values of vertical and high-frequency components is less than the quantization step, and two angular prediction modes corresponding to the mutual displacement of the points of intersection of the line of minimum values of the gradient modulus passing through the maximum point of the array of vertical high-frequency components with vertical boundaries of the array of high-frequency components are included in the candidate modes for two angular prediction modes corresponding to the mutual displacement of the points of intersection of the line of minimum values of the grad module An element passing through a maximum point of an array of horizontal high-frequency components with horizontal boundaries of an array of high-frequency components is included in the list of candidate modes, and the final choice of the prediction mode for the encoded image area from the constructed list of candidate modes is based on standard RDO (rate-distortion optimization ) estimates.