WO2024145933A1 - Point cloud coding method and apparatus, point cloud decoding method and apparatus, and devices and storage medium - Google Patents

Point cloud coding method and apparatus, point cloud decoding method and apparatus, and devices and storage medium Download PDF

Info

Publication number
WO2024145933A1
WO2024145933A1 PCT/CN2023/071069 CN2023071069W WO2024145933A1 WO 2024145933 A1 WO2024145933 A1 WO 2024145933A1 CN 2023071069 W CN2023071069 W CN 2023071069W WO 2024145933 A1 WO2024145933 A1 WO 2024145933A1
Authority
WO
WIPO (PCT)
Prior art keywords
node
prediction
information
value
nodes
Prior art date
Application number
PCT/CN2023/071069
Other languages
French (fr)
Chinese (zh)
Inventor
孙泽星
Original Assignee
Oppo广东移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oppo广东移动通信有限公司 filed Critical Oppo广东移动通信有限公司
Priority to PCT/CN2023/071069 priority Critical patent/WO2024145933A1/en
Publication of WO2024145933A1 publication Critical patent/WO2024145933A1/en

Links

Images

Definitions

  • the surface of the object is collected by the acquisition device to form point cloud data, which includes hundreds of thousands or even more points.
  • the point cloud data is transmitted between the point cloud encoding device and the point cloud decoding device in the form of point cloud media files.
  • the point cloud encoding device needs to compress the point cloud data before transmission.
  • the embodiments of the present application provide a point cloud encoding and decoding method, apparatus, device and storage medium, which increase the selectable context conditions when encoding the number of points of the node, thereby improving the encoding and decoding performance of the point cloud.
  • a point cloud decoder comprising a processor and a memory.
  • the memory is used to store a computer program, and the processor is used to call and run the computer program stored in the memory to execute the method in the first aspect or its implementation manners.
  • a point cloud encoder comprising a processor and a memory.
  • the memory is used to store a computer program, and the processor is used to call and run the computer program stored in the memory to execute the method in the second aspect or its respective implementations.
  • a chip for implementing the method in any one of the first to second aspects or their respective implementations.
  • the chip includes: a processor for calling and running a computer program from a memory, so that a device equipped with the chip executes the method in any one of the first to second aspects or their respective implementations.
  • a computer program product comprising computer program instructions, which enable a computer to execute the method in any one of the first to second aspects or their respective implementations.
  • a code stream is provided, which is generated based on the method of the second aspect.
  • the code stream includes at least one of the first parameter and the second parameter.
  • the inter-frame decoding information corresponding to the current node is determined, and the inter-frame decoding information corresponding to the current node is used as context information to predict the number of points of the current node, thereby increasing the selectable context conditions when encoding and decoding the number of points of the node, reducing the determined conditional entropy, thereby improving the decoding efficiency and performance of the point cloud.
  • Figure 1B is a partial enlarged view of the point cloud
  • FIG2 is a schematic diagram of six viewing angles of a point cloud image
  • FIG3 is a schematic block diagram of a point cloud encoding and decoding system according to an embodiment of the present application.
  • FIG4A is a schematic block diagram of a point cloud encoder provided in an embodiment of the present application.
  • FIG4B is a schematic block diagram of a point cloud decoder provided in an embodiment of the present application.
  • FIG5A is a schematic plan view
  • FIG5B is a schematic diagram of node coding sequence
  • FIG5C is a schematic diagram of a plane mark
  • FIG5D is a schematic diagram of a sibling node
  • FIG5E is a schematic diagram of the intersection of a laser radar and a node
  • FIG5F is a schematic diagram of neighborhood nodes at the same division depth and the same coordinates
  • FIG5G is a schematic diagram of a neighboring node when the node is located at a lower plane position of the parent node;
  • FIG6A is a schematic diagram of IDCM encoding
  • FIG6E is a schematic diagram of predictive coding of the X or Y axis
  • FIG9 is a schematic diagram of an octree partition
  • FIG12 is a schematic diagram of corresponding nodes of a domain node
  • FIG13B is a schematic diagram of the prediction nodes of the current node in two prediction reference frames
  • FIG14B is a schematic diagram of IDCM decoding
  • FIG17 is a schematic block diagram of a point cloud encoding device provided in an embodiment of the present application.
  • FIG18 is a schematic block diagram of an electronic device provided in an embodiment of the present application.
  • the second type of dynamic point cloud the object is moving, but the device that obtains the point cloud is stationary;
  • Category 1 Machine perception point cloud, which can be used in autonomous navigation systems, real-time inspection systems, geographic information systems, visual sorting robots, disaster relief robots, etc.
  • Category 2 Point cloud perceived by the human eye, which can be used in point cloud application scenarios such as digital cultural heritage, free viewpoint broadcasting, 3D immersive communication, and 3D immersive interaction.
  • the above point cloud acquisition technology reduces the cost and time of point cloud data acquisition and improves the accuracy of data.
  • the change in the point cloud data acquisition method makes it possible to acquire a large amount of point cloud data.
  • the processing of massive 3D point cloud data encounters bottlenecks of storage space and transmission bandwidth.
  • a point cloud video with a frame rate of 30fps frames per second
  • the number of points in each point cloud frame is 700,000
  • each point has coordinate information xyz (float) and color information RGB (uchar).
  • the data volume of a 10s point cloud video is approximately 0.7 million X (4ByteX3+1ByteX3)
  • X 30fpsX10s 3.15GB
  • the YUV sampling format is 4:2:0
  • the frame rate is 24fps.
  • the decoding device 120 may further include a display device 123 in addition to the input interface 121 and the point cloud decoder 122 .
  • the voxel unit 202 is also called a quantize and remove points unit, which can reduce the number of coordinates by quantization; after quantization, originally different points may be assigned the same coordinates, based on which, duplicate points can be deleted by deduplication operation; for example, multiple clouds with the same quantized position and different attribute information can be merged into one cloud by attribute conversion.
  • the voxel unit 202 is an optional unit module.
  • the process of generating LOD by the LOD generating unit includes: obtaining the Euclidean distance between points according to the position information of the points in the point cloud; and dividing the points into different detail expression layers according to the Euclidean distance.
  • the Euclidean distances can be sorted and the Euclidean distances in different ranges can be divided into different detail expression layers. For example, a point can be randomly selected as the first detail expression layer. Then the Euclidean distances between the remaining points and the point are calculated, and the points whose Euclidean distances meet the first threshold requirement are classified as the second detail expression layer.
  • PlanePosition i 0 represents that the current node is a plane in the i-axis direction, and the plane position is a low plane, and 1 represents that the current node is a high plane in the i-axis direction.
  • the threshold is adaptively changed. For example, when Prob(0)>Prob(1)>Prob(2), the threshold value is as shown in formula (2):
  • Prob(i) is updated by the following formula (3):
  • Prob(i) new (L x Prob(i) + ⁇ (coded node)) / L + 1 (3)
  • local_node_density is updated by the following formula (4):
  • local_node_density new local_node_density + 4*numSiblings (4)
  • the density of the points in the current layer is used to determine whether to perform plane coding on the nodes in the current layer. Assuming that the number of points in the current point cloud to be coded is pointCount, the number of points reconstructed after IDCM coding is numPointCountRecon, and because the octree is coded in the order of breadth-first traversal, the number of nodes to be coded in the current layer can be obtained as nodeCount. It is assumed that planarEligibleKOctreeDepth is used to indicate whether the current layer starts plane coding. The judgment process of planarEligibleKOctreeDepth is shown in formula (5):
  • planarEligibleKOctreeDepth (pointCount-numPointCountRecon) ⁇ nodeCount*1.3 (5)
  • the plane position information of the current node is predicted to be three elements: predicted as a low plane, predicted as a high plane, and unpredictable;
  • the black node is the current node. If the current node is located at the lower plane of the parent node, the plane position of the current node is determined in the following manner:
  • the black node is the current node. If the node is at a high plane position of the parent node, the plane position of the current node is determined in the following manner:
  • Figure 5I is the predictive coding of the plane position information of the laser radar point cloud.
  • the plane position of the current node is predicted by using the laser radar acquisition parameters, and the position is quantized into four intervals by using the position where the current node intersects with the laser ray, and finally used as the context of the plane position of the current node.
  • the specific calculation process is as follows: Assuming that the coordinates of the laser radar are (x Lidar , y Lidar , z Lidar ), and the geometric coordinates of the current point are (x, y, z), first calculate the vertical tangent value tan ⁇ of the current point relative to the laser radar. The calculation process is shown in formula (6):
  • the corrected tangent value of the current node is used to predict the plane position of the current node. Specifically, assuming that the tangent value of the lower boundary of the current node is tan( ⁇ bottom ), and the tangent value of the upper boundary is tan( ⁇ top ), the plane position is quantized into 4 quantization intervals according to tan ⁇ corr,L , which is the context of the plane position.
  • the octree-based geometric information coding mode only has an efficient compression rate for points with correlation in space.
  • the use of the direct coding model (DCM) can greatly reduce the complexity.
  • the use of DCM is not represented by flag information, but is inferred from the parent node and neighbor information of the current node. There are three ways to determine whether the current node is eligible for DCM coding, as shown in Figure 6A:
  • the current node has no sibling child nodes, that is, the parent node of the current node has only one child node, and the parent node of the parent node of the current node has only two occupied child nodes, that is, the current node has at most one neighbor node.
  • the current node does not have the DCM coding qualification, it will be divided into octrees. If it has the DCM coding qualification, the number of points contained in the node will be further determined. When the number of points is less than the threshold 2, the node will be DCM-encoded, otherwise the octree division will continue.
  • the DCM coding mode it is first necessary to encode whether the current node is a true isolated point, that is, IDCM_flag. When IDCM_flag is true, the current node is encoded using DCM, otherwise it is still encoded using octrees. When the current node meets the DCM coding requirements, it is necessary to encode the DCM coding mode of the current node.
  • the number of points numPoints of the current node is first encoded, and the number of points of the current node is encoded according to different DirectModes, including the following methods:
  • the coordinate information of the points contained in the current node is encoded.
  • the following will introduce the lidar point cloud and the human eye point cloud separately.
  • the priority coded coordinate axis dirextAxis will be obtained by using the geometric coordinates of the points. It should be noted that the currently compared coordinate axes only include the x and y axes, not the z axis. Assuming that the geometric coordinates of the current node are nodePos, the priority coded coordinate axis is determined by the method shown in formula (8):
  • the axis with the smaller node coordinate geometric position is used as the coordinate axis dirextAxis for priority encoding.
  • the geometry information of the dirextAxis coordinate axis of the priority coding is first encoded as follows, assuming that the bit depth of the geometry to be encoded corresponding to the priority coding axis is nodeSizeLog2, and assuming that the coordinates of the two points are pointPos[0] and pointPos[1] respectively:
  • the priority coded coordinate axis dirextAxis will be obtained by using the geometric coordinates of the points. Assuming that the geometric coordinates of the current node are nodePos, the priority coded coordinate axis is determined by the method shown in formula (9):
  • the axis with the smaller node coordinate geometric position is used as the coordinate axis dirextAxis for priority encoding. It should be noted here that the currently compared coordinate axes only include the x and y axes, but not the z axis.
  • the geometry information of the dirextAxis coordinate axis of the priority coding is first encoded as follows, assuming that the bit depth of the geometry to be encoded corresponding to the priority coding axis is nodeSizeLog2, and assuming that the coordinates of the two points are pointPos[0] and pointPos[1] respectively:
  • the geometric coordinate information of the current node can be predicted, so as to further improve the efficiency of the geometric information encoding of the point cloud.
  • the geometric information nodePos of the current node is first used to obtain a directly encoded main axis direction, and then the geometric information of the encoded direction is used to predict the geometric information of another dimension.
  • the axis direction of the direct encoding is directAxis
  • the bit depth to be encoded in the direct encoding is nodeSizeLog2
  • the LaserIdx corresponding to the current point i.e. pointLaserIdx in Figure 6B
  • the LaserIdx of the current node i.e. nodeLaserIdx
  • the LaserIdx of the node i.e. nodeLaserIdx
  • the LaserIdx of the node or point is as follows:
  • the LaserIdx of the current node is first used to predict the pointLaserIdx of the point. After encoding the LaserIdx of the current point, the three-dimensional geometric information of the current point is predictively encoded using the acquisition parameters of the laser radar.
  • the LaserIdx corresponding to the current point is used to obtain the corresponding predicted value of the horizontal azimuth angle, that is, Secondly, the node geometry information corresponding to the current point is used to obtain the horizontal azimuth angle corresponding to the node Among them, the horizontal azimuth
  • the calculation method between the node geometry information is shown in formula (10), assuming that the geometry coordinates of the node are nodePos:
  • the number of rotation points numPoints of each Laser can be obtained, which represents the number of points obtained when each laser ray rotates one circle.
  • the rotation angular velocity deltaPhi of each Laser can then be calculated using the number of rotation points of each Laser, as shown in formula (11):
  • the LaserIdx corresponding to the current point will be used to predict the Z-axis direction of the current point. That is, the depth information radius of the cylindrical coordinate system is calculated by using the x and y information of the current point. Then, the tangent value of the current point and the vertical value are obtained by using the laser LaserIdx of the current point. Then, the predicted value of the Z-axis direction of the current point, namely Z_pred, can be obtained:
  • Z_pred is used to predict the geometric information of the current point in the Z-axis direction to obtain the prediction residual Z_res, and finally Z_res is encoded.
  • G-PCC currently introduces a plane coding mode. In the process of geometric division, it will determine whether the child nodes of the current node are in the same plane. If the child nodes of the current node meet the conditions of the same plane, the child nodes of the current node will be represented by the plane.
  • the decoding end follows the order of breadth-first traversal. Before decoding the placeholder information of each node, it will first use the reconstructed geometric information to determine whether the current node is to be plane decoded or IDCM decoded. If the current node meets the conditions for plane decoding, the plane identification and plane position information of the current node will be decoded first, and then the placeholder information of the current node will be decoded based on the plane information; if the current node meets the conditions for IDCM decoding, it will first decode whether the current node is a real IDCM node.
  • the placeholder information of the current node will be decoded.
  • the current node has no sibling child nodes, that is, the parent node of the current node has only one child node, and the parent node of the parent node of the current node has only two occupied child nodes, that is, the current node has at most one neighbor node.
  • the current node does not meet the requirements of the DCM node, that is, the number of points is greater than 2 points and it is not a duplicate point, exit directly.
  • the priority decoding coordinate axis dirextAxis will be obtained by using the geometric coordinates of the points. It should be noted that the coordinate axes currently compared only include the x and y axes, not the z axis. Assuming that the geometric coordinates of the current node are nodePos, the method shown in formula (13) is used to determine the priority encoding coordinate axis:
  • the axis with the smaller node coordinate geometric position is used as the coordinate axis dirextAxis for priority decoding.
  • the geometry information of the dirextAxis coordinate axis to be decoded is first decoded as follows, assuming that the bit depth of the geometry to be decoded corresponding to the axis to be decoded is nodeSizeLog2, and assuming that the coordinates of the two points are pointPos[0] and pointPos[1] respectively:
  • the priority decoding coordinate axis dirextAxis will be obtained by using the geometric coordinates of the points. Assuming that the geometric coordinates of the current node are nodePos, the priority encoding coordinate axis is determined by the method shown in formula (14):
  • the decoding end Based on the geometric decoding of the prediction tree, the decoding end reconstructs the prediction tree structure by continuously parsing the bit stream, and then obtains the geometric position prediction residual information and quantization parameters of each prediction node through parsing, and dequantizes the prediction residual to recover the reconstructed geometric position information of each node, and finally completes the geometric reconstruction of the decoding end.
  • the embodiment of the present application refers to the inter-frame information when encoding and decoding the points of the node. For example, based on the inter-frame encoding and decoding information corresponding to the current node, the points of the current node are encoded and decoded, thereby increasing the selectable context conditions when encoding and decoding the points, so that the determined conditional entropy is reduced, thereby improving the encoding and decoding efficiency and performance of the point cloud.
  • the encoding end first determines whether the node is qualified for direct encoding. If the node is qualified for direct encoding, it determines whether the number of points of the node is less than or equal to the preset threshold. If the number of points of the node is less than or equal to the preset threshold, it is determined that the node can be decoded by direct encoding. Then, the number of points included in the node and the geometric information of each point are encoded into the bitstream. Correspondingly, after determining that the node is qualified for direct decoding, the decoding end decodes the bitstream, obtains the number of points of the node and the geometric information of each point, and implements geometric decoding of the node.
  • the decoding end first determines the inter-frame decoding information corresponding to the current node.
  • the embodiment of the present application does not limit the number of prediction reference frames of the current frame to be decoded.
  • the current frame to be decoded has one prediction reference frame, or the current frame to be decoded has multiple prediction reference frames.
  • the embodiment of the present application does not limit the number N of prediction nodes of the current node, which is determined according to actual needs.
  • one or several decoded frames before the current frame to be decoded are determined as prediction reference frames of the current frame to be decoded.
  • one or several decoded frames following the current frame to be decoded are determined as prediction reference frames of the current frame to be decoded.
  • one or several decoded frames before the current frame to be decoded, and one or several decoded frames after the current frame to be decoded are determined as prediction reference frames of the current frame to be decoded.
  • the previous frame and the next frame of the current frame to be decoded may be determined as prediction reference frames of the current frame to be decoded.
  • the current frame to be decoded has two prediction reference frames.
  • S101-A2 Determine N prediction nodes of the current node based on at least one prediction node of the current node in K prediction reference frames.
  • the process of the decoding end determining at least one prediction point of the current node in each of the K prediction reference frames is the same.
  • the kth prediction reference frame among the K prediction reference frames is taken as an example for explanation.
  • the embodiment of the present application does not limit the specific manner in which the decoding end determines at least one prediction node of the current node in the kth prediction reference frame.
  • Method 1 In the kth prediction reference frame, a prediction node of the current node is determined. For example, a node in the kth prediction reference frame having the same division depth as the current node is determined as the prediction node of the current node.
  • the nodes located at the third layer of the octree in the kth prediction reference frame can be obtained, and then the prediction node of the current node can be determined from these nodes.
  • node 1 is determined as a prediction node of the current node in the kth prediction reference frame.
  • the node 1 determined above and at least one domain node of node 1 in the kth prediction reference frame are determined as the prediction nodes of the current node in the kth prediction reference frame.
  • determining at least one prediction node of the current node in the kth prediction reference frame includes the following steps S101-A11 to S101-A13:
  • S101-A12 for the i-th domain node among the M domain nodes, determine the corresponding node of the i-th domain node in the k-th prediction reference frame, where i is a positive integer less than or equal to M;
  • the decoding end before determining at least one prediction node of the current node in the kth prediction reference frame, the decoding end first determines M domain nodes of the current node in the current frame to be decoded, and the M domain nodes include the current node itself.
  • the M domain nodes of the current node include at least one domain node among the domain nodes that are coplanar, colinear, and co-pointed with the current node in the current frame to be decoded. As shown in FIG11 , the current node includes 6 coplanar nodes, 12 colinear nodes, and 8 co-pointed nodes.
  • the M domain nodes of the current node may include other nodes within the reference neighborhood in addition to at least one domain node in the current frame to be decoded that is coplanar, colinear, and co-point with the current node. This embodiment of the present application does not impose any restrictions on this.
  • the decoding end determines the M domain nodes of the current node in the current frame to be decoded, and then determines the corresponding node of each of the M domain nodes in the kth prediction reference frame, and then determines at least one prediction node of the current node in the kth prediction reference frame based on the corresponding nodes of the M domain nodes in the kth prediction reference frame.
  • the embodiment of the present application does not limit the specific implementation method of S101-A13.
  • These three domain nodes correspond to a corresponding node in the forward frame, respectively recorded as node 21, node 22 and node 23, and then node 21, node 22 and node 23 are determined as the three predicted nodes of the current node in the forward frame, or 1 or 2 nodes are selected from node 21, node 22 and node 23 to be determined as 1 or 2 predicted nodes of the current node in the forward frame.
  • the K prediction reference frames include the forward frame and the backward frame of the current frame to be decoded.
  • the decoding end can determine at least one prediction node of the current node in the forward frame, and at least one prediction node of the current node in the backward frame.
  • the current node includes three domain nodes, which are respectively recorded as nodes 11, 12, and 13. These three domain nodes correspond to a corresponding node in the forward frame, which are recorded as nodes 21, 22, and 23, respectively. These three domain nodes correspond to a corresponding node in the backward frame, which are recorded as nodes 41, 42, and 43.
  • the decoding end can determine nodes 21, 22, and 23 as the three prediction nodes of the current node in the forward frame, or select 1 or 2 nodes from nodes 21, 22, and 23 to determine as 1 or 2 prediction nodes of the current node in the forward frame.
  • the decoding end can determine node 41, node 42 and node 43 as three prediction nodes of the current node in the backward frame, or select one or two nodes from node 41, node 42 and node 43 as one or two prediction nodes of the current node in the backward frame.
  • the K prediction reference frames include the first prediction reference frame and the second prediction reference frame.
  • the current node has 2 prediction nodes in the first prediction reference frame, and the current node has 3 prediction nodes in the second prediction reference frame. From these 5 prediction nodes, select 3 prediction nodes as the final prediction nodes of the current node. For example, from these 5 prediction nodes, select the 3 prediction nodes whose placeholder information has the smallest difference with the placeholder information of the current node, and determine them as the final prediction nodes of the current node.
  • the decoding end determines the M domain nodes of the current node in the current frame to be decoded, it determines the corresponding node of each of the M domain nodes in the kth prediction reference frame, and then determines at least one prediction point of the current node in the kth prediction reference frame based on the corresponding node of each of the M domain nodes.
  • the process of determining the corresponding node of the i-th domain node in the k-th prediction reference frame in S101-A12 of the second method is basically the same as the process of determining the corresponding node of the current node in the k-th prediction reference frame in S101-B11 of the above-mentioned third method.
  • the above-mentioned i-th domain node and the current node are recorded as the i-th node, and the specific process of determining the corresponding node of the i-th node in the k-th prediction reference frame is introduced below.
  • Method 1 Determine a node in the k-th prediction reference frame that has the same division depth as the i-th node as the corresponding node of the i-th node.
  • the nodes located at the third layer of the octree in the k-th prediction reference frame can be obtained, and then the corresponding node of the i-th node can be determined from these nodes. For example, among the points at which the k-th prediction reference frame and the i-th node are at the same division depth, a node whose placeholder information has the smallest difference with the placeholder information of the i-th node is selected and determined as the corresponding node of the i-th node in the k-th prediction reference frame.
  • S101-A123 determine one of the child nodes of the i matching nodes as the corresponding node of the i-th node in the k-th prediction reference frame.
  • the decoding end determines the parent node of the i-th node in the current frame to be decoded, and then determines the matching node of the parent node of the i-th prediction domain node in the k-th prediction reference frame.
  • the parent node of the i-th node is recorded as the i-th parent node
  • the matching node of the parent node of the i-th node in the k-th prediction reference frame is determined as the i-th matching node.
  • the decoding end performs an XOR operation on the placeholder information of the i-th parent node and the placeholder information of each node in the k-th prediction reference frame, and determines the node with the smallest XOR operation result in the k-th prediction reference frame as the matching node of the i-th parent node in the k-th prediction reference frame.
  • the first information may be sequence-level information, that is, the first information is used to indicate whether the current point cloud sequence allows inter-frame decoding to be started.
  • the current point cloud sequence includes at least one point cloud frame, and the at least one point cloud frame includes the current frame to be decoded. In this way, the decoding end can determine whether the current node starts inter-frame decoding based on the first information at the sequence level.
  • the encoding end may encode the first information corresponding to the current node into a bitstream, so that the decoding end may obtain the first information corresponding to the current node by decoding the bitstream.
  • Method 1 the above S102-11-A1 includes the following steps S102-11-A1-a1 and S102-11-A1-a2:
  • the first numerical value corresponding to the prediction node is determined based on the first direct decoding information of the prediction node, and finally the index of the first context model is determined based on the first numerical values corresponding to the N prediction nodes.
  • the first direct decoding information of the prediction node includes at least one of the second information and the number of points included in the prediction node, wherein the second information is used to indicate whether the prediction node meets the conditions for decoding in the direct decoding manner.
  • the second information includes direct decoding mode information of the prediction node. For example, when the direct decoding mode of the prediction node is direct decoding mode 1 or direct decoding mode 2, it is determined that the prediction node meets the conditions for decoding by direct decoding. If the direct decoding mode of the prediction node is direct decoding mode 0, it is determined that the prediction node does not meet the conditions for decoding by direct decoding.
  • the decoding end may determine the first value corresponding to the prediction node based on the second information of the prediction node and/or the number of points included in the prediction node.
  • the decoding end determines the first value corresponding to the prediction node through the following steps:
  • Step 11 Determine a first sub-value based on the second information
  • Step 12 Determine a second sub-value based on the number of points included in the prediction node
  • Step 13 Determine a first numerical value corresponding to the prediction node based on the first sub-value and/or the second sub-value.
  • the embodiment of the present application does not limit the specific manner in which the decoding end determines the first sub-value based on the second information of the prediction node in step 11.
  • the second information is used to indicate whether the prediction node meets the conditions for decoding by direct decoding, and then the first sub-value can be determined to be a binary value. For example, if the second information indicates that the prediction node meets the conditions for decoding by direct decoding, the value of the first sub-value is determined to be the first value; if the second information indicates that the prediction node does not meet the conditions for decoding by direct decoding, the value of the first sub-value is determined to be the second value.
  • the embodiment of the present application does not limit the specific values of the third value and the fourth value.
  • the first value is 1.
  • the second value is 0.
  • the embodiment of the present application does not limit the manner in which the decoding end determines the second sub-value based on the number of points included in the prediction node in step 12.
  • the second sub-value is not a binary value, so that the decoding end can determine the second sub-value according to the specific number of points included in the prediction node. For example, if the prediction node includes 1 point, the second sub-value is determined to be 1; if the prediction node includes 2 points, the second sub-value is determined to be 2; if the prediction node includes 3 or more points, the second sub-value is determined to be 3, and so on.
  • the above-mentioned second sub-value is a binary value.
  • the number of points included in the prediction node at the decoding end can be determined in a manner as follows: if the number of points included in the prediction node is greater than the first preset value, the value of the second sub-value is determined to be the first value; if the number of points included in the prediction node is less than or equal to the first preset value, the value of the second sub-value is determined to be the second value.
  • the embodiment of the present application does not limit the specific values of the first preset value, the first value and the second value.
  • the first preset value is 2,
  • the first value is 0.
  • the second value is 1.
  • the second sub-value is determined to be 1, and if the number of points included in the prediction node is greater than 2, the second sub-value is determined to be 0.
  • the decoding end determines the first numerical value corresponding to the prediction node based on the first sub-value and/or the second sub-value determined above.
  • determining the first value corresponding to the prediction node specifically includes the following situations:
  • Case 1 If the first direct decoding information of the prediction node includes the second information, but does not include the number of points included in the prediction node, the decoding end determines the first sub-value based on the second information of the prediction node. For example, if the second information indicates that the prediction node meets the conditions for decoding by direct decoding, the value of the first sub-value is determined to be a third value (e.g., 1); if the second information indicates that the prediction node does not meet the conditions for decoding by direct decoding, the value of the first sub-value is determined to be a fourth value (e.g., 0). Then, the first sub-value is determined to be the first numerical value corresponding to the prediction node.
  • the second information indicates that the prediction node meets the conditions for decoding by direct decoding
  • the value of the first sub-value is determined to be a fourth value (e.g., 0).
  • the decoding end determines the second sub-value based on the number of points included in the prediction node. For example, if the number of points included in the prediction node is greater than the first preset value (e.g., 2), the value of the second sub-value is determined to be the first value (e.g., 0); if the number of points included in the prediction node is less than or equal to the first preset value (e.g., 2), the value of the second sub-value is determined to be the second value (e.g., 1). Then, the second sub-value is determined to be the first value corresponding to the prediction node.
  • the first preset value e.g. 2
  • the decoding end determines the first sub-value based on the second information of the prediction node. For example, if the second information indicates that the prediction node meets the conditions for decoding by direct decoding, the value of the first sub-value is determined to be the first value (for example, 1); if the second information indicates that the prediction node does not meet the conditions for decoding by direct decoding, the value of the first sub-value is determined to be the second value (for example, 0). Next, the decoding end determines the second sub-value based on the number of points included in the prediction node.
  • the value of the second sub-value is determined to be the first value (for example, 0); if the number of points included in the prediction node is less than or equal to the first preset value (for example, 2), the value of the second sub-value is determined to be the second value (for example, 1).
  • the first numerical value corresponding to the prediction node is determined, for example, the sum of the first sub-value and the second sub-value is determined as the first numerical value corresponding to the prediction node.
  • the first value corresponding to each prediction node in the N prediction nodes of the current node may be determined.
  • S102-11-A1-a2. Determine the index of the first context model based on the first numerical values corresponding to the N prediction nodes.
  • the decoding end After the decoding end determines the first values corresponding to the N prediction nodes based on the above steps, it determines the index of the first context model based on the first values corresponding to the N prediction nodes.
  • weighted processing is performed on the first values corresponding to the N prediction nodes to obtain a first weighted prediction value
  • a weight i.e., the first weight
  • the first numerical values corresponding to each prediction node can be weighted based on the first weight of each prediction node, and then the index of the first context model can be determined based on the final weighted result, thereby improving the accuracy of determining the index of the first context model based on the geometric decoding information of the N prediction nodes.
  • the first weight of prediction node 1 can be determined based on the distance between domain node 1 and the current node. For example, the inverse of the distance between domain node 1 and the current node is determined as the first weight of prediction node 1.
  • the first weight of the predicted node 1 is 1; if domain node 1 is a colinear node of the current node, the first weight of the predicted node 1 is a preset weight. If domain node 1 is a common node of the current node, the first weight of predicted node 1 is the preset weight
  • a weighted sum is performed on the first numerical values corresponding to the N prediction nodes to obtain a first weighted prediction value.
  • the current node includes at least one prediction node in the jth prediction reference frame, so that the first value of the at least one prediction node is determined based on the first direct decoding information of the at least one prediction node in the jth prediction reference frame.
  • the present application does not limit the specific method of determining the index of the first context model based on the second weighted prediction value corresponding to K prediction reference frames.
  • the second weighted prediction values corresponding to the K prediction reference frames are weighted based on the second weight to obtain the index of the first context model.
  • the second direct decoding information of the N predicted nodes is used as context information to perform predictive decoding on the second flag corresponding to the current node, thereby achieving efficient decoding of the second flag.
  • the above step A includes the following steps:
  • Step A1 determining an index of a second context model based on second direct decoding information of N prediction nodes
  • Step A2 based on the index of the second context model, select the second context model from P context models corresponding to the second flag, where P is a positive integer greater than 1;
  • Method 1 the above step A1 includes the following steps A1-11 and A1-12:
  • Example 1 Determine the third weighted prediction value as the index of the second context model.
  • the decoding end may also adopt the following method 2 to determine the index of the second context model.
  • a weighted average is performed on the second values corresponding to the prediction nodes in the j-th prediction reference frame to obtain a fourth weighted prediction value corresponding to the j-th prediction reference frame.
  • the decoding end After the decoding end determines the fourth weighted prediction value corresponding to each of the K prediction reference frames, it executes the above steps A1-23.
  • the current frame to be decoded includes 2 prediction reference frames, and these 2 prediction reference frames include the forward frame and backward frame of the current frame to be decoded.
  • the second weight corresponding to the forward frame is W1
  • the second weight corresponding to the backward frame is W2
  • the fourth weighted prediction value corresponding to the forward frame and the fourth weighted prediction value corresponding to the backward frame are weighted to obtain the index of the second context model.
  • the fourth weighted prediction values corresponding to the K prediction reference frames are weighted averaged to obtain the index of the second context model.
  • the fourth weighted prediction values corresponding to the K prediction reference frames are weighted summed to obtain the index of the second context model.
  • the decoder determines the index of the second up-down model based on the above steps, it selects the second context model from the P context models corresponding to the second mark. Then, the second context model is used to decode the second mark corresponding to the current node, and the number of points of the current node is determined based on the second mark.
  • the second flag corresponding to the current node indicates that the second point of the current node is not a repeated point, it is determined that the current node includes one point.
  • the method of the embodiment of the present application further includes the following step B:
  • Step B using the third direct decoding information of the N predicted nodes as context information, predictively decoding the third flag corresponding to the current node, where the third flag is used to indicate whether the third point of the current node is a repeated point.
  • the decoding end if the decoding end decodes the second flag corresponding to the current node and obtains that the second flag indicates that the second point of the current node is a repeated point, the decoding end continues to decode the third flag corresponding to the current node to determine whether the third point of the current node is a repeated point.
  • the third direct decoding information of N predicted nodes is used as context information to predictively decode the third flag corresponding to the current node, thereby achieving efficient decoding of the third flag.
  • Step B1 determining an index of a third context model based on third direct decoding information of N prediction nodes
  • Step B2 based on the index of the third context model, select the third context model from R context models corresponding to the third flag, where R is a positive integer greater than 1;
  • the third flag corresponds to multiple context models, for example, R context models.
  • the embodiment of the present application does not limit the specific number of context models corresponding to the third flag, as long as R is greater than 1. That is to say, in the embodiment of the present application, at least one optimal context model is selected from two context models to decode the third flag of the current node, so as to improve the decoding efficiency of the third flag.
  • Method 1 the above step B1 includes the following steps B1-11 and B1-12:
  • Step B1-11 for any prediction node among the N prediction nodes, determine a third value corresponding to the prediction node based on the third direct decoding information of the prediction node.
  • the third numerical value corresponding to the prediction node is determined based on the third direct decoding information of the prediction node, and finally the index of the third context model is determined based on the third numerical values corresponding to the N prediction nodes.
  • the third direct decoding information of the prediction node includes second information and fourth information, wherein the second information indicates whether the prediction node meets the conditions for decoding in a direct decoding manner, and the fourth information indicates whether the third point of the prediction node is a repeated point.
  • the decoding end determines the third value corresponding to the prediction node through the following steps:
  • Step 31 Determine a first sub-value based on the second information
  • Step 32 Determine a fourth sub-value based on the fourth information
  • Step 23 Determine a third value corresponding to the prediction node based on the first sub-value and/or the fourth sub-value.
  • the process of determining the first sub-value based on the second information can refer to the description of step 11 above, which will not be repeated here.
  • the method for determining the fourth sub-value can be: if the fourth information indicates that the third point of the predicted node is a repeated point, then the value of the fourth sub-value is determined to be the first value; if the fourth information indicates that the third point of the predicted node is not a repeated point, then the value of the fourth sub-value is determined to be the second value.
  • the decoding end determines the third value corresponding to the prediction node based on the first sub-value and/or the fourth sub-value determined above.
  • the first sub-value is determined as the third value corresponding to the prediction node.
  • the fourth sub-value is determined as the third value corresponding to the prediction node.
  • the third value corresponding to the prediction node is determined based on the first sub-value and the fourth sub-value, for example, the sum of the first sub-value and the fourth sub-value is determined as the third value corresponding to the prediction node.
  • Step B1-12 determine the index of the third context model based on the third numerical values corresponding to the N prediction nodes.
  • determining the index of the third context model based on the third values corresponding to the N prediction nodes includes at least the following implementation methods:
  • Method 1 Determine the average value of the sum of the third numerical values corresponding to the N prediction nodes as the index of the third context model.
  • step B1-12 includes the following steps B1-121 to B1-123:
  • Step B1-121 determining a first weight corresponding to the prediction node
  • Step B1-122 Based on the first weight, weighted processing is performed on the third values corresponding to the N prediction nodes to obtain a fifth weighted prediction value;
  • a weighted sum is performed on the third values corresponding to the N prediction nodes to obtain a fifth weighted prediction value.
  • the decoding end may also adopt the following method 2 to determine the index of the third context model.
  • a weighted sum is performed on the third values corresponding to the prediction nodes in the j-th prediction reference frame to obtain a sixth weighted prediction value corresponding to the j-th prediction reference frame.
  • the decoding end determines the average value of the sixth weighted prediction values corresponding to the K prediction reference frames as the index of the third context model.
  • the decoding end determines second weights corresponding to K predicted reference frames, and performs weighted processing on sixth weighted prediction values corresponding to the K predicted reference frames based on the second weights to obtain an index of the third context model.
  • the sixth weighted prediction values corresponding to the K prediction reference frames are weighted averaged to obtain the index of the third context model.
  • step 1 the first flag is decoded and if the first flag is true, that is, the number of points of the current node is greater than 1, then step 3 is executed, which means that the current node includes 2 points, and the 2 points are non-repeated points.
  • the second direct decoding information includes the following content:
  • the second point in the prediction node is a duplicate point. If the number of points in the prediction node is less than 2, it is assumed not to be a duplicate point by default.
  • Step 5 Determine whether the second point of the current node is a duplicate point.
  • step 7 is executed, which means that the current node includes multiple repeated points.
  • Step 6 The current node includes 1 point.
  • the point cloud includes geometric information and attribute information
  • the encoding of the point cloud includes geometric encoding and attribute encoding.
  • the embodiment of the present application relates to geometric encoding of point clouds.
  • the encoding end first determines the inter-frame coding information corresponding to the current node.
  • one or several encoded frames following the current frame to be encoded are determined as prediction reference frames of the current frame to be encoded.
  • S201-A1 for a k-th prediction reference frame among K prediction reference frames, determining at least one prediction node of a current node in the k-th prediction reference frame, where k is a positive integer less than or equal to K, and K is a positive integer;
  • S201-A2 Determine N prediction nodes of the current node based on at least one prediction node of the current node in K prediction reference frames.
  • the method of determining the difference between the placeholder information of the corresponding node and the placeholder information of the current node can refer to the above-mentioned process of determining the difference in placeholder information, for example, performing an XOR operation on the placeholder information of the corresponding node and the placeholder information of the current node, and using the XOR operation result as the difference between the placeholder information of the corresponding node and the placeholder information of the current node.
  • the encoder determines at least one prediction node of the current node in each of the K prediction reference frames, it executes the above step S201-B, that is, determines N prediction nodes of the current node based on at least one prediction node of the current node in the K prediction reference frames.
  • the encoding end determines the M domain nodes of the current node in the current frame to be encoded, it determines the corresponding node of each of the M domain nodes in the kth prediction reference frame, and then determines at least one prediction point of the current node in the kth prediction reference frame based on the corresponding node of each of the M domain nodes.
  • determining at least one prediction node of the current node in the kth prediction reference frame includes the following steps S201-B11 to S201-B13:
  • the embodiment of the present application does not limit the specific manner in which the encoder determines the matching node of the i-th parent node in the k-th prediction reference frame.
  • the encoder determines the matching node of the i-th parent node in the k-th prediction reference frame based on the placeholder information of the i-th parent node. Specifically, since the placeholder information of the i-th parent node in the current frame to be encoded has been encoded, and the placeholder information of each node in the k-th prediction reference frame has also been encoded, the encoder can search for the matching node of the i-th parent node in the k-th prediction reference frame based on the placeholder information of the i-th parent node.
  • the above describes the process of determining the corresponding node of the i-th domain node in the M domain nodes in the k-th prediction reference frame, and determining the corresponding node of the current node in the k-th prediction reference frame.
  • the encoder can use the second or third method to determine the N prediction nodes of the current node in the prediction reference frame.
  • the inter-frame coding information corresponding to the current node includes geometric coding information of N prediction nodes of the current node and at least one of the first information corresponding to the current node.
  • the above S202 includes the following step S202-1:
  • the geometric coding information of the prediction node includes direct coding information of the prediction node, wherein the direct coding information of the prediction node can be understood as information related to direct coding of the prediction node.
  • the direct coding information of the prediction node includes the number of points of the prediction node, geometric information of each point included in the prediction node, and information on whether the prediction node is encoded using a direct coding method.
  • step S202-1 includes the following step S202-11:
  • the current point cloud is encoded using the GPCC method
  • the number of points included in the current node is less than or equal to 2
  • the current node is encoded using a direct encoding method.
  • the first flag is used to indicate whether the number of points included in the current node is greater than 1
  • the second flag is used to indicate whether the second point of the current node is a repeated point
  • the third flag is used to indicate whether the third point of the current node is a repeated point.
  • the context information used by the encoding end when encoding the first flag, the second flag, and the third flag may be the same, or different, or may be partially the same and partially different, and the embodiment of the present application does not limit this.
  • at least one of the first flag, the second flag, and the third flag uses inter-frame information (such as direct decoding information of the prediction node) as context information for predictive coding.
  • the direct coding information of the prediction node includes first direct coding information.
  • the predictive coding of the points of the current node based on the direct coding information of the N prediction nodes in S202-11 includes the following step S202-11-A:
  • the encoder may determine the first value corresponding to the prediction node based on the second information of the prediction node and/or the number of points included in the prediction node.
  • Step 11 Determine a first sub-value based on the second information
  • the second information is used to indicate whether the prediction node meets the conditions for encoding by direct encoding, and then the first sub-value can be determined to be a binary value. For example, if the second information indicates that the prediction node meets the conditions for encoding by direct encoding, the value of the first sub-value is determined to be the first value; if the second information indicates that the prediction node does not meet the conditions for encoding by direct encoding, the value of the first sub-value is determined to be the second value.
  • the above-mentioned second sub-value is a binary value.
  • the number of points included in the prediction node of the encoding end can be used to determine the second sub-value: if the number of points included in the prediction node is greater than the first preset value, the value of the second sub-value is determined to be the first value; if the number of points included in the prediction node is less than or equal to the first preset value, the value of the second sub-value is determined to be the second value.
  • the encoder determines the first value corresponding to the prediction node based on the first sub-value and/or the second sub-value determined above.
  • determining the index of the first context model based on the first values corresponding to the N prediction nodes includes at least the following implementation methods:
  • Method 2 If K is greater than 1, determine the second weighted prediction value corresponding to each of the K prediction reference frames, and then determine the index of the first context model based on the second weighted prediction values corresponding to the K prediction reference frames.
  • the above S202-11-A1 includes the following steps S202-11-A1-b1 to S202-11-A1-a4:
  • the encoding end After the encoding end determines the first value of at least one prediction node included in the j-th prediction reference frame, it determines the first weight corresponding to the at least one prediction node, and weightedly processes the first value corresponding to the at least one prediction node based on the first weight to obtain a second weighted prediction value corresponding to the j-th prediction reference frame.
  • the encoding end determines second weights corresponding to K prediction reference frames, and performs weighted processing on second weighted prediction values corresponding to the K prediction reference frames based on the second weights to obtain an index of the first context model.
  • the second weight corresponding to the predicted reference frame is determined.
  • each point cloud includes time information, and the time information may be the time when the point cloud acquisition device acquires the point cloud of the frame. Based on this, if the time difference between the predicted reference frame and the current frame to be encoded is smaller, the inter-frame correlation between the predicted reference frame and the current frame to be encoded is stronger, and thus the second weight corresponding to the predicted reference frame is larger. For example, the inverse of the time difference between the predicted reference frame and the current frame to be encoded can be determined as the second weight corresponding to the predicted reference frame.
  • the second weighted prediction values corresponding to the K prediction reference frames are weighted summed to obtain the index of the first context model.
  • the encoder After determining the index of the first context model based on the above steps, the encoder selects the first context model from the Q context models corresponding to the first flag, and then uses the first context model to encode the first flag corresponding to the current node.
  • Step A Use the second direct coding information of the N predicted nodes as context information to perform predictive coding on the second flag corresponding to the current node, where the second flag is used to indicate whether the second point of the current node is a repeated point.
  • the encoding end needs to encode a second flag to indicate whether the second point of the current block is a repeated point.
  • Step 22 Determine a third sub-value based on the third information
  • the index of the third context model is determined based on the fifth weighted prediction value, that is, the above step B1-123 includes at least the following examples:
  • the point cloud encoding method of the embodiment of the present application includes the following steps:
  • the point cloud encoding method determines the inter-frame encoding information corresponding to the current node when encoding the current node in the current encoding frame, and uses the inter-frame encoding information corresponding to the current node as context information to predict the number of points of the current node, thereby increasing the selectable context conditions when encoding the point number, reducing the determined conditional entropy, and thus improving the encoding efficiency and performance of the point cloud.

Landscapes

  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Provided in the present application are a point cloud coding method and apparatus, a point cloud decoding method and apparatus, and devices and a storage medium. The decoding method comprises: when performing decoding on the current node in the current decoded frame, determining inter-frame decoding information corresponding to the current node; and taking the inter-frame decoding information corresponding to the current node as context information, and performing predictive decoding regarding the number of points of the current node. Context conditions for being selected during the decoding of the number of points are enriched, such that a determined conditional entropy is reduced, thereby improving the decoding efficiency and performance of a point cloud.

Description

点云编解码方法、装置、设备及存储介质Point cloud encoding and decoding method, device, equipment and storage medium 技术领域Technical Field
本申请涉及点云技术领域,尤其涉及一种点云编解码方法、装置、设备及存储介质。The present application relates to the field of point cloud technology, and in particular to a point cloud encoding and decoding method, device, equipment and storage medium.
背景技术Background technique
通过采集设备对物体表面进行采集,形成点云数据,点云数据包括几十万甚至更多的点。在视频制作过程中,将点云数据以点云媒体文件的形式在点云编码设备和点云解码设备之间传输。但是,如此庞大的点给传输带来了挑战,因此,点云编码设备需要对点云数据进行压缩后传输。The surface of the object is collected by the acquisition device to form point cloud data, which includes hundreds of thousands or even more points. In the video production process, the point cloud data is transmitted between the point cloud encoding device and the point cloud decoding device in the form of point cloud media files. However, such a large number of points brings challenges to transmission, so the point cloud encoding device needs to compress the point cloud data before transmission.
点云的压缩也称为点云的编码,在点云编码过程中,对于在几何空间中处于孤立位置的点来说,使用推断直接编码方式(Infer Direct Mode Coding,简称IDCM)可以大大降低复杂度。在采用直接编码方式对当前节点进行编解码时,需要对当前节点所包括的点数进行预测编解码。但是,目前对当前节点所包括的点数进行预测编解码的方式较单一,进而降低点云的编解码性能。Point cloud compression is also called point cloud encoding. In the point cloud encoding process, for points that are isolated in the geometric space, the use of infer direct coding (IDCM) can greatly reduce the complexity. When the direct coding method is used to encode and decode the current node, it is necessary to predict the number of points included in the current node. However, the current method of predicting the number of points included in the current node is relatively simple, which reduces the encoding and decoding performance of the point cloud.
发明内容Summary of the invention
本申请实施例提供了一种点云编解码方法、装置、设备及存储介质,增加编码节点的点数时可选择的上下文条件,进而提升点云的编解码性能。The embodiments of the present application provide a point cloud encoding and decoding method, apparatus, device and storage medium, which increase the selectable context conditions when encoding the number of points of the node, thereby improving the encoding and decoding performance of the point cloud.
第一方面,本申请实施例提供一种点云解码方法,包括:In a first aspect, an embodiment of the present application provides a point cloud decoding method, comprising:
确定当前节点对应的帧间解码信息,所述当前节点为当前待解码帧中的待解码节点;Determine inter-frame decoding information corresponding to a current node, wherein the current node is a node to be decoded in a current frame to be decoded;
基于所述当前节点对应的帧间解码信息,对所述当前节点所包括的点数进行预测解码。Based on the inter-frame decoding information corresponding to the current node, predictive decoding is performed on the number of points included in the current node.
第二方面,本申请提供了一种点云编码方法,包括:In a second aspect, the present application provides a point cloud encoding method, comprising:
确定当前节点对应的帧间编码信息,所述当前节点为当前待编码帧中的待编码节点;Determine inter-frame coding information corresponding to a current node, where the current node is a node to be coded in a current frame to be coded;
基于所述当前节点对应的帧间编码信息,对所述当前节点所包括的点数进行预测编码。Based on the inter-frame coding information corresponding to the current node, predictive coding is performed on the points included in the current node.
第三方面,本申请提供了一种点云解码装置,用于执行上述第一方面或其各实现方式中的方法。具体地,该装置包括用于执行上述第一方面或其各实现方式中的方法的功能单元。In a third aspect, the present application provides a point cloud decoding device for executing the method in the first aspect or its respective implementations. Specifically, the device includes a functional unit for executing the method in the first aspect or its respective implementations.
第四方面,本申请提供了一种点云编码装置,用于执行上述第二方面或其各实现方式中的方法。具体地,该装置包括用于执行上述第二方面或其各实现方式中的方法的功能单元。In a fourth aspect, the present application provides a point cloud encoding device for executing the method in the second aspect or its respective implementations. Specifically, the device includes a functional unit for executing the method in the second aspect or its respective implementations.
第五方面,提供了一种点云解码器,包括处理器和存储器。该存储器用于存储计算机程序,该处理器用于调用并运行该存储器中存储的计算机程序,以执行上述第一方面或其各实现方式中的方法。In a fifth aspect, a point cloud decoder is provided, comprising a processor and a memory. The memory is used to store a computer program, and the processor is used to call and run the computer program stored in the memory to execute the method in the first aspect or its implementation manners.
第六方面,提供了一种点云编码器,包括处理器和存储器。该存储器用于存储计算机程序,该处理器用于调用并运行该存储器中存储的计算机程序,以执行上述第二方面或其各实现方式中的方法。In a sixth aspect, a point cloud encoder is provided, comprising a processor and a memory. The memory is used to store a computer program, and the processor is used to call and run the computer program stored in the memory to execute the method in the second aspect or its respective implementations.
第七方面,提供了一种点云编解码***,包括点云编码器和点云解码器。点云解码器用于执行上述第一方面或其各实现方式中的方法,点云编码器用于执行上述第二方面或其各实现方式中的方法。In a seventh aspect, a point cloud encoding and decoding system is provided, comprising a point cloud encoder and a point cloud decoder. The point cloud decoder is used to execute the method in the first aspect or its respective implementations, and the point cloud encoder is used to execute the method in the second aspect or its respective implementations.
第八方面,提供了一种芯片,用于实现上述第一方面至第二方面中的任一方面或其各实现方式中的方法。具体地,该芯片包括:处理器,用于从存储器中调用并运行计算机程序,使得安装有该芯片的设备执行如上述第一方面至第二方面中的任一方面或其各实现方式中的方法。In an eighth aspect, a chip is provided for implementing the method in any one of the first to second aspects or their respective implementations. Specifically, the chip includes: a processor for calling and running a computer program from a memory, so that a device equipped with the chip executes the method in any one of the first to second aspects or their respective implementations.
第九方面,提供了一种计算机可读存储介质,用于存储计算机程序,该计算机程序使得计算机执行上述第一方面至第二方面中的任一方面或其各实现方式中的方法。In a ninth aspect, a computer-readable storage medium is provided for storing a computer program, wherein the computer program enables a computer to execute the method of any one of the first to second aspects or any of their implementations.
第十方面,提供了一种计算机程序产品,包括计算机程序指令,该计算机程序指令使得计算机执行上述第一方面至第二方面中的任一方面或其各实现方式中的方法。In a tenth aspect, a computer program product is provided, comprising computer program instructions, which enable a computer to execute the method in any one of the first to second aspects or their respective implementations.
第十一方面,提供了一种计算机程序,当其在计算机上运行时,使得计算机执行上述第一方面至第二方面中的任一方面或其各实现方式中的方法。In an eleventh aspect, a computer program is provided, which, when executed on a computer, enables the computer to execute the method in any one of the first to second aspects or in each of their implementations.
第十二方面,提供了一种码流,码流是基于上述第二方面的方法生成的,可选的,上述码流包括第一参数和第二参数中的至少一个。In a twelfth aspect, a code stream is provided, which is generated based on the method of the second aspect. Optionally, the code stream includes at least one of the first parameter and the second parameter.
基于以上技术方案,在对当前解码帧中的当前节点进行解码时,确定当前节点对应的帧间解码信息,并将当前节点对应的帧间解码信息作为上下文信息对当前节点的点数进行预测解码,增加了编解码节点的点数时可选择的上下文条件,使得确定出的条件熵减小,从而提升点云的解码效率和性能。Based on the above technical solution, when decoding the current node in the current decoding frame, the inter-frame decoding information corresponding to the current node is determined, and the inter-frame decoding information corresponding to the current node is used as context information to predict the number of points of the current node, thereby increasing the selectable context conditions when encoding and decoding the number of points of the node, reducing the determined conditional entropy, thereby improving the decoding efficiency and performance of the point cloud.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1A为点云示意图;FIG1A is a schematic diagram of a point cloud;
图1B为点云局部放大图;Figure 1B is a partial enlarged view of the point cloud;
图2为点云图像的六个观看角度示意图;FIG2 is a schematic diagram of six viewing angles of a point cloud image;
图3为本申请实施例涉及的一种点云编解码***的示意性框图;FIG3 is a schematic block diagram of a point cloud encoding and decoding system according to an embodiment of the present application;
图4A是本申请实施例提供的点云编码器的示意性框图;FIG4A is a schematic block diagram of a point cloud encoder provided in an embodiment of the present application;
图4B是本申请实施例提供的点云解码器的示意性框图;FIG4B is a schematic block diagram of a point cloud decoder provided in an embodiment of the present application;
图5A为一种平面示意图;FIG5A is a schematic plan view;
图5B为节点编码顺序示意图;FIG5B is a schematic diagram of node coding sequence;
图5C为平面标识示意图;FIG5C is a schematic diagram of a plane mark;
图5D为兄弟姐妹节点示意图;FIG5D is a schematic diagram of a sibling node;
图5E为激光雷达与节点的相交示意图;FIG5E is a schematic diagram of the intersection of a laser radar and a node;
图5F为处于相同划分深度相同坐标的邻域节点示意图;FIG5F is a schematic diagram of neighborhood nodes at the same division depth and the same coordinates;
图5G为当节点位于父节点低平面位置时邻域节点示意图;FIG5G is a schematic diagram of a neighboring node when the node is located at a lower plane position of the parent node;
图5H为当节点位于父节点高平面位置时邻域节点示意图;FIG5H is a schematic diagram of a neighboring node when the node is located at a high plane position of the parent node;
图5I为激光雷达点云平面位置信息的预测编码示意图;FIG5I is a schematic diagram of predictive coding of planar position information of a laser radar point cloud;
图6A为IDCM编码示意图;FIG6A is a schematic diagram of IDCM encoding;
图6B为旋转激光雷达获取的点云的坐标转换示意图;FIG6B is a schematic diagram of coordinate transformation of a point cloud acquired by a rotating laser radar;
图6C为X或Y轴方向的预测编码示意图;FIG6C is a schematic diagram of predictive coding in the X or Y axis direction;
图6D为通过水平方位角来进行预测X或者Y平面的角度示意图;FIG6D is a schematic diagram showing the angle of the X or Y plane predicted by the horizontal azimuth angle;
图6E为X或Y轴的预测编码示意图;FIG6E is a schematic diagram of predictive coding of the X or Y axis;
图7A至图7C为基于三角面片的几何信息编码示意图;7A to 7C are schematic diagrams of geometric information encoding based on triangular facets;
图8为本申请一实施例提供的点云解码方法流程示意图;FIG8 is a schematic diagram of a point cloud decoding method flow chart provided by an embodiment of the present application;
图9为一种八叉树划分示意图;FIG9 is a schematic diagram of an octree partition;
图10为预测节点的一种示意图;FIG10 is a schematic diagram of a prediction node;
图11为领域节点的一种示意图;FIG11 is a schematic diagram of a domain node;
图12为领域节点的对应节点示意图;FIG12 is a schematic diagram of corresponding nodes of a domain node;
图13A为当前节点在一个预测参考帧中的预测节点的示意图;FIG13A is a schematic diagram of a predicted node of a current node in a predicted reference frame;
图13B为当前节点在两个预测参考帧中的预测节点的示意图;FIG13B is a schematic diagram of the prediction nodes of the current node in two prediction reference frames;
图14A为一种IDCM编码示意图;FIG14A is a schematic diagram of IDCM encoding;
图14B为一种IDCM解码示意图;FIG14B is a schematic diagram of IDCM decoding;
图15为本申请一实施例提供的点云编码方法流程示意图;FIG15 is a schematic diagram of a point cloud encoding method flow chart provided by an embodiment of the present application;
图16是本申请实施例提供的点云解码装置的示意性框图;FIG16 is a schematic block diagram of a point cloud decoding device provided in an embodiment of the present application;
图17是本申请实施例提供的点云编码装置的示意性框图;FIG17 is a schematic block diagram of a point cloud encoding device provided in an embodiment of the present application;
图18是本申请实施例提供的电子设备的示意性框图;FIG18 is a schematic block diagram of an electronic device provided in an embodiment of the present application;
图19是本申请实施例提供的点云编解码***的示意性框图。Figure 19 is a schematic block diagram of the point cloud encoding and decoding system provided in an embodiment of the present application.
具体实施方式Detailed ways
本申请可应用于点云上采样技术领域,例如可以应用于点云压缩技术领域。The present application can be applied to the field of point cloud upsampling technology, for example, can be applied to the field of point cloud compression technology.
为了便于理解本申请的实施例,首先对本申请实施例涉及到的相关概念进行如下简单介绍:In order to facilitate understanding of the embodiments of the present application, the relevant concepts involved in the embodiments of the present application are briefly introduced as follows:
点云(Point Cloud)是指空间中一组无规则分布的、表达三维物体或三维场景的空间结构及表面属性的离散点集。图1A为三维点云图像示意图,图1B为图1A的局部放大图,由图1A和图1B可知,点云表面是由分布稠密的点所组成的。Point cloud refers to a set of irregularly distributed discrete points in space that express the spatial structure and surface properties of a three-dimensional object or three-dimensional scene. Figure 1A is a schematic diagram of a three-dimensional point cloud image, and Figure 1B is a partial enlarged view of Figure 1A. It can be seen from Figures 1A and 1B that the point cloud surface is composed of densely distributed points.
二维图像在每一个像素点均有信息表达,分布规则,因此不需要额外记录其位置信息;然而点云中的点在三维空间中的分布具有随机性和不规则性,因此需要记录每一个点在空间中的位置,才能完整地表达一幅点云。与二维图像类似,采集过程中每一个位置均有对应的属性信息。Two-dimensional images have information expressed at each pixel point, and the distribution is regular, so there is no need to record its position information; however, the distribution of points in the point cloud in three-dimensional space is random and irregular, so it is necessary to record the position of each point in space to fully express a point cloud. Similar to two-dimensional images, each position has corresponding attribute information during the acquisition process.
点云数据(Point Cloud Data)是点云的具体记录形式,点云中的点可以包括点的位置信息和点的属性信息。例如,点的位置信息可以是点的三维坐标信息。点的位置信息也可称为点的几何信息。例如,点的属性信息可包括颜色信息、反射率信息、法向量信息等等。颜色信息反映物体的色彩,反射率(reflectance)信息反映物体的表面材质。所述颜色信息可以是任意一种色彩空间上的信息。例如,所述颜色信息可以是(RGB)。再如,所述颜色信息可以是于亮度色度(YcbCr,YUV)信息。例如,Y表示明亮度(Luma),Cb(U)表示蓝色色差,Cr(V)表示红色,U和V表示为色度(Chroma)用于描述色差信息。例如,根据激光测量原理得到的点云,所述点云中的点可以包括点的三维坐标信息和点的激光反射强度(reflectance)。再如,根据摄影测量原理得到的点云,所述点云中的点可以可包括点的三维坐标信息和点的颜色信息。再如,结合激光测量和摄影测量原理得到点云,所述点云中的点可以可包括点的三维坐标信息、点的激光反射强度(reflectance)和点的颜色信息。如图2示出了一幅点云图像,其中,图2示出了点云图像的六个观看角度,表1示出了由文件头信息部分和数据部分组成的点云数据存储格式:Point cloud data is a specific record form of point cloud. Points in the point cloud may include the location information of the point and the attribute information of the point. For example, the location information of the point may be the three-dimensional coordinate information of the point. The location information of the point may also be called the geometric information of the point. For example, the attribute information of the point may include color information, reflectance information, normal vector information, etc. Color information reflects the color of an object, and reflectance information reflects the surface material of an object. The color information may be information in any color space. For example, the color information may be (RGB). For another example, the color information may be information about brightness and chromaticity (YcbCr, YUV). For example, Y represents brightness (Luma), Cb (U) represents blue color difference, Cr (V) represents red, and U and V represent chromaticity (Chroma) for describing color difference information. For example, according to the point cloud obtained by the laser measurement principle, the points in the point cloud may include the three-dimensional coordinate information of the point and the laser reflection intensity (reflectance) of the point. For another example, according to the point cloud obtained by the photogrammetry principle, the points in the point cloud may include the three-dimensional coordinate information of the point and the color information of the point. For another example, a point cloud is obtained by combining the principles of laser measurement and photogrammetry. The points in the point cloud may include the three-dimensional coordinate information of the point, the laser reflection intensity (reflectance) of the point, and the color information of the point. FIG2 shows a point cloud image, where FIG2 shows six viewing angles of the point cloud image. Table 1 shows the point cloud data storage format composed of a file header information part and a data part:
表1Table 1
Figure PCTCN2023071069-appb-000001
Figure PCTCN2023071069-appb-000001
表1中,头信息包含了数据格式、数据表示类型、点云总点数、以及点云所表示的内容,例如,本例中的点云为 “.ply”格式,由ASCII码表示,总点数为207242,每个点具有三维位置信息XYZ和三维颜色信息RGB。In Table 1, the header information includes the data format, data representation type, the total number of point cloud points, and the content represented by the point cloud. For example, the point cloud in this example is in the ".ply" format, represented by ASCII code, with a total number of 207242 points, and each point has three-dimensional position information XYZ and three-dimensional color information RGB.
点云可以灵活方便地表达三维物体或场景的空间结构及表面属性,并且由于点云通过直接对真实物体采样获得,在保证精度的前提下能提供极强的真实感,因而应用广泛,其范围包括虚拟现实游戏、计算机辅助设计、地理信息***、自动导航***、数字文化遗产、自由视点广播、三维沉浸远程呈现、生物组织器官三维重建等。Point clouds can flexibly and conveniently express the spatial structure and surface properties of three-dimensional objects or scenes. Point clouds are obtained by directly sampling real objects, so they can provide a strong sense of reality while ensuring accuracy. Therefore, they are widely used, including virtual reality games, computer-aided design, geographic information systems, automatic navigation systems, digital cultural heritage, free viewpoint broadcasting, three-dimensional immersive remote presentation, and three-dimensional reconstruction of biological tissues and organs.
点云数据的获取途径可以包括但不限于以下至少一种:(1)计算机设备生成。计算机设备可以根据虚拟三维物体及虚拟三维场景的生成点云数据。(2)3D(3-Dimension,三维)激光扫描获取。通过3D激光扫描可以获取静态现实世界三维物体或三维场景的点云数据,每秒可以获取百万级点云数据;(3)3D摄影测量获取。通过3D摄影设备(即一组摄像机或具有多个镜头和传感器的摄像机设备)对现实世界的视觉场景进行采集以获取现实世界的视觉场景的点云数据,通过3D摄影可以获得动态现实世界三维物体或三维场景的点云数据。(4)通过医学设备获取生物组织器官的点云数据。在医学领域可以通过磁共振成像(Magnetic Resonance Imaging,MRI)、电子计算机断层扫描(Computed Tomography,CT)、电磁定位信息等医学设备获取生物组织器官的点云数据。Point cloud data can be obtained by at least one of the following ways: (1) computer equipment generation. Computer equipment can generate point cloud data based on virtual three-dimensional objects and virtual three-dimensional scenes. (2) 3D (3-Dimension) laser scanning acquisition. 3D laser scanning can be used to obtain point cloud data of static real-world three-dimensional objects or three-dimensional scenes, and millions of point cloud data can be obtained per second; (3) 3D photogrammetry acquisition. The visual scene of the real world is collected by 3D photography equipment (i.e., a group of cameras or camera equipment with multiple lenses and sensors) to obtain point cloud data of the visual scene of the real world. 3D photography can be used to obtain point cloud data of dynamic real-world three-dimensional objects or three-dimensional scenes. (4) Point cloud data of biological tissues and organs can be obtained by medical equipment. In the medical field, point cloud data of biological tissues and organs can be obtained by medical equipment such as magnetic resonance imaging (MRI), computed tomography (CT), and electromagnetic positioning information.
点云可以按获取的途径分为:密集型点云和稀疏性点云。Point clouds can be divided into dense point clouds and sparse point clouds according to the way they are acquired.
点云按照数据的时序类型划分为:Point clouds are divided into the following types according to the time series of the data:
第一类静态点云:即物体是静止的,获取点云的设备也是静止的;The first type of static point cloud: the object is stationary, and the device that obtains the point cloud is also stationary;
第二类动态点云:物体是运动的,但获取点云的设备是静止的;The second type of dynamic point cloud: the object is moving, but the device that obtains the point cloud is stationary;
第三类动态获取点云:获取点云的设备是运动的。The third type of dynamic point cloud acquisition: the device that acquires the point cloud is moving.
按点云的用途分为两大类:Point clouds can be divided into two categories according to their uses:
类别一:机器感知点云,其可以用于自主导航***、实时巡检***、地理信息***、视觉分拣机器人、抢险救灾机器人等场景;Category 1: Machine perception point cloud, which can be used in autonomous navigation systems, real-time inspection systems, geographic information systems, visual sorting robots, disaster relief robots, etc.
类别二:人眼感知点云,其可以用于数字文化遗产、自由视点广播、三维沉浸通信、三维沉浸交互等点云应用场景。Category 2: Point cloud perceived by the human eye, which can be used in point cloud application scenarios such as digital cultural heritage, free viewpoint broadcasting, 3D immersive communication, and 3D immersive interaction.
上述点云获取技术降低了点云数据获取成本和时间周期,提高了数据的精度。点云数据获取方式的变革,使大量点云数据的获取成为可能,伴随着应用需求的增长,海量3D点云数据的处理遭遇存储空间和传输带宽限制的瓶颈。The above point cloud acquisition technology reduces the cost and time of point cloud data acquisition and improves the accuracy of data. The change in the point cloud data acquisition method makes it possible to acquire a large amount of point cloud data. With the growth of application demand, the processing of massive 3D point cloud data encounters bottlenecks of storage space and transmission bandwidth.
以帧率为30fps(帧每秒)的点云视频为例,每帧点云的点数为70万,每个点具有坐标信息xyz(float)和颜色信息RGB(uchar),则10s点云视频的数据量大约为0.7millionX(4ByteX3+1ByteX3)X30fpsX10s=3.15GB,而YUV采样格式为4:2:0,帧率为24fps的1280X720二维视频,其10s的数据量约为1280X720X12bitX24framesX10s≈0.33GB,10s的两视角3D视频的数据量约为0.33X2=0.66GB。由此可见,点云视频的数据量远超过相同时长的二维视频和三维视频的数据量。因此,为更好地实现数据管理,节省服务器存储空间,降低服务器与客户端之间的传输流量及传输时间,点云压缩成为促进点云产业发展的关键问题。Taking a point cloud video with a frame rate of 30fps (frames per second) as an example, the number of points in each point cloud frame is 700,000, and each point has coordinate information xyz (float) and color information RGB (uchar). The data volume of a 10s point cloud video is approximately 0.7 million X (4ByteX3+1ByteX3) X 30fpsX10s = 3.15GB, while the YUV sampling format is 4:2:0, and the frame rate is 24fps. The data volume of a 1280X720 two-dimensional video in 10s is approximately 1280X720X12bitX24framesX10s≈0.33GB, and the data volume of a 10s two-view 3D video is approximately 0.33X2 = 0.66GB. It can be seen that the data volume of a point cloud video far exceeds that of a two-dimensional video and a three-dimensional video of the same length. Therefore, in order to better realize data management, save server storage space, and reduce the transmission traffic and transmission time between the server and the client, point cloud compression has become a key issue in promoting the development of the point cloud industry.
下面对点云编解码的相关知识进行介绍。The following is an introduction to the relevant knowledge of point cloud encoding and decoding.
图3为本申请实施例涉及的一种点云编解码***的示意性框图。需要说明的是,图3只是一种示例,本申请实施例的点云编解码***包括但不限于图3所示。如图3所示,该点云编解码***100包含编码设备110和解码设备120。其中编码设备用于对点云数据进行编码(可以理解成压缩)产生码流,并将码流传输给解码设备。解码设备对编码设备编码产生的码流进行解码,得到解码后的点云数据。FIG3 is a schematic block diagram of a point cloud encoding and decoding system involved in an embodiment of the present application. It should be noted that FIG3 is only an example, and the point cloud encoding and decoding system of the embodiment of the present application includes but is not limited to that shown in FIG3. As shown in FIG3, the point cloud encoding and decoding system 100 includes an encoding device 110 and a decoding device 120. The encoding device is used to encode (which can be understood as compression) the point cloud data to generate a code stream, and transmit the code stream to the decoding device. The decoding device decodes the code stream generated by the encoding device to obtain decoded point cloud data.
本申请实施例的编码设备110可以理解为具有点云编码功能的设备,解码设备120可以理解为具有点云解码功能的设备,即本申请实施例对编码设备110和解码设备120包括更广泛的装置,例如包含智能手机、台式计算机、移动计算装置、笔记本(例如,膝上型)计算机、平板计算机、机顶盒、电视、相机、显示装置、数字媒体播放器、点云游戏控制台、车载计算机等。The encoding device 110 of the embodiment of the present application can be understood as a device with a point cloud encoding function, and the decoding device 120 can be understood as a device with a point cloud decoding function, that is, the embodiment of the present application includes a wider range of devices for the encoding device 110 and the decoding device 120, such as smartphones, desktop computers, mobile computing devices, notebook (e.g., laptop) computers, tablet computers, set-top boxes, televisions, cameras, display devices, digital media players, point cloud game consoles, vehicle-mounted computers, etc.
在一些实施例中,编码设备110可以经由信道130将编码后的点云数据(如码流)传输给解码设备120。信道130可以包括能够将编码后的点云数据从编码设备110传输到解码设备120的一个或多个媒体和/或装置。In some embodiments, the encoding device 110 may transmit the encoded point cloud data (such as a code stream) to the decoding device 120 via the channel 130. The channel 130 may include one or more media and/or devices capable of transmitting the encoded point cloud data from the encoding device 110 to the decoding device 120.
在一个实例中,信道130包括使编码设备110能够实时地将编码后的点云数据直接发射到解码设备120的一个或多个通信媒体。在此实例中,编码设备110可根据通信标准来调制编码后的点云数据,且将调制后的点云数据发射到解码设备120。其中通信媒体包含无线通信媒体,例如射频频谱,可选的,通信媒体还可以包含有线通信媒体,例如一根或多根物理传输线。In one example, the channel 130 includes one or more communication media that enable the encoding device 110 to transmit the encoded point cloud data directly to the decoding device 120 in real time. In this example, the encoding device 110 can modulate the encoded point cloud data according to the communication standard and transmit the modulated point cloud data to the decoding device 120. The communication medium includes a wireless communication medium, such as a radio frequency spectrum, and optionally, the communication medium may also include a wired communication medium, such as one or more physical transmission lines.
在另一实例中,信道130包括存储介质,该存储介质可以存储编码设备110编码后的点云数据。存储介质包含多种本地存取式数据存储介质,例如光盘、DVD、快闪存储器等。在该实例中,解码设备120可从该存储介质中获取编码后的点云数据。In another example, the channel 130 includes a storage medium, which can store the point cloud data encoded by the encoding device 110. The storage medium includes a variety of locally accessible data storage media, such as optical disks, DVDs, flash memories, etc. In this example, the decoding device 120 can obtain the encoded point cloud data from the storage medium.
在另一实例中,信道130可包含存储服务器,该存储服务器可以存储编码设备110编码后的点云数据。在此实例中,解码设备120可以从该存储服务器中下载存储的编码后的点云数据。可选的,该存储服务器可以存储编码后的点云数据且可以将该编码后的点云数据发射到解码设备120,例如web服务器(例如,用于网站)、文件传送协议(FTP)服务器等。In another example, the channel 130 may include a storage server that can store the point cloud data encoded by the encoding device 110. In this example, the decoding device 120 can download the stored encoded point cloud data from the storage server. Optionally, the storage server can store the encoded point cloud data and transmit the encoded point cloud data to the decoding device 120, such as a web server (e.g., for a website), a file transfer protocol (FTP) server, etc.
一些实施例中,编码设备110包含点云编码器112及输出接口113。其中,输出接口113可以包含调制器/解调器(调制解调器)和/或发射器。In some embodiments, the encoding device 110 includes a point cloud encoder 112 and an output interface 113. The output interface 113 may include a modulator/demodulator (modem) and/or a transmitter.
在一些实施例中,编码设备110除了包括点云编码器112和输入接口113外,还可以包括点云源111。In some embodiments, the encoding device 110 may further include a point cloud source 111 in addition to the point cloud encoder 112 and the input interface 113 .
点云源111可包含点云采集装置(例如,扫描仪)、点云存档、点云输入接口、计算机图形***中的至少一个,其中,点云输入接口用于从点云内容提供者处接收点云数据,计算机图形***用于产生点云数据。The point cloud source 111 may include at least one of a point cloud acquisition device (e.g., a scanner), a point cloud archive, a point cloud input interface, and a computer graphics system, wherein the point cloud input interface is used to receive point cloud data from a point cloud content provider, and the computer graphics system is used to generate point cloud data.
点云编码器112对来自点云源111的点云数据进行编码,产生码流。点云编码器112经由输出接口113将编码后的点云数据直接传输到解码设备120。编码后的点云数据还可存储于存储介质或存储服务器上,以供解码设备120后 续读取。The point cloud encoder 112 encodes the point cloud data from the point cloud source 111 to generate a code stream. The point cloud encoder 112 directly transmits the encoded point cloud data to the decoding device 120 via the output interface 113. The encoded point cloud data can also be stored in a storage medium or a storage server for subsequent reading by the decoding device 120.
在一些实施例中,解码设备120包含输入接口121和点云解码器122。In some embodiments, the decoding device 120 includes an input interface 121 and a point cloud decoder 122 .
在一些实施例中,解码设备120除包括输入接口121和点云解码器122外,还可以包括显示装置123。In some embodiments, the decoding device 120 may further include a display device 123 in addition to the input interface 121 and the point cloud decoder 122 .
其中,输入接口121包含接收器及/或调制解调器。输入接口121可通过信道130接收编码后的点云数据。The input interface 121 includes a receiver and/or a modem. The input interface 121 can receive the encoded point cloud data through the channel 130 .
点云解码器122用于对编码后的点云数据进行解码,得到解码后的点云数据,并将解码后的点云数据传输至显示装置123。The point cloud decoder 122 is used to decode the encoded point cloud data to obtain decoded point cloud data, and transmit the decoded point cloud data to the display device 123.
显示装置123显示解码后的点云数据。显示装置123可与解码设备120整合或在解码设备120外部。显示装置123可包括多种显示装置,例如液晶显示器(LCD)、等离子体显示器、有机发光二极管(OLED)显示器或其它类型的显示装置。The decoded point cloud data is displayed on the display device 123. The display device 123 may be integrated with the decoding device 120 or may be external to the decoding device 120. The display device 123 may include a variety of display devices, such as a liquid crystal display (LCD), a plasma display, an organic light emitting diode (OLED) display, or other types of display devices.
此外,图3仅为实例,本申请实施例的技术方案不限于图3,例如本申请的技术还可以应用于单侧的点云编码或单侧的点云解码。In addition, Figure 3 is only an example, and the technical solution of the embodiment of the present application is not limited to Figure 3. For example, the technology of the present application can also be applied to unilateral point cloud encoding or unilateral point cloud decoding.
目前的点云编码器可以采用国际标准组织运动图像专家组(Moving Picture Experts Group,MPEG)提出了两种点云压缩编码技术路线,分别是基于投影的点云压缩(Video-based Point Cloud Compression,VPCC)和基于几何的点云压缩(Geometry-based Point Cloud Compression,GPCC)。VPCC通过将三维点云投影到二维,利用现有的二维编码工具对投影后的二维图像进行编码,GPCC利用层级化的结构将点云逐级划分为多个单元,通过编码记录划分过程编码整个点云。The current point cloud encoder can adopt two point cloud compression coding technology routes proposed by the International Standards Organization Moving Picture Experts Group (MPEG), namely Video-based Point Cloud Compression (VPCC) and Geometry-based Point Cloud Compression (GPCC). VPCC projects the three-dimensional point cloud into two dimensions and uses the existing two-dimensional coding tools to encode the projected two-dimensional image. GPCC uses a hierarchical structure to divide the point cloud into multiple units step by step, and encodes the entire point cloud by encoding the division process.
下面以GPCC编解码框架为例,对本申请实施例可适用的点云编码器和点云解码器进行说明。The following uses the GPCC encoding and decoding framework as an example to explain the point cloud encoder and point cloud decoder applicable to the embodiments of the present application.
图4A是本申请实施例提供的点云编码器的示意性框图。FIG. 4A is a schematic block diagram of a point cloud encoder provided in an embodiment of the present application.
由上述可知点云中的点可以包括点的位置信息和点的属性信息,因此,点云中的点的编码主要包括位置编码和属性编码。在一些示例中点云中点的位置信息又称为几何信息,对应的点云中点的位置编码也可以称为几何编码。From the above, we can know that the points in the point cloud can include the location information of the points and the attribute information of the points. Therefore, the encoding of the points in the point cloud mainly includes location encoding and attribute encoding. In some examples, the location information of the points in the point cloud is also called geometric information, and the corresponding location encoding of the points in the point cloud can also be called geometric encoding.
在GPCC编码框架中,点云的几何信息和对应的属性信息是分开编码的。In the GPCC coding framework, the geometric information of the point cloud and the corresponding attribute information are encoded separately.
如下图4A所示,目前G-PCC的几何编解码可分为基于八叉树的几何编解码和基于预测树的几何编解码。As shown in FIG. 4A below, the current geometric coding and decoding of G-PCC can be divided into octree-based geometric coding and decoding and prediction tree-based geometric coding and decoding.
位置编码的过程包括:对点云中的点进行预处理,例如坐标变换、量化和移除重复点等;接着,对预处理后的点云进行几何编码,例如构建八叉树,或构建预测树,基于构建的八叉树或预测树进行几何编码形成几何码流。同时,基于构建的八叉树或预测树输出的位置信息,对点云数据中各点的位置信息进行重建,得到各点的位置信息的重建值。The process of position coding includes: preprocessing the points in the point cloud, such as coordinate transformation, quantization, and removal of duplicate points; then, geometric coding the preprocessed point cloud, such as constructing an octree, or constructing a prediction tree, and geometric coding based on the constructed octree or prediction tree to form a geometric code stream. At the same time, based on the position information output by the constructed octree or prediction tree, the position information of each point in the point cloud data is reconstructed to obtain the reconstructed value of the position information of each point.
属性编码过程包括:通过给定输入点云的位置信息的重建信息和属性信息的原始值,选择三种预测模式的一种进行点云预测,对预测后的结果进行量化,并进行算术编码形成属性码流。The attribute encoding process includes: given the reconstruction information of the input point cloud position information and the original value of the attribute information, selecting one of the three prediction modes for point cloud prediction, quantizing the predicted result, and performing arithmetic coding to form an attribute code stream.
如图4A所示,位置编码可通过以下单元实现:As shown in Figure 4A, position encoding can be achieved by the following units:
坐标转换(Tanmsform coordinates)单元201、体素(Voxelize)单元202、八叉树划分(Analyze octree)单元203、几何重建(Reconstruct geometry)单元204、算术编码(Arithmetic enconde)单元205、表面拟合单元(Analyze surface approximation)206和预测树构建单元207。Coordinate transformation (Tanmsform coordinates) unit 201, voxel (Voxelize) unit 202, octree partition (Analyze octree) unit 203, geometry reconstruction (Reconstruct geometry) unit 204, arithmetic encoding (Arithmetic enconde) unit 205, surface fitting unit (Analyze surface approximation) 206 and prediction tree construction unit 207.
坐标转换单元201可用于将点云中点的世界坐标变换为相对坐标。例如,点的几何坐标分别减去xyz坐标轴的最小值,相当于去直流操作,以实现将点云中的点的坐标从世界坐标转换为相对坐标。The coordinate conversion unit 201 can be used to convert the world coordinates of the point in the point cloud into relative coordinates. For example, the geometric coordinates of the point are respectively subtracted from the minimum value of the xyz coordinate axis, which is equivalent to a DC removal operation, so as to realize the conversion of the coordinates of the point in the point cloud from the world coordinates to the relative coordinates.
体素(Voxelize)单元202也称为量化和移除重复点(Quantize and remove points)单元,可通过量化减少坐标的数目;量化后原先不同的点可能被赋予相同的坐标,基于此,可通过去重操作将重复的点删除;例如,具有相同量化位置和不同属性信息的多个云可通过属性转换合并到一个云中。在本申请的一些实施例中,体素单元202为可选的单元模块。The voxel unit 202 is also called a quantize and remove points unit, which can reduce the number of coordinates by quantization; after quantization, originally different points may be assigned the same coordinates, based on which, duplicate points can be deleted by deduplication operation; for example, multiple clouds with the same quantized position and different attribute information can be merged into one cloud by attribute conversion. In some embodiments of the present application, the voxel unit 202 is an optional unit module.
八叉树划分单元203可利用八叉树(octree)编码方式,编码量化的点的位置信息。例如,将点云按照八叉树的形式进行划分,由此,点的位置可以和八叉树的位置一一对应,通过统计八叉树中有点的位置,并将其标识(flag)记为1,以进行几何编码。The octree division unit 203 may use an octree encoding method to encode the position information of the quantized points. For example, the point cloud is divided in the form of an octree, so that the position of the point can correspond to the position of the octree one by one, and the position of the point in the octree is counted and its flag is recorded as 1 to perform geometric encoding.
在一些实施例中,在基于三角面片集(trianglesoup,trisoup)的几何信息编码过程中,同样也要通过八叉树划分单元203对点云进行八叉树划分,但区别于基于八叉树的几何信息编码,该trisoup不需要将点云逐级划分到边长为1X1X1的单位立方体,而是划分到block(子块)边长为W时停止划分,基于每个block中点云的分布所形成的表面,得到该表面与block的十二条边所产生的至多十二个vertex(交点),通过表面拟合单元206对交点进行表面拟合,对拟合后的交点进行几何编码。In some embodiments, in the process of geometric information encoding based on triangle soup (trisoup), the point cloud is also divided into octrees by the octree division unit 203. However, different from the geometric information encoding based on the octree, the trisoup does not need to divide the point cloud into unit cubes with a side length of 1X1X1 step by step, but stops dividing when the block (sub-block) has a side length of W. Based on the surface formed by the distribution of the point cloud in each block, at most twelve vertices (intersections) generated by the surface and the twelve edges of the block are obtained, and the intersections are surface fitted by the surface fitting unit 206, and the fitted intersections are geometrically encoded.
预测树构建单元207可利用预测树编码方式,编码量化的点的位置信息。例如,将点云按照预测树的形式进行划分,由此,点的位置可以和预测树中节点的位置一一对应,通过统计预测树中有点的位置,通过选取不同的预测模式对节点的几何位置信息进行预测得到预测残差,并且利用量化参数对几何预测残差进行量化。最终通过不断迭代,对预测树节点位置信息的预测残差、预测树结构以及量化参数等进行编码,生成二进制码流。The prediction tree construction unit 207 can use the prediction tree encoding method to encode the position information of the quantized points. For example, the point cloud is divided in the form of a prediction tree, so that the position of the point can correspond to the position of the node in the prediction tree one by one. By counting the positions of the points in the prediction tree, the geometric position information of the node is predicted by selecting different prediction modes to obtain the prediction residual, and the geometric prediction residual is quantized using the quantization parameter. Finally, through continuous iteration, the prediction residual of the prediction tree node position information, the prediction tree structure and the quantization parameter are encoded to generate a binary code stream.
几何重建单元204可以基于八叉树划分单元203输出的位置信息或表面拟合单元206拟合后的交点进行位置重建,得到点云数据中各点的位置信息的重建值。或者,基于预测树构建单元207输出的位置信息进行位置重建,得到点云数据中各点的位置信息的重建值。The geometric reconstruction unit 204 can perform position reconstruction based on the position information output by the octree division unit 203 or the intersection points fitted by the surface fitting unit 206 to obtain the reconstructed value of the position information of each point in the point cloud data. Alternatively, the position reconstruction can be performed based on the position information output by the prediction tree construction unit 207 to obtain the reconstructed value of the position information of each point in the point cloud data.
算术编码单元205可以采用熵编码方式对八叉树分析单元203输出的位置信息或对表面拟合单元206拟合后的交点,或者预测树构建单元207输出的几何预测残差值进行算术编码,生成几何码流;几何码流也可称为几何比特流(geometry bitstream)。The arithmetic coding unit 205 can use entropy coding to perform arithmetic coding on the position information output by the octree analysis unit 203 or the intersection points fitted by the surface fitting unit 206, or the geometric prediction residual values output by the prediction tree construction unit 207 to generate a geometric code stream; the geometric code stream can also be called a geometry bitstream.
属性编码可通过以下单元实现:Attribute encoding can be achieved through the following units:
颜色转换(Transform colors)单元210、重着色(Transfer attributes)单元211、区域自适应分层变换(Region Adaptive Hierarchical Transform,RAHT)单元212、生成LOD(Generate LOD)单元213以及提升(lifting transform)单元214、 量化系数(Quantize coefficients)单元215以及算术编码单元216。A color conversion (Transform colors) unit 210, a recoloring (Transfer attributes) unit 211, a Region Adaptive Hierarchical Transform (RAHT) unit 212, a Generate LOD (Generate LOD) unit 213, a lifting (lifting transform) unit 214, a Quantize coefficients (Quantize coefficients) unit 215 and an arithmetic coding unit 216.
需要说明的是,点云编码器200可包含比图4A更多、更少或不同的功能组件。It should be noted that the point cloud encoder 200 may include more, fewer, or different functional components than those shown in FIG. 4A .
颜色转换单元210可用于将点云中点的RGB色彩空间变换为YCbCr格式或其他格式。The color conversion unit 210 may be used to convert the RGB color space of the points in the point cloud into a YCbCr format or other formats.
重着色单元211利用重建的几何信息,对颜色信息进行重新着色,使得未编码的属性信息与重建的几何信息对应起来。The recoloring unit 211 recolors the color information using the reconstructed geometric information so that the uncoded attribute information corresponds to the reconstructed geometric information.
经过重着色单元211转换得到点的属性信息的原始值后,可选择任一种变换单元,对点云中的点进行变换。变换单元可包括:RAHT变换212和提升(lifting transform)单元214。其中,提升变化依赖生成细节层(level of detail,LOD)。After the original value of the attribute information of the point is converted by the recoloring unit 211, any transformation unit can be selected to transform the points in the point cloud. The transformation unit may include: RAHT transformation 212 and lifting (lifting transform) unit 214. Among them, the lifting transformation depends on generating a level of detail (LOD).
RAHT变换和提升变换中的任一项可以理解为用于对点云中点的属性信息进行预测,以得到点的属性信息的预测值,进而基于点的属性信息的预测值得到点的属性信息的残差值。例如,点的属性信息的残差值可以是点的属性信息的原始值减去点的属性信息的预测值。Any of the RAHT transformation and the lifting transformation can be understood as being used to predict the attribute information of a point in a point cloud to obtain a predicted value of the attribute information of the point, and then obtain a residual value of the attribute information of the point based on the predicted value of the attribute information of the point. For example, the residual value of the attribute information of the point can be the original value of the attribute information of the point minus the predicted value of the attribute information of the point.
在本申请的一实施例中,生成LOD单元生成LOD的过程包括:根据点云中点的位置信息,获取点与点之间的欧式距离;根据欧式距离,将点分为不同的细节表达层。在一个实施例中,可以将欧式距离进行排序后,将不同范围的欧式距离划分为不同的细节表达层。例如,可以随机挑选一个点,作为第一细节表达层。然后计算剩余点与该点的欧式距离,并将欧式距离符合第一阈值要求的点,归为第二细节表达层。获取第二细节表达层中点的质心,计算除第一、第二细节表达层以外的点与该质心的欧式距离,并将欧式距离符合第二阈值的点,归为第三细节表达层。以此类推,将所有的点都归到细节表达层中。通过调整欧式距离的阈值,可以使得每层LOD层的点的数量是递增的。应理解,LOD划分的方式还可以采用其它方式,本申请对此不进行限制。In one embodiment of the present application, the process of generating LOD by the LOD generating unit includes: obtaining the Euclidean distance between points according to the position information of the points in the point cloud; and dividing the points into different detail expression layers according to the Euclidean distance. In one embodiment, the Euclidean distances can be sorted and the Euclidean distances in different ranges can be divided into different detail expression layers. For example, a point can be randomly selected as the first detail expression layer. Then the Euclidean distances between the remaining points and the point are calculated, and the points whose Euclidean distances meet the first threshold requirement are classified as the second detail expression layer. The centroid of the points in the second detail expression layer is obtained, and the Euclidean distances between the points other than the first and second detail expression layers and the centroid are calculated, and the points whose Euclidean distances meet the second threshold are classified as the third detail expression layer. By analogy, all points are classified into the detail expression layer. By adjusting the threshold of the Euclidean distance, the number of points in each LOD layer can be increased. It should be understood that the LOD division method can also be adopted in other ways, and the present application does not limit this.
需要说明的是,可以直接将点云划分为一个或多个细节表达层,也可以先将点云划分为多个点云切块(slice),再将每一个点云切块划分为一个或多个LOD层。It should be noted that the point cloud may be directly divided into one or more detail expression layers, or the point cloud may be first divided into a plurality of point cloud slices, and then each point cloud slice may be divided into one or more LOD layers.
例如,可将点云划分为多个点云切块,每个点云切块的点的个数可以在55万-110万之间。每个点云切块可看成单独的点云。每个点云切块又可以划分为多个细节表达层,每个细节表达层包括多个点。在一个实施例中,可根据点与点之间的欧式距离,进行细节表达层的划分。For example, the point cloud can be divided into multiple point cloud blocks, and the number of points in each point cloud block can be between 550,000 and 1.1 million. Each point cloud block can be regarded as a separate point cloud. Each point cloud block can be divided into multiple detail expression layers, and each detail expression layer includes multiple points. In one embodiment, the detail expression layer can be divided according to the Euclidean distance between points.
量化单元215可用于量化点的属性信息的残差值。例如,若量化单元215和RAHT变换单元212相连,则量化单元215可用于量化RAHT变换单元212输出的点的属性信息的残差值。The quantization unit 215 may be used to quantize the residual value of the attribute information of the point. For example, if the quantization unit 215 is connected to the RAHT transformation unit 212, the quantization unit 215 may be used to quantize the residual value of the attribute information of the point output by the RAHT transformation unit 212.
算术编码单元216可使用零行程编码(Zero run length coding)对点的属性信息的残差值进行熵编码,以得到属性码流。所述属性码流可以是比特流信息。The arithmetic coding unit 216 may use zero run length coding to perform entropy coding on the residual value of the attribute information of the point to obtain an attribute code stream. The attribute code stream may be bit stream information.
图4B是本申请实施例提供的点云解码器的示意性框图。Figure 4B is a schematic block diagram of a point cloud decoder provided in an embodiment of the present application.
如图4B所示,解码器300可以从编码设备获取点云码流,通过解析码得到点云中的点的位置信息和属性信息。点云的解码包括位置解码和属性解码。As shown in Fig. 4B, the decoder 300 can obtain the point cloud code stream from the encoding device, and obtain the position information and attribute information of the points in the point cloud by parsing the code. The decoding of the point cloud includes position decoding and attribute decoding.
位置解码的过程包括:对几何码流进行算术解码;构建八叉树后进行合并,对点的位置信息进行重建,以得到点的位置信息的重建信息;对点的位置信息的重建信息进行坐标变换,得到点的位置信息。点的位置信息也可称为点的几何信息。The process of position decoding includes: performing arithmetic decoding on the geometric code stream; merging after building the octree, reconstructing the position information of the point to obtain the reconstructed information of the point position information; performing coordinate transformation on the reconstructed information of the point position information to obtain the point position information. The point position information can also be called the geometric information of the point.
属性解码过程包括:通过解析属性码流,获取点云中点的属性信息的残差值;通过对点的属性信息的残差值进行反量化,得到反量化后的点的属性信息的残差值;基于位置解码过程中获取的点的位置信息的重建信息,选择如下RAHT逆变换和提升逆变换中的一种进行点云预测,得到预测值,预测值与残差值相加得到点的属性信息的重建值;对点的属性信息的重建值进行颜色空间逆转换,以得到解码点云。The attribute decoding process includes: obtaining the residual value of the attribute information of the point in the point cloud by parsing the attribute code stream; obtaining the residual value of the attribute information of the point after dequantization by dequantizing the residual value of the attribute information of the point; based on the reconstruction information of the point position information obtained in the position decoding process, selecting one of the following RAHT inverse transform and lifting inverse transform to predict the point cloud to obtain the predicted value, and adding the predicted value to the residual value to obtain the reconstructed value of the attribute information of the point; performing color space inverse conversion on the reconstructed value of the attribute information of the point to obtain the decoded point cloud.
如图4B所示,位置解码可通过以下单元实现:As shown in FIG4B , position decoding can be achieved by the following units:
算数解码单元301、八叉树重构(synthesize octree)单元302、表面重构单元(Synthesize suface approximation)303、几何重建(Reconstruct geometry)单元304、坐标系反变换(inverse transform coordinates)单元305和预测树重建单元306。 Arithmetic decoding unit 301, octree reconstruction (synthesize octree) unit 302, surface reconstruction unit (Synthesize suface approximation) 303, geometry reconstruction (Reconstruct geometry) unit 304, inverse transform coordinates (inverse transform coordinates) unit 305 and prediction tree reconstruction unit 306.
属性编码可通过以下单元实现:Attribute encoding can be achieved through the following units:
算数解码单元310、反量化(inverse quantize)单元311、RAHT逆变换单元312、生成LOD(Generate LOD)单元313、提升逆变换(Inverse lifting)单元314以及颜色反变换(inverse trasform colors)单元315。 Arithmetic decoding unit 310, inverse quantize unit 311, RAHT inverse transform unit 312, generate LOD unit 313, inverse lifting unit 314 and inverse trasform colors unit 315.
需要说明的是,解压缩是压缩的逆过程,类似的,解码器300中的各个单元的功能可参见编码器200中相应的单元的功能。另外,点云解码器300可包含比图4B更多、更少或不同的功能组件。It should be noted that decompression is the inverse process of compression. Similarly, the functions of each unit in the decoder 300 can refer to the functions of the corresponding units in the encoder 200. In addition, the point cloud decoder 300 may include more, fewer or different functional components than those in FIG. 4B.
例如,解码器300可根据点云中点与点之间的欧式距离将点云划分为多个LOD;然后,依次对LOD中点的属性信息进行解码;例如,计算零行程编码技术中零的数量(zero_cnt),以基于zero_cnt对残差进行解码;接着,解码框架200可基于解码出的残差值进行反量化,并基于反量化后的残差值与当前点的预测值相加得到该点云的重建值,直到解码完所有的点云。当前点将会作为后续LOD中点的最邻近点,并利用当前点的重建值对后续点的属性信息进行预测。For example, the decoder 300 can divide the point cloud into multiple LODs according to the Euclidean distance between points in the point cloud; then, the attribute information of the points in the LOD is decoded in sequence; for example, the number of zeros (zero_cnt) in the zero-run encoding technology is calculated to decode the residual based on zero_cnt; then, the decoding framework 200 can perform inverse quantization based on the decoded residual value, and add the inverse quantized residual value to the predicted value of the current point to obtain the reconstruction value of the point cloud until all point clouds are decoded. The current point will be used as the nearest point of the subsequent LOD point, and the reconstruction value of the current point will be used to predict the attribute information of the subsequent point.
上述是基于GPCC编解码框架下的点云编解码器的基本流程,随着技术的发展,该框架或流程的一些模块或步骤可能会被优化,本申请适用于该基于GPCC编解码框架下的点云编解码器的基本流程,但不限于该框架及流程。The above is the basic process of the point cloud codec based on the GPCC codec framework. With the development of technology, some modules or steps of the framework or process may be optimized. This application is applicable to the basic process of the point cloud codec based on the GPCC codec framework, but is not limited to the framework and process.
下面对基于八叉树的几何编码和基于预测树的几何编码进行介绍。The following introduces octree-based geometric coding and prediction tree-based geometric coding.
基于八叉树的几何编码包括:首先对几何信息进行坐标转换,使点云全都包含在一个bounding box(包围盒)中。然后再进行量化,这一步量化主要起到缩放的作用,由于量化取整,使得一部分点的几何信息相同,根据参数来决定是否移除重复点,量化和移除重复点这一过程又被称为体素化过程。接下来,按照广度优先遍历的顺序不断对bounding  box进行树划分(八叉树/四叉树/二叉树),对每个节点的占位码进行编码。在一种隐式几何的划分方式中,首先计算点云的包围盒
Figure PCTCN2023071069-appb-000002
假设,该d x>d y>d z包围盒对应为一个长方体。在几何划分时,首先会基于x轴一直进行二叉树划分,得到两个子节点;直到满足d x=d y>d z条件时,才会基于x和y轴一直进行四叉树划分,得到四个子节点;当最终满足d x=d y=d z条件时,会一直进行八叉树划分,直到划分得到的叶子结点为1x1x1的单位立方体时停止划分,对叶子结点中的点进行编码,生成二进制码流。在基于二叉树/四叉树/八叉树划分的过程中,引入两个参数:K、M。参数K指示在进行八叉树划分之前二叉树/四叉树划分的最多次数;参数M用来指示在进行二叉树/四叉树划分时对应的最小块边长为2 M。同时K和M必须满足条件:假设d max=max(d x,d y,d z),d min=min(d x,d y,d z),参数K满足:K>=d max-d min;参数M满足:M>=d min。参数K与M之所以满足上述的条件,是因为目前G-PCC在几何隐式划分的过程中,划分方式的优先级为二叉树、四叉树和八叉树,当节点块大小不满足二叉树/四叉树的条件时,才会对节点一直进行八叉树的划分,直到划分到叶子节点最小单位1X1X1。
Octree-based geometric encoding includes: first, coordinate transformation of geometric information so that all point clouds are contained in a bounding box. Then quantization is performed. This step of quantization mainly plays a role of scaling. Due to quantization rounding, the geometric information of some points is the same. Whether to remove duplicate points is determined based on parameters. The process of quantization and removal of duplicate points is also called voxelization. Next, the bounding box is continuously divided into trees (octree/quadtree/binary tree) in the order of breadth-first traversal, and the placeholder code of each node is encoded. In an implicit geometric division method, the bounding box of the point cloud is first calculated.
Figure PCTCN2023071069-appb-000002
Assume that the bounding box of dx > dy > dz corresponds to a cuboid. During geometric partitioning, binary tree partitioning will be performed based on the x-axis to obtain two child nodes. When the condition of dx = dy > dz is met, quadtree partitioning will be performed based on the x-axis and y-axis to obtain four child nodes. When the condition of dx = dy = dz is finally met, octree partitioning will be performed until the leaf node obtained by partitioning is a 1x1x1 unit cube. The partitioning will be stopped, and the points in the leaf node will be encoded to generate a binary code stream. In the process of binary tree/quadtree/octree partitioning, two parameters are introduced: K and M. Parameter K indicates the maximum number of binary tree/quadtree partitioning before octree partitioning; parameter M is used to indicate that the minimum block side length corresponding to binary tree/quadtree partitioning is 2M . At the same time, K and M must meet the following conditions: Assuming d max = max(d x , dy , d z ), d min = min(d x , dy , d z ), parameter K satisfies: K>=d max -d min ; parameter M satisfies: M>=d min . The reason why parameters K and M meet the above conditions is that in the process of geometric implicit partitioning of G-PCC, the priority of partitioning is binary tree, quadtree and octree. When the node block size does not meet the conditions of binary tree/quadtree, the node will be partitioned into octree until it is partitioned into the smallest unit of leaf node 1X1X1.
基于八叉树的几何信息编码模式可以通过利用空间中相邻点之间的相关性来对点云的几何信息进行有效的编码,但是对于一些较为平坦的节点或者具有平面特性的节点,通过利用平面编码可以进一步提升点云几何信息的编码效率。The octree-based geometric information encoding mode can effectively encode the geometric information of the point cloud by utilizing the correlation between adjacent points in space. However, for some relatively flat nodes or nodes with planar characteristics, the encoding efficiency of the point cloud geometric information can be further improved by using plane coding.
示例性的,如图5A所示,(a)系列属于Z轴方向的低平面位置,(b)轴系列属于Z轴方向的高平面位置。以(a)为例子,可以看到当前节点中被占据的四个子节点都位于当前节点在Z轴方向的低平面位置,那么可以认为当前节点属于一个Z平面并且在Z轴方向是一个低平面。同样的,(b)表示的是当前节点中被占据的子节点位于当前节点在Z轴方向的高平面位置。For example, as shown in FIG5A , the (a) series belongs to the low plane position in the Z-axis direction, and the (b) series belongs to the high plane position in the Z-axis direction. Taking (a) as an example, it can be seen that the four occupied subnodes in the current node are all located in the low plane position of the current node in the Z-axis direction, so it can be considered that the current node belongs to a Z plane and is a low plane in the Z-axis direction. Similarly, (b) indicates that the occupied subnodes in the current node are located in the high plane position of the current node in the Z-axis direction.
下面以(a)为例,对八叉树编码和平面编码效率进行比较。如图5B所示,对图1中的(a)采用八叉树编码方式,那么当前节点的占位信息表示为:11001100。但是如果采用平面编码方式,首先需要编码一个标识符表示当前节点在Z轴方向是一个平面,其次如果当前节点在Z轴方向是一个平面,需要对当前节点的平面位置进行表示。其次仅仅需要对Z轴方向的低平面节点占位信息进行编码(即0246四个子节点的占位信息),因此基于平面编码方式对当前节点进行编码,仅仅需要编码6个bit,相比原本的八叉树编码可以减少2个bit的表示。基于此分析,平面编码相比八叉树编码具有较为明显的编码效率。因此,对于一个被占据的节点,如果在某一个维度上采用平面编码方式进行编码,则如图5C所示,首先需要对当前节点在该维度上的平面标识(planarMode)和平面位置(PlanePos)信息进行表示,其次基于当前节点的平面信息来对当前节点的占位信息进行编码。需要注意的是:PlaneMode i(i=0,1,2):0代表当前节点在i轴方向不是一个平面,当节点在i轴方向是一个平面时,PlanePosition i:0代表当前节点在i轴方向是一个平面,并且平面位置为低平面,1表示当前节点在i轴方向上是一个高平面。示例性的,i=0表示X轴,i=1表示Y轴,i=2表示Z轴。 Taking (a) as an example, the efficiency of octree coding and plane coding is compared. As shown in Figure 5B, if the octree coding method is used for (a) in Figure 1, the placeholder information of the current node is represented as: 11001100. However, if the plane coding method is used, first, an identifier needs to be encoded to indicate that the current node is a plane in the Z-axis direction. Secondly, if the current node is a plane in the Z-axis direction, the plane position of the current node needs to be represented. Secondly, only the placeholder information of the low plane node in the Z-axis direction needs to be encoded (that is, the placeholder information of the four child nodes 0246). Therefore, based on the plane coding method, only 6 bits need to be encoded to encode the current node, which can reduce the representation of 2 bits compared to the original octree coding. Based on this analysis, plane coding has a more obvious coding efficiency than octree coding. Therefore, for an occupied node, if a plane encoding method is used for encoding in a certain dimension, as shown in FIG5C , firstly, the plane identification (planarMode) and plane position (PlanePos) information of the current node in the dimension need to be represented, and secondly, the occupancy information of the current node is encoded based on the plane information of the current node. It should be noted that: PlaneMode i (i=0,1,2): 0 represents that the current node is not a plane in the i-axis direction. When the node is a plane in the i-axis direction, PlanePosition i : 0 represents that the current node is a plane in the i-axis direction, and the plane position is a low plane, and 1 represents that the current node is a high plane in the i-axis direction. Exemplarily, i=0 represents the X-axis, i=1 represents the Y-axis, and i=2 represents the Z-axis.
下面将详细介绍一下当前G-PCC标准中,判断一个节点是否满足平面编码的条件以及当节点满足平面编码条件时,对节点平面标识和平面位置信息的预测编码。The following is a detailed introduction to the current G-PCC standard for determining whether a node meets the plane coding conditions and predictive coding of the node plane identification and plane position information when the node meets the plane coding conditions.
当前G-PCC中存在3种判断节点是否满足平面编码的判断条件,下面逐一进行介绍:Currently, there are three types of judgment conditions in G-PCC to determine whether a node meets the plane coding criteria. The following describes them one by one:
第1种:根据节点在每个维度上的平面概率进行判断。The first type: judge based on the plane probability of the node in each dimension.
首先确定当前节点的局部区域密度(local_node_density),以及当前节点在每个维度上的概率Prob(i)。First, determine the local area density (local_node_density) of the current node and the probability Prob(i) of the current node in each dimension.
当节点的局部区域密度小于阈值Th(Th=3)时,利用当前节点在三维上的平面概率Prob(i)和阈值Th0、Th1和Th2进行比较,其中Th0<Th1<Th2(Th0=0.6,Th1=0.77,Th2=0.88)。下面利用Eligible i(i=0,1,2)表示每个维度上是否启动平面编码,其中Eligible i的判断过程如公式(1)所示,例如若Eligible i>=threshold则表示第i维度上启动平面编码: When the local area density of the node is less than the threshold Th (Th = 3), the plane probability Prob (i) of the current node in three dimensions is compared with the thresholds Th0, Th1 and Th2, where Th0 < Th1 < Th2 (Th0 = 0.6, Th1 = 0.77, Th2 = 0.88). Eligible i (i = 0, 1, 2) is used below to indicate whether plane coding is enabled in each dimension, where the judgment process of Eligible i is shown in formula (1). For example, if Eligible i > = threshold, it means that plane coding is enabled in the i-th dimension:
Eligible i=Prob(i)>=threshold   (1) Eligible i = Prob(i)> = threshold (1)
需要注意的是threshold是进行自适应变化的,例如:当Prob(0)>Prob(1)>Prob(2)时,则threshold取值如公式(2)所示:It should be noted that the threshold is adaptively changed. For example, when Prob(0)>Prob(1)>Prob(2), the threshold value is as shown in formula (2):
Eligible 0=Prob(0)>=Th0 Eligible 0 = Prob(0) > = Th0
Eligible 1=Prob(1)>=Th1 Eligible 1 = Prob(1) > = Th1
Eligible 2=Prob(2)>=Th2   (2) Eligible 2 = Prob(2) > = Th2 (2)
下面介绍local_node_density的更新过程以及Prob(i)的更新。The following describes the update process of local_node_density and the update of Prob(i).
在一种示例中,Prob(i)通过如下公式(3)进行更新:In one example, Prob(i) is updated by the following formula (3):
Prob(i) new=(Lx Prob(i)+δ(coded node))/L+1   (3) Prob(i) new = (L x Prob(i) + δ(coded node)) / L + 1 (3)
其中,L=255,当coded node节点是一个平面时,则为1否则为0。Where L=255, when the coded node is a plane, it is 1, otherwise it is 0.
在一种示例中,local_node_density通过如下公式(4)进行更新:In one example, local_node_density is updated by the following formula (4):
local_node_density new=local_node_density+4*numSiblings    (4) local_node_density new = local_node_density + 4*numSiblings (4)
其中,local_node_density初始化为4,numSiblings为节点的兄弟姐妹节点数目,如图5D所示,当前节点为左侧节点,右侧节点为当前节点的兄弟姐妹节点,则当前节点的兄弟姐妹节点数目为5(包括自身)。Among them, local_node_density is initialized to 4, numSiblings is the number of siblings of the node, as shown in Figure 5D, the current node is the left node, the right node is the sibling of the current node, then the number of siblings of the current node is 5 (including itself).
第2种:根据当前层的点云密度来判断当前层节点是否满足平面编码。The second method: Determine whether the current layer nodes meet the plane coding requirements based on the point cloud density of the current layer.
利用当前层中点的密度来判断是否对当前层的节点进行平面编码。假设当前待编码点云的点数为pointCount,经过IDCM编码已经重建出的点数为numPointCountRecon,又因为八叉树是基于广度优先遍历的顺序进行编码,因此可以得到当前层待编码的节点数目假设为nodeCount,则假设通过planarEligibleKOctreeDepth表示当前层是否启动平面编码。其中,planarEligibleKOctreeDepth的判断过程如公式(5)所示:The density of the points in the current layer is used to determine whether to perform plane coding on the nodes in the current layer. Assuming that the number of points in the current point cloud to be coded is pointCount, the number of points reconstructed after IDCM coding is numPointCountRecon, and because the octree is coded in the order of breadth-first traversal, the number of nodes to be coded in the current layer can be obtained as nodeCount. It is assumed that planarEligibleKOctreeDepth is used to indicate whether the current layer starts plane coding. The judgment process of planarEligibleKOctreeDepth is shown in formula (5):
planarEligibleKOctreeDepth=(pointCount-numPointCountRecon)<nodeCount*1.3   (5)planarEligibleKOctreeDepth=(pointCount-numPointCountRecon)<nodeCount*1.3   (5)
当planarEligibleKOctreeDepth为true时,则当前层中的所有节点都进行平面编码;否则不进行平面编码,仅仅采用八叉树编码。When planarEligibleKOctreeDepth is true, all nodes in the current layer are plane coded; otherwise, no plane coding is performed and only octree coding is used.
第3种:根据激光雷达点云的采集参数来判断当前节点是否满足平面编码。The third method is to determine whether the current node meets the plane coding requirements based on the acquisition parameters of the lidar point cloud.
如图5E所示,可以看到上方大的正方体节点同时被两个Laser穿过,因此当前节点在Z轴垂直方向上不是一个平 面,下方小的正方体节点足够小到不能同时被两个节点同时穿过,因此有可能是一个平面。因此,可以基于当前节点对应的Laser个数,判断当前节点是否满足平面编码。As shown in Figure 5E, we can see that the large cube node on the top is traversed by two lasers at the same time, so the current node is not a plane in the vertical direction of the Z axis, and the small cube node on the bottom is small enough that it cannot be traversed by two nodes at the same time, so it is possible to be a plane. Therefore, we can judge whether the current node meets the plane coding based on the number of lasers corresponding to the current node.
下面将介绍目前针对满足平面编码条件的节点,平面标识信息和平面位置信息的预测编码。The following will introduce the predictive coding of plane identification information and plane position information for nodes that currently meet the plane coding conditions.
一、平面标识信息的预测编码1. Predictive Coding of Plane Marking Information
目前采用三个上下文对平面标识信息进行编码,即各个维度上的平面表示分开进行设计上下文。Currently, three contexts are used to encode the plane identification information, that is, the plane representation in each dimension is separately designed in context.
下面对非激光雷达点云和激光雷达点云的平面位置信息的编码进行分别介绍。The encoding of the planar position information of non-lidar point clouds and lidar point clouds is introduced separately below.
一)、非激光雷达点云平面位置信息的编码1) Coding of non-lidar point cloud planar position information
1、平面位置信息的预测编码。1. Predictive coding of planar position information.
平面位置信息基于如下信息进行预测编码:The plane position information is predictively coded based on the following information:
(1)利用邻域节点占位信息进行预测得到当前节点的平面位置信息为三元素:预测为低平面、预测为高平面和无法预测;(1) Using the occupancy information of neighboring nodes, the plane position information of the current node is predicted to be three elements: predicted as a low plane, predicted as a high plane, and unpredictable;
(2)与当前节点在相同的划分深度以及相同的坐标下的节点与当前节点之间的空间距离“近”和“远”;(2) The spatial distance between the nodes at the same partition depth and the same coordinates as the current node and the current node is “close” or “far”;
(3)与当前节点在相同的划分深度以及相同的坐标下的节点平面位置;(3) The plane position of the node at the same partition depth and the same coordinates as the current node;
(4)坐标维度(i=0,1,2)。(4) Coordinate dimension (i=0, 1, 2).
如图5F所示,当前待编码节点为左侧节点,则在相同的八叉树划分深度等级下,以及相同的垂直坐标下查找邻域节点为右侧节点,判断两个节点之间的距离为“近”和“远”,并且参考节点的平面位置。As shown in Figure 5F, the current node to be encoded is the left node, then the neighboring node is searched for as the right node at the same octree partition depth level and the same vertical coordinate, the distance between the two nodes is judged as "near" and "far", and the plane position of the reference node is used.
在一种示例中,如图5G所示,黑色节点为当前节点,若当前节点位于父节点的低平面时,通过如下方式,确定当前节点的平面位置:In one example, as shown in FIG5G , the black node is the current node. If the current node is located at the lower plane of the parent node, the plane position of the current node is determined in the following manner:
a)、如果斜划线节点的子节点4到7中有任何一个被占用,而所有点状节点都未被占用,则极有可能当前节点中存在一个平面,且该平面位置较低。a) If any of the child nodes 4 to 7 of the oblique line node is occupied, and all the dot nodes are not occupied, it is very likely that there is a plane in the current node, and the plane is at a lower position.
b)、如果斜划线节点的子节点4到7都未被占用,而任何点状节点被占用,则极有可能在当前节点中存在一个平面,且该平面位置较高。b) If the child nodes 4 to 7 of the oblique line node are not occupied, and any dot node is occupied, it is very likely that there is a plane in the current node, and the plane is at a higher position.
c)、如果斜划线节点的子节点4到7均为空节点,点状节点均为空节点,则无法推断平面位置,故标记为未知。c) If the child nodes 4 to 7 of the oblique line node are all empty nodes and the dot nodes are all empty nodes, the plane position cannot be inferred and is therefore marked as unknown.
如果斜划线节点的子节点4到7中有任何一个被占用,而点状节点中有任何一个被占用,则无法推断出平面位置,因此将其标记为未知。If any of the children 4 to 7 of the dashed node are occupied and any of the dotted nodes are occupied, the plane position cannot be inferred and is therefore marked as unknown.
在另一种示例中,如图5H所示,黑色节点为当前节点,若当节点处于父节点高平面位置时,则通过如下方式,确定当前节点的平面位置:In another example, as shown in FIG5H , the black node is the current node. If the node is at a high plane position of the parent node, the plane position of the current node is determined in the following manner:
a)、如果点状节点的子节点4到7中有任何一个节点被占用,而斜划线节点未被占用,则极有可能在当前节点中存在一个平面,且平面位置较低。a) If any of the child nodes 4 to 7 of the dot node is occupied, and the dashed node is not occupied, it is very likely that there is a plane in the current node, and the plane position is lower.
b)、如果点状节点的子节点4~7均未被占用,而斜划线节点被占用,则极有可能在当前节点中存在平面,且平面位置较高。b) If the child nodes 4 to 7 of the dot node are not occupied, but the oblique line node is occupied, it is very likely that there is a plane in the current node, and the plane position is relatively high.
c)、如果点状节点的子节点4~7都是未被占用的,而斜划线节点是未被占用的,无法推断平面位置,因此标记为未知。c) If the child nodes 4 to 7 of the dot node are all unoccupied, and the oblique line node is unoccupied, the plane position cannot be inferred, so it is marked as unknown.
d)、如果点状节点的子节点4-7中有一个被占用,而斜划线节点被占用,无法推断平面位置,因此标记为未知。d) If one of the child nodes 4-7 of the dot node is occupied, and the slash node is occupied, the plane position cannot be inferred and is therefore marked as unknown.
二)、激光雷达点云平面位置信息的编码2) Coding of planar position information of laser radar point cloud
图5I为激光雷达点云平面位置信息的预测编码,通过利用激光雷达采集参数来预测当前节点的平面位置,通过利用当前节点与激光射线相交的位置来将位置量化为四个区间,最终作为当前节点平面位置的上下文。具体计算过程如下:假设激光雷达的坐标为(x Lidar,y Lidar,z Lidar),当前点的几何坐标为(x,y,z),则首先计算当前点相对于激光雷达的垂直正切值tanθ,计算过程如公式(6)所示: Figure 5I is the predictive coding of the plane position information of the laser radar point cloud. The plane position of the current node is predicted by using the laser radar acquisition parameters, and the position is quantized into four intervals by using the position where the current node intersects with the laser ray, and finally used as the context of the plane position of the current node. The specific calculation process is as follows: Assuming that the coordinates of the laser radar are (x Lidar , y Lidar , z Lidar ), and the geometric coordinates of the current point are (x, y, z), first calculate the vertical tangent value tanθ of the current point relative to the laser radar. The calculation process is shown in formula (6):
Figure PCTCN2023071069-appb-000003
Figure PCTCN2023071069-appb-000003
又因为每个Laser会相对于激光雷达有一定偏移角度,因此会计算当前节点相对于Laser的相对正切值tanθ corr,L,具体计算过程如公式(7)所示: Since each laser has a certain offset angle relative to the laser radar, the relative tangent value tanθ corr,L of the current node relative to the laser is calculated. The specific calculation process is shown in formula (7):
Figure PCTCN2023071069-appb-000004
Figure PCTCN2023071069-appb-000004
最终会利用当前节点的修正正切值来对当前节点的平面位置进行预测,具体如下,假设当前节点下边界的正切值为tan(θ底部),上边界的正切值为tan(θ顶部),根据tanθ corr,L将平面位置量化为4个量化区间,即平面位置的上下文。 Finally, the corrected tangent value of the current node is used to predict the plane position of the current node. Specifically, assuming that the tangent value of the lower boundary of the current node is tan(θ bottom ), and the tangent value of the upper boundary is tan(θ top ), the plane position is quantized into 4 quantization intervals according to tanθ corr,L , which is the context of the plane position.
但基于八叉树的几何信息编码模式仅对空间中具有相关性的点有高效的压缩速率,而对于在几何空间中处于孤立位置的点来说,使用直接编码模式(Direct Coding Model,简称DCM)可以大大降低复杂度。对于八叉树中的所有节点,DCM的使用不是通过标志位信息来表示的,而是通过当前节点父节点和邻居信息来进行推断得到。判断当前节点是否具有DCM编码资格的方式有三种,如图6A所示:However, the octree-based geometric information coding mode only has an efficient compression rate for points with correlation in space. For points in isolated positions in the geometric space, the use of the direct coding model (DCM) can greatly reduce the complexity. For all nodes in the octree, the use of DCM is not represented by flag information, but is inferred from the parent node and neighbor information of the current node. There are three ways to determine whether the current node is eligible for DCM coding, as shown in Figure 6A:
(1)当前节点没有兄弟姐妹子节点,即当前节点的父节点只有一个孩子节点,同时当前节点父节点的父节点仅有两个被占据子节点,即当前节点最多只有一个邻居节点。(1) The current node has no sibling child nodes, that is, the parent node of the current node has only one child node, and the parent node of the parent node of the current node has only two occupied child nodes, that is, the current node has at most one neighbor node.
(2)当前节点的父节点仅有当前节点一个占据子节点,同时与当前节点共用一个面的六个邻居节点也都属于空节点。(2) The parent node of the current node has only one child node, the current node. At the same time, the six neighbor nodes that share a face with the current node are also empty nodes.
(3)当前节点的兄弟姐妹节点数目大于1。(3) The number of sibling nodes of the current node is greater than 1.
如果当前节点不具有DCM编码资格将对其进行八叉树划分,若具有DCM编码资格将进一步判断该节点中包含的点数,当点数小于阈值2时,则对该节点进行DCM编码,否则将继续进行八叉树划分。当应用DCM编码模式时,首先需要编码当前节点是否是一个真正的孤立点,即IDCM_flag,当IDCM_flag为true时,则当前节点采用DCM编码,否则仍然采用八叉树编码。当前节点满足DCM编码时,需要编码当前节点的DCM编码模式,目前存在两种DCM模式,分别是:1:仅仅只有一个点存在(或者是多个点,但是属于重复点);2:含有两个点。最后需要编码每个点 的几何信息,假设节点的边长为2 d时,对该节点几何坐标的每一个分量进行编码时需要d比特,该比特信息直接被编进码流中。这里需要注意的是,在对激光雷达点云进行编码时,通过利用激光雷达采集参数来对三个维度的坐标信息进行预测编码,从而可以进一步提升几何信息的编码效率。 If the current node does not have the DCM coding qualification, it will be divided into octrees. If it has the DCM coding qualification, the number of points contained in the node will be further determined. When the number of points is less than the threshold 2, the node will be DCM-encoded, otherwise the octree division will continue. When the DCM coding mode is applied, it is first necessary to encode whether the current node is a true isolated point, that is, IDCM_flag. When IDCM_flag is true, the current node is encoded using DCM, otherwise it is still encoded using octrees. When the current node meets the DCM coding requirements, it is necessary to encode the DCM coding mode of the current node. There are currently two DCM modes: 1: only one point exists (or multiple points, but they are repeated points); 2: contains two points. Finally, it is necessary to encode the geometric information of each point. Assuming that the side length of the node is 2d , d bits are required to encode each component of the geometric coordinates of the node, and the bit information is directly encoded into the bit stream. It should be noted here that when encoding the lidar point cloud, the three-dimensional coordinate information is predictively encoded by using the lidar acquisition parameters, so as to further improve the coding efficiency of the geometric information.
接下来对IDCM编码的过程进行详细的介绍:Next, the IDCM encoding process is introduced in detail:
当前节点满足直接编码模式(DCM)时,首先编码当前节点的点数目numPoints,根据不同的DirectMode来对当前节点的点数目进行编码,具体包括如下方式:When the current node satisfies the direct coding mode (DCM), the number of points numPoints of the current node is first encoded, and the number of points of the current node is encoded according to different DirectModes, including the following methods:
1、如果当前节点不满足DCM节点的要求,则直接退出,(即点数大于2个点,并且不是重复点)。1. If the current node does not meet the requirements of the DCM node, exit directly (that is, the number of points is greater than 2 points and is not a duplicate point).
2、当前节点含有的点数numPonts小于等于2,则编码过程如下:2. If the number of points numPonts in the current node is less than or equal to 2, the encoding process is as follows:
1)、首先编码当前节点的numPonts是否大于1;1) First, encode whether the numPonts of the current node is greater than 1;
2)、如果当前节点只有一个点并且几何编码环境为几何无损编码,则需要编码当前节点的第二个点不是重复点。2) If the current node has only one point and the geometry coding environment is geometry lossless coding, it is necessary to encode that the second point of the current node is not a duplicate point.
3、当前节点含有的点数numPonts大于2,则编码过程如下:3. If the number of points numPonts contained in the current node is greater than 2, the encoding process is as follows:
1)、首先编码当前节点的numPonts小于等于1;1) First, encode the numPonts of the current node to be less than or equal to 1;
2)、其次编码当前节点的第二个点是一个重复点,其次编码当前节点的重复点数目是否大于1,当重复点数目大于1时,需要对剩余的重复点数目进行指数哥伦布解码。2) Secondly, encode whether the second point of the current node is a repeated point, and then encode whether the number of repeated points of the current node is greater than 1. When the number of repeated points is greater than 1, it is necessary to perform exponential Columbus decoding on the remaining number of repeated points.
在编码完当前节点的点数目之后,对当前节点中包含点的坐标信息进行编码。下面将分别对激光雷达点云和面向人眼点云分开介绍。After encoding the number of points in the current node, the coordinate information of the points contained in the current node is encoded. The following will introduce the lidar point cloud and the human eye point cloud separately.
面向人眼点云Point cloud for human eyes
1)、如果当前节点中仅仅只含有一个点,则会对点的三个维度方向的几何信息进行直接编码(Bypass coding)。1) If the current node contains only one point, the geometric information of the point in three dimensions will be directly encoded (Bypass coding).
2)、如果当前节点中含有两个点,则会首先通过利用点的几何坐标得到优先编码的坐标轴dirextAxis,这里需要注意的是,目前比较的坐标轴只包含x和y轴,不包含z轴。假设当前节点的几何坐标为nodePos,则采用如公式(8)所示的方式,确定优先编码的坐标轴:2) If the current node contains two points, the priority coded coordinate axis dirextAxis will be obtained by using the geometric coordinates of the points. It should be noted that the currently compared coordinate axes only include the x and y axes, not the z axis. Assuming that the geometric coordinates of the current node are nodePos, the priority coded coordinate axis is determined by the method shown in formula (8):
dirextAxis=!(nodePos[0]<nodePos[1])   (8)dirextAxis=! (nodePos[0]<nodePos[1])   (8)
也就是将节点坐标几何位置小的轴作为优先编码的坐标轴dirextAxis。That is to say, the axis with the smaller node coordinate geometric position is used as the coordinate axis dirextAxis for priority encoding.
其次按照如下方式首先对优先编码的坐标轴dirextAxis几何信息进行编码,假设优先编码的轴对应的待编码几何bit深度为nodeSizeLog2,并假设两个点的坐标分别为pointPos[0]和pointPos[1]:Secondly, the geometry information of the dirextAxis coordinate axis of the priority coding is first encoded as follows, assuming that the bit depth of the geometry to be encoded corresponding to the priority coding axis is nodeSizeLog2, and assuming that the coordinates of the two points are pointPos[0] and pointPos[1] respectively:
Figure PCTCN2023071069-appb-000005
Figure PCTCN2023071069-appb-000005
在编码完优先编码轴dirextAxis之后,再对当前点的几何坐标进行直接编码。假设每个点的剩余编码bit深度为nodeSizeLog2,则具体编码过程如下:After encoding the priority encoding axis dirextAxis, the geometric coordinates of the current point are directly encoded. Assuming that the remaining encoding bit depth of each point is nodeSizeLog2, the specific encoding process is as follows:
for(int axisIdx=0;axisIdx<3;++axisIdx)for(int axisIdx=0;axisIdx<3;++axisIdx)
       for(int mask=(1<<nodeSizeLog2[axisIdx])>>1;mask;mask>>1)for(int mask=(1<<nodeSizeLog2[axisIdx])>>1;mask;mask>>1)
              encodePosBit(!!(pointPos[axisIdx]&mask));encodePosBit(!!(pointPos[axisIdx]&mask));
面向激光雷达点云For LiDAR point clouds
1)、如果当前节点中含有两个点,则会首先通过利用点的几何坐标得到优先编码的坐标轴dirextAxis,假设当前节点的几何坐标为nodePos,则采用如公式(9)所示的方式,确定优先编码的坐标轴:1) If the current node contains two points, the priority coded coordinate axis dirextAxis will be obtained by using the geometric coordinates of the points. Assuming that the geometric coordinates of the current node are nodePos, the priority coded coordinate axis is determined by the method shown in formula (9):
dirextAxis=!(nodePos[0]<nodePos[1])    (9)dirextAxis=! (nodePos[0]<nodePos[1])    (9)
也就是将节点坐标几何位置小的轴作为优先编码的坐标轴dirextAxis,这里需要注意的是,目前比较的坐标轴只包含x和y轴,不包含z轴。That is to say, the axis with the smaller node coordinate geometric position is used as the coordinate axis dirextAxis for priority encoding. It should be noted here that the currently compared coordinate axes only include the x and y axes, but not the z axis.
其次按照如下方式首先对优先编码的坐标轴dirextAxis几何信息进行编码,假设优先编码的轴对应的待编码几何bit深度为nodeSizeLog2,并假设两个点的坐标分别为pointPos[0]和pointPos[1]:Secondly, the geometry information of the dirextAxis coordinate axis of the priority coding is first encoded as follows, assuming that the bit depth of the geometry to be encoded corresponding to the priority coding axis is nodeSizeLog2, and assuming that the coordinates of the two points are pointPos[0] and pointPos[1] respectively:
Figure PCTCN2023071069-appb-000006
Figure PCTCN2023071069-appb-000006
在编码完优先编码轴dirextAxis之后,再对当前点的几何坐标进行编码。After encoding the priority encoding axis dirextAxis, the geometric coordinates of the current point are encoded.
由于激光雷达点云可以得到激光雷达点云的采集参数,通过利用可以预测当前节点的几何坐标信息,从而可以进一步提升点云的几何信息编码效率。同样的首先利用当前节点的几何信息nodePos得到一个直接编码的主轴方向,其 次利用已经完成编码的方向的几何信息来对另外一个维度的几何信息进行预测编码。同样假设直接编码的轴方向是directAxis,并且假设直接编码中的待编码bit深度为nodeSizeLog2,则编码方式如下:Since the laser radar point cloud can obtain the acquisition parameters of the laser radar point cloud, the geometric coordinate information of the current node can be predicted, so as to further improve the efficiency of the geometric information encoding of the point cloud. Similarly, the geometric information nodePos of the current node is first used to obtain a directly encoded main axis direction, and then the geometric information of the encoded direction is used to predict the geometric information of another dimension. Also, assuming that the axis direction of the direct encoding is directAxis, and assuming that the bit depth to be encoded in the direct encoding is nodeSizeLog2, the encoding method is as follows:
for(int mask=(1<<nodeSizeLog2)>>1;mask;mask>>1)for(int mask=(1<<nodeSizeLog2)>>1;mask;mask>>1)
       encodePosBit(!!(pointPos[directAxis]&mask));encodePosBit(!!(pointPos[directAxis]&mask));
这里需要注意的是,在这里会将directAxis方向的几何精度信息全部编码。It should be noted here that all geometric accuracy information in the directAxis direction will be encoded here.
在编码完directAxis坐标方向的所有精度之后,会首先计算当前点所对应的LaserIdx,即图6B中的pointLaserIdx号,并且计算当前节点的LaserIdx,即nodeLaserIdx。其次会利用节点的LaserIdx即nodeLaserIdx来对点的LaserIdx即pointLaserIdx进行预测编码,其中节点或者点的LaserIdx的计算方式如下:After encoding all the precision of the directAxis coordinate direction, the LaserIdx corresponding to the current point, i.e. pointLaserIdx in Figure 6B, will be calculated first, and the LaserIdx of the current node, i.e. nodeLaserIdx, will be calculated. Then the LaserIdx of the node, i.e. nodeLaserIdx, will be used to predictively encode the LaserIdx of the point, i.e. pointLaserIdx. The calculation method of the LaserIdx of the node or point is as follows:
假设点的几何坐标为pointPos,激光射线的起始坐标为LidarOrigin,并且假设Laser的数目为LaserNum,每个Laser的正切值为tanθ i,每个Laser在垂直方向上的偏移位置为Z i,则: Assume that the geometric coordinates of the point are pointPos, the starting coordinates of the laser ray are LidarOrigin, and the number of Lasers is LaserNum, the tangent value of each Laser is tanθ i , and the offset position of each Laser in the vertical direction is Zi , then:
Figure PCTCN2023071069-appb-000007
Figure PCTCN2023071069-appb-000007
在计算得到当前点的LaserIdx之后,首先会利用当前节点的LaserIdx对点的pointLaserIdx进行预测编码。在编码完当前点的LaserIdx之后,对当前点三个维度的几何信息利用激光雷达的采集参数进行预测编码。After calculating the LaserIdx of the current point, the LaserIdx of the current node is first used to predict the pointLaserIdx of the point. After encoding the LaserIdx of the current point, the three-dimensional geometric information of the current point is predictively encoded using the acquisition parameters of the laser radar.
具体算法如图6C所示,首先利用当前点对应的LaserIdx得到对应的水平方位角的预测值,即
Figure PCTCN2023071069-appb-000008
其次利用当前点对应的节点几何信息得到节点对应的水平方位角度
Figure PCTCN2023071069-appb-000009
其中,水平方位角
Figure PCTCN2023071069-appb-000010
与节点几何信息之间的计算方式如公式(10)所示,假设节点的几何坐标为nodePos:
The specific algorithm is shown in FIG6C . First, the LaserIdx corresponding to the current point is used to obtain the corresponding predicted value of the horizontal azimuth angle, that is,
Figure PCTCN2023071069-appb-000008
Secondly, the node geometry information corresponding to the current point is used to obtain the horizontal azimuth angle corresponding to the node
Figure PCTCN2023071069-appb-000009
Among them, the horizontal azimuth
Figure PCTCN2023071069-appb-000010
The calculation method between the node geometry information is shown in formula (10), assuming that the geometry coordinates of the node are nodePos:
Figure PCTCN2023071069-appb-000011
Figure PCTCN2023071069-appb-000011
通过利用激光雷达的采集参数,可以得到每个Laser的旋转点数numPoints,即代表每个激光射线旋转一圈得到的点数,则可以利用每个Laser的旋转点数计算得到每个Laser的旋转角速度deltaPhi,如公式(11)所示:By using the acquisition parameters of the laser radar, the number of rotation points numPoints of each Laser can be obtained, which represents the number of points obtained when each laser ray rotates one circle. The rotation angular velocity deltaPhi of each Laser can then be calculated using the number of rotation points of each Laser, as shown in formula (11):
Figure PCTCN2023071069-appb-000012
Figure PCTCN2023071069-appb-000012
如图6D所示,利用节点的水平方位角
Figure PCTCN2023071069-appb-000013
以及当前点对应的Laser前一个编码点的水平方位角
Figure PCTCN2023071069-appb-000014
计算得到当前点对应的水平方位角预测值
Figure PCTCN2023071069-appb-000015
具体计算公式如公式(12)所示:
As shown in FIG6D , using the horizontal azimuth angle of the node
Figure PCTCN2023071069-appb-000013
And the horizontal azimuth of the previous Laser code point corresponding to the current point
Figure PCTCN2023071069-appb-000014
Calculate the predicted horizontal azimuth angle corresponding to the current point
Figure PCTCN2023071069-appb-000015
The specific calculation formula is shown in formula (12):
Figure PCTCN2023071069-appb-000016
Figure PCTCN2023071069-appb-000016
最终,如图6E所示,通过利用水平方位角的预测值
Figure PCTCN2023071069-appb-000017
以及当前节点的低平面水平方位角
Figure PCTCN2023071069-appb-000018
和高平面的水平方位角
Figure PCTCN2023071069-appb-000019
来对当前节点的几何信息进行预测编码。具体如下所示:
Finally, as shown in FIG6E , by using the predicted value of the horizontal azimuth
Figure PCTCN2023071069-appb-000017
and the low plane horizontal azimuth of the current node
Figure PCTCN2023071069-appb-000018
and the horizontal azimuth of the high plane
Figure PCTCN2023071069-appb-000019
To predict the geometric information of the current node. The details are as follows:
Figure PCTCN2023071069-appb-000020
Figure PCTCN2023071069-appb-000020
Figure PCTCN2023071069-appb-000021
Figure PCTCN2023071069-appb-000021
int context=(angLel≥0&&angLeR≥0)||(angLel<0&&angLeR<0)?0:2int context=(angLel≥0&&angLeR≥0)||(angLel<0&&angLeR<0)? 0:2
int absAngleL=abs(angLel)int absAngleL=abs(angLel)
int absAngleR=abs(angLeR)int absAngleR=abs(angLeR)
context+=absAngleL>absAngleR?0:1context+=absAngleL>absAngleR? 0:1
context+=maxAngle>minAngle<<1?4:0context+=maxAngle>minAngle<<1?4:0
在编码完点的LaserIdx之后,会利用当前点所对应的LaserIdx对当前点的Z轴方向进行预测编码,即当前通过利用当前点的x和y信息计算得到柱面坐标系的深度信息radius,其次利用当前点的激光LaserIdx得到当前点的正切值以及垂直方向的便宜,则可以得到当前点的Z轴方向的预测值即Z_pred:After encoding the LaserIdx of the point, the LaserIdx corresponding to the current point will be used to predict the Z-axis direction of the current point. That is, the depth information radius of the cylindrical coordinate system is calculated by using the x and y information of the current point. Then, the tangent value of the current point and the vertical value are obtained by using the laser LaserIdx of the current point. Then, the predicted value of the Z-axis direction of the current point, namely Z_pred, can be obtained:
Figure PCTCN2023071069-appb-000022
Figure PCTCN2023071069-appb-000022
int tanTheta=tanθ laserIdx int tanTheta=tanθ laserIdx
int zOffset=Z laserIdx int zOffset = Z laserIdx
Z_pred=radius×tanTheta-zOffsetZ_pred = radius × tanTheta - zOffset
最终利用Z_pred对当前点的Z轴方向的几何信息进行预测编码得到预测残差Z_res,最终对Z_res进行编码。Finally, Z_pred is used to predict the geometric information of the current point in the Z-axis direction to obtain the prediction residual Z_res, and finally Z_res is encoded.
需要注意的是,在节点划分到叶子节点时,在几何无损编码的情况下,需要对叶子节点中的重复点数目进行编码。最终对所有节点的占位信息进行编码,生成二进制码流。另外G-PCC目前引入了一种平面编码模式,在对几何进行划分的过程中,会判断当前节点的子节点是否处于同一平面,如果当前节点的子节点满足同一平面的条件,会用该平面对当前节点的子节点进行表示。It should be noted that when nodes are divided into leaf nodes, in the case of geometric lossless coding, the number of repeated points in the leaf nodes needs to be encoded. Finally, the placeholder information of all nodes is encoded to generate a binary code stream. In addition, G-PCC currently introduces a plane coding mode. In the process of geometric division, it will determine whether the child nodes of the current node are in the same plane. If the child nodes of the current node meet the conditions of the same plane, the child nodes of the current node will be represented by the plane.
在基于八叉树的几何解码,解码端按照广度优先遍历的顺序,在对每个节点的占位信息解码之前,首先会利用已经重建得到的几何信息来判断当前节点是否进行平面解码或者IDCM解码,如果当前节点满足平面解码的条件,则会首先对当前节点的平面标识和平面位置信息进行解码,其次基于平面信息来对当前节点的占位信息进行解码;如果当前节点满足IDCM解码的条件,则会首先解码当前节点是否是一个真正的IDCM节点,如果是一个真正的IDCM解 码,则会继续解析当前节点的DCM解码模式,其次可以得到当前DCM节点中的点数目,最后对每个点的几何信息进行解码。对于既不满足平面解码也不满足DCM解码的节点,会对当前节点的占位信息进行解码。通过按照这样的方式不断解析得到每个节点的占位码,并且依次不断划分节点,直至划分得到1X1X1的单位立方体时停止划分,解析得到每个叶子节点中包含的点数,最终恢复得到几何重构点云信息。In the octree-based geometric decoding, the decoding end follows the order of breadth-first traversal. Before decoding the placeholder information of each node, it will first use the reconstructed geometric information to determine whether the current node is to be plane decoded or IDCM decoded. If the current node meets the conditions for plane decoding, the plane identification and plane position information of the current node will be decoded first, and then the placeholder information of the current node will be decoded based on the plane information; if the current node meets the conditions for IDCM decoding, it will first decode whether the current node is a real IDCM node. If it is a real IDCM decoding, it will continue to parse the DCM decoding mode of the current node, and then the number of points in the current DCM node can be obtained, and finally the geometric information of each point will be decoded. For nodes that do not meet neither plane decoding nor DCM decoding, the placeholder information of the current node will be decoded. By continuously parsing in this way, the placeholder code of each node is obtained, and the nodes are continuously divided in turn until the division is stopped when the unit cube of 1X1X1 is obtained, the number of points contained in each leaf node is obtained by parsing, and finally the geometric reconstructed point cloud information is restored.
下面对IDCM解码的过程进行详细的介绍:The following is a detailed introduction to the IDCM decoding process:
与编码同样的处理,首先利用先验信息来决定节点是否启动IDCM,即IDCM的启动条件如下:The same process as encoding, first use the prior information to decide whether the node starts IDCM, that is, the starting conditions of IDCM are as follows:
(1)当前节点没有兄弟姐妹子节点,即当前节点的父节点只有一个孩子节点,同时当前节点父节点的父节点仅有两个被占据子节点,即当前节点最多只有一个邻居节点。(1) The current node has no sibling child nodes, that is, the parent node of the current node has only one child node, and the parent node of the parent node of the current node has only two occupied child nodes, that is, the current node has at most one neighbor node.
(2)当前节点的父节点仅有当前节点一个占据子节点,同时与当前节点共用一个面的六个邻居节点也都属于空节点。(2) The parent node of the current node has only one child node, the current node. At the same time, the six neighbor nodes that share a face with the current node are also empty nodes.
(3)当前节点的兄弟姐妹节点数目大于1。(3) The number of sibling nodes of the current node is greater than 1.
当节点满足DCM编码的条件时,首先解码当前节点是否是一个真正的DCM节点,即IDCM_flag,当IDCM_flag为true时,则当前节点采用DCM编码,否则仍然采用八叉树编码。When a node meets the conditions for DCM encoding, first decode whether the current node is a real DCM node, that is, IDCM_flag. When IDCM_flag is true, the current node adopts DCM encoding, otherwise it still adopts octree encoding.
其次解码当前节点的点数目numPoints,具体的解码方式如下所示:Secondly, decode the number of points numPoints of the current node. The specific decoding method is as follows:
1)、首先解码当前节点的numPonts是否大于1;1) First, decode whether the numPonts of the current node is greater than 1;
2)、如果解码得到当前节点的numPonts大于1,则继续解码第二个点是否是一个重复点,如果第二个点不是重复点,则这里可以隐性推断出满足DCM模式的第二种,只含有两个点;2) If the numPonts of the current node obtained by decoding is greater than 1, continue decoding to see if the second point is a duplicate point. If the second point is not a duplicate point, it can be implicitly inferred that the second type that satisfies the DCM mode contains only two points.
3)、如果解码得到当前节点的numPonts小于等于1,则继续解码第二个点是否是一个重复点,如果第二个点不是重复点,则这里可以隐性推断出满足DCM模式的第二种,只含有一个点;如果解码得到第二个点是一个重复点,则可以推断出满足DCM模式的第三种,含有多个点,但是都是重复点,则继续解码重复点的数目是否大于1(熵解码),如果大于1,则继续解码剩余重复点的数目(利用指数哥伦布进行解码)。3) If the numPonts of the current node obtained by decoding is less than or equal to 1, continue to decode whether the second point is a repeated point. If the second point is not a repeated point, it can be implicitly inferred that the second type that satisfies the DCM mode contains only one point; if the second point obtained by decoding is a repeated point, it can be inferred that the third type that satisfies the DCM mode contains multiple points, but they are all repeated points, then continue to decode whether the number of repeated points is greater than 1 (entropy decoding). If it is greater than 1, continue to decode the number of remaining repeated points (decoding using exponential Columbus).
如果当前节点不满足DCM节点的要求,即点数大于2个点,并且不是重复点,则直接退出。If the current node does not meet the requirements of the DCM node, that is, the number of points is greater than 2 points and it is not a duplicate point, exit directly.
在解码完当前节点的点数目之后,对当前节点中包含点的坐标信息进行解码。下面将分别对激光雷达点云和面向人眼点云分开介绍。After decoding the number of points in the current node, the coordinate information of the points contained in the current node is decoded. The following will introduce the lidar point cloud and the human eye point cloud separately.
面向人眼点云Point cloud for human eyes
1)、如果当前节点中仅仅只含有一个点,则会对点的三个维度方向的几何信息进行直接解码(Bypass coding);1) If the current node contains only one point, the geometric information of the point in three dimensions will be directly decoded (Bypass coding);
2)、如果当前节点中含有两个点,则会首先通过利用点的几何坐标得到优先解码的坐标轴dirextAxis,这里需要注意的是,目前比较的坐标轴只包含x和y轴,不包含z轴。假设当前节点的几何坐标为nodePos,则采用如公式(13)所示的方式,确定优先编码的坐标轴:2) If the current node contains two points, the priority decoding coordinate axis dirextAxis will be obtained by using the geometric coordinates of the points. It should be noted that the coordinate axes currently compared only include the x and y axes, not the z axis. Assuming that the geometric coordinates of the current node are nodePos, the method shown in formula (13) is used to determine the priority encoding coordinate axis:
dirextAxis=!(nodePos[0]<nodePos[1])(13)dirextAxis=! (nodePos[0]<nodePos[1])(13)
也就是说,将节点坐标几何位置小的轴作为优先解码的坐标轴dirextAxis。That is to say, the axis with the smaller node coordinate geometric position is used as the coordinate axis dirextAxis for priority decoding.
其次按照如下方式首先对优先解码的坐标轴dirextAxis几何信息进行解码,假设优先解码的轴对应的待解码几何bit深度为nodeSizeLog2,并假设两个点的坐标分别为pointPos[0]和pointPos[1]:Secondly, the geometry information of the dirextAxis coordinate axis to be decoded is first decoded as follows, assuming that the bit depth of the geometry to be decoded corresponding to the axis to be decoded is nodeSizeLog2, and assuming that the coordinates of the two points are pointPos[0] and pointPos[1] respectively:
Figure PCTCN2023071069-appb-000023
Figure PCTCN2023071069-appb-000023
在解码完优先解码轴dirextAxis之后,在对当前点的几何坐标进行直接解码。假设每个点的剩余编码bit深度为nodeSizeLog2,则具体解码过程如下,假设点的坐标信息为pointPos:After decoding the priority decoding axis dirextAxis, the geometric coordinates of the current point are directly decoded. Assuming that the remaining encoding bit depth of each point is nodeSizeLog2, the specific decoding process is as follows, assuming that the coordinate information of the point is pointPos:
Figure PCTCN2023071069-appb-000024
Figure PCTCN2023071069-appb-000024
面向激光雷达点云For LiDAR point clouds
1)、如果当前节点中含有两个点,则会首先通过利用点的几何坐标得到优先解码的坐标轴dirextAxis,假设当前 节点的几何坐标为nodePos,则采用如公式(14)所示的方式,确定优先编码的坐标轴:1) If the current node contains two points, the priority decoding coordinate axis dirextAxis will be obtained by using the geometric coordinates of the points. Assuming that the geometric coordinates of the current node are nodePos, the priority encoding coordinate axis is determined by the method shown in formula (14):
dirextAxis=!(nodePos[0]<nodePos[1])(14)dirextAxis=! (nodePos[0]<nodePos[1])(14)
也就是说,将节点坐标几何位置小的轴作为优先解码的坐标轴dirextAxis,这里需要注意的是,目前比较的坐标轴只包含x和y轴,不包含z轴。That is to say, the axis with the smaller node coordinate geometric position is used as the coordinate axis dirextAxis for priority decoding. It should be noted here that the currently compared coordinate axes only include the x and y axes, but not the z axis.
其次按照如下方式首先对优先编码的坐标轴dirextAxis几何信息进行解码,假设优先解码的轴对应的待编码几何bit深度为nodeSizeLog2,并假设两个点的坐标分别为pointPos[0]和pointPos[1]:Secondly, the geometry information of the dirextAxis coordinate axis that is encoded first is decoded as follows, assuming that the bit depth of the geometry to be encoded corresponding to the axis that is decoded first is nodeSizeLog2, and assuming that the coordinates of the two points are pointPos[0] and pointPos[1] respectively:
Figure PCTCN2023071069-appb-000025
Figure PCTCN2023071069-appb-000025
在解码完优先解码轴dirextAxis之后,再对当前点的几何坐标进行解码。After decoding the priority decoding axis dirextAxis, the geometric coordinates of the current point are decoded.
同样的,首先利用当前节点的几何信息nodePos得到一个直接解码的主轴方向,其次利用已经完成解码的方向的几何信息来对另外一个维度的几何信息进行解码。同样假设直接解码的轴方向是directAxis,并且假设直接解码中的待解码bit深度为nodeSizeLog2,则解码方式如下:Similarly, first use the current node's geometry information nodePos to get a direct decoding main axis direction, and then use the geometry information of the decoded direction to decode the geometry information of another dimension. Also assuming that the axis direction of direct decoding is directAxis, and assuming that the bit depth to be decoded in direct decoding is nodeSizeLog2, the decoding method is as follows:
Figure PCTCN2023071069-appb-000026
Figure PCTCN2023071069-appb-000026
这里需要注意的是,在这里会将directAxis方向的几何精度信息全部解码。It should be noted here that all geometric accuracy information in the directAxis direction will be decoded here.
在解码完directAxis坐标方向的所有精度之后,会首先计算当前节点的LaserIdx,即nodeLaserIdx,其次会利用节点的LaserIdx即nodeLaserIdx来对点的LaserIdx即pointLaserIdx进行预测解码,其中节点或者点的LaserIdx的计算方式跟编码端相同。最终对当前点的LaserIdx与节点的LaserIdx预测残差信息进行解码得到ResLaserIdx,则计算公式如公式15所示:After decoding all the precisions of the directAxis coordinate direction, the LaserIdx of the current node, i.e., nodeLaserIdx, is calculated first. Then, the LaserIdx of the node, i.e., nodeLaserIdx, is used to predict and decode the LaserIdx of the point, i.e., pointLaserIdx. The calculation method of the LaserIdx of the node or point is the same as that of the encoder. Finally, the LaserIdx of the current point and the predicted residual information of the LaserIdx of the node are decoded to obtain ResLaserIdx. The calculation formula is shown in Formula 15:
PointLaserIdx=nodeLaserIdx+ResLaserIdx    (15)PointLaserIdx=nodeLaserIdx+ResLaserIdx    (15)
在解码完当前点的LaserIdx之后,对当前点三个维度的几何信息利用激光雷达的采集参数进行预测解码。After decoding the LaserIdx of the current point, the three-dimensional geometric information of the current point is predicted and decoded using the acquisition parameters of the laser radar.
具体的,如图6B所示,首先利用当前点对应的LaserIdx得到对应的水平方位角的预测值,即
Figure PCTCN2023071069-appb-000027
其次利用当前点对应的节点几何信息得到节点对应的水平方位角度
Figure PCTCN2023071069-appb-000028
其中,假设节点的几何坐标为nodePos,水平方位角
Figure PCTCN2023071069-appb-000029
与节点几何信息之间的计算方式如公式(16)所示:
Specifically, as shown in FIG6B , the LaserIdx corresponding to the current point is first used to obtain the corresponding predicted value of the horizontal azimuth, that is,
Figure PCTCN2023071069-appb-000027
Secondly, the node geometry information corresponding to the current point is used to obtain the horizontal azimuth angle corresponding to the node
Figure PCTCN2023071069-appb-000028
Among them, it is assumed that the geometric coordinates of the node are nodePos, and the horizontal azimuth is
Figure PCTCN2023071069-appb-000029
The calculation method between the node geometry information is shown in formula (16):
Figure PCTCN2023071069-appb-000030
Figure PCTCN2023071069-appb-000030
通过利用激光雷达的采集参数,可以得到每个Laser的旋转点数numPoints,即代表每个激光射线旋转一圈得到的点数,则可以利用每个Laser的旋转点数计算得到每个Laser的旋转角速度deltaPhi,如公式(17)所示:By using the acquisition parameters of the laser radar, the number of rotation points numPoints of each Laser can be obtained, which represents the number of points obtained when each laser ray rotates one circle. The rotation angular velocity deltaPhi of each Laser can then be calculated using the number of rotation points of each Laser, as shown in formula (17):
Figure PCTCN2023071069-appb-000031
Figure PCTCN2023071069-appb-000031
接着,如图6D所示,利用节点的水平方位角
Figure PCTCN2023071069-appb-000032
以及当前点对应的Laser前一个编码点的水平方位角
Figure PCTCN2023071069-appb-000033
计算得到当前点对应的水平方位角预测值
Figure PCTCN2023071069-appb-000034
水平方位角的预测值。计算方式如公式(18)所示:
Next, as shown in FIG6D , the horizontal azimuth angle of the node is used
Figure PCTCN2023071069-appb-000032
And the horizontal azimuth of the previous Laser code point corresponding to the current point
Figure PCTCN2023071069-appb-000033
Calculate the predicted horizontal azimuth angle corresponding to the current point
Figure PCTCN2023071069-appb-000034
The predicted value of the horizontal azimuth angle. The calculation method is shown in formula (18):
Figure PCTCN2023071069-appb-000035
Figure PCTCN2023071069-appb-000035
最终通过利用水平方位角的预测值
Figure PCTCN2023071069-appb-000036
以及当前节点的低平面水平方位角
Figure PCTCN2023071069-appb-000037
和高平面的水平方位角
Figure PCTCN2023071069-appb-000038
来对当前节点的几何信息进行预测编码。具体如下所示:
Finally, by using the predicted value of the horizontal azimuth
Figure PCTCN2023071069-appb-000036
and the low plane horizontal azimuth of the current node
Figure PCTCN2023071069-appb-000037
and the horizontal azimuth of the high plane
Figure PCTCN2023071069-appb-000038
To predict the geometric information of the current node. The details are as follows:
Figure PCTCN2023071069-appb-000039
Figure PCTCN2023071069-appb-000039
Figure PCTCN2023071069-appb-000040
Figure PCTCN2023071069-appb-000040
int context=(angLel≥0&&angLeR≥0)||(angLel<0&&angLeR<0)?0:2int context=(angLel≥0&&angLeR≥0)||(angLel<0&&angLeR<0)? 0:2
int absAngleL=abs(angLel)int absAngleL=abs(angLel)
int absAngleR=abs(angLeR)int absAngleR=abs(angLeR)
context+=absAngleL>absAngleR?0:1context+=absAngleL>absAngleR? 0:1
context+=maxAngle>minAngle<<1?4:0context+=maxAngle>minAngle<<1?4:0
在解码完点的LaserIdx之后,会利用当前点所对应的LaserIdx对当前点的Z轴方向进行预测解码,即当前通过利用当前点的x和y信息计算得到柱面坐标系的深度信息radius,其次利用当前点的激光LaserIdx得到当前点的正切值 以及垂直方向的偏移量,则可以得到当前点的Z轴方向的预测值即Z_pred:After decoding the LaserIdx of the point, the LaserIdx corresponding to the current point will be used to predict and decode the Z-axis direction of the current point. That is, the depth information radius of the cylindrical coordinate system is calculated by using the x and y information of the current point. Then, the tangent value of the current point and the vertical offset are obtained by using the laser LaserIdx of the current point. Then, the predicted value of the Z-axis direction of the current point, namely Z_pred, can be obtained:
Figure PCTCN2023071069-appb-000041
Figure PCTCN2023071069-appb-000041
int tanTheta=tanθ laserIdx int tanTheta=tanθ laserIdx
int zOffset=Z laserIdx int zOffset = Z laserIdx
Z_pred=radius×tanTheta-zOffsetZ_pred = radius × tanTheta - zOffset
最终利用解码得到的Z_res和Z_pred来重建恢复得到当前点Z轴方向的几何信息。Finally, the decoded Z_res and Z_pred are used to reconstruct and restore the geometric information of the current point in the Z-axis direction.
在基于trisoup(triangle soup,三角面片集)的几何信息编码框架中,同样也要先进行几何划分,但区别于基于二叉树/四叉树/八叉树的几何信息编码,该方法不需要将点云逐级划分到边长为1x1x1的单位立方体,而是划分到block(子块)边长为W时停止划分,基于每个block中点云的分布所形成的表面,得到该表面与block的十二条边所产生的至多十二个vertex(交点)。依次编码每个block的vertex坐标,生成二进制码流。In the geometric information coding framework based on trisoup (triangle soup, triangle patch set), geometric division must also be performed first, but different from the geometric information coding based on binary tree/quadtree/octree, this method does not need to divide the point cloud into unit cubes with a side length of 1x1x1 step by step, but stops dividing when the block (sub-block) has a side length of W. Based on the surface formed by the distribution of the point cloud in each block, at most twelve vertices (intersection points) generated by the surface and the twelve edges of the block are obtained. The vertex coordinates of each block are encoded in turn to generate a binary code stream.
基于trisoup的点云几何信息重建,在解码端进行点云几何信息重建时,首先解码vertex坐标用于完成三角面片重建,该过程如图7A至图7C所示。图7A所示的block中存在3个vertex(v1,v2,v3),利用这3个vertex按照一定顺序所构成的三角面片集被称为triangle soup,即trisoup,如图7B所示。之后,在该三角面片集上进行采样,将得到的采样点作为该block内的重建点云,如图7C所示。When reconstructing the geometric information of the point cloud based on trisoup, the vertex coordinates are first decoded to complete the reconstruction of the triangle facets at the decoding end. The process is shown in Figures 7A to 7C. There are three vertices (v1, v2, v3) in the block shown in Figure 7A. The triangle facet set formed by these three vertices in a certain order is called triangle soup, i.e., trisoup, as shown in Figure 7B. After that, sampling is performed on the triangle facet set, and the obtained sampling points are used as the reconstructed point cloud in the block, as shown in Figure 7C.
基于预测树的几何编码包括:首先对输入点云进行排序,目前采用的排序方法包括无序、莫顿序、方位角序和径向距离序。在编码端通过利用两种不同的方式建立预测树结构,其中包括:KD-Tree(高时延慢速模式)和利用激光雷达标定信息,将每个点划分到不同的Laser上,按照不同的Laser建立预测结构(低时延快速模式)。接下来基于预测树的结构,遍历预测树中的每个节点,通过选取不同的预测模式对节点的几何位置信息进行预测得到预测残差,并且利用量化参数对几何预测残差进行量化。最终通过不断迭代,对预测树节点位置信息的预测残差、预测树结构以及量化参数等进行编码,生成二进制码流。The geometric coding based on the prediction tree includes: first, sorting the input point cloud. The currently used sorting methods include unordered, Morton order, azimuth order and radial distance order. At the encoding end, the prediction tree structure is established by using two different methods, including: KD-Tree (high-latency slow mode) and using the laser radar calibration information to divide each point into different Lasers, and establish a prediction structure according to different Lasers (low-latency fast mode). Next, based on the structure of the prediction tree, traverse each node in the prediction tree, predict the geometric position information of the node by selecting different prediction modes to obtain the prediction residual, and quantize the geometric prediction residual using the quantization parameter. Finally, through continuous iteration, the prediction residual of the prediction tree node position information, the prediction tree structure and the quantization parameters are encoded to generate a binary code stream.
基于预测树的几何解码,解码端通过不断解析码流,重构预测树结构,其次通过解析得到每个预测节点的几何位置预测残差信息以及量化参数,并且对预测残差进行反量化,恢复得到每个节点的重构几何位置信息,最终完成解码端的几何重构。Based on the geometric decoding of the prediction tree, the decoding end reconstructs the prediction tree structure by continuously parsing the bit stream, and then obtains the geometric position prediction residual information and quantization parameters of each prediction node through parsing, and dequantizes the prediction residual to recover the reconstructed geometric position information of each node, and finally completes the geometric reconstruction of the decoding end.
几何编码完成后,对几何信息进行重建。目前,属性编码主要针对颜色信息进行。首先,将颜色信息从RGB颜色空间转换到YUV颜色空间。然后,利用重建的几何信息对点云重新着色,使得未编码的属性信息与重建的几何信息对应起来。在颜色信息编码中,主要有两种变换方法,一是依赖于LOD(Level of Detail,细节层次)划分的基于距离的提升变换,二是直接进行RAHT(Region Adaptive Hierarchal Transform,区域自适应分层变换)变换,这两种方法都会将颜色信息从空间域转换到频域,通过变换得到高频系数和低频系数,最后对系数进行量化并编码,生成二进制码流。After the geometric encoding is completed, the geometric information is reconstructed. At present, attribute encoding is mainly performed on color information. First, the color information is converted from the RGB color space to the YUV color space. Then, the point cloud is recolored using the reconstructed geometric information so that the unencoded attribute information corresponds to the reconstructed geometric information. In color information encoding, there are two main transformation methods. One is the distance-based lifting transformation that relies on LOD (Level of Detail) division, and the other is to directly perform RAHT (Region Adaptive Hierarchal Transform) transformation. Both methods will convert color information from the spatial domain to the frequency domain, obtain high-frequency coefficients and low-frequency coefficients through transformation, and finally quantize and encode the coefficients to generate a binary code stream.
在利用几何信息来对属性信息进行预测时,可以利用莫顿码进行最近邻居搜索,点云中每点对应的莫顿码可以由该点的几何坐标得到。计算莫顿码的具体方法描述如下所示,对于每一个分量用d比特二进制数表示的三维坐标,其三个分量可以表示为公式(19):When using geometric information to predict attribute information, Morton codes can be used to search for nearest neighbors. The Morton code corresponding to each point in the point cloud can be obtained from the geometric coordinates of the point. The specific method for calculating the Morton code is described as follows. For each component of the three-dimensional coordinate represented by a d-bit binary number, its three components can be expressed as formula (19):
Figure PCTCN2023071069-appb-000042
Figure PCTCN2023071069-appb-000042
其中,x l,y l,z l∈{0,1}分别是x,y,z的最高位(l=1)到最低位(l=d)对应的二进制数值。莫顿码M是对x,y,z从最高位开始,依次交叉排列x l,y l,z l到最低位,M的计算公式如下公式(20)所示: Where x l ,y l ,z l ∈{0,1} are the binary values corresponding to the highest bit (l=1) to the lowest bit (l=d) of x, y, z respectively. The Morton code M is x, y, z starting from the highest bit and arranged in sequence from x l ,y l ,z l to the lowest bit. The calculation formula of M is shown in the following formula (20):
Figure PCTCN2023071069-appb-000043
Figure PCTCN2023071069-appb-000043
其中,m l′∈{0,1}分别是M的最高位(l′=1)到最低位(l′=3d)的值。在得到点云中每个点的莫顿码M后,将点云中的点按莫顿码由小到大的顺序进行排列,并将每个点的权值w设为1。 Among them, m l′ ∈{0,1} are the values of the highest bit (l′=1) to the lowest bit (l′=3d) of M. After obtaining the Morton code M of each point in the point cloud, the points in the point cloud are arranged in order of Morton code from small to large, and the weight w of each point is set to 1.
GPCC的通用测试条件共4种:There are 4 general test conditions for GPCC:
条件1:几何位置有限度有损、属性有损;Condition 1: The geometric position is limitedly lossy and the attributes are lossy;
条件2:几何位置无损、属性有损;Condition 2: The geometric position is lossless, but the attributes are lossy;
条件3:几何位置无损、属性有限度有损;Condition 3: The geometric position is lossless, and the attributes are limitedly lossy;
条件4:几何位置无损、属性无损。Condition 4: The geometric position and attributes are lossless.
通用测试序列包括Cat1A,Cat1B,Cat3-fused,Cat3-frame共四类,其中Cat2-frame点云只包含反射率属性信息,Cat1A、Cat1B点云只包含颜色属性信息,Cat3-fused点云同时包含颜色和反射率属性信息。The general test sequences include Cat1A, Cat1B, Cat3-fused, and Cat3-frame. The Cat2-frame point cloud only contains reflectance attribute information, the Cat1A and Cat1B point clouds only contain color attribute information, and the Cat3-fused point cloud contains both color and reflectance attribute information.
GPCC的技术路线共2种,以几何压缩所采用的算法进行区分,分为八叉树编码分支和预测树编码分支。There are two technical routes of GPCC, which are distinguished by the algorithm used for geometric compression, and are divided into octree coding branch and prediction tree coding branch.
其中,八叉树编码分支中,在编码端,将包围盒依次划分得到子立方体,对非空的(包含点云中的点)的子立方体继续进行划分,直到划分得到的叶子结点为1X1X1的单位立方体时停止划分,在几何无损编码情况下,需要对叶子节点中所包含的点数进行编码,最终完成几何八叉树的编码,生成二进制码流。在解码端,解码端按照广度优先遍历的顺序,通过不断解析得到每个节点的占位码,并且依次不断划分节点,直至划分得到1x1x1的单位立方体时停止划分,在几何无损解码的情况下,需要解析得到每个叶子节点中包含的点数,最终恢复得到几何重构点云信息。Among them, in the octree coding branch, at the encoding end, the bounding box is divided into sub-cubes in sequence, and the non-empty (containing points in the point cloud) sub-cubes are continued to be divided until the leaf node obtained by division is a 1X1X1 unit cube. In the case of geometric lossless coding, it is necessary to encode the number of points contained in the leaf node, and finally complete the encoding of the geometric octree to generate a binary code stream. At the decoding end, the decoding end obtains the placeholder code of each node by continuous parsing in the order of breadth-first traversal, and continuously divides the nodes in sequence until the division is a 1x1x1 unit cube. In the case of geometric lossless decoding, it is necessary to parse the number of points contained in each leaf node, and finally restore the geometric reconstructed point cloud information.
预测树编码分支中,在编码端通过利用两种不同的方式建立预测树结构,其中包括:KD-Tree(高时延慢速模式)和利用激光雷达标定信息,将每个点划分到不同的Laser上,按照不同的Laser建立预测结构(低时延快速模式)。接下来基于预测树的结构,遍历预测树中的每个节点,通过选取不同的预测模式对节点的几何位置信息进行预测得到预测残差,并且利用量化参数对几何预测残差进行量化。最终通过不断迭代,对预测树节点位置信息的预测残差、预测树 结构以及量化参数等进行编码,生成二进制码流。在解码端,解码端通过不断解析码流,重构预测树结构,其次通过解析得到每个预测节点的几何位置预测残差信息以及量化参数,并且对预测残差进行反量化,恢复得到每个节点的重构几何位置信息,最终完成解码端的几何重构。In the prediction tree coding branch, the prediction tree structure is established at the encoding end by using two different methods, including: KD-Tree (high-latency slow mode) and using the laser radar calibration information to divide each point into different lasers and establish a prediction structure according to different lasers (low-latency fast mode). Next, based on the structure of the prediction tree, each node in the prediction tree is traversed, and the geometric position information of the node is predicted by selecting different prediction modes to obtain the prediction residual, and the geometric prediction residual is quantized using the quantization parameter. Finally, through continuous iteration, the prediction residual of the prediction tree node position information, the prediction tree structure, and the quantization parameters are encoded to generate a binary code stream. At the decoding end, the decoding end reconstructs the prediction tree structure by continuously parsing the code stream, and then obtains the geometric position prediction residual information and quantization parameters of each prediction node through parsing, and dequantizes the prediction residual to restore the reconstructed geometric position information of each node, and finally completes the geometric reconstruction of the decoding end.
在对当前节点进行直接编码时,编码端在确定当前节点具备直接编解码的资格后,对当前节点的点数进行编码。对应的,解码端从码流中解码出当前节点的点数。但是,目前在当前节点的点数进行预测编解码时,未考虑时域相关性,可选择的上下文条件少,导致得到的条件熵较大,进而使得点数的编码性能差,导致点云的编码效率低。When directly encoding the current node, the encoder encodes the number of points of the current node after determining that the current node is eligible for direct encoding and decoding. Correspondingly, the decoder decodes the number of points of the current node from the bitstream. However, currently, when predicting and decoding the number of points of the current node, the time domain correlation is not considered, and there are few selectable context conditions, resulting in a large conditional entropy, which in turn makes the encoding performance of the points poor, resulting in low encoding efficiency of the point cloud.
为了解决上述技术问题,本申请实施例在编解码节点的点数时,参照了帧间信息,例如基于当前节点对应的帧间编解码信息,对当前节点的点数进行编解码,进而增加了编解码点数时可选择的上下文条件,使得确定出的条件熵减小,从而提升点云的编解码效率和性能。In order to solve the above technical problems, the embodiment of the present application refers to the inter-frame information when encoding and decoding the points of the node. For example, based on the inter-frame encoding and decoding information corresponding to the current node, the points of the current node are encoded and decoded, thereby increasing the selectable context conditions when encoding and decoding the points, so that the determined conditional entropy is reduced, thereby improving the encoding and decoding efficiency and performance of the point cloud.
下面结合具体的实施例,对本申请实施例涉及的点云编解码方法进行介绍。The point cloud encoding and decoding method involved in the embodiments of the present application is introduced below in conjunction with specific embodiments.
首先,以解码端为例,对本申请实施例提供的点云解码方法进行介绍。First, taking the decoding end as an example, the point cloud decoding method provided in the embodiment of the present application is introduced.
图8为本申请一实施例提供的点云解码方法流程示意图。本申请实施例的点云解码方法可以由上述图3或图4B所示的点云解码设备或点云解码器完成。Fig. 8 is a schematic diagram of a point cloud decoding method according to an embodiment of the present application. The point cloud decoding method according to the embodiment of the present application can be implemented by the point cloud decoding device or point cloud decoder shown in Fig. 3 or Fig. 4B above.
如图8所示,本申请实施例的点云解码方法包括:As shown in FIG8 , the point cloud decoding method of the embodiment of the present application includes:
S101、确定当前节点对应的帧间解码信息。S101. Determine inter-frame decoding information corresponding to a current node.
其中,当前节点为当前待解码帧中的待解码节点。The current node is the node to be decoded in the current frame to be decoded.
由上述可知,点云包括几何信息和属性信息,对点云的解码包括几何解码和属性解码。本申请实施例涉及点云的几何解码。As can be seen from the above, the point cloud includes geometric information and attribute information, and the decoding of the point cloud includes geometric decoding and attribute decoding. The embodiment of the present application relates to geometric decoding of point clouds.
在一些实施例中,点云的几何信息也称为点云的位置信息,因此,点云的几何解码也称为点云的位置解码。In some embodiments, the geometric information of the point cloud is also referred to as the position information of the point cloud. Therefore, the geometric decoding of the point cloud is also referred to as the position decoding of the point cloud.
在基于八叉树的编码方式中,编码端基于点云的几何信息,构建点云的八叉树结构,如图9所示,使用最小长方体包围点云,首先对该包围盒进行八叉树划分,得到8个节点,对这8个节点中被占用的节点,即包括点的节点继续进行八叉树划分,以此类推,直到划分到体素级别位置,例如划分到1X1X1的正方体为止。这样划分得到的点云八叉树结构包括多层节点组成,例如包括N层,在编码时,逐层编码每一层的占位信息,直到编码完最后一层的体素级别的叶子节点为止。也就是说,在八叉树编码中,将点云通八叉树划分,最终将点云中的点划分到八叉树的体素级的叶子节点中,通过对整个八叉树进行编码,实现对点云的编码。In the octree-based encoding method, the encoding end constructs the octree structure of the point cloud based on the geometric information of the point cloud. As shown in Figure 9, the point cloud is enclosed by the smallest cuboid. The enclosing box is first divided into octrees to obtain 8 nodes. The occupied nodes among the 8 nodes, that is, the nodes including the points, continue to be divided into octrees, and so on, until the division is to the voxel level, for example, to a 1X1X1 cube. The point cloud octree structure obtained by such division includes multiple layers of nodes, for example, N layers. When encoding, the placeholder information of each layer is encoded layer by layer until the voxel-level leaf nodes of the last layer are encoded. That is to say, in octree encoding, the point cloud is divided through the octree, and finally the points in the point cloud are divided into the voxel-level leaf nodes of the octree. The encoding of the point cloud is achieved by encoding the entire octree.
对应的,解码端,首先解码点云的几何码流,得到该点云的八叉树的根节点的占位信息,并基于该根节点的占位信息,确定出该根节点所包括的子节点,即八叉树的第2层包括的节点。接着,解码几何码流,得到第2层中的各节点的占位信息,并基于各节点的占位信息,确定出八叉树的第3层所包括的节点,依次类推。Correspondingly, the decoding end first decodes the geometric code stream of the point cloud to obtain the placeholder information of the root node of the octree of the point cloud, and based on the placeholder information of the root node, determines the child nodes included in the root node, that is, the nodes included in the second layer of the octree. Then, the geometric code stream is decoded to obtain the placeholder information of each node in the second layer, and based on the placeholder information of each node, determines the nodes included in the third layer of the octree, and so on.
但基于八叉树的几何信息编码模式对空间中具有相关性的点有高效的压缩速率,而对于在几何空间中处于孤立位置的点来说,使用直接编码方式可以大大降低复杂度,提升编解码效率。However, the octree-based geometric information encoding mode has an efficient compression rate for correlated points in space, and for points in isolated positions in the geometric space, the use of direct encoding can greatly reduce the complexity and improve the encoding and decoding efficiency.
由于直接编码方式是对节点所包括的点的几何信息直接进行编码,若节点所包括的点数较多时,采用直接编码方式时压缩效果差。因此,对于八叉树中的节点,在进行直接编码之前,首先判断该节点是否可以采用直接编码方式。若判断该节点可以采用直接编码方式进行编码时,则采用直接编码方式对该节点所包括的点的几何信息进行直接编码。若判断该节点不可以采用直接编码方式进行编码时,则继续采用八叉树方式对该节点进行划分。Since the direct encoding method directly encodes the geometric information of the points included in the node, if the number of points included in the node is large, the compression effect is poor when the direct encoding method is used. Therefore, for the nodes in the octree, before direct encoding, first determine whether the node can be encoded using the direct encoding method. If it is determined that the node can be encoded using the direct encoding method, the direct encoding method is used to directly encode the geometric information of the points included in the node. If it is determined that the node cannot be encoded using the direct encoding method, the octree method is continued to be used to divide the node.
具体的,编码端首先判断节点是否具备直接编码的资格,若该节点具备直接编码的资格后,判断节点的点数是否小于或等于预设阈值,若节点的点数小于或等于预设阈值,则确定该节点可以采用直接编码方式进行解码。接着,将该节点所包括的点数,以及各点的几何信息编入码流。对应的,解码端在确定该节点具备直接解码的资格后,解码码流,得到该节点的点数,以及各点的几何信息,实现该节点的几何解码。Specifically, the encoding end first determines whether the node is qualified for direct encoding. If the node is qualified for direct encoding, it determines whether the number of points of the node is less than or equal to the preset threshold. If the number of points of the node is less than or equal to the preset threshold, it is determined that the node can be decoded by direct encoding. Then, the number of points included in the node and the geometric information of each point are encoded into the bitstream. Correspondingly, after determining that the node is qualified for direct decoding, the decoding end decodes the bitstream, obtains the number of points of the node and the geometric information of each point, and implements geometric decoding of the node.
由于对语法元素进行解码时,上下文模型选择的条件越多则得到的条件熵越小,解码性能越强。但是,目前对节点的点数进行解码时,未考虑帧间信息,导致可选择的上下文条件少得到的条件熵大,使得点数的熵解码性能差,进行降低点云的解码性能。When decoding syntax elements, the more conditions the context model selects, the smaller the conditional entropy is, and the better the decoding performance is. However, when decoding the points of a node, the inter-frame information is not considered, resulting in a small number of selectable context conditions and a large conditional entropy, which makes the entropy decoding performance of the points poor, and reduces the decoding performance of the point cloud.
为了解决上述问题,本申请实施例中,解码端基于当前节点对应的帧间解码信息,对当前节点的点数进行预测解码,进而增加了解码点数时可选择的上下文条件,使得确定出的条件熵减小,从而提升点云的解码效率和解码性能。In order to solve the above problems, in an embodiment of the present application, the decoding end predicts and decodes the number of points of the current node based on the inter-frame decoding information corresponding to the current node, thereby increasing the selectable context conditions when decoding the points, so that the determined conditional entropy is reduced, thereby improving the decoding efficiency and decoding performance of the point cloud.
具体的,解码端首先确定当前节点对应的帧间解码信息。Specifically, the decoding end first determines the inter-frame decoding information corresponding to the current node.
需要说明的是,当前待解码帧为一个点云帧,在一些实施例中,当前待解码帧也称为当前帧或当前点云帧或当前待解码的点云帧等。当前节点可以理解为当前待解码帧中的任意一个非叶子节点的非空节点。也就是说,当前节点不是当前待解码帧对应的八叉树中的叶子节点,即当前节点为八叉树的中间任意节点,且该当前节点不是非空节点,即至少包括1个点。It should be noted that the current frame to be decoded is a point cloud frame. In some embodiments, the current frame to be decoded is also referred to as the current frame or the current point cloud frame or the point cloud frame to be decoded. The current node can be understood as any non-leaf node in the current frame to be decoded. In other words, the current node is not a leaf node in the octree corresponding to the current frame to be decoded, that is, the current node is any middle node of the octree, and the current node is not a non-empty node, that is, it includes at least 1 point.
另外,本申请实施例所述的当前节点对应的帧间解码信息通常指帧间几何解码信息。In addition, the inter-frame decoding information corresponding to the current node described in the embodiment of the present application generally refers to inter-frame geometric decoding information.
本申请实施例对当前节点对应的帧间解码信息的具体类型不做限制,可以理解为当前节点涉及的任意帧间几何解码信息。例如,当前节点对应的帧间解码信息可以包括当前节点的相关帧间几何解码信息,和/或当前待解码帧的相关帧间几何解码信息。The embodiment of the present application does not limit the specific type of inter-frame decoding information corresponding to the current node, which can be understood as any inter-frame geometric decoding information involved in the current node. For example, the inter-frame decoding information corresponding to the current node may include relevant inter-frame geometric decoding information of the current node and/or relevant inter-frame geometric decoding information of the current frame to be decoded.
在一些实施例中,当前节点对应的帧间解码信息包括当前节点的N个预测节点的几何解码信息和当前节点对应的第一信息中的至少一个。其中,当前节点的N个预测节点为当前节点在帧间预测参考帧中的预测节点,当前节点对应的第一信息用于指示当前节点是否允许启动帧间解码方式。In some embodiments, the inter-frame decoding information corresponding to the current node includes at least one of the geometric decoding information of the N prediction nodes of the current node and the first information corresponding to the current node. The N prediction nodes of the current node are the prediction nodes of the current node in the inter-frame prediction reference frame, and the first information corresponding to the current node is used to indicate whether the current node is allowed to start the inter-frame decoding mode.
此时,上述S101中确定当前节点对应的帧间解码信息包括如下S101-A的步骤:At this time, determining the inter-frame decoding information corresponding to the current node in the above S101 includes the following step S101-A:
S101-A、在当前待解码帧的预测参考帧中,确定当前节点的N个预测节点,和/或解码点云码流,得到第一信息。S101-A. In a prediction reference frame of a current frame to be decoded, determine N prediction nodes of a current node, and/or decode a point cloud code stream to obtain first information.
其中,N为正整数。Wherein, N is a positive integer.
在本申请实施例中,解码端在解码当前待解码帧中的当前节点时,判断该当前节点是否具备直接解码资格,在确定当前节点具备直接解码资格时,对当前节点的点数进行解码。其中,解码端在对当前节点的点数进行解码时,首先确定当前待解码帧的预测参考帧,并在预测参考帧中,确定当前节点的N个预测节点。例如图10示出了当前节点在预测参考帧中的一个预测节点。In an embodiment of the present application, when decoding the current node in the current frame to be decoded, the decoding end determines whether the current node is qualified for direct decoding, and when determining that the current node is qualified for direct decoding, decodes the number of points of the current node. When decoding the number of points of the current node, the decoding end first determines the predicted reference frame of the current frame to be decoded, and determines N predicted nodes of the current node in the predicted reference frame. For example, FIG. 10 shows a predicted node of the current node in the predicted reference frame.
需要说明的是,本申请实施例对当前待解码帧的预测参考帧的个数不做限制,例如,当前待解码帧具有一个预测参考帧,或者当前待解码帧具有多个预测参考帧。同时,本申请实施例对当前节点的预测节点的个数N也不做限制,具体根据实际需要确定。It should be noted that the embodiment of the present application does not limit the number of prediction reference frames of the current frame to be decoded. For example, the current frame to be decoded has one prediction reference frame, or the current frame to be decoded has multiple prediction reference frames. At the same time, the embodiment of the present application does not limit the number N of prediction nodes of the current node, which is determined according to actual needs.
本申请实施例对确定当前待解码帧的预测参考帧的具体方式也不做限制。The embodiment of the present application does not limit the specific method of determining the prediction reference frame of the current frame to be decoded.
在一些实施例中,将当前待解码帧的前一个或前几个已解码帧,确定为该当前待解码帧的预测参考帧。In some embodiments, one or several decoded frames before the current frame to be decoded are determined as prediction reference frames of the current frame to be decoded.
例如,若当前待解码帧为P帧,P帧在帧间参考帧包括P帧的前一帧(即前向帧),因此,可以将当前待解码帧的前一帧(即前向帧),确定为当前待解码帧的预测参考帧。For example, if the current frame to be decoded is a P frame, the inter-frame reference frame of the P frame includes the previous frame of the P frame (ie, the forward frame). Therefore, the previous frame of the current frame to be decoded (ie, the forward frame) can be determined as the predicted reference frame of the current frame to be decoded.
再例如,若当前待解码帧为B帧,B帧的帧间参考帧包括P帧的前一帧(即前向帧)和P帧的后一帧(即后向帧),因此,可以将当前待解码帧的前一帧(即前向帧),确定为当前待解码帧的预测参考帧。For another example, if the current frame to be decoded is a B frame, the inter-frame reference frames of the B frame include the previous frame of the P frame (i.e., the forward frame) and the next frame of the P frame (i.e., the backward frame). Therefore, the previous frame of the current frame to be decoded (i.e., the forward frame) can be determined as the predicted reference frame of the current frame to be decoded.
在一些实施例中,将当前待解码帧的后一个或后几个已解码帧,确定为该当前待解码帧的预测参考帧。In some embodiments, one or several decoded frames following the current frame to be decoded are determined as prediction reference frames of the current frame to be decoded.
例如,若当前待解码帧为B帧,则可以将当前待解码帧的后一帧,确定为当前待解码帧的预测参考帧。For example, if the current frame to be decoded is a B frame, the frame following the current frame to be decoded may be determined as a prediction reference frame of the current frame to be decoded.
在一些实施例中,将当前待解码帧的前一个或前几个已解码帧,以及当前待解码帧的后一个或后几个已解码帧,确定为该当前待解码帧的预测参考帧。In some embodiments, one or several decoded frames before the current frame to be decoded, and one or several decoded frames after the current frame to be decoded, are determined as prediction reference frames of the current frame to be decoded.
例如,若当前待解码帧为B帧,则可以将当前待解码帧的前一帧和后一帧,确定为当前待解码帧的预测参考帧,此时,当前待解码帧具有2个预测参考帧。For example, if the current frame to be decoded is a B frame, the previous frame and the next frame of the current frame to be decoded may be determined as prediction reference frames of the current frame to be decoded. In this case, the current frame to be decoded has two prediction reference frames.
下面以当前待解码帧包括K个预测参考帧为例,对上述S101-A中在当前待解码帧的预测参考帧中,确定当前节点的N个预测节点的具体过程进行介绍。Taking the current frame to be decoded including K prediction reference frames as an example, the specific process of determining N prediction nodes of the current node in the prediction reference frame of the current frame to be decoded in S101-A above is introduced.
在一些实施例中,解码端基于当前待解码帧中节点的占位信息,以及K个预测参考帧中每一个预测参考帧中节点的占位信息,从K个预测参考帧中选出至少一个预测参考帧,进而在该至少一个预测参考帧中,查找当前节点的预测节点。例如,从K个预测参考帧中,选出节点的占位信息与当前待解码帧的节点的占位信息最相近的至少一个预测参考帧,进而在这至少一个预测参考帧中,查找当前节点的预测节点。In some embodiments, the decoding end selects at least one prediction reference frame from the K prediction reference frames based on the placeholder information of the node in the current frame to be decoded and the placeholder information of the node in each of the K prediction reference frames, and then searches for the prediction node of the current node in the at least one prediction reference frame. For example, at least one prediction reference frame whose placeholder information of the node is closest to the placeholder information of the node in the current frame to be decoded is selected from the K prediction reference frames, and then searches for the prediction node of the current node in the at least one prediction reference frame.
在一些实施例中,解码端可以通过如下S101-A1和S101-A2的步骤,确定当前节点的N个预测节点:In some embodiments, the decoding end may determine N predicted nodes of the current node through the following steps S101-A1 and S101-A2:
S101-A1、针对K个预测参考帧中的第k个预测参考帧,确定当前节点在第k个预测参考帧中的至少一个预测节点,k为小于或等于K的正整数,K为正整数;S101-A1, for a k-th prediction reference frame among K prediction reference frames, determining at least one prediction node of a current node in the k-th prediction reference frame, where k is a positive integer less than or equal to K, and K is a positive integer;
S101-A2、基于当前节点在K个预测参考帧中的至少一个预测节点,确定当前节点的N个预测节点。S101-A2. Determine N prediction nodes of the current node based on at least one prediction node of the current node in K prediction reference frames.
在该实施例中,解码端从K个预测参考帧中的每一个预测参考帧中,确定出当前节点的至少一个预测节点,最后将K个预测参考帧中各预测参考帧中的至少一个预测节点进行汇总,得到当前节点的N个预测节点。In this embodiment, the decoding end determines at least one prediction node of the current node from each of the K prediction reference frames, and finally summarizes at least one prediction node in each of the K prediction reference frames to obtain N prediction nodes of the current node.
其中,解码端确定当前节点在K个预测参考帧中每一个预测参考帧中的至少一个预测点的过程相同,为了便于描述,在此以K个预测参考帧中的第k个预测参考帧为例进行说明。Among them, the process of the decoding end determining at least one prediction point of the current node in each of the K prediction reference frames is the same. For the convenience of description, the kth prediction reference frame among the K prediction reference frames is taken as an example for explanation.
下面对上述S101-A1中确定当前节点在第k个预测参考帧中的至少一个预测节点的具体过程进行介绍。The specific process of determining at least one prediction node of the current node in the kth prediction reference frame in the above S101-A1 is introduced below.
本申请实施例对解码端确定当前节点在第k个预测参考帧中的至少一个预测节点的具体方式不做限制。The embodiment of the present application does not limit the specific manner in which the decoding end determines at least one prediction node of the current node in the kth prediction reference frame.
方式一,在第k个预测参考帧中,确定出当前节点的一个预测节点。例如,将第k个预测参考帧中与当前节点的划分深度相同的一个节点,确定为当前节点的预测节点。Method 1: In the kth prediction reference frame, a prediction node of the current node is determined. For example, a node in the kth prediction reference frame having the same division depth as the current node is determined as the prediction node of the current node.
举例说明,假设当前节点位于当前待解码帧的八叉树的第3层,这样可以获取第k个预测参考帧中位于八叉树第3层的各节点,进而从这些节点中,确定出当前节点的预测节点。For example, assuming that the current node is located at the third layer of the octree of the current frame to be decoded, the nodes located at the third layer of the octree in the kth prediction reference frame can be obtained, and then the prediction node of the current node can be determined from these nodes.
在一种示例中,若当前节点在第k个预测参考帧中的预测节点的个数为1个时,则可以将第k个预测参考帧与当前节点处于相同划分深度的这些点中,选出占位信息与当前节点的占位信息差异最小的一个节点,记为节点1,将该节点1确定为当前节点在第k个预测参考帧中的一个预测节点。In one example, if the number of prediction nodes of the current node in the kth prediction reference frame is 1, then among the points at which the kth prediction reference frame and the current node are at the same division depth, a node whose occupancy information is the smallest different from that of the current node can be selected, recorded as node 1, and node 1 is determined as a prediction node of the current node in the kth prediction reference frame.
在另一种示例中,若当前节点在第k个预测参考帧中的预测节点的个数大于1时,则将上述确定的节点1,以及节点1在第k个预测参考帧中的至少一个领域节点,例如与节点1共面、共线、共点等的至少一个领域节点,确定为当前节点在第k个预测参考帧中的预测节点。In another example, if the number of prediction nodes of the current node in the kth prediction reference frame is greater than 1, the node 1 determined above and at least one domain node of node 1 in the kth prediction reference frame, such as at least one domain node that is coplanar, colinear, or co-point with node 1, are determined as the prediction nodes of the current node in the kth prediction reference frame.
方式二,上述S101-A1中确定当前节点在第k个预测参考帧中的至少一个预测节点,包括如下S101-A11至S101-A13的步骤: Mode 2, in the above S101-A1, determining at least one prediction node of the current node in the kth prediction reference frame includes the following steps S101-A11 to S101-A13:
S101-A11、在当前待解码帧中,确定当前节点的M个领域节点,M个领域节点中包括当前节点,M为正整数;S101-A11, in the current frame to be decoded, determine M domain nodes of the current node, the M domain nodes include the current node, and M is a positive integer;
S101-A12、针对M个领域节点中的第i个领域节点,确定第i个领域节点在第k个预测参考帧中的对应节点,i为小于或等于M的正整数;S101-A12, for the i-th domain node among the M domain nodes, determine the corresponding node of the i-th domain node in the k-th prediction reference frame, where i is a positive integer less than or equal to M;
S101-A13、基于M个领域节点在第k个预测参考帧中的对应节点,确定当前节点在第k个预测参考帧中的至少一个预测节点。S101-A13. Determine at least one prediction node of the current node in the kth prediction reference frame based on the corresponding nodes of the M domain nodes in the kth prediction reference frame.
在该实现方式中,解码端在确定当前节点在第k个预测参考帧中的至少一个预测节点之前,首先在当前待解码帧中确定当前节点的M个领域节点,该M个领域节点中包括当前节点自身。In this implementation, before determining at least one prediction node of the current node in the kth prediction reference frame, the decoding end first determines M domain nodes of the current node in the current frame to be decoded, and the M domain nodes include the current node itself.
需要说明的是,在本申请实施例中,对当前节点的M个领域节点的具体确定方式不做限制。It should be noted that in the embodiment of the present application, there is no limitation on the specific method of determining the M domain nodes of the current node.
在一种示例中,当前节点的M个领域节点包括当前待解码帧中,与当前节点共面、共线和共点的领域节点中的至少一个领域节点。如图11所示,当前节点包括6个共面的节点、12个共线的节点和8个共点的节点。In one example, the M domain nodes of the current node include at least one domain node among the domain nodes that are coplanar, colinear, and co-pointed with the current node in the current frame to be decoded. As shown in FIG11 , the current node includes 6 coplanar nodes, 12 colinear nodes, and 8 co-pointed nodes.
在另一种示例中,当前节点的M个领域节点中除了包括当前待解码帧中,与当前节点共面、共线和共点的领域节点中的至少一个领域节点外,还可以包括参考邻域范围内的其他节点,本申请实施例对此不做限制。In another example, the M domain nodes of the current node may include other nodes within the reference neighborhood in addition to at least one domain node in the current frame to be decoded that is coplanar, colinear, and co-point with the current node. This embodiment of the present application does not impose any restrictions on this.
解码端基于上述步骤,在当前待解码帧中确定出当前节点的M个领域节点后,确定出M个领域节点中每一个领域节点在第k个预测参考帧中的对应节点,进而基于M个领域节点在第k个预测参考帧中的对应节点,确定当前节点在第k个预测参考帧中的至少一个预测节点。Based on the above steps, the decoding end determines the M domain nodes of the current node in the current frame to be decoded, and then determines the corresponding node of each of the M domain nodes in the kth prediction reference frame, and then determines at least one prediction node of the current node in the kth prediction reference frame based on the corresponding nodes of the M domain nodes in the kth prediction reference frame.
本申请实施例对S101-A13的具体实现方式不做限制。The embodiment of the present application does not limit the specific implementation method of S101-A13.
在一种可能的实现方式中,从M个领域节点在第k个预测参考帧中的对应节点中,筛选出至少一个对应节点,作为当前节点在第k个预测参考帧中的至少一个预测节点。例如,从M个领域节点在第k个预测参考帧中的对应节点中,筛选出占位信息与当前节点的占位信息差异最小的至少一个对应节点,作为当前节点在第k个预测参考帧中的至少一个预测节点。其中,确定对应节点的占位信息与当前节点的占位信息的差异的方式可以参照上述占位信息的差异的确定过程,例如将对应节点的占位信息与当前节点的占位信息进行异或运算,将异或运算结果作为该对应节点的占位信息与当前节点的占位信息之间的差异。In a possible implementation, at least one corresponding node is selected from the corresponding nodes of the M domain nodes in the k-th prediction reference frame as at least one prediction node of the current node in the k-th prediction reference frame. For example, at least one corresponding node whose placeholder information has the smallest difference with the placeholder information of the current node is selected from the corresponding nodes of the M domain nodes in the k-th prediction reference frame as at least one prediction node of the current node in the k-th prediction reference frame. The method of determining the difference between the placeholder information of the corresponding node and the placeholder information of the current node can refer to the above-mentioned process of determining the difference in placeholder information, for example, performing an XOR operation on the placeholder information of the corresponding node and the placeholder information of the current node, and using the XOR operation result as the difference between the placeholder information of the corresponding node and the placeholder information of the current node.
在另一种可能的实现方式中,解码端将M个领域节点在第k个预测参考帧中的对应节点,确定为当前节点在第k个预测参考帧中的至少一个预测节点。例如,M个领域节点分别在第k个预测参考帧中具有一个对应节点,进而有M个对应节点,将这M个对应节点确定为当前节点在第k个预测参考帧中的预测节点,共有M个预测节点。In another possible implementation, the decoding end determines the corresponding nodes of the M domain nodes in the kth prediction reference frame as at least one prediction node of the current node in the kth prediction reference frame. For example, the M domain nodes each have a corresponding node in the kth prediction reference frame, and then there are M corresponding nodes, and these M corresponding nodes are determined as the prediction nodes of the current node in the kth prediction reference frame, and there are M prediction nodes in total.
上面对确定当前节点在第k个预测参考帧中的至少一个预测节点的过程进行介绍。这样,解码端可以采用与上述相同的方式,确定出当前节点在K个预测参考帧中每一个预测参考帧中的至少一个预测节点。The above describes the process of determining at least one prediction node of the current node in the kth prediction reference frame. In this way, the decoding end can use the same method as above to determine at least one prediction node of the current node in each of the K prediction reference frames.
例如,若当前待解码帧为P帧时,K个预测参考帧包括当前待解码帧的前向帧。此时,解码端基于上述步骤,可以确定出当前节点在前向帧中的至少一个预测节点。示例性的,如图13A所示,假设当前节点包括3个领域节点,分别记为节点11、节点12(为当前节点)和节点13,这3个领域节点在前向帧中分别对应一个对应节点,分别记为节点21、节点22和节点23,进而将节点21、节点22和节点23确定为当前节点在前向帧中的3个预测节点,或者从节点21、节点22和节点23中选出1或2个节点确定为当前节点在前向帧中的1个或2个预测节点。For example, if the current frame to be decoded is a P frame, the K predicted reference frames include the forward frame of the current frame to be decoded. At this point, the decoding end can determine at least one predicted node of the current node in the forward frame based on the above steps. Exemplarily, as shown in FIG13A, it is assumed that the current node includes three domain nodes, which are respectively recorded as node 11, node 12 (current node) and node 13. These three domain nodes correspond to a corresponding node in the forward frame, respectively recorded as node 21, node 22 and node 23, and then node 21, node 22 and node 23 are determined as the three predicted nodes of the current node in the forward frame, or 1 or 2 nodes are selected from node 21, node 22 and node 23 to be determined as 1 or 2 predicted nodes of the current node in the forward frame.
再例如,若当前待解码帧为B帧时,K个预测参考帧包括当前待解码帧的前向帧和后向帧。此时,解码端基于上述步骤,可以确定出当前节点在前向帧中的至少一个预测节点,以及当前节点在后向帧中的至少一个预测节点。示例性的,如图13B所示,假设当前节点包括3个领域节点,分别记为节点11、节点12和节点13,这3个领域节点在前向帧中分别对应一个对应节点记为节点21、节点22和节点23,这3个领域节点在后向帧中分别对应一个对应节点记为节点41、节点42和节点43。这样解码端可以将节点21、节点22和节点23确定为当前节点在前向帧中的3个预测节点,或者从节点21、节点22和节点23中选出1或2个节点确定为当前节点在前向帧中的1个或2个预测节点。同理,解码端可以将节点41、节点42和节点43确定为当前节点在后向帧中的3个预测节点,或者从节点41、节点42和节点43中选出1或2个节点确定为当前节点在后向帧中的1个或2个预测节点。For another example, if the current frame to be decoded is a B frame, the K prediction reference frames include the forward frame and the backward frame of the current frame to be decoded. At this time, based on the above steps, the decoding end can determine at least one prediction node of the current node in the forward frame, and at least one prediction node of the current node in the backward frame. Exemplarily, as shown in FIG13B, it is assumed that the current node includes three domain nodes, which are respectively recorded as nodes 11, 12, and 13. These three domain nodes correspond to a corresponding node in the forward frame, which are recorded as nodes 21, 22, and 23, respectively. These three domain nodes correspond to a corresponding node in the backward frame, which are recorded as nodes 41, 42, and 43. In this way, the decoding end can determine nodes 21, 22, and 23 as the three prediction nodes of the current node in the forward frame, or select 1 or 2 nodes from nodes 21, 22, and 23 to determine as 1 or 2 prediction nodes of the current node in the forward frame. Similarly, the decoding end can determine node 41, node 42 and node 43 as three prediction nodes of the current node in the backward frame, or select one or two nodes from node 41, node 42 and node 43 as one or two prediction nodes of the current node in the backward frame.
解码端确定出当前节点在K个预测参考帧中每一个预测参考帧中的至少一个预测节点后,执行上述S101-B的步骤,即基于当前节点在K个预测参考帧中的至少一个预测节点,确定当前节点的N个预测节点。After the decoding end determines at least one prediction node of the current node in each of the K prediction reference frames, it executes the above step S101-B, that is, based on at least one prediction node of the current node in the K prediction reference frames, determines N prediction nodes of the current node.
在一种示例中,将当前节点在K个预测参考帧中的至少一个预测节点,确定为当前节点的N个预测节点。In an example, at least one prediction node of the current node in K prediction reference frames is determined as N prediction nodes of the current node.
例如,K=2,即K个预测参考帧包括第一个预测参考帧和第二个预测参考帧。假设当前节点在第一个预测参考帧中有2个预测节点,当前节点在第二个预测参考帧中有3个预测节点,这样可以确定当前节点具有5个预测节点,此时N=5。For example, K=2, that is, the K prediction reference frames include the first prediction reference frame and the second prediction reference frame. Assuming that the current node has 2 prediction nodes in the first prediction reference frame and the current node has 3 prediction nodes in the second prediction reference frame, it can be determined that the current node has 5 prediction nodes, and N=5 at this time.
在另一种示例中,从当前节点在K个预测参考帧中的至少一个预测节点中,筛选出当前节点的N个预测节点。In another example, N prediction nodes of the current node are screened out from at least one prediction node of the current node in K prediction reference frames.
继续参照上述示例,假设K=2,即K个预测参考帧包括第一个预测参考帧和第二个预测参考帧。假设当前节点在第一个预测参考帧中有2个预测节点,当前节点在第二个预测参考帧中有3个预测节点。从这5个预测节点中,选出3个预测节点作为当前节点的最终预测节点。例如,从这5个预测节点中,选出占位信息与当前节点的占位信息差异最小的3个预测节点,确定为当前节点的最终预测节点。Continuing with the above example, assume that K=2, that is, the K prediction reference frames include the first prediction reference frame and the second prediction reference frame. Assume that the current node has 2 prediction nodes in the first prediction reference frame, and the current node has 3 prediction nodes in the second prediction reference frame. From these 5 prediction nodes, select 3 prediction nodes as the final prediction nodes of the current node. For example, from these 5 prediction nodes, select the 3 prediction nodes whose placeholder information has the smallest difference with the placeholder information of the current node, and determine them as the final prediction nodes of the current node.
在该方式二中,解码端在当前待解码帧中确定出当前节点的M个领域节点后,在第k个预测参考帧中,确定出这M个领域节点中每一个领域节点的对应节点,进而基于M个领域节点中每一个领域节点的对应节点,确定当前节点在该第k个预测参考帧中的至少一个预测点。In the second method, after the decoding end determines the M domain nodes of the current node in the current frame to be decoded, it determines the corresponding node of each of the M domain nodes in the kth prediction reference frame, and then determines at least one prediction point of the current node in the kth prediction reference frame based on the corresponding node of each of the M domain nodes.
方式三,上述S101-A1中确定当前节点在第k个预测参考帧中的至少一个预测节点,包括如下S101-B11至S101-B13的步骤:Mode 3, in the above S101-A1, determining at least one prediction node of the current node in the kth prediction reference frame includes the following steps S101-B11 to S101-B13:
S101-B11、确定当前节点在第k个预测参考帧中的对应节点;S101-B11, determining a corresponding node of the current node in the kth prediction reference frame;
S101-B12、确定对应节点的至少一个领域节点;S101-B12, determining at least one domain node of the corresponding node;
S101-B13、将至少一个领域节点,确定为当前节点在第k个预测参考帧中的至少一个预测节点。S101-B13. Determine at least one domain node as at least one prediction node of the current node in the kth prediction reference frame.
在该方式3中,针对K个预测参考帧中的每一个预测参考帧,解码端首先在各预测参考帧中确定该当前节点的对应节点。例如,确定当前节点在预测参考帧1中的对应节点1,确定当前节点在预测参考帧2中的对应节点2。接着,解码端确定各对应节点的至少一个领域节点。例如,在预测参考帧1中确定对应节点1的至少一个领域节点,以及在预测参考帧2中确定对应节点2的至少一个领域节点。这样可以将对应节点1在预测参考帧1中的至少一个领域节点,确定为当前节点在预测参考帧1中的至少一个预测节点,将对应节点2在预测参考帧2中的至少一个领域节点,确定为当前节点在预测参考帧2中的至少一个预测节点。In this method 3, for each of the K prediction reference frames, the decoding end first determines the corresponding node of the current node in each prediction reference frame. For example, the corresponding node 1 of the current node in the prediction reference frame 1 is determined, and the corresponding node 2 of the current node in the prediction reference frame 2 is determined. Next, the decoding end determines at least one domain node of each corresponding node. For example, at least one domain node of the corresponding node 1 is determined in the prediction reference frame 1, and at least one domain node of the corresponding node 2 is determined in the prediction reference frame 2. In this way, at least one domain node of the corresponding node 1 in the prediction reference frame 1 can be determined as at least one prediction node of the current node in the prediction reference frame 1, and at least one domain node of the corresponding node 2 in the prediction reference frame 2 can be determined as at least one prediction node of the current node in the prediction reference frame 2.
方式二的S101-A12中确定第i个领域节点在第k个预测参考帧中的对应节点,与上述方式三的S101-B11中确定当前节点在第k个预测参考帧中的对应节点的过程基本相同。为了便于描述,将上述第i个领域节点和当前节点记为第i个节点,下面对确定第i个节点在第k个预测参考帧中的对应节点的具体过程进行介绍。The process of determining the corresponding node of the i-th domain node in the k-th prediction reference frame in S101-A12 of the second method is basically the same as the process of determining the corresponding node of the current node in the k-th prediction reference frame in S101-B11 of the above-mentioned third method. For the convenience of description, the above-mentioned i-th domain node and the current node are recorded as the i-th node, and the specific process of determining the corresponding node of the i-th node in the k-th prediction reference frame is introduced below.
解码端确定第i个节点在第k个预测参考帧中的对应节点至少包括如下几种方式:The decoding end determines the corresponding node of the i-th node in the k-th prediction reference frame in at least the following ways:
方式1,将第k个预测参考帧中与第i个节点的划分深度相同的一个节点,确定为第i个节点的对应节点。Method 1: Determine a node in the k-th prediction reference frame that has the same division depth as the i-th node as the corresponding node of the i-th node.
举例说明,假设第i个节点位于当前待解码帧的八叉树的第3层,这样可以获取第k个预测参考帧中位于八叉树 第3层的各节点,进而从这些节点中,确定出第i个节点的对应节点。例如,将第k个预测参考帧与第i个节点处于相同划分深度的这些点中,选出占位信息与第i个节点的占位信息差异最小的一个节点,确定为第i个节点在第k个预测参考帧中的对应节点。For example, assuming that the i-th node is located at the third layer of the octree of the current frame to be decoded, the nodes located at the third layer of the octree in the k-th prediction reference frame can be obtained, and then the corresponding node of the i-th node can be determined from these nodes. For example, among the points at which the k-th prediction reference frame and the i-th node are at the same division depth, a node whose placeholder information has the smallest difference with the placeholder information of the i-th node is selected and determined as the corresponding node of the i-th node in the k-th prediction reference frame.
方式2、上述S101-A12和S101-B11包括如下步骤:Mode 2: The above S101-A12 and S101-B11 include the following steps:
S101-A121、在当前待解码帧中,确定第i个节点的父节点,作为第i个父节点;S101-A121, in the current frame to be decoded, determine the parent node of the i-th node as the i-th parent node;
S101-A122、确定第i个父节点在第k个预测参考帧中的匹配节点,作为第i个匹配节点;S101-A122, determining a matching node of the i-th parent node in the k-th prediction reference frame as the i-th matching node;
S101-A123、将i个匹配节点的子节点中的一个子节点,确定为第i个节点在第k个预测参考帧中的对应节点。S101-A123, determine one of the child nodes of the i matching nodes as the corresponding node of the i-th node in the k-th prediction reference frame.
在该方式2中,对于第i个节点,解码端在当前待解码帧中,确定该第i个节点的父节点,进而在第k个预测参考帧中,确定第i个预测领域节点的父节点的匹配节点。为了便于描述,将第i个节点的父节点记为第i个父节点,将第i个节点的父节点在第k个预测参考帧中的匹配节点,确定为第i个匹配节点。接着,将第i个匹配节点的子节点的一个子节点,确定为第i个节点在第k个预测参考帧中的对应节点,实现对第i个节点在第k个预测参考帧中的对应节点的准确确定。In the method 2, for the i-th node, the decoding end determines the parent node of the i-th node in the current frame to be decoded, and then determines the matching node of the parent node of the i-th prediction domain node in the k-th prediction reference frame. For the convenience of description, the parent node of the i-th node is recorded as the i-th parent node, and the matching node of the parent node of the i-th node in the k-th prediction reference frame is determined as the i-th matching node. Then, a child node of the child node of the i-th matching node is determined as the corresponding node of the i-th node in the k-th prediction reference frame, thereby accurately determining the corresponding node of the i-th node in the k-th prediction reference frame.
下面对上述S101-A122中确定第i个父节点在第k个预测参考帧中的匹配节点的具体过程进行介绍。The specific process of determining the matching node of the i-th parent node in the k-th prediction reference frame in the above S101-A122 is introduced below.
本申请实施例对解码端确定第i个父节点在第k个预测参考帧中的匹配节点的具体方式不做限制。The embodiment of the present application does not limit the specific manner in which the decoding end determines the matching node of the i-th parent node in the k-th prediction reference frame.
在一些实施例中,确定第i个父节点在当前待解码帧中的划分深度,例如第i个父节点在当前待解码帧的八叉树的第2层。这样,解码端可以将第k个预测参考帧中与第i个父节点的划分深度相同的各节点中的一个节点,确定为第i个父节点在第k个预测参考帧中的匹配节点。例如将第k个预测参考帧中处于第2层的各节点中的一个节点,确定为第i个父节点在第k个预测参考帧中的匹配节点。In some embodiments, the division depth of the i-th parent node in the current frame to be decoded is determined, for example, the i-th parent node is in the second layer of the octree of the current frame to be decoded. In this way, the decoding end can determine one of the nodes in the k-th prediction reference frame that have the same division depth as the i-th parent node as the matching node of the i-th parent node in the k-th prediction reference frame. For example, one of the nodes in the second layer in the k-th prediction reference frame is determined as the matching node of the i-th parent node in the k-th prediction reference frame.
在一些实施例中,解码端基于第i个父节点的占位信息,确定第i个父节点在第k个预测参考帧中的匹配节点。具体的,由于当前待解码帧中的第i个父节点的占位信息已解码,且第k个预测参考帧中的各节点的占位信息也已解码。这样,解码端可以基于第i个父节点的占位信息,在第k个预测参考帧中查找该第i个父节点的匹配节点。In some embodiments, the decoding end determines the matching node of the i-th parent node in the k-th prediction reference frame based on the placeholder information of the i-th parent node. Specifically, since the placeholder information of the i-th parent node in the current frame to be decoded has been decoded, and the placeholder information of each node in the k-th prediction reference frame has also been decoded, the decoding end can search for the matching node of the i-th parent node in the k-th prediction reference frame based on the placeholder information of the i-th parent node.
例如,将第k个预测参考帧中,占位信息与第i个父节点的占位信息之间的差异最小的节点,确定为第i个父节点在第k个预测参考帧中的匹配节点。For example, a node whose placeholder information in the k-th prediction reference frame has the smallest difference with the placeholder information of the i-th parent node is determined as a matching node of the i-th parent node in the k-th prediction reference frame.
举例说明,假设第i个父节点的占位信息为11001101,在第k个预测参考帧中查询占位信息与占位信息11001101差异最小的节点。具体是,解码端将第i个父节点的占位信息,与第k个预测参考帧中的每一个节点的占位信息进行异或运算,将第k个预测参考帧中的异或运算结果最小的一个节点,确定为第i个父节点在第k个预测参考帧中匹配节点。For example, assuming that the placeholder information of the i-th parent node is 11001101, the node whose placeholder information is the smallest different from the placeholder information 11001101 is searched in the k-th prediction reference frame. Specifically, the decoding end performs an XOR operation on the placeholder information of the i-th parent node and the placeholder information of each node in the k-th prediction reference frame, and determines the node with the smallest XOR operation result in the k-th prediction reference frame as the matching node of the i-th parent node in the k-th prediction reference frame.
示例性的,假设第k个预测参考帧中的节点1的占位信息为10001101,将11001101与10001101进行异或运算,其中11001101的第1位与10001101的第1位均为1,因此,两者的第1位的异或运算结果为0,11001101的第2位与10001111的第2位不相同,因此,两者的第2位的异或运算结果为1,依次类推,得到11001101与10001111的异或运算结果为0+1+0+0+0+0+1+0=2。参照该方式,解码端可以确定出第i个父节点的占位信息与第k个预测参考帧中每一个节点的占位信息的异或运算结果,进而将第k个预测参考帧中,与第i个父节点的占位信息的异或运算最小的节点,确定为第i个父节点在第k个预测参考帧中的匹配节点。For example, assuming that the occupancy information of node 1 in the kth predicted reference frame is 10001101, 11001101 and 10001101 are XORed, where the first bit of 11001101 and the first bit of 10001101 are both 1, so the XOR result of the first bit of the two is 0, the second bit of 11001101 is different from the second bit of 10001111, so the XOR result of the second bit of the two is 1, and so on, the XOR result of 11001101 and 10001111 is 0+1+0+0+0+0+1+0=2. According to this method, the decoding end can determine the XOR operation result of the occupancy information of the i-th parent node and the occupancy information of each node in the k-th prediction reference frame, and then determine the node in the k-th prediction reference frame with the smallest XOR operation with the occupancy information of the i-th parent node as the matching node of the i-th parent node in the k-th prediction reference frame.
基于上述步骤,解码端可以确定出第i个父节点在第k个预测参考帧中的匹配节点。为了便于描述,将该匹配节点记为第i个匹配节点。Based on the above steps, the decoding end can determine the matching node of the i-th parent node in the k-th prediction reference frame. For ease of description, the matching node is recorded as the i-th matching node.
接着,解码端将该第i个匹配节点的子节点中的一个子节点,确定为第i个领域节点在第k个预测参考帧中的对应节点。Next, the decoding end determines one of the child nodes of the i-th matching node as the corresponding node of the i-th domain node in the k-th prediction reference frame.
例如,解码端将该第i个匹配节点所包括的子节点中的一个默认子节点,确定为第i个节点在第k个预测参考帧中的对应节点。假设,将第i个匹配节点的第1个子节点,确定为第i个节点在第k个预测参考帧中的对应节点。For example, the decoding end determines a default child node among the child nodes included in the i-th matching node as the corresponding node of the i-th node in the k-th prediction reference frame. Assume that the first child node of the i-th matching node is determined as the corresponding node of the i-th node in the k-th prediction reference frame.
再例如,解码端确定第i个节点在父节点所包括的子节点中的第一序号;将第i个匹配节点的子节点中序号为第一序号的子节点,确定为第i个节点在第k个预测参考帧中的对应节点。示例性的,如图12所示,第i个节点为第i个父节点的第2个子节点,此时第一序号为2。这样可以将第i个匹配节点的第2个子节点,确定为第i个节点的对应节点。For another example, the decoding end determines the first sequence number of the i-th node in the child nodes included in the parent node; the child node with the first sequence number in the child nodes of the i-th matching node is determined as the corresponding node of the i-th node in the k-th prediction reference frame. Exemplarily, as shown in FIG12, the i-th node is the second child node of the i-th parent node, and the first sequence number is 2. In this way, the second child node of the i-th matching node can be determined as the corresponding node of the i-th node.
上面对确定M个领域节点中第i个领域节点,在第k个预测参考帧中的对应节点,以及确定当前节点在第k个预测参考帧中的对应节点的确定过程进行介绍。这样,解码端可以采用方式二或方式三的方式,确定出当前节点在预测参考帧中的N个预测节点。The above describes the process of determining the corresponding node of the i-th domain node in the M domain nodes in the k-th prediction reference frame, and determining the corresponding node of the current node in the k-th prediction reference frame. In this way, the decoding end can use the second or third method to determine the N prediction nodes of the current node in the prediction reference frame.
下面对S101-A中确定第一信息的过程进行介绍。The process of determining the first information in S101-A is introduced below.
由上述可知,该第一信息用于指示当前节点是否允许启动帧间解码方式。示例性的,第一信息可以为predFlag,若predFlag为1时,表示当前节点允许启动帧间解码方式,若predFlag为0时,表示当前节点不允许启动帧间解码方式。As can be seen from the above, the first information is used to indicate whether the current node is allowed to start the inter-frame decoding mode. Exemplarily, the first information can be predFlag, if predFlag is 1, it means that the current node is allowed to start the inter-frame decoding mode, if predFlag is 0, it means that the current node is not allowed to start the inter-frame decoding mode.
在一种示例中,上述第一信息可以为序列级信息,即该第一信息用于指示当前点云序列是否允许启动帧间解码。其中,当前点云序列中包括至少一个点云帧,这至少一个点云帧中包括当前待解码帧。这样,解码端可以基于该序列级的第一信息,确定出当前节点是否启动帧间解码。In one example, the first information may be sequence-level information, that is, the first information is used to indicate whether the current point cloud sequence allows inter-frame decoding to be started. The current point cloud sequence includes at least one point cloud frame, and the at least one point cloud frame includes the current frame to be decoded. In this way, the decoding end can determine whether the current node starts inter-frame decoding based on the first information at the sequence level.
在另一种示例中,上述第一信息为帧级信息,即该第一信息用于指示当前待解码帧是否允许启动帧间解码。这样,解码端可以基于该帧级的第一信息,确定出当前节点是否启动帧间解码。In another example, the first information is frame-level information, that is, the first information is used to indicate whether the current frame to be decoded allows inter-frame decoding to be started. In this way, the decoding end can determine whether the current node starts inter-frame decoding based on the first frame-level information.
在另一种示例中,上述第一信息为节点级信息,即该第一信息用于指示当前节点是否允许启动帧间解码。In another example, the first information is node-level information, that is, the first information is used to indicate whether the current node is allowed to start inter-frame decoding.
本申请实施例对解码端确定第一信息的具体方式不做限制。The embodiment of the present application does not limit the specific method for the decoding end to determine the first information.
在一种可能的实现方式中,编码端可以将当前节点对应的第一信息编入码流,这样解码端可以通过解码码流,得到当前节点对应的第一信息。In a possible implementation manner, the encoding end may encode the first information corresponding to the current node into a bitstream, so that the decoding end may obtain the first information corresponding to the current node by decoding the bitstream.
在另一种可能的实现方式中,解码端可以通过相关信息,推导出当前节点是否启动帧间解码,进而确定出当前节点对应的第一信息。In another possible implementation manner, the decoding end may deduce whether the current node starts inter-frame decoding through relevant information, and further determine the first information corresponding to the current node.
解码端基于上述步骤,确定出当前节点对应的帧间解码信息后,执行如下S102的步骤。After the decoding end determines the inter-frame decoding information corresponding to the current node based on the above steps, it executes the following step S102.
S102、基于当前节点对应的帧间解码信息,对当前节点所包括的点数进行预测解码。S102: Based on the inter-frame decoding information corresponding to the current node, predictive decoding is performed on the number of points included in the current node.
由于点云的相邻帧之间的相关性,本申请实施例基于点云的相邻帧之间的相关性,在对当前节点的点数进行预测编码时,参照了帧间的相关信息。具体是基于当前节点对应的帧间编码信息,对当前节点的点数进行预测编码,增加了编码点数时可选择的上下文条件,减小条件熵,进行减少编码点数所使用的码字,从而提升点云的编解码效率和编解码性能。Due to the correlation between adjacent frames of the point cloud, the embodiment of the present application refers to the relevant information between frames when predicting the number of points of the current node based on the correlation between adjacent frames of the point cloud. Specifically, based on the inter-frame coding information corresponding to the current node, the number of points of the current node is predictively coded, the context conditions that can be selected when coding the number of points are increased, the conditional entropy is reduced, and the codewords used for coding the number of points are reduced, thereby improving the coding and decoding efficiency and coding and decoding performance of the point cloud.
在一种示例中,如图14A所示,编码端对当前节点进行直接编码的过程包括:判断当前节点是否具备直接编码资格,若确定当前节点具备直接解码资格时,则将IDCMEligible置为真。接着,判断当前节点所包括的点数是否小于预设阈值,若小于预设阈值,则确定采用直接编码方式对当前节点进行编码,即对当前节点的点数,以及当前节点中点的几何信息进行直接编码。In one example, as shown in FIG14A , the process of the encoder directly encoding the current node includes: determining whether the current node is qualified for direct encoding, and if it is determined that the current node is qualified for direct decoding, setting IDCMEligible to true. Next, determining whether the number of points included in the current node is less than a preset threshold, and if it is less than the preset threshold, determining to encode the current node in a direct encoding manner, that is, directly encoding the number of points of the current node and the geometric information of the midpoint of the current node.
对应的,解码端在对当前节点进行解码时,如图14B所示,解码端首先判断当前节点是否具备直接解码的资格,若解码端确定当前节点具备直接解码资格后,将IDCMEligible置为真。接着,解码当前节点的点数。Correspondingly, when the decoding end decodes the current node, as shown in FIG14B , the decoding end first determines whether the current node is qualified for direct decoding, and if the decoding end determines that the current node is qualified for direct decoding, it sets IDCMEligible to true. Next, the points of the current node are decoded.
在本申请实施例中,解码端在对当前节点的点数进行解码时,基于当前节点对应的帧间解码信息,对当前节点的点数进行预测解码,以提高对点数的解码性能。In an embodiment of the present application, when decoding the number of points of the current node, the decoding end predictively decodes the number of points of the current node based on the inter-frame decoding information corresponding to the current node to improve the decoding performance of the number of points.
需要说明的是,本申请实施例中,基于当前节点对应的帧间解码信息,对当前节点的点数进行预测解码,可以理解为将当前节点对应的帧间解码信息作为上下文,对当前节点的点数进行预测解码。例如,解码端基于当前节点对应的帧间解码信息,确定上下文模型的索引,进而基于该上下文模型的索引,从预设的多个上下文模型中,确定出目标上下文模型,并使用该上下文模型,对当前节点的点数进行解码,得到当前节点所包括的点数。It should be noted that, in the embodiment of the present application, the number of points of the current node is predicted and decoded based on the inter-frame decoding information corresponding to the current node, which can be understood as using the inter-frame decoding information corresponding to the current node as context to predict and decode the number of points of the current node. For example, the decoding end determines the index of the context model based on the inter-frame decoding information corresponding to the current node, and then determines the target context model from multiple preset context models based on the index of the context model, and uses the context model to decode the number of points of the current node to obtain the number of points included in the current node.
在一些实施例中,当前节点对应的帧间解码信息包括当前节点的N个预测节点的几何解码信息和当前节点对应的第一信息中的至少一个,此时,上述S102包括如下S102-1的步骤:In some embodiments, the inter-frame decoding information corresponding to the current node includes geometric decoding information of N prediction nodes of the current node and at least one of the first information corresponding to the current node. In this case, the above S102 includes the following step S102-1:
S102-1、基于N个预测节点的几何解码信息和/或第一信息,对当前节点的点数进行预测解码。S102-1. Based on the geometric decoding information of N predicted nodes and/or the first information, predict and decode the number of points of the current node.
例如,基于N个预测节点的几何解码信息和/或第一信息,确定上下文模型的索引,进而基于该上下文模型的索引。接着,从预设的多个上下文模型中,确定出对应的上下文模型,并使用该上下文模型,对当前节点的点数进行解码,得到当前节点所包括的点数。即本申请实施例通过增加上下文条件(或上下文模型),以提升对点数进行解码时的性能,进而提升点云的解码效率。For example, based on the geometric decoding information and/or the first information of the N predicted nodes, the index of the context model is determined, and then based on the index of the context model. Next, the corresponding context model is determined from the preset multiple context models, and the context model is used to decode the number of points of the current node to obtain the number of points included in the current node. That is, the embodiment of the present application improves the performance of decoding the number of points by adding context conditions (or context models), thereby improving the decoding efficiency of the point cloud.
本申请实施例中预测节点的几何解码信息可以理解为预测节点的几何解码过程中所涉及的任意信息。例如包括预测节点所包括的点数、预测节点的占位信息、预测节点的解码方式等等。In the embodiment of the present application, the geometric decoding information of the prediction node can be understood as any information involved in the geometric decoding process of the prediction node, such as the number of points included in the prediction node, the placeholder information of the prediction node, the decoding method of the prediction node, etc.
在一些实施例中,预测节点的几何解码信息包括预测节点的直接解码信息,其中,预测节点的直接解码信息可以理解为预测节点的直接解码相关的信息。例如,预测节点的直接解码信息包括预测节点的点数,预测节点所包括的各点的几何信息、以及该预测节点是否采用直接解码方式进行解码的信息等。In some embodiments, the geometric decoding information of the prediction node includes direct decoding information of the prediction node, wherein the direct decoding information of the prediction node can be understood as information related to direct decoding of the prediction node. For example, the direct decoding information of the prediction node includes the number of points of the prediction node, geometric information of each point included in the prediction node, and information on whether the prediction node is decoded using direct decoding.
基于此,上述S102-1包括如下S102-11的步骤:Based on this, the above S102-1 includes the following step S102-11:
S102-11、基于N个预测节点的直接解码信息和/或第一信息,对当前节点的点数进行预测解码。S102-11. Based on the direct decoding information of the N prediction nodes and/or the first information, predict and decode the number of points of the current node.
在该实施例中,解码端将当前节点的N个预测节点的直接解码信息和/或第一信息作为点数的上下文,对当前节点的点数进行预测解码,进而增加了点数对应的上下文模型,保证了对点数的准确解码,进而提升点云的解码效率。In this embodiment, the decoding end uses the direct decoding information and/or the first information of the N predicted nodes of the current node as the context of the point number, and predicts and decodes the point number of the current node, thereby adding a context model corresponding to the point number, ensuring accurate decoding of the point number, and thereby improving the decoding efficiency of the point cloud.
在一种示例中,若当前点云采用GPCC方式进行编码时,当前节点所包括的点数小于或等于2时,采用直接编码方式对当前节点进行编码。In one example, if the current point cloud is encoded using the GPCC method, when the number of points included in the current node is less than or equal to 2, the current node is encoded using a direct encoding method.
情况1,当前节点含有的点数numPonts小于等于2,则编码过程如下:Case 1: The number of points numPonts in the current node is less than or equal to 2, then the encoding process is as follows:
1)、首先编码当前节点的numPonts是否大于1;1) First, encode whether the numPonts of the current node is greater than 1;
2)、如果当前节点只有一个点,则需要编码当前节点的第二个点不是重复点。2) If the current node has only one point, it is necessary to encode that the second point of the current node is not a duplicate point.
情况2,当前节点含有的点数numPonts大于2,则编码过程如下:Case 2: The number of points numPonts in the current node is greater than 2, then the encoding process is as follows:
1)、首先编码当前节点的numPonts是否大于1;1) First, encode whether the numPonts of the current node is greater than 1;
2)、其次编码当前节点的第二个点是一个重复点;2) Secondly, the second point of the current node is encoded as a repeated point;
3)、如果第二个点为重复点,则编码当前节点的第三个点是否为重复点;3) If the second point is a duplicate point, encode whether the third point of the current node is a duplicate point;
4)、如果第三个点为重复点,利用指数哥伦布编码方式编码剩余的重复点数目。4) If the third point is a repeated point, use the Exponential Golomb coding method to encode the remaining number of repeated points.
具体的,在编解码时,使用第一标志表示当前节点所包括的点数是否大于1,使用第二标志表示当前节点的第二个点是否为重复点,使用第三标志表示当前节点的第三个点是否为重复点。Specifically, during encoding and decoding, the first flag is used to indicate whether the number of points included in the current node is greater than 1, the second flag is used to indicate whether the second point of the current node is a repeated point, and the third flag is used to indicate whether the third point of the current node is a repeated point.
需要说明的是,编解码对上述第一标志、第二标志和第三标志进行编解码时所使用的上下文信息可以相同,也可以不相同,或者可以部分相同,部分不同,本申请实施例对此不做限制。示例性的,第一标志、第二标志和第三标志中的至少一个标志使用帧间信息(例如预测节点的直接解码信息)作为上下文信息进行预测编解码。It should be noted that the context information used by the codec for encoding and decoding the above-mentioned first flag, second flag, and third flag may be the same, or different, or may be partially the same and partially different, and the embodiment of the present application does not limit this. Exemplarily, at least one of the first flag, the second flag, and the third flag uses inter-frame information (such as direct decoding information of the prediction node) as context information for predictive encoding and decoding.
在一些实施例中,上述预测节点的直接解码信息包括第一直接解码信息,此时,上述S102-11中基于N个预测节点的直接解码信息,对当前节点的点数进行预测解码包括如下S102-11-A的步骤:In some embodiments, the direct decoding information of the prediction node includes first direct decoding information. In this case, the predictive decoding of the number of points of the current node based on the direct decoding information of the N prediction nodes in S102-11 includes the following step S102-11-A:
S102-11-A、将N个预测节点的第一直接解码信息作为上下文信息,对当前节点对应的第一标志进行预测解码,第一标志用于表示当前节点所包括的点数是否大于1。S102-11-A. Use the first direct decoding information of the N prediction nodes as context information, and perform predictive decoding on the first flag corresponding to the current node, where the first flag is used to indicate whether the number of points included in the current node is greater than 1.
本申请实施例对第一直接解码信息的具体内容不做限制。在本申请实施例中,解码端将N个预测节点的第一直接解码信息作为上下文信息,对当前节点对应的第一标志进行预测解码,提高对第一标志的解码效率,进而提升点云的解码性能。The specific content of the first direct decoding information is not limited in the embodiment of the present application. In the embodiment of the present application, the decoding end uses the first direct decoding information of the N predicted nodes as context information, performs predictive decoding on the first flag corresponding to the current node, improves the decoding efficiency of the first flag, and thus improves the decoding performance of the point cloud.
在一些实施例中,上述S102-11-A包括如下步骤:In some embodiments, the above S102-11-A includes the following steps:
S102-11-A1、基于N个预测节点的第一直接解码信息,确定第一上下文模型的索引;S102-11-A1. Determine an index of a first context model based on first direct decoding information of N prediction nodes;
S102-11-A2、基于第一上下文模型的索引,从第一标志对应的Q个上下文模型中,选出第一上下文模型,Q为大于1的正整数;S102-11-A2. Based on the index of the first context model, select the first context model from Q context models corresponding to the first flag, where Q is a positive integer greater than 1;
S102-11-A3、使用第一上下文模型,对当前节点对应的第一标志进行预测解码。S102-11-A3. Use the first context model to predict and decode the first flag corresponding to the current node.
在本申请实施例中,第一标志对应多个上下文模型,例如Q个上下文模型,本申请实施例对第一标志所对应的上下文模型的具体个数不做限制,只要保证Q大于1即可。也就是说,在本申请实施例中,至少从2个上下文模型中选择一个最优的上下文模型,对当前节点的第一标志进行解码,以提高对第一标志的解码效率。In the embodiment of the present application, the first flag corresponds to multiple context models, for example, Q context models. The embodiment of the present application does not limit the specific number of context models corresponding to the first flag, as long as Q is greater than 1. That is to say, in the embodiment of the present application, at least one optimal context model is selected from two context models to decode the first flag of the current node, so as to improve the decoding efficiency of the first flag.
示例性的,第一标志对应多个上下文模型如表2所示:Exemplarily, the first flag corresponds to multiple context models as shown in Table 2:
表2Table 2
索引 index 上下文模型Context Model
00 上下文模型A Context Model A
11 上下文模型BContext Model B
…… ……
这样,解码端基于N个预测节点的第一直接解码信息,确定第一上下文模型的索引。接着,基于该第一上下文模型的索引,从表2对应的Q个上下文模型中,选出第一上下文模型。In this way, the decoding end determines the index of the first context model based on the first direct decoding information of the N prediction nodes. Then, based on the index of the first context model, the first context model is selected from the Q context models corresponding to Table 2.
下面对上述S102-11-A1中基于N个预测节点的第一直接解码信息,确定第一上下文模型的索引的具体过程进行介绍。The specific process of determining the index of the first context model based on the first direct decoding information of N prediction nodes in the above S102-11-A1 is introduced below.
方式一,上述S102-11-A1包括如下S102-11-A1-a1和S102-11-A1-a2的步骤:Method 1: the above S102-11-A1 includes the following steps S102-11-A1-a1 and S102-11-A1-a2:
S102-11-A1-a1、针对N个预测节点中的任一预测节点,基于预测节点的第一直接解码信息,确定预测节点对应的第一数值。S102-11-A1-a1. For any prediction node among the N prediction nodes, determine a first numerical value corresponding to the prediction node based on first direct decoding information of the prediction node.
在该方式中,针对N个预测节点中的每一个预测节点,基于该预测节点的第一直接解码信息,确定该预测节点对应的第一数值,最后基于N个预测节点对应的第一数值,确定第一上下文模型的索引。In this method, for each of the N prediction nodes, the first numerical value corresponding to the prediction node is determined based on the first direct decoding information of the prediction node, and finally the index of the first context model is determined based on the first numerical values corresponding to the N prediction nodes.
下面对确定预测节点对应的第一数值的过程进行介绍。The following describes the process of determining the first value corresponding to the prediction node.
在一些实施例中,预测节点的第一直接解码信息包括第二信息和预测节点所包括的点数中的至少一个,其中,第二信息用于指示预测节点是否满足直接解码方式进行解码的条件。In some embodiments, the first direct decoding information of the prediction node includes at least one of the second information and the number of points included in the prediction node, wherein the second information is used to indicate whether the prediction node meets the conditions for decoding in the direct decoding manner.
本申请实施例对第二信息的具体内容不做限制,可以为任意指示出预测节点是否满足直接解码条件的信息即可。The embodiment of the present application does not limit the specific content of the second information, which may be any information indicating whether the prediction node satisfies the direct decoding condition.
在一种示例中,上述第二信息包括预测节点的直接解码模式信息。例如,预测节点的直接解码模式为直接解码模式1或直接解码模式2时,则确定该预测节点满足直接解码方式进行解码的条件。若预测节点的直接解码模式为直接解码模式0时,则确定该预测节点不满足直接解码方式进行解码的条件。In one example, the second information includes direct decoding mode information of the prediction node. For example, when the direct decoding mode of the prediction node is direct decoding mode 1 or direct decoding mode 2, it is determined that the prediction node meets the conditions for decoding by direct decoding. If the direct decoding mode of the prediction node is direct decoding mode 0, it is determined that the prediction node does not meet the conditions for decoding by direct decoding.
在另一种示例中,上述第二信息包括预测节点对应的第一标识,例如第一标识为IDCM_flag,若该第一标识的取值为第一值(例如1)时,则确定该预测节点满足直接解码方式进行解码的条件。若该第一标识的取值为第二值(例如0)时,则确定该预测节点不满足直接解码方式进行解码的条件。In another example, the second information includes a first identifier corresponding to the prediction node, for example, the first identifier is IDCM_flag, and if the value of the first identifier is a first value (for example, 1), it is determined that the prediction node meets the conditions for decoding by direct decoding. If the value of the first identifier is a second value (for example, 0), it is determined that the prediction node does not meet the conditions for decoding by direct decoding.
此时,解码端可以基于预测节点的第二信息和/或该预测节点所包括的点数,确定该预测节点对应的第一数值。At this time, the decoding end may determine the first value corresponding to the prediction node based on the second information of the prediction node and/or the number of points included in the prediction node.
示例性的,解码端通过如下步骤,确定出预测节点对应的第一数值:Exemplarily, the decoding end determines the first value corresponding to the prediction node through the following steps:
步骤11、基于第二信息,确定第一子值;Step 11: Determine a first sub-value based on the second information;
步骤12、基于预测节点所包括的点数,确定第二子值;Step 12: Determine a second sub-value based on the number of points included in the prediction node;
步骤13、基于第一子值和/或第二子值,确定预测节点对应的第一数值。Step 13: Determine a first numerical value corresponding to the prediction node based on the first sub-value and/or the second sub-value.
本申请实施例对步骤11中解码端基于预测节点的第二信息,确定第一子值的具体方式不做限制。The embodiment of the present application does not limit the specific manner in which the decoding end determines the first sub-value based on the second information of the prediction node in step 11.
由于上述可知,第二信息用于指示该预测节点是否满足直接解码方式进行解码的条件,进而可以确定出第一子值为二值化数值。例如,若第二信息指示预测节点满足直接解码方式进行解码的条件时,则确定第一子值的取值为第一值;若第二信息指示预测节点不满足直接解码方式进行解码的条件时,则确定第一子值的取值为第二值。As can be known from the above, the second information is used to indicate whether the prediction node meets the conditions for decoding by direct decoding, and then the first sub-value can be determined to be a binary value. For example, if the second information indicates that the prediction node meets the conditions for decoding by direct decoding, the value of the first sub-value is determined to be the first value; if the second information indicates that the prediction node does not meet the conditions for decoding by direct decoding, the value of the first sub-value is determined to be the second value.
本申请实施例对第三值和第四值的具体取值不做限制。The embodiment of the present application does not limit the specific values of the third value and the fourth value.
可选的,第一值为1。Optionally, the first value is 1.
可选的,第二值为0。Optionally, the second value is 0.
本申请实施例对步骤12中解码端基于预测节点所包括的点数,确定第二子值的方式也不做限制。The embodiment of the present application does not limit the manner in which the decoding end determines the second sub-value based on the number of points included in the prediction node in step 12.
在一种示例中,第二子值不是二值化数值,这样解码端可以根据预测节点所包括的点数的具体个数,确定第二子值,例如若该预测节点包括1个点时,则确定第二子值为数值1,若预测节点包括2个点时,则确定第二子值为数值2,若预测节点包括3个或3个以上的点时,则确定第二子值数值3等。In one example, the second sub-value is not a binary value, so that the decoding end can determine the second sub-value according to the specific number of points included in the prediction node. For example, if the prediction node includes 1 point, the second sub-value is determined to be 1; if the prediction node includes 2 points, the second sub-value is determined to be 2; if the prediction node includes 3 or more points, the second sub-value is determined to be 3, and so on.
在另一种示例中,上述第二子值为二值化数值,此时,解码端预测节点所包括的点数,确定第二子值的方式可以是:若预测节点所包括的点数大于第一预设值,则确定第二子值的取值为第一值;若预测节点所包括的点数小于或等于第一预设值,则确定第二子值的取值为第二值。In another example, the above-mentioned second sub-value is a binary value. At this time, the number of points included in the prediction node at the decoding end can be determined in a manner as follows: if the number of points included in the prediction node is greater than the first preset value, the value of the second sub-value is determined to be the first value; if the number of points included in the prediction node is less than or equal to the first preset value, the value of the second sub-value is determined to be the second value.
本申请实施例对上述第一预设值、第一值和第二值的具体取值不做限制。The embodiment of the present application does not limit the specific values of the first preset value, the first value and the second value.
可选的,第一预设值为2,Optionally, the first preset value is 2,
可选的,第一值为0。Optionally, the first value is 0.
可选的,第二值为1。Optionally, the second value is 1.
也就是说,若预测节点所包括的点数小于或等于2时,则确定第二子值为1,若预测节点所包括的点数大于2时,则确定第二子值为0。That is, if the number of points included in the prediction node is less than or equal to 2, the second sub-value is determined to be 1, and if the number of points included in the prediction node is greater than 2, the second sub-value is determined to be 0.
接着,解码端基于上述确定的第一子值和/或第二子值,确定该预测节点对应的第一数值。Next, the decoding end determines the first numerical value corresponding to the prediction node based on the first sub-value and/or the second sub-value determined above.
由上述可知,在本申请实施例中,确定预测节点对应的第一数值具体包括如下几种情况:As can be seen from the above, in the embodiment of the present application, determining the first value corresponding to the prediction node specifically includes the following situations:
情况1,若预测节点的第一直接解码信息包括第二信息,且不包括预测节点所包括的点数时,则解码端基于该预测节点的第二信息,确定第一子值。例如若第二信息指示预测节点满足直接解码方式进行解码的条件时,则确定第一子值的取值为第三值(例如1);若第二信息指示预测节点不满足直接解码方式进行解码的条件时,则确定第一子值的取值为第四值(例如0)。接着,将该第一子值,确定为该预测节点对应的第一数值。Case 1: If the first direct decoding information of the prediction node includes the second information, but does not include the number of points included in the prediction node, the decoding end determines the first sub-value based on the second information of the prediction node. For example, if the second information indicates that the prediction node meets the conditions for decoding by direct decoding, the value of the first sub-value is determined to be a third value (e.g., 1); if the second information indicates that the prediction node does not meet the conditions for decoding by direct decoding, the value of the first sub-value is determined to be a fourth value (e.g., 0). Then, the first sub-value is determined to be the first numerical value corresponding to the prediction node.
情况2,若预测节点的第一直接解码信息包括预测节点所包括的点数,且不包括第二信息时,则解码端基于预测节点所包括的点数,确定第二子值。例如,若预测节点所包括的点数大于第一预设值(例如2),则确定第二子值的取值为第一值(例如0);若预测节点所包括的点数小于或等于第一预设值(例如2),则确定第二子值的取值为第二值(例如1)。接着,将该第二子值,确定为该预测节点对应的第一数值。Case 2: If the first direct decoding information of the prediction node includes the number of points included in the prediction node, and does not include the second information, the decoding end determines the second sub-value based on the number of points included in the prediction node. For example, if the number of points included in the prediction node is greater than the first preset value (e.g., 2), the value of the second sub-value is determined to be the first value (e.g., 0); if the number of points included in the prediction node is less than or equal to the first preset value (e.g., 2), the value of the second sub-value is determined to be the second value (e.g., 1). Then, the second sub-value is determined to be the first value corresponding to the prediction node.
情况3,若预测节点的直接解码信息包括第二信息,且包括预测节点所包括的点数时,则解码端基于该预测节点的第二信息,确定第一子值。例如若第二信息指示预测节点满足直接解码方式进行解码的条件时,则确定第一子值的取值为第一值(例如1);若第二信息指示预测节点不满足直接解码方式进行解码的条件时,则确定第一子值的取值为第二值(例如0)。接着,解码端基于预测节点所包括的点数,确定第二子值。例如,若预测节点所包括的点数大于第一预设值(例如2),则确定第二子值的取值为第一值(例如0);若预测节点所包括的点数小于或等于第一预设值(例如2),则确定第二子值的取值为第二值(例如1)。最后,基于第一子值和第二子值,确定该预测节点对应的第一数值,例如,将第一子值和第二子值之和,确定为预测节点对应的第一数值。Case 3, if the direct decoding information of the prediction node includes the second information and includes the number of points included in the prediction node, the decoding end determines the first sub-value based on the second information of the prediction node. For example, if the second information indicates that the prediction node meets the conditions for decoding by direct decoding, the value of the first sub-value is determined to be the first value (for example, 1); if the second information indicates that the prediction node does not meet the conditions for decoding by direct decoding, the value of the first sub-value is determined to be the second value (for example, 0). Next, the decoding end determines the second sub-value based on the number of points included in the prediction node. For example, if the number of points included in the prediction node is greater than the first preset value (for example, 2), the value of the second sub-value is determined to be the first value (for example, 0); if the number of points included in the prediction node is less than or equal to the first preset value (for example, 2), the value of the second sub-value is determined to be the second value (for example, 1). Finally, based on the first sub-value and the second sub-value, the first numerical value corresponding to the prediction node is determined, for example, the sum of the first sub-value and the second sub-value is determined as the first numerical value corresponding to the prediction node.
基于上述步骤,可以确定出当前节点的N个预测节点中,每一个预测节点对应的第一数值。Based on the above steps, the first value corresponding to each prediction node in the N prediction nodes of the current node may be determined.
S102-11-A1-a2、基于N个预测节点对应的第一数值,确定第一上下文模型的索引。S102-11-A1-a2. Determine the index of the first context model based on the first numerical values corresponding to the N prediction nodes.
解码端基于上述步骤,确定出N个预测节点对应的第一数值后,基于N个预测节点对应的第一数值,确定第一上下文模型的索引。After the decoding end determines the first values corresponding to the N prediction nodes based on the above steps, it determines the index of the first context model based on the first values corresponding to the N prediction nodes.
其中,基于N个预测节点对应的第一数值,确定第一上下文模型的索引至少包括如下实现方式:Wherein, determining the index of the first context model based on the first values corresponding to the N prediction nodes includes at least the following implementation methods:
方式1,将N个预测节点对应的第一数值之和的平均值,确定为第一上下文模型的索引。Method 1: Determine the average value of the sum of the first numerical values corresponding to the N prediction nodes as the index of the first context model.
方式2,S102-11-A1-a2包括如下S102-11-A1-a2-1至S102-11-A1-a2-3的步骤: Mode 2, S102-11-A1-a2 includes the following steps S102-11-A1-a2-1 to S102-11-A1-a2-3:
S102-11-A1-a2-1、确定N个预测节点分别对应的第一权重;S102-11-A1-a2-1. Determine first weights corresponding to the N prediction nodes respectively;
S102-11-A1-a2-2、基于第一权重,对N个预测节点对应的第一数值进行加权处理,得到第一加权预测值;S102-11-A1-a2-2. Based on the first weight, weighted processing is performed on the first values corresponding to the N prediction nodes to obtain a first weighted prediction value;
S102-11-A1-a2-3、基于第一加权预测值,确定第一上下文模型的索引。S102-11-A1-a2-3. Based on the first weighted prediction value, determine the index of the first context model.
在该方式2中,若当前节点包括多个预测节点,即N个预测节点时,在基于N个预测节点对应的第一数值确定第一上下文模型的索引时,可以为N个预测节点中的每一个预测节点确定一个权重,即第一权重,这样可以基于各预测节点的第一权重,对各预测节点对应的第一数值进行加权处理,进而根据最终的加权结果,确定第一上下文模型的索引,从而提高了基于N个预测节点的几何解码信息,确定第一上下文模型的索引的准确性。In method 2, if the current node includes multiple prediction nodes, i.e., N prediction nodes, when determining the index of the first context model based on the first numerical values corresponding to the N prediction nodes, a weight, i.e., the first weight, can be determined for each of the N prediction nodes. In this way, the first numerical values corresponding to each prediction node can be weighted based on the first weight of each prediction node, and then the index of the first context model can be determined based on the final weighted result, thereby improving the accuracy of determining the index of the first context model based on the geometric decoding information of the N prediction nodes.
本申请实施例对确定N个预测节点分别对应的第一权重不做限制。The embodiment of the present application does not limit the determination of the first weights corresponding to the N prediction nodes.
在一些实施例中,上述N个预测节点中各预测节点对应的第一权重为预设值。由上述可知,上述N个预测节点是基于当前节点的M个领域节点确定的,假设预测节点1为领域节点1对应的预测节点,若领域节点1为当前节点的共面节点时,则预测节点1的第一权重为预设权重1,若领域节点1为当前节点的共线节点时,则预测节点1的第一权重为预设权重2,若领域节点1为当前节点的共点节点时,则预测节点1的第一权重为预设权重3。In some embodiments, the first weight corresponding to each prediction node in the above-mentioned N prediction nodes is a preset value. As can be seen from the above, the above-mentioned N prediction nodes are determined based on the M domain nodes of the current node. Assuming that prediction node 1 is the prediction node corresponding to domain node 1, if domain node 1 is a coplanar node of the current node, the first weight of prediction node 1 is the preset weight 1, if domain node 1 is a colinear node of the current node, the first weight of prediction node 1 is the preset weight 2, and if domain node 1 is a co-point node of the current node, the first weight of prediction node 1 is the preset weight 3.
在一些实施例中,对于N个预测节点中的每一个预测节点,基于该预测节点对应的领域节点与当前节点之间的距离,确定该预测节点对应的第一权重。例如,领域节点与当前节点之间的距离越小,则该领域节点对应的预测节点与当前节点的帧间相关性越强,进而该预测节点的第一权重越大。In some embodiments, for each of the N prediction nodes, a first weight corresponding to the prediction node is determined based on the distance between the domain node corresponding to the prediction node and the current node. For example, the smaller the distance between the domain node and the current node, the stronger the inter-frame correlation between the prediction node corresponding to the domain node and the current node, and thus the greater the first weight of the prediction node.
举例说明,以N个预测节点中的预测节点1为例,假设预测节点1为当前节点的M个领域节点中领域节点1在预测参考帧中的对应点,这样可以基于领域节点1与当前节点之间的距离,确定预测节点1的第一权重。例如,将领域节点1与当前节点之间的距离的倒数,确定为预测节点1的第一权重。For example, taking prediction node 1 among N prediction nodes as an example, assuming that prediction node 1 is the corresponding point of domain node 1 among M domain nodes of the current node in the prediction reference frame, the first weight of prediction node 1 can be determined based on the distance between domain node 1 and the current node. For example, the inverse of the distance between domain node 1 and the current node is determined as the first weight of prediction node 1.
在一种示例中,若领域节点1为当前节点的共面节点时,则预测节点1的第一权重为1,若领域节点1为当前节点的共线节点时,则预测节点1的第一权重为预设权重
Figure PCTCN2023071069-appb-000044
若领域节点1为当前节点的共点节点时,则预测节点1的第一权重为预设权重
Figure PCTCN2023071069-appb-000045
In one example, if domain node 1 is a coplanar node of the current node, the first weight of the predicted node 1 is 1; if domain node 1 is a colinear node of the current node, the first weight of the predicted node 1 is a preset weight.
Figure PCTCN2023071069-appb-000044
If domain node 1 is a common node of the current node, the first weight of predicted node 1 is the preset weight
Figure PCTCN2023071069-appb-000045
在一种示例中,若领域节点1为当前节点的共面节点时,则预测节点1的第一权重为
Figure PCTCN2023071069-appb-000046
若领域节点1为当前节点的共线节点时,则预测节点1的第一权重为预设权重
Figure PCTCN2023071069-appb-000047
若领域节点1为当前节点的共点节点时,则预测节点1的第一权重为预设权重
Figure PCTCN2023071069-appb-000048
In one example, if domain node 1 is a coplanar node of the current node, the first weight of predicted node 1 is
Figure PCTCN2023071069-appb-000046
If domain node 1 is a collinear node of the current node, the first weight of predicted node 1 is the preset weight
Figure PCTCN2023071069-appb-000047
If domain node 1 is a common node of the current node, the first weight of predicted node 1 is the preset weight
Figure PCTCN2023071069-appb-000048
在一些实施例中,基于上述步骤,确定出N个预测节点中各预测节点对应的权重后,对该权重进行归一化处理,将归一化处理后的权重作出预测节点的最终第一权重。In some embodiments, based on the above steps, after the weight corresponding to each prediction node in the N prediction nodes is determined, the weight is normalized, and the normalized weight is used as the final first weight of the prediction node.
本申请实施例对基于第一权重,对N个预测节点对应的第一数值进行加权处理,得到第一加权预测值的具体方式不做限制。The embodiment of the present application does not limit the specific method of obtaining the first weighted prediction value by weighting the first numerical values corresponding to N prediction nodes based on the first weight.
在一种示例中,基于第一权重,对N个预测节点对应的第一数值进行加权平均,得到第一加权预测值。In one example, based on the first weight, a weighted average is performed on the first numerical values corresponding to the N prediction nodes to obtain a first weighted prediction value.
在另一种示例中,基于第一权重,对N个预测节点对应的第一数值进行加权求和,得到第一加权预测值。In another example, based on the first weight, a weighted sum is performed on the first numerical values corresponding to the N prediction nodes to obtain a first weighted prediction value.
基于方式步骤,确定出第一加权预测值后,基于该第一加权预测值,确定第一上下文模型的索引,即上述S102-11-A1-a2-3至少包括如下几种示例:After determining the first weighted prediction value based on the method steps, the index of the first context model is determined based on the first weighted prediction value, that is, the above S102-11-A1-a2-3 includes at least the following examples:
示例1,将第一加权预测值,确定为第一上下文模型的索引。Example 1: Determine the first weighted prediction value as the index of the first context model.
示例2,确定第一加权预测值所在加权预测值范围,将该范围对应的索引,确定为第一上下文模型的索引,也就是说一个加权预测值范围对应一个上下文模型。Example 2, determines the weighted prediction value range where the first weighted prediction value is located, and determines the index corresponding to the range as the index of the first context model, that is, one weighted prediction value range corresponds to one context model.
在一些实施例中,解码端还可以采用如下方式二,确定第一上下文模型的索引。In some embodiments, the decoding end may also adopt the following method 2 to determine the index of the first context model.
方式二、若K大于1时,确定K个预测参考帧中每一个预测参考帧对应的第二加权预测值,进而基于K个预测参考帧分别对应的第二加权预测值,确定第一上下文模型的索引。此时,上述S102-11-A1包括如下S102-11-A1-b1至S102-11-A1-a4的步骤:Method 2: If K is greater than 1, determine the second weighted prediction value corresponding to each of the K prediction reference frames, and then determine the index of the first context model based on the second weighted prediction values corresponding to the K prediction reference frames. At this time, the above S102-11-A1 includes the following steps S102-11-A1-b1 to S102-11-A1-a4:
S102-11-A1-b1、针对K个预测参考帧中的第j个预测参考帧,基于当前节点在第j个预测参考帧中预测节点的第一直接解码信息,确定第j个预测参考帧中的预测节点对应的第一数值,j为小于或等于K的正整数;S102-11-A1-b1, for the j-th prediction reference frame among the K prediction reference frames, determine a first value corresponding to the prediction node in the j-th prediction reference frame based on the first direct decoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K;
S102-11-A1-b2、确定预测节点对应的第一权重,并基于第一权重对第j个预测参考帧中的预测节点对应的第一数值进行加权处理,得到第j个预测参考帧对应的第二加权预测值;S102-11-A1-b2, determining a first weight corresponding to the prediction node, and performing weighted processing on the first value corresponding to the prediction node in the j-th prediction reference frame based on the first weight, to obtain a second weighted prediction value corresponding to the j-th prediction reference frame;
S102-11-A1-b3、基于K个预测参考帧对应的第二加权预测值,确定第一上下文模型的索引。S102-11-A1-b3. Determine the index of the first context model based on the second weighted prediction values corresponding to the K prediction reference frames.
在该方式二中,在确定第一上下文模型的索引时,对这K个预测参考帧中的每一个预测参考帧作为单独的上下文信息分别进行考虑。具体是,确定K个预测参考帧中每一个预测参考帧所包括的预测节点的第一直接解码信息,确定每一个预测参考帧对应的第二加权预测值,进而基于每一个预测参考帧对应的第二加权预测值,确定第一上下文模型的索引,实现第一上下文模型的准确选择,进而提升点云的解码效率。In the second method, when determining the index of the first context model, each of the K prediction reference frames is considered as separate context information. Specifically, the first direct decoding information of the prediction node included in each of the K prediction reference frames is determined, and the second weighted prediction value corresponding to each prediction reference frame is determined. Then, based on the second weighted prediction value corresponding to each prediction reference frame, the index of the first context model is determined to achieve accurate selection of the first context model, thereby improving the decoding efficiency of the point cloud.
本申请实施例中,解码端确定K个预测参考帧中每一个预测参考帧对应的第二加权预测值的具体方式相同,为了便于描述,在此以K个预测参考帧中的第j个预测参考帧为例进行说明。In the embodiment of the present application, the specific method in which the decoding end determines the second weighted prediction value corresponding to each of the K prediction reference frames is the same. For the sake of ease of description, the jth prediction reference frame among the K prediction reference frames is taken as an example for explanation.
在本申请实施例中,当前节点在第j个预测参考帧中包括至少一个预测节点,这样基于该第j个预测参考帧中的这至少一个预测节点的第一直接解码信息,确定这至少一个预测节点的第一数值。In an embodiment of the present application, the current node includes at least one prediction node in the jth prediction reference frame, so that the first value of the at least one prediction node is determined based on the first direct decoding information of the at least one prediction node in the jth prediction reference frame.
举例说明,第j个预测参考帧中包括当前节点的2个预测节点,分别记为预测节点1和预测节点2,进而基于预测节点1的第一直接解码信息,确定预测节点1的第一数值,基于预测节点2的第一直接解码信息,确定预测节点2的第一数值。For example, the j-th prediction reference frame includes two prediction nodes of the current node, which are respectively recorded as prediction node 1 and prediction node 2, and then based on the first direct decoding information of prediction node 1, the first value of prediction node 1 is determined, and based on the first direct decoding information of prediction node 2, the first value of prediction node 2 is determined.
解码端确定出第j个预测参考帧所包括的至少一个预测节点的第一数值后,确定这至少一个预测节点分别对应的第一权重,并基于第一权重对这至少一个预测节点对应的第一数值进行加权处理,得到第j个预测参考帧对应的第二加权预测值。After the decoding end determines the first value of at least one prediction node included in the j-th prediction reference frame, it determines the first weight corresponding to the at least one prediction node, and weightedly processes the first value corresponding to the at least one prediction node based on the first weight to obtain a second weighted prediction value corresponding to the j-th prediction reference frame.
在一种示例中,基于第一权重,对第j个预测参考帧中的预测节点对应的第一数值进行加权平均,得到第j个预测参考帧对应的第二加权预测值。In one example, based on the first weight, a weighted average is performed on the first values corresponding to the prediction nodes in the j-th prediction reference frame to obtain a second weighted prediction value corresponding to the j-th prediction reference frame.
在另一种示例中,基于第一权重,对第j个预测参考帧中的预测节点对应的第一数值进行加权求和,得到第j个预测参考帧对应的第二加权预测值。In another example, based on the first weight, a weighted sum is performed on the first numerical values corresponding to the prediction nodes in the j-th prediction reference frame to obtain a second weighted prediction value corresponding to the j-th prediction reference frame.
其中,第一权重的确定过程可以参照上述实施例的描述,在此不再赘述。The process of determining the first weight may refer to the description of the above embodiment and will not be repeated here.
上述对确定K个预测参考帧中第j个预测参考帧对应的第二加权预测值的过程进行介绍,K个预测参考帧中其他预测参考帧对应的第二加权预测值参照上述第j个预测参考帧对应的方式进行确定。The above introduces the process of determining the second weighted prediction value corresponding to the j-th prediction reference frame among the K prediction reference frames. The second weighted prediction values corresponding to other prediction reference frames among the K prediction reference frames are determined in accordance with the method corresponding to the j-th prediction reference frame.
解码端确定出K个预测参考帧中每一个预测参考帧对应的第二加权预测值后,执行上述S102-11-A1-b3的步骤。After the decoding end determines the second weighted prediction value corresponding to each of the K prediction reference frames, it executes the above step S102-11-A1-b3.
本申请实施对基于K个预测参考帧对应的第二加权预测值,确定第一上下文模型的索引的具体方式不做限制。The present application does not limit the specific method of determining the index of the first context model based on the second weighted prediction value corresponding to K prediction reference frames.
在一些实施例中,解码端将K个预测参考帧对应的第二加权预测值的平均值,确定为第一上下文模型的索引。In some embodiments, the decoding end determines the average value of the second weighted prediction values corresponding to the K prediction reference frames as the index of the first context model.
在一些实施例中,解码端确定K个预测参考帧对应的第二权重,并基于第二权重对K个预测参考帧对应的第二加权预测值进行加权处理,得到第一上下文模型的索引。In some embodiments, the decoding end determines second weights corresponding to K predicted reference frames, and performs weighted processing on second weighted prediction values corresponding to the K predicted reference frames based on the second weights to obtain an index of the first context model.
在该实施例中,解码端首先确定K个预测参考帧中每一个预测参考帧对应的第二权重。本申请实施例对确定K个预测参考帧中每一个预测参考帧对应的第二权重不做限制。In this embodiment, the decoding end first determines the second weight corresponding to each of the K prediction reference frames. The embodiment of the present application does not limit the determination of the second weight corresponding to each of the K prediction reference frames.
在一些实施例中,上述K个预测参考帧中每一个预测参考帧对应的第二权重为预设值。由上述可知,上述K个预测参考帧为当前待解码帧的前向帧和/或后向帧。假设预测参考帧1为当前待解码帧的前向帧时,则预测参考帧1对应的第二权重为预设权重1,若预测参考帧1为当前待解码帧的后向帧时,则预测参考帧1对应的第二权重为预设权重2。In some embodiments, the second weight corresponding to each of the K predicted reference frames is a preset value. As can be seen from the above, the K predicted reference frames are forward frames and/or backward frames of the current frame to be decoded. Assuming that predicted reference frame 1 is the forward frame of the current frame to be decoded, the second weight corresponding to predicted reference frame 1 is the preset weight 1. If predicted reference frame 1 is the backward frame of the current frame to be decoded, the second weight corresponding to predicted reference frame 1 is the preset weight 2.
在一些实施例中,基于预测参考帧与当前待解码帧的时间差距,确定预测参考帧对应的第二权重。在本申请实施例中,每一张点云包括时间信息,该时间信息可以为点云采集设备采集该帧点云时的时间。基于此,若预测参考帧与当前待解码帧的时间差距越小,则该预测参考帧与当前待解码帧的帧间相关性越强,进而该预测参考帧对应的第二权重越大。例如,可以将预测参考帧与当前待解码帧的时间差距的倒数确定为该预测参考帧对应的第二权重。In some embodiments, the second weight corresponding to the predicted reference frame is determined based on the time difference between the predicted reference frame and the current frame to be decoded. In an embodiment of the present application, each point cloud includes time information, and the time information may be the time when the point cloud acquisition device acquires the point cloud of the frame. Based on this, if the time difference between the predicted reference frame and the current frame to be decoded is smaller, the inter-frame correlation between the predicted reference frame and the current frame to be decoded is stronger, and thus the second weight corresponding to the predicted reference frame is larger. For example, the inverse of the time difference between the predicted reference frame and the current frame to be decoded can be determined as the second weight corresponding to the predicted reference frame.
确定出K个预测参考帧中每一个预测参考帧对应的第二权重后,基于第二权重对K个预测参考帧分别对应的第二加权预测值进行加权处理,得到第一上下文模型的索引。After determining the second weight corresponding to each of the K prediction reference frames, the second weighted prediction values corresponding to the K prediction reference frames are weighted based on the second weight to obtain the index of the first context model.
举例说明,假设K=2,例如当前待解码帧包括2个预测参考帧,这2个预测参考帧包括当前待解码帧的前向帧和后向帧,假设前向帧对应的第二权重为W1、后向帧对应的第二权重为W2,这样基于W1和W2,对前向帧对应的第二加权预测值和后向帧对应的第二加权预测值进行加权,得到第一上下文模型的索引。For example, assuming K=2, for example, the current frame to be decoded includes 2 prediction reference frames, and these 2 prediction reference frames include the forward frame and backward frame of the current frame to be decoded. Assuming that the second weight corresponding to the forward frame is W1, and the second weight corresponding to the backward frame is W2, based on W1 and W2, the second weighted prediction value corresponding to the forward frame and the second weighted prediction value corresponding to the backward frame are weighted to obtain the index of the first context model.
在一种示例中,基于第二权重,对K个预测参考帧分别对应的第二加权预测值进行加权平均,得到第一上下文模型的索引。In one example, based on the second weight, the second weighted prediction values corresponding to the K prediction reference frames are weighted averaged to obtain the index of the first context model.
例如,继续参照上述,假设K个预测参考帧包括前向帧和后向帧,前向帧对应的第二权重为W1、后向帧对应的第二权重为W2,此时可以基于如下公式(21)确定出当前节点对应的第一上下文模型的索引:For example, continuing to refer to the above, assuming that the K prediction reference frames include forward frames and backward frames, the second weight corresponding to the forward frame is W1, and the second weight corresponding to the backward frame is W2, then the index of the first context model corresponding to the current node can be determined based on the following formula (21):
第一上下文模型的索引=(W1X第二加权预测值1+W2X第二加权预测值2)/2(21)Index of the first context model = (W1×second weighted prediction value 1+W2×second weighted prediction value 2)/2(21)
其中,第二加权预测值1为前向帧对应的第二加权预测值,第二加权预测值2为后向帧对应的第二加权预测值。Among them, the second weighted prediction value 1 is the second weighted prediction value corresponding to the forward frame, and the second weighted prediction value 2 is the second weighted prediction value corresponding to the backward frame.
在另一种示例中,基于第二权重,对K个预测参考帧分别对应的第二加权预测值进行加权求和,得到第一上下文模型的索引。In another example, based on the second weight, the second weighted prediction values corresponding to the K prediction reference frames are weighted summed to obtain the index of the first context model.
例如,继续参照上述,假设K个预测参考帧包括前向帧和后向帧,前向帧对应的第二权重为W1、后向帧对应的 第二权重为W2,此时可以基于如下公式(22)确定出当前节点对应的第一上下文模型的索引:For example, continuing to refer to the above, assuming that the K prediction reference frames include forward frames and backward frames, the second weight corresponding to the forward frame is W1, and the second weight corresponding to the backward frame is W2. At this time, the index of the first context model corresponding to the current node can be determined based on the following formula (22):
第一上下文模型的索引=W1X第二加权预测值1+W2X第二加权预测值2(22)Index of the first context model = W1×second weighted prediction value 1+W2×second weighted prediction value 2 (22)
解码端基于上述步骤,确定出第一上下模型的索引后,从第一标志对应的Q个上下文模型中,选出第一上下文模型。接着,使用该第一上下文模型对当前节点对应的第一标志进行解码,进而基于第一标志确定当前节点的点数。After the decoder determines the index of the first up-down model based on the above steps, it selects the first context model from the Q context models corresponding to the first mark. Then, the first context model is used to decode the first mark corresponding to the current node, and the number of points of the current node is determined based on the first mark.
在一些实施例中,若当前节点对应的第一标志表示当前节点所包括的点数大于1时,则确定当前节点包括2个点。In some embodiments, if the first flag corresponding to the current node indicates that the number of points included in the current node is greater than 1, it is determined that the current node includes 2 points.
在一些实施例中,若当前节点对应的第一标志表示当前节点所包括的点数小于或等于1时,则本申请实施例的方法还包括如下步骤A:In some embodiments, if the first flag corresponding to the current node indicates that the number of points included in the current node is less than or equal to 1, the method of the embodiment of the present application further includes the following step A:
步骤A、将N个预测节点的第二直接解码信息作为上下文信息,对当前节点对应的第二标志进行预测解码,第二标志用于表示当前节点的第二个点是否为重复点。Step A: Use the second direct decoding information of the N predicted nodes as context information to perform predictive decoding on the second flag corresponding to the current node, where the second flag is used to indicate whether the second point of the current node is a repeated point.
在该实施例中,若解码端解码当前节点对应的第一标志,得到该第一标志指示当前节点所包括的点数小于或等于1时,则解码端继续对当前节点对应的第二标志进行解码,以确定当前节点的第二个点是否为重复点。In this embodiment, if the decoding end decodes the first flag corresponding to the current node and obtains that the first flag indicates that the number of points included in the current node is less than or equal to 1, the decoding end continues to decode the second flag corresponding to the current node to determine whether the second point of the current node is a duplicate point.
在本申请实施例中,使用N个预测节点的第二直接解码信息作为上下文信息,对当前节点对应的第二标志进行预测解码,实现对第二标志的高效解码。In an embodiment of the present application, the second direct decoding information of the N predicted nodes is used as context information to perform predictive decoding on the second flag corresponding to the current node, thereby achieving efficient decoding of the second flag.
在一些实施例中,上述步骤A包括如下步骤:In some embodiments, the above step A includes the following steps:
步骤A1、基于N个预测节点的第二直接解码信息,确定第二上下文模型的索引;Step A1: determining an index of a second context model based on second direct decoding information of N prediction nodes;
步骤A2、基于第二上下文模型的索引,从第二标志对应的P个上下文模型中,选出第二上下文模型,P为大于1的正整数;Step A2: based on the index of the second context model, select the second context model from P context models corresponding to the second flag, where P is a positive integer greater than 1;
步骤A3、使用第二上下文模型,对当前节点对应的第二标志进行预测解码。Step A3: Use the second context model to predict and decode the second flag corresponding to the current node.
在本申请实施例中,第二标志对应多个上下文模型,例如P个上下文模型,本申请实施例对第二标志所对应的上下文模型的具体个数不做限制,只要保证P大于1即可。也就是说,在本申请实施例中,至少从2个上下文模型中选择一个最优的上下文模型,对当前节点的第二标志进行解码,以提高对第二标志的解码效率。In the embodiment of the present application, the second flag corresponds to multiple context models, for example, P context models. The embodiment of the present application does not limit the specific number of context models corresponding to the second flag, as long as P is greater than 1. That is to say, in the embodiment of the present application, at least one optimal context model is selected from two context models to decode the second flag of the current node, so as to improve the decoding efficiency of the second flag.
下面对上述步骤A1中基于N个预测节点的第二直接解码信息,确定第二上下文模型的索引的具体过程进行介绍。The specific process of determining the index of the second context model based on the second direct decoding information of the N prediction nodes in the above step A1 is introduced below.
方式一,上述步骤A1包括如下步骤A1-11和步骤A1-12的步骤:Method 1: the above step A1 includes the following steps A1-11 and A1-12:
步骤A1-11、针对N个预测节点中的任一预测节点,基于预测节点的第二直接解码信息,确定预测节点对应的第二数值。Step A1-11: for any prediction node among the N prediction nodes, determine a second value corresponding to the prediction node based on the second direct decoding information of the prediction node.
在该方式中,针对N个预测节点中的每一个预测节点,基于该预测节点的第二直接解码信息,确定该预测节点对应的第二数值,最后基于N个预测节点对应的第二数值,确定第二上下文模型的索引。In this method, for each of the N prediction nodes, the second numerical value corresponding to the prediction node is determined based on the second direct decoding information of the prediction node, and finally the index of the second context model is determined based on the second numerical values corresponding to the N prediction nodes.
下面对确定预测节点对应的第二数值的过程进行介绍。The following describes the process of determining the second value corresponding to the prediction node.
在一些实施例中,预测节点的第二直接解码信息包括第二信息和第三信息,其中第二信息表示预测节点是否满足直接解码方式进行解码的条件,第三信息表示预测节点的第二个点是否为重复点。In some embodiments, the second direct decoding information of the prediction node includes second information and third information, wherein the second information indicates whether the prediction node meets the conditions for decoding in a direct decoding manner, and the third information indicates whether the second point of the prediction node is a repeated point.
此时,解码端可以基于预测节点的第二信息和/或第三信息,确定该预测节点对应的第二数值。At this time, the decoding end may determine the second value corresponding to the prediction node based on the second information and/or the third information of the prediction node.
示例性的,解码端通过如下步骤,确定出预测节点对应的第二数值:Exemplarily, the decoding end determines the second value corresponding to the prediction node through the following steps:
步骤21、基于第二信息,确定第一子值;Step 21: Determine a first sub-value based on the second information;
步骤22、基于第三信息,确定第三子值;Step 22: Determine a third sub-value based on the third information;
步骤23、基于第一子值和/或第三子值,确定预测节点对应的第二数值。Step 23: Determine a second numerical value corresponding to the prediction node based on the first sub-value and/or the third sub-value.
基于第二信息,确定第一子值的过程可以参照上述步骤11的描述,在此不再赘述。The process of determining the first sub-value based on the second information can refer to the description of step 11 above, which will not be repeated here.
其中,基于第三信息,确定第三子值的方式可以是:若第三信息表示预测节点的第二个点是重复点,则确定第三子值的取值为第一值;若第三信息表示预测节点的第二个点不是重复点,则确定第三子值的取值为第二值。Among them, based on the third information, the method of determining the third sub-value can be: if the third information indicates that the second point of the predicted node is a repeated point, then the value of the third sub-value is determined to be the first value; if the third information indicates that the second point of the predicted node is not a repeated point, then the value of the third sub-value is determined to be the second value.
接着,解码端基于上述确定的第一子值和/或第三子值,确定该预测节点对应的第二数值。例如,将第一子值确定为该预测节点对应的第二数值。或者,将该第三子值,确定为该预测节点对应的第二数值。或者,基于第一子值和第三子值,确定该预测节点对应的第二数值,例如,将第一子值和第三子值之和,确定为预测节点对应的第二数值。Next, the decoding end determines the second value corresponding to the prediction node based on the first sub-value and/or the third sub-value determined above. For example, the first sub-value is determined as the second value corresponding to the prediction node. Alternatively, the third sub-value is determined as the second value corresponding to the prediction node. Alternatively, the second value corresponding to the prediction node is determined based on the first sub-value and the third sub-value, for example, the sum of the first sub-value and the third sub-value is determined as the second value corresponding to the prediction node.
基于上述步骤,可以确定出当前节点的N个预测节点中,每一个预测节点对应的第二数值。接着,执行如下步骤A1-12:Based on the above steps, the second value corresponding to each prediction node in the N prediction nodes of the current node can be determined. Then, the following steps A1-12 are performed:
步骤A1-12、基于N个预测节点对应的第二数值,确定第二上下文模型的索引。Step A1-12: determine the index of the second context model based on the second numerical values corresponding to the N prediction nodes.
解码端基于上述步骤,确定出N个预测节点对应的第二数值后,基于N个预测节点对应的第二数值,确定第二上下文模型的索引。After the decoding end determines the second values corresponding to the N prediction nodes based on the above steps, it determines the index of the second context model based on the second values corresponding to the N prediction nodes.
其中,基于N个预测节点对应的第二数值,确定第二上下文模型的索引至少包括如下实现方式:Wherein, determining the index of the second context model based on the second values corresponding to the N prediction nodes includes at least the following implementation methods:
方式1,将N个预测节点对应的第二数值之和的平均值,确定为第二上下文模型的索引。Method 1: Determine the average value of the sum of the second numerical values corresponding to the N prediction nodes as the index of the second context model.
方式2,步骤A1-12包括如下步骤A1-121至步骤A1-123的步骤: Mode 2, step A1-12 includes the following steps A1-121 to A1-123:
步骤A1-121、确定预测节点对应的第一权重;Step A1-121, determining a first weight corresponding to the prediction node;
步骤A1-122、基于第一权重,对N个预测节点对应的第二数值进行加权处理,得到第三加权预测值;Step A1-122: Based on the first weight, weighted processing is performed on the second values corresponding to the N prediction nodes to obtain a third weighted prediction value;
步骤A1-123、基于第三加权预测值,确定第二上下文模型的索引。Step A1-123, determine the index of the second context model based on the third weighted prediction value.
在该方式2中,若当前节点包括多个预测节点,即N个预测节点时,在基于N个预测节点对应的第二数值确定第二上下文模型的索引时,可以为N个预测节点中的每一个预测节点确定一个权重,即第一权重,这样可以基于各预测节点的第一权重,对各预测节点对应的第二数值进行加权处理,进而根据最终的加权结果,确定第二上下文模型的索引,从而提高了基于N个预测节点的几何解码信息,确定第二上下文模型的索引的准确性。In method 2, if the current node includes multiple prediction nodes, i.e., N prediction nodes, when determining the index of the second context model based on the second numerical values corresponding to the N prediction nodes, a weight, i.e., the first weight, can be determined for each of the N prediction nodes. In this way, the second numerical values corresponding to each prediction node can be weighted based on the first weight of each prediction node, and then the index of the second context model can be determined based on the final weighted result, thereby improving the accuracy of determining the index of the second context model based on the geometric decoding information of the N prediction nodes.
本申请实施例对基于第一权重,对N个预测节点对应的第二数值进行加权处理,得到第二加权预测值的具体方式不做限制。The embodiment of the present application does not limit the specific method of obtaining the second weighted prediction value by weighting the second numerical values corresponding to the N prediction nodes based on the first weight.
在一种示例中,基于第一权重,对N个预测节点对应的第二数值进行加权平均,得到第三加权预测值。In one example, based on the first weight, a weighted average is performed on the second values corresponding to the N prediction nodes to obtain a third weighted prediction value.
在另一种示例中,基于第一权重,对N个预测节点对应的第二数值进行加权求和,得到第三加权预测值。In another example, based on the first weight, a weighted sum is performed on the second values corresponding to the N prediction nodes to obtain a third weighted prediction value.
基于方式步骤,确定出第三加权预测值后,基于该第三加权预测值,确定第二上下文模型的索引,即上述步骤A1-123至少包括如下几种示例:After the third weighted prediction value is determined based on the method steps, the index of the second context model is determined based on the third weighted prediction value, that is, the above steps A1-123 include at least the following examples:
示例1,将第三加权预测值,确定为第二上下文模型的索引。Example 1: Determine the third weighted prediction value as the index of the second context model.
示例2,确定第三加权预测值所在加权预测值范围,将该范围对应的索引,确定为第二上下文模型的索引。Example 2: Determine the weighted prediction value range in which the third weighted prediction value is located, and determine the index corresponding to the range as the index of the second context model.
在一些实施例中,解码端还可以采用如下方式二,确定第二上下文模型的索引。In some embodiments, the decoding end may also adopt the following method 2 to determine the index of the second context model.
方式二、若K大于1时,确定K个预测参考帧中每一个预测参考帧对应的第四加权预测值,进而基于K个预测参考帧分别对应的第四加权预测值,确定第二上下文模型的索引。此时,上述步骤A1包括如下步骤A1-21至步骤A1-24的步骤:Method 2: If K is greater than 1, determine the fourth weighted prediction value corresponding to each of the K prediction reference frames, and then determine the index of the second context model based on the fourth weighted prediction values corresponding to the K prediction reference frames. At this time, the above step A1 includes the following steps A1-21 to A1-24:
步骤A1-21、针对K个预测参考帧中的第j个预测参考帧,基于当前节点在第j个预测参考帧中预测节点的第二直接解码信息,确定第j个预测参考帧中的预测节点对应的第二数值,j为小于或等于K的正整数;Step A1-21, for the j-th prediction reference frame among the K prediction reference frames, determining a second value corresponding to the prediction node in the j-th prediction reference frame based on the second direct decoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K;
步骤A1-22、确定预测节点对应的第一权重,并基于第一权重对第j个预测参考帧中的预测节点对应的第二数值进行加权处理,得到第j个预测参考帧对应的第四加权预测值;Step A1-22, determining a first weight corresponding to the prediction node, and performing weighted processing on the second value corresponding to the prediction node in the j-th prediction reference frame based on the first weight, to obtain a fourth weighted prediction value corresponding to the j-th prediction reference frame;
步骤A1-23、基于K个预测参考帧对应的第四加权预测值,确定第二上下文模型的索引。Step A1-23, determine the index of the second context model based on the fourth weighted prediction values corresponding to the K prediction reference frames.
在该方式二中,在确定第二上下文模型的索引时,对这K个预测参考帧中的每一个预测参考帧作为单独的上下文信息分别进行考虑。具体是,确定K个预测参考帧中每一个预测参考帧所包括的预测节点的第二直接解码信息,确定每一个预测参考帧对应的第四加权预测值,进而基于每一个预测参考帧对应的第四加权预测值,确定第二上下文模型的索引,实现第二上下文模型的准确选择,进而提升点云的解码效率。In the second method, when determining the index of the second context model, each of the K prediction reference frames is considered as separate context information. Specifically, the second direct decoding information of the prediction node included in each of the K prediction reference frames is determined, and the fourth weighted prediction value corresponding to each prediction reference frame is determined. Then, based on the fourth weighted prediction value corresponding to each prediction reference frame, the index of the second context model is determined to achieve accurate selection of the second context model, thereby improving the decoding efficiency of the point cloud.
本申请实施例中,解码端确定K个预测参考帧中每一个预测参考帧对应的第四加权预测值的具体方式相同,为了便于描述,在此以K个预测参考帧中的第j个预测参考帧为例进行说明。In the embodiment of the present application, the specific method in which the decoding end determines the fourth weighted prediction value corresponding to each of the K prediction reference frames is the same. For the sake of ease of description, the jth prediction reference frame among the K prediction reference frames is taken as an example for explanation.
在本申请实施例中,当前节点在第j个预测参考帧中包括至少一个预测节点,这样基于该第j个预测参考帧中的这至少一个预测节点的第二直接解码信息,确定这至少一个预测节点的第二数值。In an embodiment of the present application, the current node includes at least one prediction node in the jth prediction reference frame, so that the second value of the at least one prediction node is determined based on the second direct decoding information of the at least one prediction node in the jth prediction reference frame.
举例说明,第j个预测参考帧中包括当前节点的2个预测节点,分别记为预测节点1和预测节点2,进而基于预测节点1的第二直接解码信息,确定预测节点1的第二数值,基于预测节点2的第二直接解码信息,确定预测节点2的第二数值。For example, the j-th prediction reference frame includes two prediction nodes of the current node, which are respectively recorded as prediction node 1 and prediction node 2, and then based on the second direct decoding information of prediction node 1, the second value of prediction node 1 is determined, and based on the second direct decoding information of prediction node 2, the second value of prediction node 2 is determined.
解码端确定出第j个预测参考帧所包括的至少一个预测节点的第二数值后,确定这至少一个预测节点分别对应的第一权重,并基于第一权重对这至少一个预测节点对应的第二数值进行加权处理,得到第j个预测参考帧对应的第四加权预测值。After the decoding end determines the second value of at least one prediction node included in the j-th prediction reference frame, it determines the first weight corresponding to the at least one prediction node, and weightedly processes the second value corresponding to the at least one prediction node based on the first weight to obtain the fourth weighted prediction value corresponding to the j-th prediction reference frame.
在一种示例中,基于第一权重,对第j个预测参考帧中的预测节点对应的第二数值进行加权平均,得到第j个预测参考帧对应的第四加权预测值。In one example, based on the first weight, a weighted average is performed on the second values corresponding to the prediction nodes in the j-th prediction reference frame to obtain a fourth weighted prediction value corresponding to the j-th prediction reference frame.
在另一种示例中,基于第一权重,对第j个预测参考帧中的预测节点对应的第二数值进行加权求和,得到第j个预测参考帧对应的第四加权预测值。In another example, based on the first weight, a weighted sum is performed on the second values corresponding to the prediction nodes in the j-th prediction reference frame to obtain a fourth weighted prediction value corresponding to the j-th prediction reference frame.
其中,第一权重的确定过程可以参照上述实施例的描述,在此不再赘述。The process of determining the first weight may refer to the description of the above embodiment and will not be repeated here.
上述对确定K个预测参考帧中第j个预测参考帧对应的第四加权预测值的过程进行介绍,K个预测参考帧中其他预测参考帧对应的第四加权预测值参照上述第j个预测参考帧对应的方式进行确定。The above introduces the process of determining the fourth weighted prediction value corresponding to the j-th prediction reference frame among the K prediction reference frames. The fourth weighted prediction values corresponding to other prediction reference frames among the K prediction reference frames are determined in accordance with the method corresponding to the j-th prediction reference frame.
解码端确定出K个预测参考帧中每一个预测参考帧对应的第四加权预测值后,执行上述步骤A1-23的步骤。After the decoding end determines the fourth weighted prediction value corresponding to each of the K prediction reference frames, it executes the above steps A1-23.
本申请实施对基于K个预测参考帧对应的第四加权预测值,确定第二上下文模型的索引的具体方式不做限制。The present application does not limit the specific method of determining the index of the second context model based on the fourth weighted prediction value corresponding to K prediction reference frames.
在一些实施例中,解码端将K个预测参考帧对应的第四加权预测值的平均值,确定为第二上下文模型的索引。In some embodiments, the decoding end determines the average value of the fourth weighted prediction values corresponding to the K prediction reference frames as the index of the second context model.
在一些实施例中,解码端确定K个预测参考帧对应的第二权重,并基于第二权重对K个预测参考帧对应的第四加权预测值进行加权处理,得到第二上下文模型的索引。In some embodiments, the decoding end determines second weights corresponding to K predicted reference frames, and performs weighted processing on fourth weighted prediction values corresponding to the K predicted reference frames based on the second weights to obtain an index of the second context model.
举例说明,假设K=2,例如当前待解码帧包括2个预测参考帧,这2个预测参考帧包括当前待解码帧的前向帧和后向帧,假设前向帧对应的第二权重为W1、后向帧对应的第二权重为W2,这样基于W1和W2,对前向帧对应的第四加权预测值和后向帧对应的第四加权预测值进行加权,得到第二上下文模型的索引。For example, assuming K=2, for example, the current frame to be decoded includes 2 prediction reference frames, and these 2 prediction reference frames include the forward frame and backward frame of the current frame to be decoded. Assuming that the second weight corresponding to the forward frame is W1, and the second weight corresponding to the backward frame is W2, based on W1 and W2, the fourth weighted prediction value corresponding to the forward frame and the fourth weighted prediction value corresponding to the backward frame are weighted to obtain the index of the second context model.
在一种示例中,基于第二权重,对K个预测参考帧分别对应的第四加权预测值进行加权平均,得到第二上下文模型的索引。In one example, based on the second weight, the fourth weighted prediction values corresponding to the K prediction reference frames are weighted averaged to obtain the index of the second context model.
在另一种示例中,基于第二权重,对K个预测参考帧分别对应的第四加权预测值进行加权求和,得到第二上下文模型的索引。In another example, based on the second weight, the fourth weighted prediction values corresponding to the K prediction reference frames are weighted summed to obtain the index of the second context model.
解码端基于上述步骤,确定出第二上下模型的索引后,从第二标志对应的P个上下文模型中,选出第二上下文模型。接着,使用该第二上下文模型对当前节点对应的第二标志进行解码,进而基于第二标志确定当前节点的点数。After the decoder determines the index of the second up-down model based on the above steps, it selects the second context model from the P context models corresponding to the second mark. Then, the second context model is used to decode the second mark corresponding to the current node, and the number of points of the current node is determined based on the second mark.
在一些实施例中,若当前节点对应的第二标志表示当前节点的第二个点不是重复点,则确定当前节点包括一个点。In some embodiments, if the second flag corresponding to the current node indicates that the second point of the current node is not a repeated point, it is determined that the current node includes one point.
在一些实施例中,若当前节点对应的第二标志表示当前节点的第二个点为重复点,则本申请实施例的方法还包括如下步骤B:In some embodiments, if the second flag corresponding to the current node indicates that the second point of the current node is a duplicate point, the method of the embodiment of the present application further includes the following step B:
步骤B、将N个预测节点的第三直接解码信息作为上下文信息,对当前节点对应的第三标志进行预测解码,第三标志用于表示当前节点的第三个点是否为重复点。Step B: using the third direct decoding information of the N predicted nodes as context information, predictively decoding the third flag corresponding to the current node, where the third flag is used to indicate whether the third point of the current node is a repeated point.
在该实施例中,若解码端解码当前节点对应的第二标志,得到该第二标志指示当前节点的第二个点是重复点时,则解码端继续对当前节点对应的第三标志进行解码,以确定当前节点的第三个点是否为重复点。In this embodiment, if the decoding end decodes the second flag corresponding to the current node and obtains that the second flag indicates that the second point of the current node is a repeated point, the decoding end continues to decode the third flag corresponding to the current node to determine whether the third point of the current node is a repeated point.
在本申请实施例中,使用N个预测节点的第三直接解码信息作为上下文信息,对当前节点对应的第三标志进行预 测解码,实现对第三标志的高效解码。In an embodiment of the present application, the third direct decoding information of N predicted nodes is used as context information to predictively decode the third flag corresponding to the current node, thereby achieving efficient decoding of the third flag.
在一些实施例中,上述步骤B包括如下步骤:In some embodiments, the above step B includes the following steps:
步骤B1、基于N个预测节点的第三直接解码信息,确定第三上下文模型的索引;Step B1, determining an index of a third context model based on third direct decoding information of N prediction nodes;
步骤B2、基于第三上下文模型的索引,从第三标志对应的R个上下文模型中,选出第三上下文模型,R为大于1的正整数;Step B2: based on the index of the third context model, select the third context model from R context models corresponding to the third flag, where R is a positive integer greater than 1;
步骤B3、使用第三上下文模型,对当前节点对应的第三标志进行预测解码。Step B3: Use the third context model to predict and decode the third flag corresponding to the current node.
在本申请实施例中,第三标志对应多个上下文模型,例如R个上下文模型,本申请实施例对第三标志所对应的上下文模型的具体个数不做限制,只要保证R大于1即可。也就是说,在本申请实施例中,至少从2个上下文模型中选择一个最优的上下文模型,对当前节点的第三标志进行解码,以提高对第三标志的解码效率。In the embodiment of the present application, the third flag corresponds to multiple context models, for example, R context models. The embodiment of the present application does not limit the specific number of context models corresponding to the third flag, as long as R is greater than 1. That is to say, in the embodiment of the present application, at least one optimal context model is selected from two context models to decode the third flag of the current node, so as to improve the decoding efficiency of the third flag.
下面对上述步骤B1中基于N个预测节点的第三直接解码信息,确定第三上下文模型的索引的具体过程进行介绍。The specific process of determining the index of the third context model based on the third direct decoding information of the N prediction nodes in the above step B1 is introduced below.
方式一,上述步骤B1包括如下步骤B1-11和步骤B1-12的步骤:Method 1: the above step B1 includes the following steps B1-11 and B1-12:
步骤B1-11、针对N个预测节点中的任一预测节点,基于预测节点的第三直接解码信息,确定预测节点对应的第三数值。Step B1-11: for any prediction node among the N prediction nodes, determine a third value corresponding to the prediction node based on the third direct decoding information of the prediction node.
在该方式中,针对N个预测节点中的每一个预测节点,基于该预测节点的第三直接解码信息,确定该预测节点对应的第三数值,最后基于N个预测节点对应的第三数值,确定第三上下文模型的索引。In this method, for each of the N prediction nodes, the third numerical value corresponding to the prediction node is determined based on the third direct decoding information of the prediction node, and finally the index of the third context model is determined based on the third numerical values corresponding to the N prediction nodes.
下面对确定预测节点对应的第三数值的过程进行介绍。The process of determining the third value corresponding to the prediction node is introduced below.
在一些实施例中,预测节点的第三直接解码信息包括第二信息和第四信息,其中第二信息表示预测节点是否满足直接解码方式进行解码的条件,第四信息表示预测节点的第三个点是否为重复点。In some embodiments, the third direct decoding information of the prediction node includes second information and fourth information, wherein the second information indicates whether the prediction node meets the conditions for decoding in a direct decoding manner, and the fourth information indicates whether the third point of the prediction node is a repeated point.
此时,解码端可以基于预测节点的第二信息和/或第四信息,确定该预测节点对应的第三数值。At this time, the decoding end may determine the third value corresponding to the prediction node based on the second information and/or the fourth information of the prediction node.
示例性的,解码端通过如下步骤,确定出预测节点对应的第三数值:Exemplarily, the decoding end determines the third value corresponding to the prediction node through the following steps:
步骤31、基于第二信息,确定第一子值;Step 31: Determine a first sub-value based on the second information;
步骤32、基于第四信息,确定第四子值;Step 32: Determine a fourth sub-value based on the fourth information;
步骤23、基于第一子值和/或第四子值,确定预测节点对应的第三数值。Step 23: Determine a third value corresponding to the prediction node based on the first sub-value and/or the fourth sub-value.
基于第二信息,确定第一子值的过程可以参照上述步骤11的描述,在此不再赘述。The process of determining the first sub-value based on the second information can refer to the description of step 11 above, which will not be repeated here.
其中,基于第四信息,确定第四子值的方式可以是:若第四信息表示预测节点的第三个点是重复点,则确定第四子值的取值为第一值;若第四信息表示预测节点的第三个点不是重复点,则确定第四子值的取值为第二值。Among them, based on the fourth information, the method for determining the fourth sub-value can be: if the fourth information indicates that the third point of the predicted node is a repeated point, then the value of the fourth sub-value is determined to be the first value; if the fourth information indicates that the third point of the predicted node is not a repeated point, then the value of the fourth sub-value is determined to be the second value.
接着,解码端基于上述确定的第一子值和/或第四子值,确定该预测节点对应的第三数值。例如,将第一子值确定为该预测节点对应的第三数值。或者,将该第四子值,确定为该预测节点对应的第三数值。或者,基于第一子值和第四子值,确定该预测节点对应的第三数值,例如,将第一子值和第四子值之和,确定为预测节点对应的第三数值。Next, the decoding end determines the third value corresponding to the prediction node based on the first sub-value and/or the fourth sub-value determined above. For example, the first sub-value is determined as the third value corresponding to the prediction node. Alternatively, the fourth sub-value is determined as the third value corresponding to the prediction node. Alternatively, the third value corresponding to the prediction node is determined based on the first sub-value and the fourth sub-value, for example, the sum of the first sub-value and the fourth sub-value is determined as the third value corresponding to the prediction node.
基于上述步骤,可以确定出当前节点的N个预测节点中,每一个预测节点对应的第三数值。接着,执行如下步骤B1-12:Based on the above steps, the third value corresponding to each prediction node in the N prediction nodes of the current node can be determined. Then, the following steps B1-12 are performed:
步骤B1-12、基于N个预测节点对应的第三数值,确定第三上下文模型的索引。Step B1-12: determine the index of the third context model based on the third numerical values corresponding to the N prediction nodes.
解码端基于上述步骤,确定出N个预测节点对应的第三数值后,基于N个预测节点对应的第三数值,确定第三上下文模型的索引。After the decoding end determines the third values corresponding to the N prediction nodes based on the above steps, it determines the index of the third context model based on the third values corresponding to the N prediction nodes.
其中,基于N个预测节点对应的第三数值,确定第三上下文模型的索引至少包括如下实现方式:Wherein, determining the index of the third context model based on the third values corresponding to the N prediction nodes includes at least the following implementation methods:
方式1,将N个预测节点对应的第三数值之和的平均值,确定为第三上下文模型的索引。Method 1: Determine the average value of the sum of the third numerical values corresponding to the N prediction nodes as the index of the third context model.
方式2,步骤B1-12包括如下步骤B1-121至步骤B1-123的步骤: Mode 2, step B1-12 includes the following steps B1-121 to B1-123:
步骤B1-121、确定预测节点对应的第一权重;Step B1-121, determining a first weight corresponding to the prediction node;
步骤B1-122、基于第一权重,对N个预测节点对应的第三数值进行加权处理,得到第五加权预测值;Step B1-122: Based on the first weight, weighted processing is performed on the third values corresponding to the N prediction nodes to obtain a fifth weighted prediction value;
步骤B1-123、基于第五加权预测值,确定第三上下文模型的索引。Step B1-123, determine the index of the third context model based on the fifth weighted prediction value.
在该方式2中,若当前节点包括多个预测节点,即N个预测节点时,在基于N个预测节点对应的第三数值确定第三上下文模型的索引时,可以为N个预测节点中的每一个预测节点确定一个权重,即第一权重,这样可以基于各预测节点的第一权重,对各预测节点对应的第三数值进行加权处理,进而根据最终的加权结果,确定第三上下文模型的索引,从而提高了基于N个预测节点的几何解码信息,确定第三上下文模型的索引的准确性。In method 2, if the current node includes multiple prediction nodes, i.e., N prediction nodes, when determining the index of the third context model based on the third numerical values corresponding to the N prediction nodes, a weight, i.e., the first weight, can be determined for each of the N prediction nodes. In this way, the third numerical values corresponding to each prediction node can be weighted based on the first weight of each prediction node, and then the index of the third context model can be determined based on the final weighted result, thereby improving the accuracy of determining the index of the third context model based on the geometric decoding information of the N prediction nodes.
本申请实施例对基于第一权重,对N个预测节点对应的第三数值进行加权处理,得到第五加权预测值的具体方式不做限制。The embodiment of the present application does not limit the specific method of obtaining the fifth weighted prediction value by weighting the third numerical values corresponding to the N prediction nodes based on the first weight.
在一种示例中,基于第一权重,对N个预测节点对应的第三数值进行加权平均,得到第五加权预测值。In one example, based on the first weight, a weighted average is performed on the third values corresponding to the N prediction nodes to obtain a fifth weighted prediction value.
在另一种示例中,基于第一权重,对N个预测节点对应的第三数值进行加权求和,得到第五加权预测值。In another example, based on the first weight, a weighted sum is performed on the third values corresponding to the N prediction nodes to obtain a fifth weighted prediction value.
基于方式步骤,确定出第五加权预测值后,基于该第五加权预测值,确定第三上下文模型的索引,即上述步骤B1-123至少包括如下几种示例:After the fifth weighted prediction value is determined based on the method step, the index of the third context model is determined based on the fifth weighted prediction value, that is, the above step B1-123 includes at least the following examples:
示例1,将第五加权预测值,确定为第三上下文模型的索引。Example 1: Determine the fifth weighted prediction value as the index of the third context model.
示例2,确定第五加权预测值所在加权预测值范围,将该范围对应的索引,确定为第三上下文模型的索引。Example 2: Determine the weighted prediction value range in which the fifth weighted prediction value is located, and determine the index corresponding to the range as the index of the third context model.
在一些实施例中,解码端还可以采用如下方式二,确定第三上下文模型的索引。In some embodiments, the decoding end may also adopt the following method 2 to determine the index of the third context model.
方式二、若K大于1时,确定K个预测参考帧中每一个预测参考帧对应的第六加权预测值,进而基于K个预测参考帧分别对应的第六加权预测值,确定第三上下文模型的索引。此时,上述步骤B1包括如下步骤B1-21至步骤B1-24的步骤:Method 2: If K is greater than 1, determine the sixth weighted prediction value corresponding to each of the K prediction reference frames, and then determine the index of the third context model based on the sixth weighted prediction values corresponding to the K prediction reference frames. At this time, the above step B1 includes the following steps B1-21 to B1-24:
步骤B1-21、针对K个预测参考帧中的第j个预测参考帧,基于当前节点在第j个预测参考帧中预测节点的第三直接解码信息,确定第j个预测参考帧中的预测节点对应的第三数值,j为小于或等于K的正整数;Step B1-21, for the j-th prediction reference frame among the K prediction reference frames, determine a third value corresponding to the prediction node in the j-th prediction reference frame based on the third direct decoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K;
步骤B1-22、确定预测节点对应的第一权重,并基于第一权重对第j个预测参考帧中的预测节点对应的第三数值 进行加权处理,得到第j个预测参考帧对应的第六加权预测值;Step B1-22, determining a first weight corresponding to the prediction node, and performing weighted processing on the third value corresponding to the prediction node in the j-th prediction reference frame based on the first weight, to obtain a sixth weighted prediction value corresponding to the j-th prediction reference frame;
步骤B1-23、基于K个预测参考帧对应的第六加权预测值,确定第三上下文模型的索引。Step B1-23, determine the index of the third context model based on the sixth weighted prediction value corresponding to the K prediction reference frames.
在该方式二中,在确定第三上下文模型的索引时,对这K个预测参考帧中的每一个预测参考帧作为单独的上下文信息分别进行考虑。具体是,确定K个预测参考帧中每一个预测参考帧所包括的预测节点的第三直接解码信息,确定每一个预测参考帧对应的第六加权预测值,进而基于每一个预测参考帧对应的第六加权预测值,确定第三上下文模型的索引,实现第三上下文模型的准确选择,进而提升点云的解码效率。In the second method, when determining the index of the third context model, each of the K prediction reference frames is considered as separate context information. Specifically, the third direct decoding information of the prediction node included in each of the K prediction reference frames is determined, and the sixth weighted prediction value corresponding to each prediction reference frame is determined. Then, based on the sixth weighted prediction value corresponding to each prediction reference frame, the index of the third context model is determined to achieve accurate selection of the third context model, thereby improving the decoding efficiency of the point cloud.
本申请实施例中,解码端确定K个预测参考帧中每一个预测参考帧对应的第六加权预测值的具体方式相同,为了便于描述,在此以K个预测参考帧中的第j个预测参考帧为例进行说明。In the embodiment of the present application, the specific method in which the decoding end determines the sixth weighted prediction value corresponding to each of the K prediction reference frames is the same. For the sake of ease of description, the jth prediction reference frame among the K prediction reference frames is used as an example for illustration.
在本申请实施例中,当前节点在第j个预测参考帧中包括至少一个预测节点,这样基于该第j个预测参考帧中的这至少一个预测节点的第三直接解码信息,确定这至少一个预测节点的第三数值。In an embodiment of the present application, the current node includes at least one prediction node in the jth prediction reference frame, so that the third value of the at least one prediction node is determined based on the third direct decoding information of the at least one prediction node in the jth prediction reference frame.
举例说明,第j个预测参考帧中包括当前节点的2个预测节点,分别记为预测节点1和预测节点2,进而基于预测节点1的第三直接解码信息,确定预测节点1的第三数值,基于预测节点2的第三直接解码信息,确定预测节点2的第三数值。For example, the j-th prediction reference frame includes two prediction nodes of the current node, which are respectively recorded as prediction node 1 and prediction node 2, and then based on the third direct decoding information of prediction node 1, the third value of prediction node 1 is determined, and based on the third direct decoding information of prediction node 2, the third value of prediction node 2 is determined.
解码端确定出第j个预测参考帧所包括的至少一个预测节点的第三数值后,确定这至少一个预测节点分别对应的第一权重,并基于第一权重对这至少一个预测节点对应的第三数值进行加权处理,得到第j个预测参考帧对应的第六加权预测值。After the decoding end determines the third value of at least one prediction node included in the j-th prediction reference frame, it determines the first weight corresponding to the at least one prediction node, and weightedly processes the third value corresponding to the at least one prediction node based on the first weight to obtain the sixth weighted prediction value corresponding to the j-th prediction reference frame.
在一种示例中,基于第一权重,对第j个预测参考帧中的预测节点对应的第三数值进行加权平均,得到第j个预测参考帧对应的第六加权预测值。In one example, based on the first weight, a weighted average is performed on the third values corresponding to the prediction nodes in the j-th prediction reference frame to obtain a sixth weighted prediction value corresponding to the j-th prediction reference frame.
在另一种示例中,基于第一权重,对第j个预测参考帧中的预测节点对应的第三数值进行加权求和,得到第j个预测参考帧对应的第六加权预测值。In another example, based on the first weight, a weighted sum is performed on the third values corresponding to the prediction nodes in the j-th prediction reference frame to obtain a sixth weighted prediction value corresponding to the j-th prediction reference frame.
其中,第一权重的确定过程可以参照上述实施例的描述,在此不再赘述。The process of determining the first weight may refer to the description of the above embodiment and will not be repeated here.
上述对确定K个预测参考帧中第j个预测参考帧对应的第六加权预测值的过程进行介绍,K个预测参考帧中其他预测参考帧对应的第六加权预测值参照上述第j个预测参考帧对应的方式进行确定。The above introduces the process of determining the sixth weighted prediction value corresponding to the j-th prediction reference frame among the K prediction reference frames. The sixth weighted prediction values corresponding to other prediction reference frames among the K prediction reference frames are determined in accordance with the method corresponding to the j-th prediction reference frame.
解码端确定出K个预测参考帧中每一个预测参考帧对应的第六加权预测值后,执行上述步骤B1-23的步骤。After the decoding end determines the sixth weighted prediction value corresponding to each of the K prediction reference frames, the above steps B1-23 are executed.
本申请实施对基于K个预测参考帧对应的第六加权预测值,确定第三上下文模型的索引的具体方式不做限制。The present application does not limit the specific method of determining the index of the third context model based on the sixth weighted prediction value corresponding to K prediction reference frames.
在一些实施例中,解码端将K个预测参考帧对应的第六加权预测值的平均值,确定为第三上下文模型的索引。In some embodiments, the decoding end determines the average value of the sixth weighted prediction values corresponding to the K prediction reference frames as the index of the third context model.
在一些实施例中,解码端确定K个预测参考帧对应的第二权重,并基于第二权重对K个预测参考帧对应的第六加权预测值进行加权处理,得到第三上下文模型的索引。In some embodiments, the decoding end determines second weights corresponding to K predicted reference frames, and performs weighted processing on sixth weighted prediction values corresponding to the K predicted reference frames based on the second weights to obtain an index of the third context model.
举例说明,假设K=2,例如当前待解码帧包括2个预测参考帧,这2个预测参考帧包括当前待解码帧的前向帧和后向帧,假设前向帧对应的第二权重为W1、后向帧对应的第二权重为W2,这样基于W1和W2,对前向帧对应的第六加权预测值和后向帧对应的第六加权预测值进行加权,得到第三上下文模型的索引。For example, assuming K=2, for example, the current frame to be decoded includes 2 prediction reference frames, and these 2 prediction reference frames include the forward frame and backward frame of the current frame to be decoded. Assuming that the second weight corresponding to the forward frame is W1, and the second weight corresponding to the backward frame is W2, based on W1 and W2, the sixth weighted prediction value corresponding to the forward frame and the sixth weighted prediction value corresponding to the backward frame are weighted to obtain the index of the third context model.
在一种示例中,基于第二权重,对K个预测参考帧分别对应的第六加权预测值进行加权平均,得到第三上下文模型的索引。In one example, based on the second weight, the sixth weighted prediction values corresponding to the K prediction reference frames are weighted averaged to obtain the index of the third context model.
在另一种示例中,基于第二权重,对K个预测参考帧分别对应的第六加权预测值进行加权求和,得到第三上下文模型的索引。In another example, based on the second weight, the sixth weighted prediction values corresponding to the K prediction reference frames are weighted summed to obtain the index of the third context model.
解码端基于上述步骤,确定出第三上下模型的索引后,从第三标志对应的R个上下文模型中,选出第三上下文模型。接着,使用该第三上下文模型对当前节点对应的第三标志进行解码,进而基于第三标志确定当前节点的点数。After the decoder determines the index of the third up-down model based on the above steps, it selects the third context model from the R context models corresponding to the third flag. Then, the third context model is used to decode the third flag corresponding to the current node, and the number of points of the current node is determined based on the third flag.
例如,若当前节点对应的第三标志表示当前节点的第三个点不是重复点,则确定当前节点包括2个点,且这2个点为重复点。For example, if the third flag corresponding to the current node indicates that the third point of the current node is not a repeated point, it is determined that the current node includes 2 points, and the 2 points are repeated points.
再例如,若当前节点对应的第三标志表示当前节点的第三个点是重复点,则继续解码当前节点剩余的重复点。For another example, if the third flag corresponding to the current node indicates that the third point of the current node is a repeated point, then continue to decode the remaining repeated points of the current node.
在一种具体的示例中,本申请实施例的点云解码方法包括如下步骤:In a specific example, the point cloud decoding method of the embodiment of the present application includes the following steps:
步骤1、将N个预测节点的第一直接解码信息作为上下文信息,对当前节点对应的第一标志进行预测解码。Step 1: Use the first direct decoding information of N prediction nodes as context information to perform predictive decoding on the first flag corresponding to the current node.
示例性的,第一直接解码信息包括如下内容:Exemplarily, the first direct decoding information includes the following content:
a)、预测节点的IDCM模式,即PredDCMode(0,1,2);a) IDCM mode of the prediction node, i.e. PredDCMode(0,1,2);
b)、预测节点中的点数PredNumPoints是否大于1。b) Whether the number of points in the prediction node PredNumPoints is greater than 1.
上述步骤1的具体实现过程可以参照上述实施例的描述,在此不再赘述。The specific implementation process of the above step 1 can refer to the description of the above embodiment and will not be repeated here.
步骤2、判断第一标志指示当前节点的点数是否大于1。Step 2: Determine whether the number of points of the current node indicated by the first flag is greater than 1.
基于上述步骤1,解码得到第一标志,若第一标志为真,即表示当前节点的点数大于1时,则执行步骤3,此时说明当前节点包括2个点,且该2个点为非重复点。Based on the above step 1, the first flag is decoded and if the first flag is true, that is, the number of points of the current node is greater than 1, then step 3 is executed, which means that the current node includes 2 points, and the 2 points are non-repeated points.
若第一标志为假,即表示当前节点的点数小于或等于1,则执行步骤4。If the first flag is false, which means the number of points of the current node is less than or equal to 1, then execute step 4.
步骤3、当前节点包括2个非重复点。Step 3: The current node includes 2 non-repeated points.
步骤4、将N个预测节点的第二直接解码信息作为上下文信息,对当前节点对应的第二标志进行预测解码。Step 4: Use the second direct decoding information of the N prediction nodes as context information to perform predictive decoding on the second flag corresponding to the current node.
示例性的,第二直接解码信息包括如下内容:Exemplarily, the second direct decoding information includes the following content:
a)、预测节点的IDCM模式,即PredDCMode(0,1,2);a) IDCM mode of the prediction node, i.e. PredDCMode(0,1,2);
b)、预测节点中的第二个点是否是重复点,如果预测节点的点数小于2,则默认为不是重复点。b) Whether the second point in the prediction node is a duplicate point. If the number of points in the prediction node is less than 2, it is assumed not to be a duplicate point by default.
上述步骤5的具体实现过程可以参照上述实施例的描述,在此不再赘述。The specific implementation process of the above step 5 can refer to the description of the above embodiment, which will not be repeated here.
步骤5、判断当前节点的第二个点是否为重复点。Step 5: Determine whether the second point of the current node is a duplicate point.
若第二标志为假,则表示当前节点的第二点不是重复点,此时执行步骤6,即可以确定出当前节点只包括1个点。If the second flag is false, it means that the second point of the current node is not a duplicate point. In this case, step 6 is executed to determine that the current node includes only one point.
若第二标志为真,即表示当前节点的第二个点为重复点时,则执行步骤7,即说明当前节点包括多个重复点。If the second flag is true, that is, it means that the second point of the current node is a repeated point, then step 7 is executed, which means that the current node includes multiple repeated points.
步骤6、当前节点包括1个点。Step 6: The current node includes 1 point.
步骤7、将N个预测节点的第三直接解码信息作为上下文信息,对当前节点对应的第三标志进行预测解码。Step 7: Use the third direct decoding information of the N prediction nodes as context information to perform predictive decoding on the third flag corresponding to the current node.
示例性的,第三直接解码信息包括如下内容:Exemplarily, the third direct decoding information includes the following content:
a)、预测节点的IDCM模式,即PredDCMode(0,1,2);a) IDCM mode of the prediction node, i.e. PredDCMode(0,1,2);
b)、预测节点中的第三个点是否是重复点,如果预测节点的点数小于2,则默认为不是重复点。b) Whether the third point in the prediction node is a duplicate point. If the number of points in the prediction node is less than 2, it is assumed not to be a duplicate point.
上述步骤7的具体实现过程可以参照上述实施例的描述,在此不再赘述。The specific implementation process of the above step 7 can refer to the description of the above embodiment, which will not be repeated here.
步骤8、判断当前节点的第三个点是否为重复点。Step 8: Determine whether the third point of the current node is a duplicate point.
若第三标志为假,即表示当前节点的第三个点不是重复点,此时执行步骤9,则可以确定出当前节点包括2个点,且这2个点为重复点。If the third flag is false, it means that the third point of the current node is not a duplicate point. In this case, step 9 is executed to determine that the current node includes 2 points, and these 2 points are duplicate points.
若第三标志为真,即表示当前节点的第三个点为重复点时,则执行步骤10,此时说明当前节点可能还存在其他重复点。If the third flag is true, that is, the third point of the current node is a duplicate point, then step 10 is executed, which means that there may be other duplicate points in the current node.
步骤9、当前节点包括2个重复点。Step 9: The current node includes 2 duplicate points.
步骤10、解码当前节点的剩余重复点数目。Step 10: Decode the number of remaining repeated points of the current node.
本申请实施例提供的点云解码方法,在对当前解码帧中的当前节点进行解码时,确定当前节点对应的帧间解码信息,并将当前节点对应的帧间解码信息作为上下文信息对当前节点的点数进行预测解码,增加了解码点数时可选择的上下文条件,使得确定出的条件熵减小,从而提升点云的解码效率和性能。The point cloud decoding method provided in the embodiment of the present application determines the inter-frame decoding information corresponding to the current node when decoding the current node in the current decoding frame, and uses the inter-frame decoding information corresponding to the current node as context information to predict and decode the number of points of the current node, thereby increasing the selectable context conditions when decoding the number of points, reducing the determined conditional entropy, and thus improving the decoding efficiency and performance of the point cloud.
上文以解码端为例,对本申请实施例提供的点云解码方法进行详细介绍,下面以编码端为例,对本申请实施例提供的点云编码方法进行介绍。The above takes the decoding end as an example to introduce in detail the point cloud decoding method provided in the embodiment of the present application. The following takes the encoding end as an example to introduce the point cloud encoding method provided in the embodiment of the present application.
图15为本申请一实施例提供的点云编码方法流程示意图。本申请实施例的点云编码方法可以由上述图3或图4A所示的点云编码设备完成。Fig. 15 is a schematic diagram of a point cloud coding method according to an embodiment of the present application. The point cloud coding method according to the embodiment of the present application can be implemented by the point cloud coding device shown in Fig. 3 or Fig. 4A above.
如图15所示,本申请实施例的点云编码方法包括:As shown in FIG15 , the point cloud encoding method of the embodiment of the present application includes:
S201、确定当前节点对应的帧间编码信息。S201. Determine inter-frame coding information corresponding to the current node.
其中,当前节点为当前待编码帧中的待编码节点。The current node is the node to be encoded in the current frame to be encoded.
由上述可知,点云包括几何信息和属性信息,对点云的编码包括几何编码和属性编码。本申请实施例涉及点云的几何编码。As can be seen from the above, the point cloud includes geometric information and attribute information, and the encoding of the point cloud includes geometric encoding and attribute encoding. The embodiment of the present application relates to geometric encoding of point clouds.
在一些实施例中,点云的几何信息也称为点云的位置信息,因此,点云的几何编码也称为点云的位置编码。In some embodiments, the geometric information of the point cloud is also referred to as the position information of the point cloud, and therefore, the geometric encoding of the point cloud is also referred to as the position encoding of the point cloud.
在基于八叉树的编码方式中,编码端基于点云的几何信息,构建点云的八叉树结构,如图9所示,使用最小长方体包围点云,首先对该包围盒进行八叉树划分,得到8个节点,对这8个节点中被占用的节点,即包括点的节点继续进行八叉树划分,以此类推,直到划分到体素级别位置,例如划分到1X1X1的正方体为止。这样划分得到的点云八叉树结构包括多层节点组成,例如包括N层,在编码时,逐层编码每一层的占位信息,直到编码完最后一层的体素级别的叶子节点为止。也就是说,在八叉树编码中,将点云通八叉树划分,最终将点云中的点划分到八叉树的体素级的叶子节点中,通过对整个八叉树进行编码,实现对点云的编码。In the octree-based encoding method, the encoding end constructs the octree structure of the point cloud based on the geometric information of the point cloud. As shown in Figure 9, the point cloud is enclosed by the smallest cuboid. The enclosing box is first divided into octrees to obtain 8 nodes. The occupied nodes among the 8 nodes, that is, the nodes including the points, continue to be divided into octrees, and so on, until the division is to the voxel level, for example, to a 1X1X1 cube. The point cloud octree structure obtained by such division includes multiple layers of nodes, for example, N layers. When encoding, the placeholder information of each layer is encoded layer by layer until the voxel-level leaf nodes of the last layer are encoded. That is to say, in octree encoding, the point cloud is divided through the octree, and finally the points in the point cloud are divided into the voxel-level leaf nodes of the octree. The encoding of the point cloud is achieved by encoding the entire octree.
但基于八叉树的几何信息编码模式对空间中具有相关性的点有高效的压缩速率,而对于在几何空间中处于孤立位置的点来说,使用直接编码方式可以大大降低复杂度,提升编编码效率。However, the octree-based geometric information encoding mode has an efficient compression rate for correlated points in space, and for points in isolated positions in the geometric space, the use of direct encoding can greatly reduce complexity and improve encoding efficiency.
由于直接编码方式是对节点所包括的点的几何信息直接进行编码,若节点所包括的点数较多时,采用直接编码方式时压缩效果差。因此,对于八叉树中的节点,在进行直接编码之前,首先判断该节点是否可以采用直接编码方式。若判断该节点可以采用直接编码方式进行编码时,则采用直接编码方式对该节点所包括的点的几何信息进行直接编码。若判断该节点不可以采用直接编码方式进行编码时,则继续采用八叉树方式对该节点进行划分。Since the direct encoding method directly encodes the geometric information of the points included in the node, if the number of points included in the node is large, the compression effect is poor when the direct encoding method is used. Therefore, for the nodes in the octree, before direct encoding, first determine whether the node can be encoded using the direct encoding method. If it is determined that the node can be encoded using the direct encoding method, the direct encoding method is used to directly encode the geometric information of the points included in the node. If it is determined that the node cannot be encoded using the direct encoding method, the octree method is continued to be used to divide the node.
具体的,编码端首先判断节点是否具备直接编码的资格,若该节点具备直接编码的资格后,判断节点的点数是否小于或等于预设阈值,若节点的点数小于或等于预设阈值,则确定该节点可以采用直接编码方式进行编码。接着,将该节点所包括的点数,以及各点的几何信息编入码流。对应的,编码端在确定该节点具备直接编码的资格后,编码码流,得到该节点的点数,以及各点的几何信息,实现该节点的几何编码。Specifically, the encoding end first determines whether the node is qualified for direct encoding. If the node is qualified for direct encoding, it determines whether the number of points of the node is less than or equal to the preset threshold. If the number of points of the node is less than or equal to the preset threshold, it is determined that the node can be encoded using direct encoding. Then, the number of points included in the node and the geometric information of each point are encoded into the bitstream. Correspondingly, after determining that the node is qualified for direct encoding, the encoding end encodes the bitstream to obtain the number of points of the node and the geometric information of each point, thereby realizing geometric encoding of the node.
由于对语法元素进行编码时,上下文模型选择的条件越多则得到的条件熵越小,编码性能越强。但是,目前对节点的点数进行编码时,未考虑帧间信息,导致可选择的上下文条件少得到的条件熵大,使得点数的熵编码性能差,进行降低点云的编码性能。When encoding syntax elements, the more conditions the context model selects, the smaller the conditional entropy is, and the better the encoding performance is. However, when encoding the number of points of a node, the inter-frame information is not considered, resulting in a small number of selectable context conditions and a large conditional entropy, which makes the entropy encoding performance of the point number poor, and reduces the encoding performance of the point cloud.
为了解决上述问题,本申请实施例中,编码端基于当前节点对应的帧间编码信息,对当前节点的点数进行预测编码,进而增加了编码点数时可选择的上下文条件,使得确定出的条件熵减小,从而提升点云的编码效率和编码性能。In order to solve the above problems, in an embodiment of the present application, the encoding end predicts the number of points of the current node based on the inter-frame coding information corresponding to the current node, thereby increasing the selectable context conditions when encoding the point number, so that the determined conditional entropy is reduced, thereby improving the coding efficiency and coding performance of the point cloud.
具体的,编码端首先确定当前节点对应的帧间编码信息。Specifically, the encoding end first determines the inter-frame coding information corresponding to the current node.
本申请实施例对当前节点对应的帧间编码信息的具体类型不做限制,可以理解为当前节点涉及的任意帧间几何编码信息。例如,当前节点对应的帧间编码信息可以包括当前节点的相关帧间几何编码信息,和/或当前待编码帧的相关帧间几何编码信息。The embodiment of the present application does not limit the specific type of inter-frame coding information corresponding to the current node, which can be understood as any inter-frame geometric coding information involved in the current node. For example, the inter-frame coding information corresponding to the current node may include relevant inter-frame geometric coding information of the current node and/or relevant inter-frame geometric coding information of the current frame to be coded.
在一些实施例中,当前节点对应的帧间编码信息包括当前节点的N个预测节点的几何编码信息和当前节点对应的第一信息中的至少一个。其中,当前节点的N个预测节点为当前节点在帧间预测参考帧中的预测节点,当前节点对应的第一信息用于指示当前节点是否允许启动帧间编码方式。In some embodiments, the inter-frame coding information corresponding to the current node includes at least one of the geometric coding information of the N prediction nodes of the current node and the first information corresponding to the current node. The N prediction nodes of the current node are the prediction nodes of the current node in the inter-frame prediction reference frame, and the first information corresponding to the current node is used to indicate whether the current node is allowed to start the inter-frame coding mode.
此时,上述S201中确定当前节点对应的帧间编码信息包括如下S201-A的步骤:At this time, determining the inter-frame coding information corresponding to the current node in the above S201 includes the following step S201-A:
S201-A、在当前待编码帧的预测参考帧中,确定当前节点的N个预测节点,和/或编码点云码流,得到第一信息。S201-A. In a prediction reference frame of a current frame to be encoded, determine N prediction nodes of a current node and/or encode a point cloud code stream to obtain first information.
其中,N为正整数。Wherein, N is a positive integer.
本申请实施例对确定当前待编码帧的预测参考帧的具体方式也不做限制。The embodiment of the present application does not limit the specific method of determining the prediction reference frame of the current frame to be encoded.
在一些实施例中,将当前待编码帧的前一个或前几个已编码帧,确定为该当前待编码帧的预测参考帧。In some embodiments, one or several encoded frames before the current frame to be encoded are determined as prediction reference frames of the current frame to be encoded.
在一些实施例中,将当前待编码帧的后一个或后几个已编码帧,确定为该当前待编码帧的预测参考帧。In some embodiments, one or several encoded frames following the current frame to be encoded are determined as prediction reference frames of the current frame to be encoded.
在一些实施例中,将当前待编码帧的前一个或前几个已编码帧,以及当前待编码帧的后一个或后几个已编码帧,确定为该当前待编码帧的预测参考帧。In some embodiments, one or several encoded frames before the current frame to be encoded, and one or several encoded frames after the current frame to be encoded, are determined as prediction reference frames of the current frame to be encoded.
下面以当前待编码帧包括K个预测参考帧为例,对上述S201-A中在当前待编码帧的预测参考帧中,确定当前节点的N个预测节点的具体过程进行介绍。Taking the current frame to be encoded including K prediction reference frames as an example, the specific process of determining N prediction nodes of the current node in the prediction reference frame of the current frame to be encoded in S201-A above is introduced.
在一些实施例中,编码端可以通过如下S201-A1和S201-A2的步骤,确定当前节点的N个预测节点:In some embodiments, the encoder may determine N prediction nodes of the current node through the following steps S201-A1 and S201-A2:
S201-A1、针对K个预测参考帧中的第k个预测参考帧,确定当前节点在第k个预测参考帧中的至少一个预测节点,k为小于或等于K的正整数,K为正整数;S201-A1, for a k-th prediction reference frame among K prediction reference frames, determining at least one prediction node of a current node in the k-th prediction reference frame, where k is a positive integer less than or equal to K, and K is a positive integer;
S201-A2、基于当前节点在K个预测参考帧中的至少一个预测节点,确定当前节点的N个预测节点。S201-A2: Determine N prediction nodes of the current node based on at least one prediction node of the current node in K prediction reference frames.
在该实施例中,编码端从K个预测参考帧中的每一个预测参考帧中,确定出当前节点的至少一个预测节点,最后将K个预测参考帧中各预测参考帧中的至少一个预测节点进行汇总,得到当前节点的N个预测节点。In this embodiment, the encoding end determines at least one prediction node of the current node from each of the K prediction reference frames, and finally summarizes at least one prediction node in each of the K prediction reference frames to obtain N prediction nodes of the current node.
其中,编码端确定当前节点在K个预测参考帧中每一个预测参考帧中的至少一个预测点的过程相同,为了便于描述,在此以K个预测参考帧中的第k个预测参考帧为例进行说明。Among them, the process of the encoding end determining at least one prediction point of the current node in each of the K prediction reference frames is the same. For the sake of ease of description, the kth prediction reference frame among the K prediction reference frames is taken as an example for explanation.
下面对上述S201-A1中确定当前节点在第k个预测参考帧中的至少一个预测节点的具体过程进行介绍。The specific process of determining at least one prediction node of the current node in the kth prediction reference frame in the above S201-A1 is introduced below.
本申请实施例对编码端确定当前节点在第k个预测参考帧中的至少一个预测节点的具体方式不做限制。The embodiment of the present application does not limit the specific manner in which the encoder determines at least one prediction node of the current node in the kth prediction reference frame.
方式一,在第k个预测参考帧中,确定出当前节点的一个预测节点。例如,将第k个预测参考帧中与当前节点的划分深度相同的一个节点,确定为当前节点的预测节点。Method 1: In the kth prediction reference frame, a prediction node of the current node is determined. For example, a node in the kth prediction reference frame having the same division depth as the current node is determined as the prediction node of the current node.
方式二,上述S201-A1中确定当前节点在第k个预测参考帧中的至少一个预测节点,包括如下S201-A11至S201-A13的步骤: Mode 2, in the above S201-A1, determining at least one prediction node of the current node in the kth prediction reference frame includes the following steps S201-A11 to S201-A13:
S201-A11、在当前待编码帧中,确定当前节点的M个领域节点,M个领域节点中包括当前节点,M为正整数;S201-A11, in the current frame to be encoded, determine M domain nodes of the current node, the M domain nodes include the current node, and M is a positive integer;
S201-A12、针对M个领域节点中的第i个领域节点,确定第i个领域节点在第k个预测参考帧中的对应节点,i为小于或等于M的正整数;S201-A12, for the i-th domain node among the M domain nodes, determine the corresponding node of the i-th domain node in the k-th prediction reference frame, where i is a positive integer less than or equal to M;
S201-A13、基于M个领域节点在第k个预测参考帧中的对应节点,确定当前节点在第k个预测参考帧中的至少一个预测节点。S201-A13. Determine at least one prediction node of the current node in the kth prediction reference frame based on the corresponding nodes of the M domain nodes in the kth prediction reference frame.
编码端基于上述步骤,在当前待编码帧中确定出当前节点的M个领域节点后,确定出M个领域节点中每一个领域节点在第k个预测参考帧中的对应节点,进而基于M个领域节点在第k个预测参考帧中的对应节点,确定当前节点在第k个预测参考帧中的至少一个预测节点。Based on the above steps, the encoding end determines the M domain nodes of the current node in the current frame to be encoded, and then determines the corresponding node of each of the M domain nodes in the kth prediction reference frame, and then determines at least one prediction node of the current node in the kth prediction reference frame based on the corresponding nodes of the M domain nodes in the kth prediction reference frame.
本申请实施例对S201-A13的具体实现方式不做限制。The embodiment of the present application does not limit the specific implementation method of S201-A13.
在一种可能的实现方式中,从M个领域节点在第k个预测参考帧中的对应节点中,筛选出至少一个对应节点,作为当前节点在第k个预测参考帧中的至少一个预测节点。例如,从M个领域节点在第k个预测参考帧中的对应节点中,筛选出占位信息与当前节点的占位信息差异最小的至少一个对应节点,作为当前节点在第k个预测参考帧中的至少一个预测节点。其中,确定对应节点的占位信息与当前节点的占位信息的差异的方式可以参照上述占位信息的差异的确定过程,例如将对应节点的占位信息与当前节点的占位信息进行异或运算,将异或运算结果作为该对应节点的占位信息与当前节点的占位信息之间的差异。In a possible implementation, at least one corresponding node is selected from the corresponding nodes of the M domain nodes in the k-th prediction reference frame as at least one prediction node of the current node in the k-th prediction reference frame. For example, at least one corresponding node whose placeholder information has the smallest difference with the placeholder information of the current node is selected from the corresponding nodes of the M domain nodes in the k-th prediction reference frame as at least one prediction node of the current node in the k-th prediction reference frame. The method of determining the difference between the placeholder information of the corresponding node and the placeholder information of the current node can refer to the above-mentioned process of determining the difference in placeholder information, for example, performing an XOR operation on the placeholder information of the corresponding node and the placeholder information of the current node, and using the XOR operation result as the difference between the placeholder information of the corresponding node and the placeholder information of the current node.
在另一种可能的实现方式中,编码端将M个领域节点在第k个预测参考帧中的对应节点,确定为当前节点在第k个预测参考帧中的至少一个预测节点。例如,M个领域节点分别在第k个预测参考帧中具有一个对应节点,进而有M个对应节点,将这M个对应节点确定为当前节点在第k个预测参考帧中的预测节点,共有M个预测节点。In another possible implementation, the encoder determines the corresponding nodes of the M domain nodes in the kth prediction reference frame as at least one prediction node of the current node in the kth prediction reference frame. For example, the M domain nodes each have a corresponding node in the kth prediction reference frame, and thus there are M corresponding nodes. The M corresponding nodes are determined as the prediction nodes of the current node in the kth prediction reference frame, and there are M prediction nodes in total.
编码端确定出当前节点在K个预测参考帧中每一个预测参考帧中的至少一个预测节点后,执行上述S201-B的步骤,即基于当前节点在K个预测参考帧中的至少一个预测节点,确定当前节点的N个预测节点。After the encoder determines at least one prediction node of the current node in each of the K prediction reference frames, it executes the above step S201-B, that is, determines N prediction nodes of the current node based on at least one prediction node of the current node in the K prediction reference frames.
在该方式二中,编码端在当前待编码帧中确定出当前节点的M个领域节点后,在第k个预测参考帧中,确定出这M个领域节点中每一个领域节点的对应节点,进而基于M个领域节点中每一个领域节点的对应节点,确定当前节点在该第k个预测参考帧中的至少一个预测点。In the second method, after the encoding end determines the M domain nodes of the current node in the current frame to be encoded, it determines the corresponding node of each of the M domain nodes in the kth prediction reference frame, and then determines at least one prediction point of the current node in the kth prediction reference frame based on the corresponding node of each of the M domain nodes.
方式三,上述S201-A1中确定当前节点在第k个预测参考帧中的至少一个预测节点,包括如下S201-B11至S201-B13的步骤:Mode 3, in the above S201-A1, determining at least one prediction node of the current node in the kth prediction reference frame includes the following steps S201-B11 to S201-B13:
S201-B11、确定当前节点在第k个预测参考帧中的对应节点;S201-B11, determining the corresponding node of the current node in the kth prediction reference frame;
S201-B12、确定对应节点的至少一个领域节点;S201-B12, determining at least one domain node of the corresponding node;
S201-B13、将至少一个领域节点,确定为当前节点在第k个预测参考帧中的至少一个预测节点。S201-B13. Determine at least one domain node as at least one prediction node of the current node in the kth prediction reference frame.
在该方式3中,针对K个预测参考帧中的每一个预测参考帧,编码端首先在各预测参考帧中确定该当前节点的对应节点。例如,确定当前节点在预测参考帧1中的对应节点1,确定当前节点在预测参考帧2中的对应节点2。接着,编码端确定各对应节点的至少一个领域节点。例如,在预测参考帧1中确定对应节点1的至少一个领域节点,以及在预测参考帧2中确定对应节点2的至少一个领域节点。这样可以将对应节点1在预测参考帧1中的至少一个领域节点,确定为当前节点在预测参考帧1中的至少一个预测节点,将对应节点2在预测参考帧2中的至少一个领域节点,确定为当前节点在预测参考帧2中的至少一个预测节点。In this method 3, for each of the K prediction reference frames, the encoder first determines the corresponding node of the current node in each prediction reference frame. For example, the corresponding node 1 of the current node in the prediction reference frame 1 is determined, and the corresponding node 2 of the current node in the prediction reference frame 2 is determined. Next, the encoder determines at least one domain node of each corresponding node. For example, at least one domain node of the corresponding node 1 is determined in the prediction reference frame 1, and at least one domain node of the corresponding node 2 is determined in the prediction reference frame 2. In this way, at least one domain node of the corresponding node 1 in the prediction reference frame 1 can be determined as at least one prediction node of the current node in the prediction reference frame 1, and at least one domain node of the corresponding node 2 in the prediction reference frame 2 can be determined as at least one prediction node of the current node in the prediction reference frame 2.
方式二的S201-A12中确定第i个领域节点在第k个预测参考帧中的对应节点,与上述方式三的S201-B11中确定当前节点在第k个预测参考帧中的对应节点的过程基本相同。为了便于描述,将上述第i个领域节点和当前节点记为第i个节点,下面对确定第i个节点在第k个预测参考帧中的对应节点的具体过程进行介绍。The process of determining the corresponding node of the i-th domain node in the k-th prediction reference frame in S201-A12 of the second method is basically the same as the process of determining the corresponding node of the current node in the k-th prediction reference frame in S201-B11 of the above-mentioned third method. For the convenience of description, the above-mentioned i-th domain node and the current node are recorded as the i-th node, and the specific process of determining the corresponding node of the i-th node in the k-th prediction reference frame is introduced below.
编码端确定第i个节点在第k个预测参考帧中的对应节点至少包括如下几种方式:The encoder determines the corresponding node of the i-th node in the k-th prediction reference frame in at least the following ways:
方式1,将第k个预测参考帧中与第i个节点的划分深度相同的一个节点,确定为第i个节点的对应节点。Method 1: Determine a node in the k-th prediction reference frame that has the same division depth as the i-th node as the corresponding node of the i-th node.
方式2、上述S201-A12和S201-B11包括如下步骤:Mode 2: The above S201-A12 and S201-B11 include the following steps:
S201-A121、在当前待编码帧中,确定第i个节点的父节点,作为第i个父节点;S201-A121, in the current frame to be encoded, determine the parent node of the i-th node as the i-th parent node;
S201-A122、确定第i个父节点在第k个预测参考帧中的匹配节点,作为第i个匹配节点;S201-A122, determining a matching node of the i-th parent node in the k-th prediction reference frame as the i-th matching node;
S201-A123、将i个匹配节点的子节点中的一个子节点,确定为第i个节点在第k个预测参考帧中的对应节点。S201-A123. Determine one of the child nodes of the i matching nodes as the corresponding node of the i-th node in the k-th prediction reference frame.
在该方式2中,对于第i个节点,编码端在当前待编码帧中,确定该第i个节点的父节点,进而在第k个预测参考帧中,确定第i个预测领域节点的父节点的匹配节点。为了便于描述,将第i个节点的父节点记为第i个父节点,将第i个节点的父节点在第k个预测参考帧中的匹配节点,确定为第i个匹配节点。接着,将第i个匹配节点的子节点的一个子节点,确定为第i个节点在第k个预测参考帧中的对应节点,实现对第i个节点在第k个预测参考帧中的对应节点的准确确定。In the method 2, for the i-th node, the encoding end determines the parent node of the i-th node in the current frame to be encoded, and then determines the matching node of the parent node of the i-th prediction domain node in the k-th prediction reference frame. For the convenience of description, the parent node of the i-th node is recorded as the i-th parent node, and the matching node of the parent node of the i-th node in the k-th prediction reference frame is determined as the i-th matching node. Then, a child node of the child node of the i-th matching node is determined as the corresponding node of the i-th node in the k-th prediction reference frame, thereby accurately determining the corresponding node of the i-th node in the k-th prediction reference frame.
下面对上述S201-A122中确定第i个父节点在第k个预测参考帧中的匹配节点的具体过程进行介绍。The specific process of determining the matching node of the i-th parent node in the k-th prediction reference frame in the above S201-A122 is introduced below.
本申请实施例对编码端确定第i个父节点在第k个预测参考帧中的匹配节点的具体方式不做限制。The embodiment of the present application does not limit the specific manner in which the encoder determines the matching node of the i-th parent node in the k-th prediction reference frame.
在一些实施例中,确定第i个父节点在当前待编码帧中的划分深度,例如第i个父节点在当前待编码帧的八叉树的第2层。这样,编码端可以将第k个预测参考帧中与第i个父节点的划分深度相同的各节点中的一个节点,确定为第i个父节点在第k个预测参考帧中的匹配节点。例如将第k个预测参考帧中处于第2层的各节点中的一个节点,确定为第i个父节点在第k个预测参考帧中的匹配节点。In some embodiments, the division depth of the i-th parent node in the current frame to be encoded is determined, for example, the i-th parent node is in the second layer of the octree of the current frame to be encoded. In this way, the encoder can determine one of the nodes in the k-th prediction reference frame that have the same division depth as the i-th parent node as the matching node of the i-th parent node in the k-th prediction reference frame. For example, one of the nodes in the second layer in the k-th prediction reference frame is determined as the matching node of the i-th parent node in the k-th prediction reference frame.
在一些实施例中,编码端基于第i个父节点的占位信息,确定第i个父节点在第k个预测参考帧中的匹配节点。具体的,由于当前待编码帧中的第i个父节点的占位信息已编码,且第k个预测参考帧中的各节点的占位信息也已编码。这样,编码端可以基于第i个父节点的占位信息,在第k个预测参考帧中查找该第i个父节点的匹配节点。In some embodiments, the encoder determines the matching node of the i-th parent node in the k-th prediction reference frame based on the placeholder information of the i-th parent node. Specifically, since the placeholder information of the i-th parent node in the current frame to be encoded has been encoded, and the placeholder information of each node in the k-th prediction reference frame has also been encoded, the encoder can search for the matching node of the i-th parent node in the k-th prediction reference frame based on the placeholder information of the i-th parent node.
基于上述步骤,编码端可以确定出第i个父节点在第k个预测参考帧中的匹配节点。为了便于描述,将该匹配节点记为第i个匹配节点。Based on the above steps, the encoder can determine the matching node of the i-th parent node in the k-th prediction reference frame. For ease of description, the matching node is recorded as the i-th matching node.
接着,编码端将该第i个匹配节点的子节点中的一个子节点,确定为第i个领域节点在第k个预测参考帧中的对应节点。Next, the encoder determines one of the child nodes of the i-th matching node as the corresponding node of the i-th domain node in the k-th prediction reference frame.
例如,编码端将该第i个匹配节点所包括的子节点中的一个默认子节点,确定为第i个节点在第k个预测参考帧中的对应节点。假设,将第i个匹配节点的第1个子节点,确定为第i个节点在第k个预测参考帧中的对应节点。For example, the encoder determines a default child node among the child nodes included in the i-th matching node as the corresponding node of the i-th node in the k-th prediction reference frame. Assume that the first child node of the i-th matching node is determined as the corresponding node of the i-th node in the k-th prediction reference frame.
再例如,编码端确定第i个节点在父节点所包括的子节点中的第一序号;将第i个匹配节点的子节点中序号为第一序号的子节点,确定为第i个节点在第k个预测参考帧中的对应节点。示例性的,如图12所示,第i个节点为第i个父节点的第2个子节点,此时第一序号为2。这样可以将第i个匹配节点的第2个子节点,确定为第i个节点的对应节点。For another example, the encoding end determines the first sequence number of the i-th node in the child nodes included in the parent node; the child node with the first sequence number in the child nodes of the i-th matching node is determined as the corresponding node of the i-th node in the k-th prediction reference frame. Exemplarily, as shown in FIG12, the i-th node is the second child node of the i-th parent node, and the first sequence number is 2. In this way, the second child node of the i-th matching node can be determined as the corresponding node of the i-th node.
上面对确定M个领域节点中第i个领域节点,在第k个预测参考帧中的对应节点,以及确定当前节点在第k个预测参考帧中的对应节点的确定过程进行介绍。这样,编码端可以采用方式二或方式三的方式,确定出当前节点在预测参考帧中的N个预测节点。The above describes the process of determining the corresponding node of the i-th domain node in the M domain nodes in the k-th prediction reference frame, and determining the corresponding node of the current node in the k-th prediction reference frame. In this way, the encoder can use the second or third method to determine the N prediction nodes of the current node in the prediction reference frame.
下面对S201-A中确定第一信息的过程进行介绍。The process of determining the first information in S201-A is introduced below.
由上述可知,该第一信息用于指示当前节点是否允许启动帧间编码方式。示例性的,第一信息可以为predFlag,若predFlag为1时,表示当前节点允许启动帧间编码方式,若predFlag为0时,表示当前节点不允许启动帧间编码方式。As can be seen from the above, the first information is used to indicate whether the current node is allowed to start the inter-frame coding mode. Exemplarily, the first information can be predFlag, if predFlag is 1, it means that the current node is allowed to start the inter-frame coding mode, if predFlag is 0, it means that the current node is not allowed to start the inter-frame coding mode.
编码端基于上述步骤,确定出当前节点对应的帧间编码信息后,执行如下S202的步骤。After the encoder determines the inter-frame coding information corresponding to the current node based on the above steps, it executes the following step S202.
S202、基于当前节点对应的帧间编码信息,对当前节点所包括的点数进行预测编码。S202: Based on the inter-frame coding information corresponding to the current node, predictive coding is performed on the points included in the current node.
由于点云的相邻帧之间的相关性,本申请实施例基于点云的相邻帧之间的相关性,在对当前节点的点数进行预测编码时,参照了帧间的相关信息。具体是基于当前节点对应的帧间编码信息,对当前节点的点数进行预测编码,增加了编码点数时可选择的上下文条件,减小条件熵,进行减少编码点数所使用的码字,从而提升点云的编编码效率和编编码性能。Due to the correlation between adjacent frames of the point cloud, the embodiment of the present application refers to the relevant information between frames when predicting the number of points of the current node based on the correlation between adjacent frames of the point cloud. Specifically, based on the inter-frame coding information corresponding to the current node, the number of points of the current node is predictively coded, the context conditions that can be selected when coding the number of points are increased, the conditional entropy is reduced, and the codewords used for coding the number of points are reduced, thereby improving the coding efficiency and coding performance of the point cloud.
在一些实施例中,当前节点对应的帧间编码信息包括当前节点的N个预测节点的几何编码信息和当前节点对应的第一信息中的至少一个,此时,上述S202包括如下S202-1的步骤:In some embodiments, the inter-frame coding information corresponding to the current node includes geometric coding information of N prediction nodes of the current node and at least one of the first information corresponding to the current node. In this case, the above S202 includes the following step S202-1:
S202-1、基于N个预测节点的几何编码信息和/或第一信息,对当前节点的点数进行预测编码。S202-1. Based on the geometric coding information of N predicted nodes and/or the first information, predictively encode the number of points of the current node.
在一些实施例中,预测节点的几何编码信息包括预测节点的直接编码信息,其中,预测节点的直接编码信息可以理解为预测节点的直接编码相关的信息。例如,预测节点的直接编码信息包括预测节点的点数,预测节点所包括的各点的几何信息、以及该预测节点是否采用直接编码方式进行编码的信息等。In some embodiments, the geometric coding information of the prediction node includes direct coding information of the prediction node, wherein the direct coding information of the prediction node can be understood as information related to direct coding of the prediction node. For example, the direct coding information of the prediction node includes the number of points of the prediction node, geometric information of each point included in the prediction node, and information on whether the prediction node is encoded using a direct coding method.
基于此,上述S202-1包括如下S202-11的步骤:Based on this, the above S202-1 includes the following step S202-11:
S202-11、基于N个预测节点的直接编码信息和/或第一信息,对当前节点的点数进行预测编码。S202-11. Based on the direct coding information of the N prediction nodes and/or the first information, predictive coding is performed on the number of points of the current node.
在该实施例中,编码端将当前节点的N个预测节点的直接编码信息和/或第一信息作为点数的上下文,对当前节点的点数进行预测编码,进而增加了点数对应的上下文模型,保证了对点数的准确编码,进而提升点云的编码效率。In this embodiment, the encoding end uses the direct encoding information and/or the first information of the N predicted nodes of the current node as the context of the point number, performs predictive encoding on the point number of the current node, and thereby adds a context model corresponding to the point number, ensuring accurate encoding of the point number, thereby improving the encoding efficiency of the point cloud.
在一种示例中,若当前点云采用GPCC方式进行编码时,当前节点所包括的点数小于或等于2时,采用直接编码方式对当前节点进行编码。In one example, if the current point cloud is encoded using the GPCC method, when the number of points included in the current node is less than or equal to 2, the current node is encoded using a direct encoding method.
情况1,当前节点含有的点数numPonts小于等于2,则编码过程如下:Case 1: The number of points numPonts in the current node is less than or equal to 2, then the encoding process is as follows:
1)、首先编码当前节点的numPonts是否大于1;1) First, encode whether the numPonts of the current node is greater than 1;
2)、如果当前节点只有一个点,则需要编码当前节点的第二个点不是重复点。2) If the current node has only one point, it is necessary to encode that the second point of the current node is not a duplicate point.
情况2,当前节点含有的点数numPonts大于2,则编码过程如下:Case 2: The number of points numPonts in the current node is greater than 2, then the encoding process is as follows:
1)、首先编码当前节点的numPonts是否大于1;1) First, encode whether the numPonts of the current node is greater than 1;
2)、其次编码当前节点的第二个点是一个重复点;2) Secondly, the second point of the current node is encoded as a repeated point;
3)、如果第二个点为重复点,则编码当前节点的第三个点是否为重复点;3) If the second point is a duplicate point, encode whether the third point of the current node is a duplicate point;
4)、如果第三个点为重复点,利用指数哥伦布编码方式编码剩余的重复点数目。4) If the third point is a repeated point, use the Exponential Golomb coding method to encode the remaining number of repeated points.
具体的,在编码时,使用第一标志表示当前节点所包括的点数是否大于1,使用第二标志表示当前节点的第二个点是否为重复点,使用第三标志表示当前节点的第三个点是否为重复点。Specifically, during encoding, the first flag is used to indicate whether the number of points included in the current node is greater than 1, the second flag is used to indicate whether the second point of the current node is a repeated point, and the third flag is used to indicate whether the third point of the current node is a repeated point.
需要说明的是,编码端对上述第一标志、第二标志和第三标志进行编码时所使用的上下文信息可以相同,也可以不相同,或者可以部分相同,部分不同,本申请实施例对此不做限制。示例性的,第一标志、第二标志和第三标志中的至少一个标志使用帧间信息(例如预测节点的直接解码信息)作为上下文信息进行预测编码。It should be noted that the context information used by the encoding end when encoding the first flag, the second flag, and the third flag may be the same, or different, or may be partially the same and partially different, and the embodiment of the present application does not limit this. Exemplarily, at least one of the first flag, the second flag, and the third flag uses inter-frame information (such as direct decoding information of the prediction node) as context information for predictive coding.
下面对编码端对第一标志、第二标志和第三标志的预测编码过程进行介绍。The following is an introduction to the predictive encoding process of the first flag, the second flag and the third flag by the encoding end.
在一些实施例中,上述预测节点的直接编码信息包括第一直接编码信息,此时,上述S202-11中基于N个预测节点的直接编码信息,对当前节点的点数进行预测编码包括如下S202-11-A的步骤:In some embodiments, the direct coding information of the prediction node includes first direct coding information. In this case, the predictive coding of the points of the current node based on the direct coding information of the N prediction nodes in S202-11 includes the following step S202-11-A:
S202-11-A、将N个预测节点的第一直接编码信息作为上下文信息,对当前节点对应的第一标志进行预测编码,第一标志用于表示当前节点所包括的点数是否大于1。S202-11-A. Use the first direct encoding information of the N prediction nodes as context information, and perform predictive encoding on the first flag corresponding to the current node, where the first flag is used to indicate whether the number of points included in the current node is greater than 1.
本申请实施例对第一直接编码信息的具体内容不做限制。在本申请实施例中,编码端将N个预测节点的第一直接编码信息作为上下文信息,对当前节点对应的第一标志进行预测编码,提高对第一标志的编码效率,进而提升点云的编码性能。The embodiment of the present application does not limit the specific content of the first direct coding information. In the embodiment of the present application, the encoding end uses the first direct coding information of the N predicted nodes as context information, performs predictive coding on the first flag corresponding to the current node, improves the coding efficiency of the first flag, and thus improves the coding performance of the point cloud.
在一些实施例中,上述S202-11-A包括如下步骤:In some embodiments, the above S202-11-A includes the following steps:
S202-11-A1、基于N个预测节点的第一直接编码信息,确定第一上下文模型的索引;S202-11-A1, determining an index of a first context model based on first direct encoding information of N prediction nodes;
S202-11-A2、基于第一上下文模型的索引,从第一标志对应的Q个上下文模型中,选出第一上下文模型,Q为大于1的正整数;S202-11-A2. Based on the index of the first context model, select the first context model from Q context models corresponding to the first flag, where Q is a positive integer greater than 1;
S202-11-A3、使用第一上下文模型,对当前节点对应的第一标志进行预测编码。S202-11-A3. Use the first context model to predictively encode the first flag corresponding to the current node.
下面对上述S202-11-A1中基于N个预测节点的第一直接编码信息,确定第一上下文模型的索引的具体过程进行介绍。The specific process of determining the index of the first context model based on the first direct encoding information of N prediction nodes in the above S202-11-A1 is introduced below.
方式一,上述S202-11-A1包括如下S202-11-A1-a1和S202-11-A1-a2的步骤:Method 1: the above S202-11-A1 includes the following steps S202-11-A1-a1 and S202-11-A1-a2:
S202-11-A1-a1、针对N个预测节点中的任一预测节点,基于预测节点的第一直接编码信息,确定预测节点对应的第一数值。S202-11-A1-a1. For any prediction node among the N prediction nodes, determine a first numerical value corresponding to the prediction node based on first direct encoding information of the prediction node.
在该方式中,针对N个预测节点中的每一个预测节点,基于该预测节点的第一直接编码信息,确定该预测节点对应的第一数值,最后基于N个预测节点对应的第一数值,确定第一上下文模型的索引。In this method, for each of the N prediction nodes, the first numerical value corresponding to the prediction node is determined based on the first direct encoding information of the prediction node, and finally the index of the first context model is determined based on the first numerical values corresponding to the N prediction nodes.
下面对确定预测节点对应的第一数值的过程进行介绍。The following describes the process of determining the first value corresponding to the prediction node.
在一些实施例中,预测节点的第一直接编码信息包括第二信息和预测节点所包括的点数中的至少一个,其中,第二信息用于指示预测节点是否满足直接编码方式进行编码的条件。In some embodiments, the first direct encoding information of the prediction node includes at least one of the second information and the number of points included in the prediction node, wherein the second information is used to indicate whether the prediction node meets the conditions for encoding in the direct encoding manner.
此时,编码端可以基于预测节点的第二信息和/或该预测节点所包括的点数,确定该预测节点对应的第一数值。At this time, the encoder may determine the first value corresponding to the prediction node based on the second information of the prediction node and/or the number of points included in the prediction node.
示例性的,编码端通过如下步骤,确定出预测节点对应的第一数值:Exemplarily, the encoder determines the first value corresponding to the prediction node through the following steps:
步骤11、基于第二信息,确定第一子值;Step 11: Determine a first sub-value based on the second information;
步骤12、基于预测节点所包括的点数,确定第二子值;Step 12: Determine a second sub-value based on the number of points included in the prediction node;
步骤13、基于第一子值和/或第二子值,确定预测节点对应的第一数值。Step 13: Determine a first numerical value corresponding to the prediction node based on the first sub-value and/or the second sub-value.
由于上述可知,第二信息用于指示该预测节点是否满直接编码方式进行编码的条件,进而可以确定出第一子值为二值化数值。例如,若第二信息指示预测节点满足直接编码方式进行编码的条件时,则确定第一子值的取值为第一值;若第二信息指示预测节点不满足直接编码方式进行编码的条件时,则确定第一子值的取值为第二值。As can be seen from the above, the second information is used to indicate whether the prediction node meets the conditions for encoding by direct encoding, and then the first sub-value can be determined to be a binary value. For example, if the second information indicates that the prediction node meets the conditions for encoding by direct encoding, the value of the first sub-value is determined to be the first value; if the second information indicates that the prediction node does not meet the conditions for encoding by direct encoding, the value of the first sub-value is determined to be the second value.
在另一种示例中,上述第二子值为二值化数值,此时,编码端预测节点所包括的点数,确定第二子值的方式可以是:若预测节点所包括的点数大于第一预设值,则确定第二子值的取值为第一值;若预测节点所包括的点数小于或等于第一预设值,则确定第二子值的取值为第二值。In another example, the above-mentioned second sub-value is a binary value. At this time, the number of points included in the prediction node of the encoding end can be used to determine the second sub-value: if the number of points included in the prediction node is greater than the first preset value, the value of the second sub-value is determined to be the first value; if the number of points included in the prediction node is less than or equal to the first preset value, the value of the second sub-value is determined to be the second value.
接着,编码端基于上述确定的第一子值和/或第二子值,确定该预测节点对应的第一数值。Next, the encoder determines the first value corresponding to the prediction node based on the first sub-value and/or the second sub-value determined above.
基于上述步骤,可以确定出当前节点的N个预测节点中,每一个预测节点对应的第一数值。Based on the above steps, the first value corresponding to each prediction node in the N prediction nodes of the current node may be determined.
S202-11-A1-a2、基于N个预测节点对应的第一数值,确定第一上下文模型的索引。S202-11-A1-a2. Determine the index of the first context model based on the first numerical values corresponding to the N prediction nodes.
编码端基于上述步骤,确定出N个预测节点对应的第一数值后,基于N个预测节点对应的第一数值,确定第一上下文模型的索引。After the encoder determines the first values corresponding to the N prediction nodes based on the above steps, it determines the index of the first context model based on the first values corresponding to the N prediction nodes.
其中,基于N个预测节点对应的第一数值,确定第一上下文模型的索引至少包括如下实现方式:Wherein, determining the index of the first context model based on the first values corresponding to the N prediction nodes includes at least the following implementation methods:
方式1,将N个预测节点对应的第一数值之和的平均值,确定为第一上下文模型的索引。Method 1: Determine the average value of the sum of the first numerical values corresponding to the N prediction nodes as the index of the first context model.
方式2,S202-11-A1-a2包括如下S202-11-A1-a2-1至S202-11-A1-a2-3的步骤: Mode 2, S202-11-A1-a2 includes the following steps S202-11-A1-a2-1 to S202-11-A1-a2-3:
S202-11-A1-a2-1、确定N个预测节点分别对应的第一权重;S202-11-A1-a2-1. Determine first weights corresponding to the N prediction nodes respectively;
S202-11-A1-a2-2、基于第一权重,对N个预测节点对应的第一数值进行加权处理,得到第一加权预测值;S202-11-A1-a2-2. Based on the first weight, weighted processing is performed on the first values corresponding to the N prediction nodes to obtain a first weighted prediction value;
S202-11-A1-a2-3、基于第一加权预测值,确定第一上下文模型的索引。S202-11-A1-a2-3. Determine the index of the first context model based on the first weighted prediction value.
在该方式2中,若当前节点包括多个预测节点,即N个预测节点时,在基于N个预测节点对应的第一数值确定第一上下文模型的索引时,可以为N个预测节点中的每一个预测节点确定一个权重,即第一权重,这样可以基于各预测节点的第一权重,对各预测节点对应的第一数值进行加权处理,进而根据最终的加权结果,确定第一上下文模型的索引,从而提高了基于N个预测节点的几何编码信息,确定第一上下文模型的索引的准确性。In method 2, if the current node includes multiple prediction nodes, i.e., N prediction nodes, when determining the index of the first context model based on the first numerical values corresponding to the N prediction nodes, a weight, i.e., the first weight, can be determined for each of the N prediction nodes. In this way, the first numerical values corresponding to each prediction node can be weighted based on the first weight of each prediction node, and then the index of the first context model can be determined based on the final weighted result, thereby improving the accuracy of determining the index of the first context model based on the geometric coding information of the N prediction nodes.
本申请实施例对确定N个预测节点分别对应的第一权重不做限制。The embodiment of the present application does not limit the determination of the first weights corresponding to the N prediction nodes.
在一些实施例中,对于N个预测节点中的每一个预测节点,基于该预测节点对应的领域节点与当前节点之间的距 离,确定该预测节点对应的第一权重。例如,领域节点与当前节点之间的距离越小,则该领域节点对应的预测节点与当前节点的帧间相关性越强,进而该预测节点的第一权重越大。In some embodiments, for each of the N prediction nodes, a first weight corresponding to the prediction node is determined based on the distance between the domain node corresponding to the prediction node and the current node. For example, the smaller the distance between the domain node and the current node, the stronger the inter-frame correlation between the prediction node corresponding to the domain node and the current node, and thus the greater the first weight of the prediction node.
本申请实施例对基于第一权重,对N个预测节点对应的第一数值进行加权处理,得到第一加权预测值的具体方式不做限制。The embodiment of the present application does not limit the specific method of obtaining the first weighted prediction value by weighting the first numerical values corresponding to N prediction nodes based on the first weight.
在一种示例中,基于第一权重,对N个预测节点对应的第一数值进行加权平均,得到第一加权预测值。In one example, based on the first weight, a weighted average is performed on the first numerical values corresponding to the N prediction nodes to obtain a first weighted prediction value.
在另一种示例中,基于第一权重,对N个预测节点对应的第一数值进行加权求和,得到第一加权预测值。In another example, based on the first weight, a weighted sum is performed on the first numerical values corresponding to the N prediction nodes to obtain a first weighted prediction value.
基于方式步骤,确定出第一加权预测值后,基于该第一加权预测值,确定第一上下文模型的索引,即上述S202-11-A1-a2-3至少包括如下几种示例:After determining the first weighted prediction value based on the method steps, the index of the first context model is determined based on the first weighted prediction value, that is, the above S202-11-A1-a2-3 includes at least the following examples:
示例1,将第一加权预测值,确定为第一上下文模型的索引。Example 1: Determine the first weighted prediction value as the index of the first context model.
示例2,确定第一加权预测值所在加权预测值范围,将该范围对应的索引,确定为第一上下文模型的索引,也就是说一个加权预测值范围对应一个上下文模型。Example 2, determines the weighted prediction value range where the first weighted prediction value is located, and determines the index corresponding to the range as the index of the first context model, that is, one weighted prediction value range corresponds to one context model.
在一些实施例中,编码端还可以采用如下方式二,确定第一上下文模型的索引。In some embodiments, the encoding end may also adopt the following method 2 to determine the index of the first context model.
方式二、若K大于1时,确定K个预测参考帧中每一个预测参考帧对应的第二加权预测值,进而基于K个预测参考帧分别对应的第二加权预测值,确定第一上下文模型的索引。此时,上述S202-11-A1包括如下S202-11-A1-b1至S202-11-A1-a4的步骤:Method 2: If K is greater than 1, determine the second weighted prediction value corresponding to each of the K prediction reference frames, and then determine the index of the first context model based on the second weighted prediction values corresponding to the K prediction reference frames. At this time, the above S202-11-A1 includes the following steps S202-11-A1-b1 to S202-11-A1-a4:
S202-11-A1-b1、针对K个预测参考帧中的第j个预测参考帧,基于当前节点在第j个预测参考帧中预测节点的第一直接编码信息,确定第j个预测参考帧中的预测节点对应的第一数值,j为小于或等于K的正整数;S202-11-A1-b1, for the j-th prediction reference frame among the K prediction reference frames, determine a first value corresponding to the prediction node in the j-th prediction reference frame based on the first direct encoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K;
S202-11-A1-b2、确定预测节点对应的第一权重,并基于第一权重对第j个预测参考帧中的预测节点对应的第一数值进行加权处理,得到第j个预测参考帧对应的第二加权预测值;S202-11-A1-b2, determining a first weight corresponding to the prediction node, and performing weighted processing on the first value corresponding to the prediction node in the j-th prediction reference frame based on the first weight, to obtain a second weighted prediction value corresponding to the j-th prediction reference frame;
S202-11-A1-b3、基于K个预测参考帧对应的第二加权预测值,确定第一上下文模型的索引。S202-11-A1-b3. Determine the index of the first context model based on the second weighted prediction values corresponding to the K prediction reference frames.
在该方式二中,在确定第一上下文模型的索引时,对这K个预测参考帧中的每一个预测参考帧作为单独的上下文信息分别进行考虑。具体是,确定K个预测参考帧中每一个预测参考帧所包括的预测节点的第一直接编码信息,确定每一个预测参考帧对应的第二加权预测值,进而基于每一个预测参考帧对应的第二加权预测值,确定第一上下文模型的索引,实现第一上下文模型的准确选择,进而提升点云的编码效率。In the second method, when determining the index of the first context model, each of the K prediction reference frames is considered as separate context information. Specifically, the first direct coding information of the prediction node included in each of the K prediction reference frames is determined, and the second weighted prediction value corresponding to each prediction reference frame is determined. Then, based on the second weighted prediction value corresponding to each prediction reference frame, the index of the first context model is determined to achieve accurate selection of the first context model, thereby improving the coding efficiency of the point cloud.
本申请实施例中,编码端确定K个预测参考帧中每一个预测参考帧对应的第二加权预测值的具体方式相同,为了便于描述,在此以K个预测参考帧中的第j个预测参考帧为例进行说明。In the embodiment of the present application, the specific method in which the encoding end determines the second weighted prediction value corresponding to each of the K prediction reference frames is the same. For the sake of ease of description, the jth prediction reference frame among the K prediction reference frames is taken as an example for illustration.
在本申请实施例中,当前节点在第j个预测参考帧中包括至少一个预测节点,这样基于该第j个预测参考帧中的这至少一个预测节点的第一直接编码信息,确定这至少一个预测节点的第一数值。In an embodiment of the present application, the current node includes at least one prediction node in the jth prediction reference frame, so that the first value of the at least one prediction node is determined based on the first direct encoding information of the at least one prediction node in the jth prediction reference frame.
编码端确定出第j个预测参考帧所包括的至少一个预测节点的第一数值后,确定这至少一个预测节点分别对应的第一权重,并基于第一权重对这至少一个预测节点对应的第一数值进行加权处理,得到第j个预测参考帧对应的第二加权预测值。After the encoding end determines the first value of at least one prediction node included in the j-th prediction reference frame, it determines the first weight corresponding to the at least one prediction node, and weightedly processes the first value corresponding to the at least one prediction node based on the first weight to obtain a second weighted prediction value corresponding to the j-th prediction reference frame.
在一种示例中,基于第一权重,对第j个预测参考帧中的预测节点对应的第一数值进行加权平均,得到第j个预测参考帧对应的第二加权预测值。In one example, based on the first weight, a weighted average is performed on the first values corresponding to the prediction nodes in the j-th prediction reference frame to obtain a second weighted prediction value corresponding to the j-th prediction reference frame.
在另一种示例中,基于第一权重,对第j个预测参考帧中的预测节点对应的第一数值进行加权求和,得到第j个预测参考帧对应的第二加权预测值。In another example, based on the first weight, a weighted sum is performed on the first numerical values corresponding to the prediction nodes in the j-th prediction reference frame to obtain a second weighted prediction value corresponding to the j-th prediction reference frame.
编码端确定出K个预测参考帧中每一个预测参考帧对应的第二加权预测值后,执行上述S202-11-A1-b3的步骤。After the encoding end determines the second weighted prediction value corresponding to each of the K prediction reference frames, it executes the above step S202-11-A1-b3.
本申请实施对基于K个预测参考帧对应的第二加权预测值,确定第一上下文模型的索引的具体方式不做限制。The present application does not limit the specific method of determining the index of the first context model based on the second weighted prediction value corresponding to K prediction reference frames.
在一些实施例中,编码端将K个预测参考帧对应的第二加权预测值的平均值,确定为第一上下文模型的索引。In some embodiments, the encoding end determines the average value of the second weighted prediction values corresponding to the K prediction reference frames as the index of the first context model.
在一些实施例中,编码端确定K个预测参考帧对应的第二权重,并基于第二权重对K个预测参考帧对应的第二加权预测值进行加权处理,得到第一上下文模型的索引。In some embodiments, the encoding end determines second weights corresponding to K prediction reference frames, and performs weighted processing on second weighted prediction values corresponding to the K prediction reference frames based on the second weights to obtain an index of the first context model.
在该实施例中,编码端首先确定K个预测参考帧中每一个预测参考帧对应的第二权重。本申请实施例对确定K个预测参考帧中每一个预测参考帧对应的第二权重不做限制。In this embodiment, the encoding end first determines the second weight corresponding to each of the K prediction reference frames. The embodiment of the present application does not limit the determination of the second weight corresponding to each of the K prediction reference frames.
在一些实施例中,基于预测参考帧与当前待编码帧的时间差距,确定预测参考帧对应的第二权重。在本申请实施例中,每一张点云包括时间信息,该时间信息可以为点云采集设备采集该帧点云时的时间。基于此,若预测参考帧与当前待编码帧的时间差距越小,则该预测参考帧与当前待编码帧的帧间相关性越强,进而该预测参考帧对应的第二权重越大。例如,可以将预测参考帧与当前待编码帧的时间差距的倒数确定为该预测参考帧对应的第二权重。In some embodiments, based on the time difference between the predicted reference frame and the current frame to be encoded, the second weight corresponding to the predicted reference frame is determined. In an embodiment of the present application, each point cloud includes time information, and the time information may be the time when the point cloud acquisition device acquires the point cloud of the frame. Based on this, if the time difference between the predicted reference frame and the current frame to be encoded is smaller, the inter-frame correlation between the predicted reference frame and the current frame to be encoded is stronger, and thus the second weight corresponding to the predicted reference frame is larger. For example, the inverse of the time difference between the predicted reference frame and the current frame to be encoded can be determined as the second weight corresponding to the predicted reference frame.
确定出K个预测参考帧中每一个预测参考帧对应的第二权重后,基于第二权重对K个预测参考帧分别对应的第二加权预测值进行加权处理,得到第一上下文模型的索引。After determining the second weight corresponding to each of the K prediction reference frames, the second weighted prediction values corresponding to the K prediction reference frames are weighted based on the second weight to obtain the index of the first context model.
在一种示例中,基于第二权重,对K个预测参考帧分别对应的第二加权预测值进行加权平均,得到第一上下文模型的索引。In one example, based on the second weight, the second weighted prediction values corresponding to the K prediction reference frames are weighted averaged to obtain the index of the first context model.
在另一种示例中,基于第二权重,对K个预测参考帧分别对应的第二加权预测值进行加权求和,得到第一上下文模型的索引。In another example, based on the second weight, the second weighted prediction values corresponding to the K prediction reference frames are weighted summed to obtain the index of the first context model.
编码端基于上述步骤,确定出第一上下模型的索引后,从第一标志对应的Q个上下文模型中,选出第一上下文模型。接着,使用该第一上下文模型对当前节点对应的第一标志进行编码。After determining the index of the first context model based on the above steps, the encoder selects the first context model from the Q context models corresponding to the first flag, and then uses the first context model to encode the first flag corresponding to the current node.
下面对编码端对第二标志的预测编码过程进行介绍。The following is an introduction to the predictive encoding process of the second flag by the encoding end.
步骤A、将N个预测节点的第二直接编码信息作为上下文信息,对当前节点对应的第二标志进行预测编码,第二标志用于表示当前节点的第二个点是否为重复点。Step A: Use the second direct coding information of the N predicted nodes as context information to perform predictive coding on the second flag corresponding to the current node, where the second flag is used to indicate whether the second point of the current node is a repeated point.
在该实施例中,若当前节点包括一个点,或者当前节点包括多个重复点时,则编码端需要编码第二标志,以指示 当前块的第二个点是否为重复点。In this embodiment, if the current node includes one point, or the current node includes multiple repeated points, the encoding end needs to encode a second flag to indicate whether the second point of the current block is a repeated point.
在本申请实施例中,使用N个预测节点的第二直接编码信息作为上下文信息,对当前节点对应的第二标志进行预测编码,实现对第二标志的高效编码。In an embodiment of the present application, the second direct encoding information of the N prediction nodes is used as context information to perform predictive encoding on the second flag corresponding to the current node, thereby achieving efficient encoding of the second flag.
在一些实施例中,上述步骤A包括如下步骤:In some embodiments, the above step A includes the following steps:
步骤A1、基于N个预测节点的第二直接编码信息,确定第二上下文模型的索引;Step A1: determining an index of a second context model based on second direct encoding information of N prediction nodes;
步骤A2、基于第二上下文模型的索引,从第二标志对应的P个上下文模型中,选出第二上下文模型,P为大于1的正整数;Step A2: based on the index of the second context model, select the second context model from P context models corresponding to the second flag, where P is a positive integer greater than 1;
步骤A3、使用第二上下文模型,对当前节点对应的第二标志进行预测编码。Step A3: Use the second context model to perform predictive coding on the second flag corresponding to the current node.
在本申请实施例中,第二标志对应多个上下文模型,例如P个上下文模型,本申请实施例对第二标志所对应的上下文模型的具体个数不做限制,只要保证P大于1即可。也就是说,在本申请实施例中,至少从2个上下文模型中选择一个最优的上下文模型,对当前节点的第二标志进行编码,以提高对第二标志的编码效率。In the embodiment of the present application, the second flag corresponds to multiple context models, for example, P context models. The embodiment of the present application does not limit the specific number of context models corresponding to the second flag, as long as P is greater than 1. That is to say, in the embodiment of the present application, at least one optimal context model is selected from two context models, and the second flag of the current node is encoded to improve the encoding efficiency of the second flag.
下面对上述步骤A1中基于N个预测节点的第二直接编码信息,确定第二上下文模型的索引的具体过程进行介绍。The specific process of determining the index of the second context model based on the second direct encoding information of the N prediction nodes in the above step A1 is introduced below.
方式一,上述步骤A1包括如下步骤A1-11和步骤A1-12的步骤:Method 1: the above step A1 includes the following steps A1-11 and A1-12:
步骤A1-11、针对N个预测节点中的任一预测节点,基于预测节点的第二直接编码信息,确定预测节点对应的第二数值。Step A1-11: for any prediction node among the N prediction nodes, determine a second value corresponding to the prediction node based on the second direct encoding information of the prediction node.
在该方式中,针对N个预测节点中的每一个预测节点,基于该预测节点的第二直接编码信息,确定该预测节点对应的第二数值,最后基于N个预测节点对应的第二数值,确定第二上下文模型的索引。In this method, for each of the N prediction nodes, the second numerical value corresponding to the prediction node is determined based on the second direct encoding information of the prediction node, and finally the index of the second context model is determined based on the second numerical values corresponding to the N prediction nodes.
下面对确定预测节点对应的第二数值的过程进行介绍。The following describes the process of determining the second value corresponding to the prediction node.
在一些实施例中,预测节点的第二直接编码信息包括第二信息和第三信息,其中第二信息表示预测节点是否满足直接编码方式进行编码的条件,第三信息表示预测节点的第二个点是否为重复点。In some embodiments, the second direct encoding information of the prediction node includes second information and third information, wherein the second information indicates whether the prediction node meets the conditions for encoding in a direct encoding manner, and the third information indicates whether the second point of the prediction node is a repeated point.
此时,编码端可以基于预测节点的第二信息和/或第三信息,确定该预测节点对应的第二数值。At this time, the encoding end may determine the second value corresponding to the prediction node based on the second information and/or the third information of the prediction node.
示例性的,编码端通过如下步骤,确定出预测节点对应的第二数值:Exemplarily, the encoder determines the second value corresponding to the prediction node through the following steps:
步骤21、基于第二信息,确定第一子值;Step 21: Determine a first sub-value based on the second information;
步骤22、基于第三信息,确定第三子值;Step 22: Determine a third sub-value based on the third information;
步骤23、基于第一子值和/或第三子值,确定预测节点对应的第二数值。Step 23: Determine a second numerical value corresponding to the prediction node based on the first sub-value and/or the third sub-value.
基于第二信息,确定第一子值的过程可以参照上述步骤11的描述,在此不再赘述。The process of determining the first sub-value based on the second information can refer to the description of step 11 above, which will not be repeated here.
其中,基于第三信息,确定第三子值的方式可以是:若第三信息表示预测节点的第二个点是重复点,则确定第三子值的取值为第一值;若第三信息表示预测节点的第二个点不是重复点,则确定第三子值的取值为第二值。Among them, based on the third information, the method of determining the third sub-value can be: if the third information indicates that the second point of the predicted node is a repeated point, then the value of the third sub-value is determined to be the first value; if the third information indicates that the second point of the predicted node is not a repeated point, then the value of the third sub-value is determined to be the second value.
接着,编码端基于上述确定的第一子值和/或第三子值,确定该预测节点对应的第二数值。Next, the encoder determines the second value corresponding to the prediction node based on the first sub-value and/or the third sub-value determined above.
基于上述步骤,可以确定出当前节点的N个预测节点中,每一个预测节点对应的第二数值。接着,执行如下步骤A1-12:Based on the above steps, the second value corresponding to each prediction node in the N prediction nodes of the current node can be determined. Then, the following steps A1-12 are performed:
步骤A1-12、基于N个预测节点对应的第二数值,确定第二上下文模型的索引。Step A1-12: determine the index of the second context model based on the second numerical values corresponding to the N prediction nodes.
其中,基于N个预测节点对应的第二数值,确定第二上下文模型的索引至少包括如下实现方式:Wherein, determining the index of the second context model based on the second values corresponding to the N prediction nodes includes at least the following implementation methods:
方式1,将N个预测节点对应的第二数值之和的平均值,确定为第二上下文模型的索引。Method 1: Determine the average value of the sum of the second numerical values corresponding to the N prediction nodes as the index of the second context model.
方式2,步骤A1-12包括如下步骤A1-121至步骤A1-123的步骤: Mode 2, step A1-12 includes the following steps A1-121 to A1-123:
步骤A1-121、确定预测节点对应的第一权重;Step A1-121, determining a first weight corresponding to the prediction node;
步骤A1-122、基于第一权重,对N个预测节点对应的第二数值进行加权处理,得到第三加权预测值;Step A1-122: Based on the first weight, weighted processing is performed on the second values corresponding to the N prediction nodes to obtain a third weighted prediction value;
步骤A1-123、基于第三加权预测值,确定第二上下文模型的索引。Step A1-123, determine the index of the second context model based on the third weighted prediction value.
本申请实施例对基于第一权重,对N个预测节点对应的第二数值进行加权处理,得到第二加权预测值的具体方式不做限制。The embodiment of the present application does not limit the specific method of obtaining the second weighted prediction value by weighting the second numerical values corresponding to the N prediction nodes based on the first weight.
在一种示例中,基于第一权重,对N个预测节点对应的第二数值进行加权平均,得到第三加权预测值。In one example, based on the first weight, a weighted average is performed on the second values corresponding to the N prediction nodes to obtain a third weighted prediction value.
在另一种示例中,基于第一权重,对N个预测节点对应的第二数值进行加权求和,得到第三加权预测值。In another example, based on the first weight, a weighted sum is performed on the second values corresponding to the N prediction nodes to obtain a third weighted prediction value.
基于方式步骤,确定出第三加权预测值后,基于该第三加权预测值,确定第二上下文模型的索引,即上述步骤A1-123至少包括如下几种示例:After the third weighted prediction value is determined based on the method steps, the index of the second context model is determined based on the third weighted prediction value, that is, the above steps A1-123 include at least the following examples:
示例1,将第三加权预测值,确定为第二上下文模型的索引。Example 1: Determine the third weighted prediction value as the index of the second context model.
示例2,确定第三加权预测值所在加权预测值范围,将该范围对应的索引,确定为第二上下文模型的索引。Example 2: Determine the weighted prediction value range in which the third weighted prediction value is located, and determine the index corresponding to the range as the index of the second context model.
在一些实施例中,编码端还可以采用如下方式二,确定第二上下文模型的索引。In some embodiments, the encoding end may also adopt the following method 2 to determine the index of the second context model.
方式二、若K大于1时,确定K个预测参考帧中每一个预测参考帧对应的第四加权预测值,进而基于K个预测参考帧分别对应的第四加权预测值,确定第二上下文模型的索引。此时,上述步骤A1包括如下步骤A1-21至步骤A1-24的步骤:Method 2: If K is greater than 1, determine the fourth weighted prediction value corresponding to each of the K prediction reference frames, and then determine the index of the second context model based on the fourth weighted prediction values corresponding to the K prediction reference frames. At this time, the above step A1 includes the following steps A1-21 to A1-24:
步骤A1-21、针对K个预测参考帧中的第j个预测参考帧,基于当前节点在第j个预测参考帧中预测节点的第二直接编码信息,确定第j个预测参考帧中的预测节点对应的第二数值,j为小于或等于K的正整数;Step A1-21, for the j-th prediction reference frame among the K prediction reference frames, determining a second value corresponding to the prediction node in the j-th prediction reference frame based on the second direct encoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K;
步骤A1-22、确定预测节点对应的第一权重,并基于第一权重对第j个预测参考帧中的预测节点对应的第二数值进行加权处理,得到第j个预测参考帧对应的第四加权预测值;Step A1-22, determining a first weight corresponding to the prediction node, and performing weighted processing on the second value corresponding to the prediction node in the j-th prediction reference frame based on the first weight, to obtain a fourth weighted prediction value corresponding to the j-th prediction reference frame;
步骤A1-23、基于K个预测参考帧对应的第四加权预测值,确定第二上下文模型的索引。Step A1-23, determine the index of the second context model based on the fourth weighted prediction values corresponding to the K prediction reference frames.
本申请实施例中,编码端确定K个预测参考帧中每一个预测参考帧对应的第四加权预测值的具体方式相同,为了便于描述,在此以K个预测参考帧中的第j个预测参考帧为例进行说明。In the embodiment of the present application, the specific method in which the encoding end determines the fourth weighted prediction value corresponding to each of the K prediction reference frames is the same. For the sake of ease of description, the jth prediction reference frame among the K prediction reference frames is taken as an example for illustration.
在本申请实施例中,当前节点在第j个预测参考帧中包括至少一个预测节点,这样基于该第j个预测参考帧中的这至少一个预测节点的第二直接编码信息,确定这至少一个预测节点的第二数值。In an embodiment of the present application, the current node includes at least one prediction node in the jth prediction reference frame, so that the second value of the at least one prediction node is determined based on the second direct encoding information of the at least one prediction node in the jth prediction reference frame.
编码端确定出第j个预测参考帧所包括的至少一个预测节点的第二数值后,确定这至少一个预测节点分别对应的第一权重,并基于第一权重对这至少一个预测节点对应的第二数值进行加权处理,得到第j个预测参考帧对应的第四加权预测值。After the encoding end determines the second value of at least one prediction node included in the j-th prediction reference frame, it determines the first weight corresponding to the at least one prediction node, and weightedly processes the second value corresponding to the at least one prediction node based on the first weight to obtain a fourth weighted prediction value corresponding to the j-th prediction reference frame.
在一种示例中,基于第一权重,对第j个预测参考帧中的预测节点对应的第二数值进行加权平均,得到第j个预测参考帧对应的第四加权预测值。In one example, based on the first weight, a weighted average is performed on the second values corresponding to the prediction nodes in the j-th prediction reference frame to obtain a fourth weighted prediction value corresponding to the j-th prediction reference frame.
在另一种示例中,基于第一权重,对第j个预测参考帧中的预测节点对应的第二数值进行加权求和,得到第j个预测参考帧对应的第四加权预测值。In another example, based on the first weight, a weighted sum is performed on the second values corresponding to the prediction nodes in the j-th prediction reference frame to obtain a fourth weighted prediction value corresponding to the j-th prediction reference frame.
编码端确定出K个预测参考帧中每一个预测参考帧对应的第四加权预测值后,执行上述步骤A1-23的步骤。After the encoder determines the fourth weighted prediction value corresponding to each of the K prediction reference frames, it executes the above steps A1-23.
本申请实施对基于K个预测参考帧对应的第四加权预测值,确定第二上下文模型的索引的具体方式不做限制。The present application does not limit the specific method of determining the index of the second context model based on the fourth weighted prediction value corresponding to K prediction reference frames.
在一些实施例中,编码端将K个预测参考帧对应的第四加权预测值的平均值,确定为第二上下文模型的索引。In some embodiments, the encoding end determines the average value of the fourth weighted prediction values corresponding to the K prediction reference frames as the index of the second context model.
在一些实施例中,编码端确定K个预测参考帧对应的第二权重,并基于第二权重对K个预测参考帧对应的第四加权预测值进行加权处理,得到第二上下文模型的索引。In some embodiments, the encoding end determines second weights corresponding to K prediction reference frames, and performs weighted processing on fourth weighted prediction values corresponding to the K prediction reference frames based on the second weights to obtain an index of the second context model.
在一种示例中,基于第二权重,对K个预测参考帧分别对应的第四加权预测值进行加权平均,得到第二上下文模型的索引。In one example, based on the second weight, the fourth weighted prediction values corresponding to the K prediction reference frames are weighted averaged to obtain the index of the second context model.
在另一种示例中,基于第二权重,对K个预测参考帧分别对应的第四加权预测值进行加权求和,得到第二上下文模型的索引。In another example, based on the second weight, the fourth weighted prediction values corresponding to the K prediction reference frames are weighted summed to obtain the index of the second context model.
编码端基于上述步骤,确定出第二上下模型的索引后,从第二标志对应的P个上下文模型中,选出第二上下文模型。接着,使用该第二上下文模型对当前节点对应的第二标志进行编码。After determining the index of the second context model based on the above steps, the encoder selects the second context model from the P context models corresponding to the second flag, and then uses the second context model to encode the second flag corresponding to the current node.
下面对编码端对第三标志的预测编码过程进行介绍。The following is an introduction to the predictive encoding process of the third flag by the encoding end.
步骤B、将N个预测节点的第三直接编码信息作为上下文信息,对当前节点对应的第三标志进行预测编码,第三标志用于表示当前节点的第三个点是否为重复点。Step B: Use the third direct coding information of the N predicted nodes as context information, and perform predictive coding on the third flag corresponding to the current node, where the third flag is used to indicate whether the third point of the current node is a repeated point.
在该实施例中,若当前节点的第二个点为重复点时,则编码端继续对当前节点对应的第三标志进行编码,以指示当前节点的第三个点是否为重复点。In this embodiment, if the second point of the current node is a repeated point, the encoding end continues to encode the third flag corresponding to the current node to indicate whether the third point of the current node is a repeated point.
在一些实施例中,上述步骤B包括如下步骤:In some embodiments, the above step B includes the following steps:
步骤B1、基于N个预测节点的第三直接编码信息,确定第三上下文模型的索引;Step B1, determining an index of a third context model based on third direct encoding information of N prediction nodes;
步骤B2、基于第三上下文模型的索引,从第三标志对应的R个上下文模型中,选出第三上下文模型,R为大于1的正整数;Step B2: based on the index of the third context model, select the third context model from R context models corresponding to the third flag, where R is a positive integer greater than 1;
步骤B3、使用第三上下文模型,对当前节点对应的第三标志进行预测编码。Step B3: Use the third context model to perform predictive encoding on the third flag corresponding to the current node.
下面对上述步骤B1中基于N个预测节点的第三直接编码信息,确定第三上下文模型的索引的具体过程进行介绍。The specific process of determining the index of the third context model based on the third direct encoding information of the N prediction nodes in the above step B1 is introduced below.
方式一,上述步骤B1包括如下步骤B1-11和步骤B1-12的步骤:Method 1: the above step B1 includes the following steps B1-11 and B1-12:
步骤B1-11、针对N个预测节点中的任一预测节点,基于预测节点的第三直接编码信息,确定预测节点对应的第三数值。Step B1-11: for any prediction node among the N prediction nodes, determine a third value corresponding to the prediction node based on the third direct encoding information of the prediction node.
在该方式中,针对N个预测节点中的每一个预测节点,基于该预测节点的第三直接编码信息,确定该预测节点对应的第三数值,最后基于N个预测节点对应的第三数值,确定第三上下文模型的索引。In this method, for each of the N prediction nodes, the third numerical value corresponding to the prediction node is determined based on the third direct encoding information of the prediction node, and finally the index of the third context model is determined based on the third numerical values corresponding to the N prediction nodes.
下面对确定预测节点对应的第三数值的过程进行介绍。The process of determining the third value corresponding to the prediction node is introduced below.
在一些实施例中,预测节点的第三直接编码信息包括第二信息和第四信息,其中第二信息表示预测节点是否满足直接编码方式进行编码的条件,第四信息表示预测节点的第三个点是否为重复点。In some embodiments, the third direct encoding information of the prediction node includes second information and fourth information, wherein the second information indicates whether the prediction node meets the conditions for encoding in a direct encoding manner, and the fourth information indicates whether the third point of the prediction node is a repeated point.
此时,编码端可以基于预测节点的第二信息和/或第四信息,确定该预测节点对应的第三数值。At this time, the encoding end may determine the third value corresponding to the prediction node based on the second information and/or the fourth information of the prediction node.
示例性的,编码端通过如下步骤,确定出预测节点对应的第三数值:Exemplarily, the encoder determines the third value corresponding to the prediction node through the following steps:
步骤31、基于第二信息,确定第一子值;Step 31: Determine a first sub-value based on the second information;
步骤32、基于第四信息,确定第四子值;Step 32: Determine a fourth sub-value based on the fourth information;
步骤23、基于第一子值和/或第四子值,确定预测节点对应的第三数值。Step 23: Determine a third value corresponding to the prediction node based on the first sub-value and/or the fourth sub-value.
基于第二信息,确定第一子值的过程可以参照上述步骤11的描述,在此不再赘述。The process of determining the first sub-value based on the second information can refer to the description of step 11 above, which will not be repeated here.
其中,基于第四信息,确定第四子值的方式可以是:若第四信息表示预测节点的第三个点是重复点,则确定第四子值的取值为第一值;若第四信息表示预测节点的第三个点不是重复点,则确定第四子值的取值为第二值。Among them, based on the fourth information, the method for determining the fourth sub-value can be: if the fourth information indicates that the third point of the predicted node is a repeated point, then the value of the fourth sub-value is determined to be the first value; if the fourth information indicates that the third point of the predicted node is not a repeated point, then the value of the fourth sub-value is determined to be the second value.
接着,编码端基于上述确定的第一子值和/或第四子值,确定该预测节点对应的第三数值。Next, the encoder determines a third value corresponding to the prediction node based on the first sub-value and/or the fourth sub-value determined above.
基于上述步骤,可以确定出当前节点的N个预测节点中,每一个预测节点对应的第三数值。接着,执行如下步骤B1-12:Based on the above steps, the third value corresponding to each prediction node in the N prediction nodes of the current node can be determined. Then, the following steps B1-12 are performed:
步骤B1-12、基于N个预测节点对应的第三数值,确定第三上下文模型的索引。Step B1-12: determine the index of the third context model based on the third numerical values corresponding to the N prediction nodes.
方式1,将N个预测节点对应的第三数值之和的平均值,确定为第三上下文模型的索引。Method 1: Determine the average value of the sum of the third numerical values corresponding to the N prediction nodes as the index of the third context model.
方式2,步骤B1-12包括如下步骤B1-121至步骤B1-123的步骤: Mode 2, step B1-12 includes the following steps B1-121 to B1-123:
步骤B1-121、确定预测节点对应的第一权重;Step B1-121, determining a first weight corresponding to the prediction node;
步骤B1-122、基于第一权重,对N个预测节点对应的第三数值进行加权处理,得到第五加权预测值;Step B1-122: Based on the first weight, weighted processing is performed on the third values corresponding to the N prediction nodes to obtain a fifth weighted prediction value;
步骤B1-123、基于第五加权预测值,确定第三上下文模型的索引。Step B1-123, determine the index of the third context model based on the fifth weighted prediction value.
本申请实施例对基于第一权重,对N个预测节点对应的第三数值进行加权处理,得到第五加权预测值的具体方式不做限制。The embodiment of the present application does not limit the specific method of obtaining the fifth weighted prediction value by weighting the third numerical values corresponding to the N prediction nodes based on the first weight.
在一种示例中,基于第一权重,对N个预测节点对应的第三数值进行加权平均,得到第五加权预测值。In one example, based on the first weight, a weighted average is performed on the third values corresponding to the N prediction nodes to obtain a fifth weighted prediction value.
在另一种示例中,基于第一权重,对N个预测节点对应的第三数值进行加权求和,得到第五加权预测值。In another example, based on the first weight, a weighted sum is performed on the third values corresponding to the N prediction nodes to obtain a fifth weighted prediction value.
基于方式步骤,确定出第五加权预测值后,基于该第五加权预测值,确定第三上下文模型的索引,即上述步骤B1-123至少包括如下几种示例:After the fifth weighted prediction value is determined based on the method step, the index of the third context model is determined based on the fifth weighted prediction value, that is, the above step B1-123 includes at least the following examples:
示例1,将第五加权预测值,确定为第三上下文模型的索引。Example 1: Determine the fifth weighted prediction value as the index of the third context model.
示例2,确定第五加权预测值所在加权预测值范围,将该范围对应的索引,确定为第三上下文模型的索引。Example 2: Determine the weighted prediction value range in which the fifth weighted prediction value is located, and determine the index corresponding to the range as the index of the third context model.
在一些实施例中,编码端还可以采用如下方式二,确定第三上下文模型的索引。In some embodiments, the encoding end may also adopt the following method 2 to determine the index of the third context model.
方式二、若K大于1时,确定K个预测参考帧中每一个预测参考帧对应的第六加权预测值,进而基于K个预测参考帧分别对应的第六加权预测值,确定第三上下文模型的索引。此时,上述步骤B1包括如下步骤B1-21至步骤B1-24的步骤:Method 2: If K is greater than 1, determine the sixth weighted prediction value corresponding to each of the K prediction reference frames, and then determine the index of the third context model based on the sixth weighted prediction values corresponding to the K prediction reference frames. At this time, the above step B1 includes the following steps B1-21 to B1-24:
步骤B1-21、针对K个预测参考帧中的第j个预测参考帧,基于当前节点在第j个预测参考帧中预测节点的第三直接编码信息,确定第j个预测参考帧中的预测节点对应的第三数值,j为小于或等于K的正整数;Step B1-21, for the j-th prediction reference frame among the K prediction reference frames, determine a third value corresponding to the prediction node in the j-th prediction reference frame based on the third direct coding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K;
步骤B1-22、确定预测节点对应的第一权重,并基于第一权重对第j个预测参考帧中的预测节点对应的第三数值进行加权处理,得到第j个预测参考帧对应的第六加权预测值;Step B1-22, determining a first weight corresponding to the prediction node, and performing weighted processing on the third value corresponding to the prediction node in the j-th prediction reference frame based on the first weight, to obtain a sixth weighted prediction value corresponding to the j-th prediction reference frame;
步骤B1-23、基于K个预测参考帧对应的第六加权预测值,确定第三上下文模型的索引。Step B1-23, determine the index of the third context model based on the sixth weighted prediction value corresponding to the K prediction reference frames.
在该方式二中,在确定第三上下文模型的索引时,对这K个预测参考帧中的每一个预测参考帧作为单独的上下文信息分别进行考虑。具体是,确定K个预测参考帧中每一个预测参考帧所包括的预测节点的第三直接编码信息,确定每一个预测参考帧对应的第六加权预测值,进而基于每一个预测参考帧对应的第六加权预测值,确定第三上下文模型的索引,实现第三上下文模型的准确选择,进而提升点云的编码效率。In the second method, when determining the index of the third context model, each of the K prediction reference frames is considered as separate context information. Specifically, the third direct coding information of the prediction node included in each of the K prediction reference frames is determined, and the sixth weighted prediction value corresponding to each prediction reference frame is determined. Then, based on the sixth weighted prediction value corresponding to each prediction reference frame, the index of the third context model is determined to achieve accurate selection of the third context model, thereby improving the coding efficiency of the point cloud.
本申请实施例中,编码端确定K个预测参考帧中每一个预测参考帧对应的第六加权预测值的具体方式相同,为了便于描述,在此以K个预测参考帧中的第j个预测参考帧为例进行说明。In the embodiment of the present application, the specific method in which the encoding end determines the sixth weighted prediction value corresponding to each of the K prediction reference frames is the same. For the sake of ease of description, the jth prediction reference frame among the K prediction reference frames is taken as an example for illustration.
在本申请实施例中,当前节点在第j个预测参考帧中包括至少一个预测节点,这样基于该第j个预测参考帧中的这至少一个预测节点的第三直接编码信息,确定这至少一个预测节点的第三数值。In an embodiment of the present application, the current node includes at least one prediction node in the jth prediction reference frame, so that the third value of the at least one prediction node is determined based on the third direct encoding information of the at least one prediction node in the jth prediction reference frame.
编码端确定出第j个预测参考帧所包括的至少一个预测节点的第三数值后,确定这至少一个预测节点分别对应的第一权重,并基于第一权重对这至少一个预测节点对应的第三数值进行加权处理,得到第j个预测参考帧对应的第六加权预测值。After the encoding end determines the third value of at least one prediction node included in the j-th prediction reference frame, it determines the first weight corresponding to the at least one prediction node, and weightedly processes the third value corresponding to the at least one prediction node based on the first weight to obtain the sixth weighted prediction value corresponding to the j-th prediction reference frame.
在一种示例中,基于第一权重,对第j个预测参考帧中的预测节点对应的第三数值进行加权平均,得到第j个预测参考帧对应的第六加权预测值。In one example, based on the first weight, a weighted average is performed on the third values corresponding to the prediction nodes in the j-th prediction reference frame to obtain a sixth weighted prediction value corresponding to the j-th prediction reference frame.
在另一种示例中,基于第一权重,对第j个预测参考帧中的预测节点对应的第三数值进行加权求和,得到第j个预测参考帧对应的第六加权预测值。In another example, based on the first weight, a weighted sum is performed on the third values corresponding to the prediction nodes in the j-th prediction reference frame to obtain a sixth weighted prediction value corresponding to the j-th prediction reference frame.
编码端确定出K个预测参考帧中每一个预测参考帧对应的第六加权预测值后,执行上述步骤B1-23的步骤。After the encoder determines the sixth weighted prediction value corresponding to each of the K prediction reference frames, it executes the above steps B1-23.
在一些实施例中,编码端将K个预测参考帧对应的第六加权预测值的平均值,确定为第三上下文模型的索引。In some embodiments, the encoding end determines the average value of the sixth weighted prediction values corresponding to the K prediction reference frames as the index of the third context model.
在一些实施例中,编码端确定K个预测参考帧对应的第二权重,并基于第二权重对K个预测参考帧对应的第六加权预测值进行加权处理,得到第三上下文模型的索引。In some embodiments, the encoding end determines second weights corresponding to K prediction reference frames, and performs weighted processing on sixth weighted prediction values corresponding to the K prediction reference frames based on the second weights to obtain an index of the third context model.
在一种示例中,基于第二权重,对K个预测参考帧分别对应的第六加权预测值进行加权平均,得到第三上下文模型的索引。In one example, based on the second weight, the sixth weighted prediction values corresponding to the K prediction reference frames are weighted averaged to obtain the index of the third context model.
在另一种示例中,基于第二权重,对K个预测参考帧分别对应的第六加权预测值进行加权求和,得到第三上下文模型的索引。In another example, based on the second weight, the sixth weighted prediction values corresponding to the K prediction reference frames are weighted summed to obtain the index of the third context model.
编码端基于上述步骤,确定出第三上下模型的索引后,从第三标志对应的R个上下文模型中,选出第三上下文模型。接着,使用该第三上下文模型对当前节点对应的第三标志进行编码。After determining the index of the third context model based on the above steps, the encoder selects the third context model from the R context models corresponding to the third flag, and then uses the third context model to encode the third flag corresponding to the current node.
在一些实施例中,若当前节点的第三个点是重复点,则编码端继续编码当前节点剩余的重复点。In some embodiments, if the third point of the current node is a repeated point, the encoding end continues to encode the remaining repeated points of the current node.
下面对当前节点的点数小于或等于2,以及当前节点的点数大于2这2种情况分别进行介绍。The following introduces two cases: the number of points of the current node is less than or equal to 2, and the number of points of the current node is greater than 2.
情况1,若当前节点的点数小于或等于2时,则本申请实施例的点云编码方法包括如下步骤:Case 1: if the number of points of the current node is less than or equal to 2, the point cloud encoding method of the embodiment of the present application includes the following steps:
步骤c1、将N个预测节点的第一直接编码信息作为上下文信息,对当前节点对应的第一标志进行预测编码。Step c1: Use the first direct coding information of the N prediction nodes as context information to perform predictive coding on the first flag corresponding to the current node.
示例性的,第一直接编码信息包括如下内容:Exemplarily, the first direct coding information includes the following content:
a)、预测节点的IDCM模式,即PredDCMode(0,1,2);a) IDCM mode of the prediction node, i.e. PredDCMode(0,1,2);
b)、预测节点中的点数PredNumPoints是否大于1。b) Whether the number of points in the prediction node PredNumPoints is greater than 1.
上述步骤c1的具体实现过程可以参照上述实施例的描述,在此不再赘述。The specific implementation process of the above step c1 can refer to the description of the above embodiment, which will not be repeated here.
其中,若当前节点包括2个非重复点时,则将第一标志置为真。If the current node includes two non-repeated points, the first flag is set to true.
若当前节点包括只包括1个点时,则将第一标志置为假,且执行如下步骤02。If the current node includes only one point, the first flag is set to false and the following step 02 is executed.
步骤c2、将N个预测节点的第二直接编码信息作为上下文信息,对当前节点对应的第二标志进行预测编码。Step c2: Use the second direct encoding information of the N prediction nodes as context information to perform predictive encoding on the second flag corresponding to the current node.
示例性的,第二直接编码信息包括如下内容:Exemplarily, the second direct coding information includes the following content:
a)、预测节点的IDCM模式,即PredDCMode(0,1,2);a) IDCM mode of the prediction node, i.e. PredDCMode(0,1,2);
b)、预测节点中的第二个点是否是重复点,如果预测节点的点数小于2,则默认为不是重复点。b) Whether the second point in the prediction node is a duplicate point. If the number of points in the prediction node is less than 2, it is assumed not to be a duplicate point by default.
上述步骤c2的具体实现过程可以参照上述实施例的描述,在此不再赘述。The specific implementation process of the above step c2 can refer to the description of the above embodiment and will not be repeated here.
情况2,若当前节点的点数大于2时,则本申请实施例的点云编码方法包括如下步骤:Case 2: If the number of points of the current node is greater than 2, the point cloud encoding method of the embodiment of the present application includes the following steps:
步骤d1、将N个预测节点的第一直接编码信息作为上下文信息,对当前节点对应的第一标志进行预测编码。Step d1: Use the first direct coding information of the N prediction nodes as context information to perform predictive coding on the first flag corresponding to the current node.
示例性的,第一直接编码信息包括如下内容:Exemplarily, the first direct coding information includes the following content:
a)、预测节点的IDCM模式,即PredDCMode(0,1,2);a) IDCM mode of the prediction node, i.e. PredDCMode(0,1,2);
b)、预测节点中的点数PredNumPoints是否大于1。b) Whether the number of points in the prediction node PredNumPoints is greater than 1.
上述步骤d1的具体实现过程可以参照上述实施例的描述,在此不再赘述。The specific implementation process of the above step d1 can refer to the description of the above embodiment, which will not be repeated here.
其中,若当前节点包括多个重复点时,则将第一标志置为假,且执行如下步骤d2。If the current node includes multiple repeated points, the first flag is set to false, and the following step d2 is executed.
步骤d2、将N个预测节点的第二直接编码信息作为上下文信息,对当前节点对应的第二标志进行预测编码。Step d2: Use the second direct encoding information of the N prediction nodes as context information to perform predictive encoding on the second flag corresponding to the current node.
示例性的,第二直接编码信息包括如下内容:Exemplarily, the second direct coding information includes the following content:
a)、预测节点的IDCM模式,即PredDCMode(0,1,2);a) IDCM mode of the prediction node, i.e. PredDCMode(0,1,2);
b)、预测节点中的第二个点是否是重复点,如果预测节点的点数小于2,则默认为不是重复点。b) Whether the second point in the prediction node is a duplicate point. If the number of points in the prediction node is less than 2, it is assumed not to be a duplicate point by default.
上述步骤d2的具体实现过程可以参照上述实施例的描述,在此不再赘述。The specific implementation process of the above step d2 can refer to the description of the above embodiment and will not be repeated here.
若当前节点的第二个点为重复点时,则将第二标志置为真,则执行步骤d3。If the second point of the current node is a duplicate point, the second flag is set to true, and step d3 is executed.
步骤d3、将N个预测节点的第三直接编码信息作为上下文信息,对当前节点对应的第三标志进行预测编码。Step d3: Use the third direct encoding information of the N prediction nodes as context information to perform predictive encoding on the third flag corresponding to the current node.
示例性的,第三直接编码信息包括如下内容:Exemplarily, the third direct coding information includes the following content:
a)、预测节点的IDCM模式,即PredDCMode(0,1,2);a) IDCM mode of the prediction node, i.e. PredDCMode(0,1,2);
b)、预测节点中的第三个点是否是重复点,如果预测节点的点数小于2,则默认为不是重复点。b) Whether the third point in the prediction node is a duplicate point. If the number of points in the prediction node is less than 2, it is assumed not to be a duplicate point.
上述步骤d3的具体实现过程可以参照上述实施例的描述,在此不再赘述。The specific implementation process of the above step d3 can refer to the description of the above embodiment, which will not be repeated here.
若当前节点的第三个点为重复点时,则将第三标志置为真,则执行步骤d4。If the third point of the current node is a repeated point, the third flag is set to true, and step d4 is executed.
步骤d4、利用指数哥伦布编码方式编码剩余的重复点数目。Step d4: Encode the remaining number of repeated points using Exponential Golomb coding.
本申请实施例提供的点云编码方法,在对当前编码帧中的当前节点进行编码时,确定当前节点对应的帧间编码信息,并将当前节点对应的帧间编码信息作为上下文信息对当前节点的点数进行预测编码,增加了编码点数时可选择的上下文条件,使得确定出的条件熵减小,从而提升点云的编码效率和性能。The point cloud encoding method provided in the embodiment of the present application determines the inter-frame encoding information corresponding to the current node when encoding the current node in the current encoding frame, and uses the inter-frame encoding information corresponding to the current node as context information to predict the number of points of the current node, thereby increasing the selectable context conditions when encoding the point number, reducing the determined conditional entropy, and thus improving the encoding efficiency and performance of the point cloud.
应理解,图8至图15仅为本申请的示例,不应理解为对本申请的限制。It should be understood that Figures 8 to 15 are merely examples of the present application and should not be construed as limitations to the present application.
以上结合附图详细描述了本申请的优选实施方式,但是,本申请并不限于上述实施方式中的具体细节,在本申请的技术构思范围内,可以对本申请的技术方案进行多种简单变型,这些简单变型均属于本申请的保护范围。例如,在上述具体实施方式中所描述的各个具体技术特征,在不矛盾的情况下,可以通过任何合适的方式进行组合,为了避免不必要的重复,本申请对各种可能的组合方式不再另行说明。又例如,本申请的各种不同的实施方式之间也可以进行任意组合,只要其不违背本申请的思想,其同样应当视为本申请所公开的内容。The preferred embodiments of the present application are described in detail above in conjunction with the accompanying drawings. However, the present application is not limited to the specific details in the above embodiments. Within the technical concept of the present application, the technical solution of the present application can be subjected to a variety of simple modifications, and these simple modifications all belong to the protection scope of the present application. For example, the various specific technical features described in the above specific embodiments can be combined in any suitable manner without contradiction. In order to avoid unnecessary repetition, the present application will not further explain various possible combinations. For another example, the various different embodiments of the present application can also be arbitrarily combined, as long as they do not violate the ideas of the present application, they should also be regarded as the contents disclosed in the present application.
还应理解,在本申请的各种方法实施例中,上述各过程的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。另外,本申请实施例中,术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系。具体地,A和/或B可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。It should also be understood that in the various method embodiments of the present application, the size of the sequence number of each process does not mean the order of execution, and the execution order of each process should be determined by its function and internal logic, and should not constitute any limitation on the implementation process of the embodiment of the present application. In addition, in the embodiment of the present application, the term "and/or" is merely a description of the association relationship of associated objects, indicating that three relationships may exist. Specifically, A and/or B can represent: A exists alone, A and B exist at the same time, and B exists alone. In addition, the character "/" in this article generally indicates that the objects associated before and after are in an "or" relationship.
上文结合图8至图15,详细描述了本申请的方法实施例,下文结合图16至图17,详细描述本申请的装置实施例。The above text describes in detail a method embodiment of the present application in combination with Figures 8 to 15 , and the following text describes in detail a device embodiment of the present application in combination with Figures 16 to 17 .
图16是本申请实施例提供的点云解码装置的示意性框图。FIG. 16 is a schematic block diagram of a point cloud decoding device provided in an embodiment of the present application.
如图16所示,该点云解码装置10可以包括:As shown in FIG16 , the point cloud decoding device 10 may include:
确定单元11,用于确定当前节点对应的帧间解码信息,所述当前节点为当前待解码帧中的待解码节点;A determination unit 11, configured to determine inter-frame decoding information corresponding to a current node, wherein the current node is a node to be decoded in a current frame to be decoded;
解码单元12,用于基于所述当前节点对应的帧间解码信息,对所述当前节点所包括的点数进行预测解码。The decoding unit 12 is used to predict and decode the number of points included in the current node based on the inter-frame decoding information corresponding to the current node.
在一些实施例中,所述帧间解码信息包括所述当前节点的N个预测节点的几何解码信息和所述当前节点对应的第一信息中的至少一个,所述第一信息用于指示所述当前节点是否允许启动帧间解码方式,确定单元11,具体用于在所述当前待解码帧的预测参考帧中,确定所述当前节点的N个预测节点,和/或确定所述第一信息,所述N为正整数;In some embodiments, the inter-frame decoding information includes at least one of geometric decoding information of N prediction nodes of the current node and first information corresponding to the current node, the first information is used to indicate whether the current node is allowed to start the inter-frame decoding mode, and the determination unit 11 is specifically used to determine the N prediction nodes of the current node in the prediction reference frame of the current frame to be decoded, and/or determine the first information, where N is a positive integer;
解码单元12,具体用于基于所述N个预测节点的几何解码信息和/或所述第一信息,对所述当前节点所包括的点数进行预测解码。The decoding unit 12 is specifically used to predict and decode the number of points included in the current node based on the geometric decoding information of the N prediction nodes and/or the first information.
在一些实施例中,所述预测节点的几何解码信息包括所述预测节点的直接解码信息,所述解码单元12,具体用于基于所述N个预测节点的直接解码信息和/或所述第一信息,对所述当前节点的点数进行预测解码。In some embodiments, the geometric decoding information of the prediction node includes direct decoding information of the prediction node, and the decoding unit 12 is specifically used to predict and decode the number of points of the current node based on the direct decoding information of the N prediction nodes and/or the first information.
在一些实施例中,所述当前待解码帧包括K个预测参考帧,确定单元11,具体用于针对所述K个预测参考帧中的第k个预测参考帧,确定所述当前节点在所述第k个预测参考帧中的至少一个预测节点,所述k为小于或等于K的正整数,所述K为正整数;基于所述当前节点在所述K个预测参考帧中的至少一个预测节点,确定所述当前节点的N个预测节点。In some embodiments, the current frame to be decoded includes K prediction reference frames, and the determination unit 11 is specifically used to determine at least one prediction node of the current node in the kth prediction reference frame among the K prediction reference frames, where k is a positive integer less than or equal to K, and K is a positive integer; based on at least one prediction node of the current node in the K prediction reference frames, determine N prediction nodes of the current node.
在一些实施例中,确定单元11,具体用于在所述当前待解码帧中,确定所述当前节点的M个领域节点,所述M个领域节点中包括所述当前节点,所述M为正整数;针对所述M个领域节点中的第i个领域节点,确定所述第i个领域节点在所述第k个预测参考帧中的对应节点,所述i为小于或等于M的正整数;基于所述M个领域节点在所述第k个预测参考帧中的对应节点,确定所述当前节点在所述第k个预测参考帧中的至少一个预测节点。In some embodiments, the determination unit 11 is specifically used to determine M domain nodes of the current node in the current frame to be decoded, where the M domain nodes include the current node, and M is a positive integer; for the i-th domain node among the M domain nodes, determine the corresponding node of the i-th domain node in the k-th prediction reference frame, where i is a positive integer less than or equal to M; based on the corresponding nodes of the M domain nodes in the k-th prediction reference frame, determine at least one prediction node of the current node in the k-th prediction reference frame.
在一些实施例中,确定单元11,具体用于确定所述当前节点在所述第k个预测参考帧中的对应节点;确定所述对应节点的至少一个领域节点;将所述至少一个领域节点,确定为所述当前节点在所述第k个预测参考帧中的至少一个预测节点。In some embodiments, the determination unit 11 is specifically used to determine the corresponding node of the current node in the kth prediction reference frame; determine at least one domain node of the corresponding node; and determine the at least one domain node as at least one prediction node of the current node in the kth prediction reference frame.
在一些实施例中,确定单元11,具体用于在所述当前待解码帧中,确定第i个节点的父节点,作为第i个父节点,所述第i个节点为所述第i个领域节点或者为所述当前节点;确定所述第i个父节点在所述第k个预测参考帧中的匹配节点,作为第i个匹配节点;将所述i个匹配节点的子节点中的一个子节点,确定为所述第i个节点在所述第k个预测参考帧中的对应节点。In some embodiments, the determination unit 11 is specifically used to determine the parent node of the i-th node in the current frame to be decoded, as the i-th parent node, the i-th node being the i-th domain node or the current node; determine the matching node of the i-th parent node in the k-th prediction reference frame as the i-th matching node; and determine one of the child nodes of the i-matching node as the corresponding node of the i-th node in the k-th prediction reference frame.
在一些实施例中,确定单元11,具体用于基于所述第i个父节点的占位信息,确定所述第i个父节点在所述第k个预测参考帧中的匹配节点。In some embodiments, the determination unit 11 is specifically configured to determine a matching node of the i-th parent node in the k-th prediction reference frame based on the placeholder information of the i-th parent node.
在一些实施例中,确定单元11,具体用于将所述第k个预测参考帧中,占位信息与所述第i个父节点的占位信息 之间的差异最小的节点,确定为所述第i个父节点在所述第k个预测参考帧中的匹配节点。In some embodiments, the determination unit 11 is specifically used to determine the node whose placeholder information in the k-th prediction reference frame has the smallest difference with the placeholder information of the i-th parent node as the matching node of the i-th parent node in the k-th prediction reference frame.
在一些实施例中,确定单元11,具体用于确定所述第i个节点在所述父节点所包括的子节点中的第一序号;将所述第i个匹配节点的子节点中序号为第一序号的子节点,确定为所述第i个节点在所述第k个预测参考帧中的对应节点。In some embodiments, the determination unit 11 is specifically used to determine the first serial number of the i-th node among the child nodes included in the parent node; and determine the child node with the first serial number among the child nodes of the i-th matching node as the corresponding node of the i-th node in the k-th prediction reference frame.
在一些实施例中,确定单元11,具体用于将所述M个领域节点在所述第k个预测参考帧中的对应节点,确定为所述当前节点在所述第k个预测参考帧中的至少一个预测节点。In some embodiments, the determination unit 11 is specifically configured to determine corresponding nodes of the M domain nodes in the k-th prediction reference frame as at least one prediction node of the current node in the k-th prediction reference frame.
在一些实施例中,确定单元11,具体用于将所述当前节点在所述K个预测参考帧中的至少一个预测节点,确定为所述当前节点的N个预测节点。In some embodiments, the determination unit 11 is specifically configured to determine at least one prediction node of the current node in the K prediction reference frames as N prediction nodes of the current node.
在一些实施例中,若所述当前待解码帧为P帧,则所述K个预测参考帧包括所述当前待解码帧的前向帧。In some embodiments, if the current frame to be decoded is a P frame, the K prediction reference frames include a forward frame of the current frame to be decoded.
在一些实施例中,若所述当前待解码帧为B帧,则所述K个预测参考帧包括所述当前待解码帧的前向帧和后向帧。In some embodiments, if the current frame to be decoded is a B frame, the K prediction reference frames include a forward frame and a backward frame of the current frame to be decoded.
在一些实施例中,所述预测节点的直接解码信息包括第一直接解码信息,解码单元12,具体用于将所述N个预测节点的第一直接解码信息作为上下文信息,对所述当前节点对应的第一标志进行预测解码,所述第一标志用于表示所述当前节点所包括的点数是否大于1。In some embodiments, the direct decoding information of the prediction node includes first direct decoding information, and the decoding unit 12 is specifically used to use the first direct decoding information of the N prediction nodes as context information to perform predictive decoding on a first flag corresponding to the current node, wherein the first flag is used to indicate whether the number of points included in the current node is greater than 1.
在一些实施例中,解码单元12,具体用于基于所述N个预测节点的第一直接解码信息,确定第一上下文模型的索引;基于所述第一上下文模型的索引,从所述第一标志对应的Q个上下文模型中,选出所述第一上下文模型,所述Q为大于1的正整数;使用所述第一上下文模型,对所述当前节点对应的第一标志进行预测解码。In some embodiments, the decoding unit 12 is specifically used to determine the index of the first context model based on the first direct decoding information of the N prediction nodes; based on the index of the first context model, select the first context model from the Q context models corresponding to the first flag, where Q is a positive integer greater than 1; and use the first context model to predict and decode the first flag corresponding to the current node.
在一些实施例中,解码单元12,具体用于针对所述N个预测节点中的任一预测节点,基于所述预测节点的第一直接解码信息,确定所述预测节点对应的第一数值;基于所述N个预测节点对应的第一数值,确定所述第一上下文模型的索引。In some embodiments, the decoding unit 12 is specifically used to determine, for any prediction node among the N prediction nodes, a first numerical value corresponding to the prediction node based on the first direct decoding information of the prediction node; and determine the index of the first context model based on the first numerical values corresponding to the N prediction nodes.
在一些实施例中,解码单元12,具体用于确定所述预测节点对应的第一权重;基于所述第一权重,对所述N个预测节点对应的第一数值进行加权处理,得到第一加权预测值;基于所述第一加权预测值,确定所述第一上下文模型的索引。In some embodiments, the decoding unit 12 is specifically used to determine a first weight corresponding to the prediction node; based on the first weight, weighted processing is performed on the first numerical values corresponding to the N prediction nodes to obtain a first weighted prediction value; based on the first weighted prediction value, the index of the first context model is determined.
在一些实施例中,若所述K大于1时,则解码单元12,具体用于针对所述K个预测参考帧中的第j个预测参考帧,基于所述当前节点在所述第j个预测参考帧中的预测节点的第一直接解码信息,确定所述第j个预测参考帧中的预测节点对应的第一数值,所述j为小于或等于K的正整数;确定所述预测节点对应的第一权重,并基于所述第一权重对所述第j个预测参考帧中的预测节点对应的第一数值进行加权处理,得到所述第j个预测参考帧对应的第二加权预测值;基于所述K个预测参考帧对应的第二加权预测值,确定所述第一上下文模型的索引。In some embodiments, if K is greater than 1, the decoding unit 12 is specifically used to determine, for the j-th prediction reference frame among the K prediction reference frames, a first numerical value corresponding to the prediction node in the j-th prediction reference frame based on the first direct decoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K; determine a first weight corresponding to the prediction node, and weightedly process the first numerical value corresponding to the prediction node in the j-th prediction reference frame based on the first weight to obtain a second weighted prediction value corresponding to the j-th prediction reference frame; and determine the index of the first context model based on the second weighted prediction values corresponding to the K prediction reference frames.
在一些实施例中,解码单元12,具体用于确定所述K个预测参考帧对应的第二权重;基于所述第二权重对所述K个预测参考帧分别对应的第二加权预测值进行加权处理,得到所述第一上下文模型的索引。In some embodiments, the decoding unit 12 is specifically used to determine the second weights corresponding to the K prediction reference frames; based on the second weights, weighted processing is performed on the second weighted prediction values corresponding to the K prediction reference frames respectively to obtain the index of the first context model.
在一些实施例中,所述第一直接解码信息包括第二信息和预测节点的点数中的至少一个,所述第二信息表示所述预测节点是否满足所述直接解码方式进行解码的条件,解码单元12,具体用于基于所述第二信息,确定第一子值;基于所述预测节点的点数,确定第二子值;基于所述第一子值和/或所述第二子值,确定所述预测节点对应的第一数值。In some embodiments, the first direct decoding information includes at least one of second information and the number of points of the prediction node, the second information indicates whether the prediction node satisfies the conditions for decoding by the direct decoding method, and the decoding unit 12 is specifically used to determine the first sub-value based on the second information; determine the second sub-value based on the number of points of the prediction node; determine the first numerical value corresponding to the prediction node based on the first sub-value and/or the second sub-value.
在一些实施例中,解码单元12,具体用于若所述预测节点所包括的点数大于第一预设值,则确定所述第二子值的取值为第一值;若所述预测节点所包括的点数小于或等于第一预设值,则确定所述第二子值的取值为第二值。In some embodiments, the decoding unit 12 is specifically used to determine that the value of the second sub-value is a first value if the number of points included in the prediction node is greater than a first preset value; if the number of points included in the prediction node is less than or equal to the first preset value, then determine that the value of the second sub-value is a second value.
在一些实施例中,解码单元12,还用于若所述当前节点对应的第一标志表示所述当前节点所包括的点数大于1时,则确定所述当前节点包括2个点。In some embodiments, the decoding unit 12 is further used to determine that the current node includes 2 points if the first flag corresponding to the current node indicates that the number of points included in the current node is greater than 1.
在一些实施例中,所述预测节点的直接解码信息包括第二直接解码信息,解码单元12,还用于若所述当前节点对应的第一标志表示所述当前节点所包括的点数小于或等于1时,则将所述N个预测节点的第二直接解码信息作为上下文信息,对所述当前节点对应的第二标志进行预测解码,所述第二标志用于表示所述当前节点的第二个点是否为重复点。In some embodiments, the direct decoding information of the predicted node includes second direct decoding information, and the decoding unit 12 is also used to use the second direct decoding information of the N predicted nodes as context information to perform predictive decoding on the second flag corresponding to the current node if the first flag corresponding to the current node indicates that the number of points included in the current node is less than or equal to 1, and the second flag is used to indicate whether the second point of the current node is a repeated point.
在一些实施例中,解码单元12,具体用于基于所述N个预测节点的第二直接解码信息,确定第二上下文模型的索引;基于所述第二上下文模型的索引,从所述第二标志对应的P个上下文模型中,选出所述第二上下文模型,所述P为大于1的正整数;使用所述第二上下文模型,对所述当前节点对应的第二标志进行预测解码。In some embodiments, the decoding unit 12 is specifically used to determine the index of the second context model based on the second direct decoding information of the N prediction nodes; based on the index of the second context model, select the second context model from the P context models corresponding to the second flag, where P is a positive integer greater than 1; and use the second context model to predict and decode the second flag corresponding to the current node.
在一些实施例中,解码单元12,具体用于针对所述N个预测节点中的任一预测节点,基于所述预测节点的第二直接解码信息,确定所述预测节点对应的第二数值;基于所述N个预测节点对应的第二数值,确定所述第二上下文模型的索引。In some embodiments, the decoding unit 12 is specifically used to determine, for any prediction node among the N prediction nodes, a second numerical value corresponding to the prediction node based on the second direct decoding information of the prediction node; and determine an index of the second context model based on the second numerical values corresponding to the N prediction nodes.
在一些实施例中,解码单元12,具体用于确定所述预测节点对应的第一权重;基于所述第一权重,对所述N个预测节点对应的第二数值进行加权处理,得到第三加权预测值;基于所述第三加权预测值,确定所述第二上下文模型的索引。In some embodiments, the decoding unit 12 is specifically used to determine a first weight corresponding to the prediction node; based on the first weight, weighted processing is performed on the second numerical values corresponding to the N prediction nodes to obtain a third weighted prediction value; based on the third weighted prediction value, the index of the second context model is determined.
在一些实施例后,若所述K大于1时,则解码单元12,具体用于针对所述K个预测参考帧中的第j个预测参考帧,基于所述当前节点在所述第j个预测参考帧中的预测节点的第二直接解码信息,确定所述第j个预测参考帧中的预测节点对应的第二数值,所述j为小于或等于K的正整数;确定所述预测节点对应的第一权重,并基于所述第一权重对所述第j个预测参考帧中的预测节点对应的第二数值进行加权处理,得到所述第j个预测参考帧对应的第四加权预测值;基于所述K个预测参考帧对应的第四加权预测值,确定所述第二上下文模型的索引。In some embodiments, if K is greater than 1, the decoding unit 12 is specifically used to determine, for the j-th prediction reference frame among the K prediction reference frames, a second numerical value corresponding to the prediction node in the j-th prediction reference frame based on the second direct decoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K; determine a first weight corresponding to the prediction node, and weightedly process the second numerical value corresponding to the prediction node in the j-th prediction reference frame based on the first weight to obtain a fourth weighted prediction value corresponding to the j-th prediction reference frame; and determine the index of the second context model based on the fourth weighted prediction values corresponding to the K prediction reference frames.
在一些实施例中,解码单元12,具体用于确定所述K个预测参考帧对应的第二权重;基于所述第二权重对所述K个预测参考帧分别对应的第四加权预测值进行加权处理,得到所述第二上下文模型的索引。In some embodiments, the decoding unit 12 is specifically used to determine the second weights corresponding to the K prediction reference frames; based on the second weights, weighted processing is performed on the fourth weighted prediction values corresponding to the K prediction reference frames respectively to obtain the index of the second context model.
在一些实施例中,所述第二直接解码信息包括第二信息和第三信息中的至少一个,所述第二信息表示所述预测节点是否满足所述直接解码方式进行解码的条件,所述第三信息表示所述预测节点的第二个点是否为重复点,解码单元 12,具体用于基于所述第二信息,确定第一子值;基于所述第三信息,确定第三子值;基于所述第一子值和/或所述第三子值,确定所述预测节点对应的第二数值。In some embodiments, the second direct decoding information includes at least one of second information and third information, the second information indicates whether the predicted node satisfies the conditions for decoding by the direct decoding method, and the third information indicates whether the second point of the predicted node is a repeated point. The decoding unit 12 is specifically used to determine the first sub-value based on the second information; determine the third sub-value based on the third information; and determine the second numerical value corresponding to the predicted node based on the first sub-value and/or the third sub-value.
在一些实施例中,解码单元12,具体用于若所述第三信息表示所述预测节点的第二个点是重复点,则确定所述第三子值的取值为第一值;若所述第三信息表示所述预测节点的第二个点不是重复点,则确定所述第三子值的取值为第二值。In some embodiments, the decoding unit 12 is specifically used to determine that the value of the third sub-value is the first value if the third information indicates that the second point of the prediction node is a repeated point; if the third information indicates that the second point of the prediction node is not a repeated point, determine that the value of the third sub-value is the second value.
在一些实施例中,解码单元12,还用于若所述当前节点对应的第二标志表示所述当前节点的第二个点不是重复点1时,则确定所述当前节点包括1个点。In some embodiments, the decoding unit 12 is further used to determine that the current node includes 1 point if the second flag corresponding to the current node indicates that the second point of the current node is not the repeated point 1.
在一些实施例中,所述预测节点的直接解码信息包括第三直接解码信息,解码单元12,还用于若所述当前节点对应的第二标志表示所述当前节点的第二个点是重复点时,则将所述N个预测节点的第三直接解码信息作为上下文信息,对所述当前节点对应的第三标志进行预测解码,所述第三标志用于表示所述当前节点的第三个点是否为重复点。In some embodiments, the direct decoding information of the predicted node includes third direct decoding information, and the decoding unit 12 is also used to use the third direct decoding information of the N predicted nodes as context information to perform predictive decoding on the third flag corresponding to the current node if the second flag corresponding to the current node indicates that the second point of the current node is a repeated point. The third flag is used to indicate whether the third point of the current node is a repeated point.
在一些实施例中,解码单元12,具体用于基于所述N个预测节点的第三直接解码信息,确定第三上下文模型的索引;基于所述第三上下文模型的索引,从所述第三标志对应的R个上下文模型中,选出所述第三上下文模型,所述R为大于1的正整数;使用所述第三上下文模型,对所述当前节点对应的第三标志进行预测解码。In some embodiments, the decoding unit 12 is specifically used to determine the index of the third context model based on the third direct decoding information of the N prediction nodes; based on the index of the third context model, select the third context model from the R context models corresponding to the third flag, where R is a positive integer greater than 1; and use the third context model to predict and decode the third flag corresponding to the current node.
在一些实施例中,解码单元12,具体用于针对所述N个预测节点中的任一预测节点,基于所述预测节点的第三直接解码信息,确定所述预测节点对应的第三数值;基于所述N个预测节点对应的第三数值,确定所述第三上下文模型的索引。In some embodiments, the decoding unit 12 is specifically used to determine, for any one of the N prediction nodes, a third numerical value corresponding to the prediction node based on the third direct decoding information of the prediction node; and determine an index of the third context model based on the third numerical values corresponding to the N prediction nodes.
在一些实施例中,解码单元12,具体用于确定所述预测节点对应的第一权重;基于所述第一权重,对所述N个预测节点对应的第三数值进行加权处理,得到第五加权预测值;基于所述第五加权预测值,确定所述第三上下文模型的索引。In some embodiments, the decoding unit 12 is specifically used to determine a first weight corresponding to the prediction node; based on the first weight, weighted processing is performed on the third numerical values corresponding to the N prediction nodes to obtain a fifth weighted prediction value; based on the fifth weighted prediction value, the index of the third context model is determined.
在一些实施例中,若所述K大于1时,则解码单元12,具体用于针对所述K个预测参考帧中的第j个预测参考帧,基于所述当前节点在所述第j个预测参考帧中的预测节点的第三直接解码信息,确定所述第j个预测参考帧中的预测节点对应的第三数值,所述j为小于或等于K的正整数;确定所述预测节点对应的第一权重,并基于所述第一权重对所述第j个预测参考帧中的预测节点对应的第三数值进行加权处理,得到所述第j个预测参考帧对应的第六加权预测值;基于所述K个预测参考帧对应的第六加权预测值,确定所述第三上下文模型的索引。In some embodiments, if K is greater than 1, the decoding unit 12 is specifically used to determine, for the j-th prediction reference frame among the K prediction reference frames, a third numerical value corresponding to the prediction node in the j-th prediction reference frame based on the third direct decoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K; determine a first weight corresponding to the prediction node, and weightedly process the third numerical value corresponding to the prediction node in the j-th prediction reference frame based on the first weight to obtain a sixth weighted prediction value corresponding to the j-th prediction reference frame; and determine an index of the third context model based on the sixth weighted prediction values corresponding to the K prediction reference frames.
在一些实施例中,解码单元12,具体用于确定所述K个预测参考帧对应的第二权重;基于所述第二权重对所述K个预测参考帧分别对应的第六加权预测值进行加权处理,得到所述第三上下文模型的索引。In some embodiments, the decoding unit 12 is specifically used to determine the second weights corresponding to the K prediction reference frames; based on the second weights, weighted processing is performed on the sixth weighted prediction values corresponding to the K prediction reference frames respectively to obtain the index of the third context model.
在一些实施例中,所述第三直接解码信息包括第二信息和第四信息中的至少一个,所述第二信息表示所述预测节点是否满足所述直接解码方式进行解码的条件,所述第四信息表示所述预测节点的第三个点是否为重复点,在一些实施例中,解码单元12,具体用于基于所述第二信息,确定第一子值;基于所述第四信息,确定第四子值;基于所述第一子值和/或所述第四子值,确定所述预测节点对应的第三数值。In some embodiments, the third direct decoding information includes at least one of the second information and the fourth information, the second information indicates whether the predicted node satisfies the conditions for decoding by the direct decoding method, and the fourth information indicates whether the third point of the predicted node is a repeated point. In some embodiments, the decoding unit 12 is specifically used to determine the first sub-value based on the second information; determine the fourth sub-value based on the fourth information; and determine the third numerical value corresponding to the predicted node based on the first sub-value and/or the fourth sub-value.
在一些实施例中,解码单元12,具体用于若所述第四信息表示所述预测节点的第三个点是重复点,则确定所述第四子值的取值为第一值;若所述第四信息表示所述预测节点的第三个点不是重复点,则确定所述第四子值的取值为第二值。In some embodiments, the decoding unit 12 is specifically used to determine that the value of the fourth sub-value is the first value if the fourth information indicates that the third point of the prediction node is a repeated point; if the fourth information indicates that the third point of the prediction node is not a repeated point, determine that the value of the fourth sub-value is the second value.
在一些实施例中,解码单元12,还用于若所述第三标志表示所述当前节点的第三个点为重复点时,则对所述当前节点所包括的剩余重复点数目进行解码。In some embodiments, the decoding unit 12 is further configured to decode the number of remaining repeated points included in the current node if the third flag indicates that the third point of the current node is a repeated point.
在一些实施例中,解码单元12,具体用于若所述第二信息指示所述预测节点满足所述直接解码方式进行解码的条件时,则确定所述第一子值的取值为第一值;若所述第二信息指示所述预测节点不满足所述直接解码方式进行解码的条件时,则确定所述第一子值的取值为第二值。In some embodiments, the decoding unit 12 is specifically used to determine that the value of the first sub-value is a first value if the second information indicates that the prediction node meets the conditions for decoding by the direct decoding method; if the second information indicates that the prediction node does not meet the conditions for decoding by the direct decoding method, then determine that the value of the first sub-value is a second value.
在一些实施例中,所述第二信息包括所述预测节点的直接解码模式信息。In some embodiments, the second information includes direct decoding mode information of the prediction node.
在一些实施例中,解码单元12,具体用于基于所述预测节点对应的领域节点与所述当前节点之间的距离,确定所述预测节点对应的第一权重。In some embodiments, the decoding unit 12 is specifically configured to determine a first weight corresponding to the prediction node based on a distance between a domain node corresponding to the prediction node and the current node.
在一些实施例中,解码单元12,具体用于基于所述预测参考帧与所述当前待解码帧之间的时间差距,确定所述预测参考帧对应的第二权重。In some embodiments, the decoding unit 12 is specifically configured to determine a second weight corresponding to the predicted reference frame based on a time difference between the predicted reference frame and the current frame to be decoded.
应理解,装置实施例与方法实施例可以相互对应,类似的描述可以参照方法实施例。为避免重复,此处不再赘述。具体地,图16所示的点云解码装置10可以对应于执行本申请实施例的点云解码方法中的相应主体,并且点云解码装置10中的各个单元的前述和其它操作和/或功能分别为了实现点云解码方法中的相应流程,为了简洁,在此不再赘述。It should be understood that the device embodiment and the method embodiment may correspond to each other, and similar descriptions may refer to the method embodiment. To avoid repetition, no further description is given here. Specifically, the point cloud decoding device 10 shown in FIG. 16 may correspond to the corresponding subject in the point cloud decoding method of the embodiment of the present application, and the aforementioned and other operations and/or functions of each unit in the point cloud decoding device 10 are respectively for implementing the corresponding processes in the point cloud decoding method, and for the sake of brevity, no further description is given here.
图17是本申请实施例提供的点云编码装置的示意性框图。FIG. 17 is a schematic block diagram of a point cloud encoding device provided in an embodiment of the present application.
如图17所示,点云编码装置20包括:As shown in FIG. 17 , the point cloud encoding device 20 includes:
确定单元21,用于确定当前节点对应的帧间编码信息,所述当前节点为当前待编码帧中的待编码节点;A determination unit 21, configured to determine inter-frame coding information corresponding to a current node, wherein the current node is a node to be coded in a current frame to be coded;
编码单元22,用于基于所述当前节点对应的帧间编码信息,对所述当前节点所包括的点数进行预测编码。The encoding unit 22 is used to perform predictive encoding on the points included in the current node based on the inter-frame encoding information corresponding to the current node.
在一些实施例中,所述帧间编码信息包括所述当前节点的N个预测节点的几何编码信息和所述当前节点对应的第一信息中的至少一个,所述第一信息用于指示所述当前节点是否允许启动帧间编码方式,确定单元21,具体用于在所述当前待编码帧的预测参考帧中,确定所述当前节点的N个预测节点,和/或确定所述第一信息,所述N为正整数;对应的,编码单元22,具体用于基于所述N个预测节点的几何编码信息和/或所述第一信息,对所述当前节点所包括的点数进行预测编码。In some embodiments, the inter-frame coding information includes geometric coding information of N prediction nodes of the current node and at least one of the first information corresponding to the current node, the first information is used to indicate whether the current node allows the inter-frame coding mode to be started, and the determination unit 21 is specifically used to determine the N prediction nodes of the current node in the predicted reference frame of the current frame to be encoded, and/or determine the first information, where N is a positive integer; correspondingly, the encoding unit 22 is specifically used to perform predictive coding on the number of points included in the current node based on the geometric coding information of the N prediction nodes and/or the first information.
在一些实施例中,所述预测节点的几何编码信息包括所述预测节点的直接编码信息,编码单元22,具体用于基于所述N个预测节点的直接编码信息和/或所述第一信息,对所述当前节点的点数进行预测编码。In some embodiments, the geometric coding information of the prediction node includes direct coding information of the prediction node, and the coding unit 22 is specifically used to perform predictive coding on the number of points of the current node based on the direct coding information of the N prediction nodes and/or the first information.
在一些实施例中,所述当前待编码帧包括K个预测参考帧,确定单元21,具体用于针对所述K个预测参考帧中 的第k个预测参考帧,确定所述当前节点在所述第k个预测参考帧中的至少一个预测节点,所述k为小于或等于K的正整数,所述K为正整数;基于所述当前节点在所述K个预测参考帧中的至少一个预测节点,确定所述当前节点的N个预测节点。In some embodiments, the current frame to be encoded includes K prediction reference frames, and the determination unit 21 is specifically used to determine at least one prediction node of the current node in the kth prediction reference frame among the K prediction reference frames, where k is a positive integer less than or equal to K, and K is a positive integer; based on at least one prediction node of the current node in the K prediction reference frames, determine N prediction nodes of the current node.
在一些实施例中,确定单元21,具体用于在所述当前待编码帧中,确定所述当前节点的M个领域节点,所述M个领域节点中包括所述当前节点,所述M为正整数;针对所述M个领域节点中的第i个领域节点,确定所述第i个领域节点在所述第k个预测参考帧中的对应节点,所述i为小于或等于M的正整数;基于所述M个领域节点在所述第k个预测参考帧中的对应节点,确定所述当前节点在所述第k个预测参考帧中的至少一个预测节点。In some embodiments, the determination unit 21 is specifically used to determine M domain nodes of the current node in the current frame to be encoded, where the M domain nodes include the current node, and M is a positive integer; for the i-th domain node among the M domain nodes, determine the corresponding node of the i-th domain node in the k-th prediction reference frame, where i is a positive integer less than or equal to M; based on the corresponding nodes of the M domain nodes in the k-th prediction reference frame, determine at least one prediction node of the current node in the k-th prediction reference frame.
在一些实施例中,确定单元21,具体用于确定所述当前节点在所述第k个预测参考帧中的对应节点;确定所述对应节点的至少一个领域节点;将所述至少一个领域节点,确定为所述当前节点在所述第k个预测参考帧中的至少一个预测节点。In some embodiments, the determination unit 21 is specifically used to determine the corresponding node of the current node in the kth prediction reference frame; determine at least one domain node of the corresponding node; and determine the at least one domain node as at least one prediction node of the current node in the kth prediction reference frame.
在一些实施例中,确定单元21,具体用于在所述当前待编码帧中,确定第i个节点的父节点,作为第i个父节点,所述第i个节点为所述第i个领域节点或者为所述当前节点;确定所述第i个父节点在所述第k个预测参考帧中的匹配节点,作为第i个匹配节点;将所述i个匹配节点的子节点中的一个子节点,确定为所述第i个节点在所述第k个预测参考帧中的对应节点。In some embodiments, the determination unit 21 is specifically used to determine the parent node of the i-th node in the current frame to be encoded, as the i-th parent node, the i-th node being the i-th domain node or the current node; determine the matching node of the i-th parent node in the k-th prediction reference frame as the i-th matching node; and determine one of the child nodes of the i-matching node as the corresponding node of the i-th node in the k-th prediction reference frame.
在一些实施例中,确定单元21,具体用于基于所述第i个父节点的占位信息,确定所述第i个父节点在所述第k个预测参考帧中的匹配节点。In some embodiments, the determination unit 21 is specifically configured to determine a matching node of the i-th parent node in the k-th prediction reference frame based on the placeholder information of the i-th parent node.
在一些实施例中,确定单元21,具体用于将所述第k个预测参考帧中,占位信息与所述第i个父节点的占位信息之间的差异最小的节点,确定为所述第i个父节点在所述第k个预测参考帧中的匹配节点。In some embodiments, the determination unit 21 is specifically configured to determine the node whose placeholder information in the kth prediction reference frame has the smallest difference with the placeholder information of the i-th parent node as the matching node of the i-th parent node in the k-th prediction reference frame.
在一些实施例中,确定单元21,具体用于确定所述第i个节点在所述父节点所包括的子节点中的第一序号;将所述第i个匹配节点的子节点中序号为第一序号的子节点,确定为所述第i个节点在所述第k个预测参考帧中的对应节点。In some embodiments, the determination unit 21 is specifically used to determine the first serial number of the i-th node among the child nodes included in the parent node; and determine the child node with the first serial number among the child nodes of the i-th matching node as the corresponding node of the i-th node in the k-th prediction reference frame.
在一些实施例中,确定单元21,具体用于将所述M个领域节点在所述第k个预测参考帧中的对应节点,确定为所述当前节点在所述第k个预测参考帧中的至少一个预测节点。In some embodiments, the determination unit 21 is specifically configured to determine corresponding nodes of the M domain nodes in the k-th prediction reference frame as at least one prediction node of the current node in the k-th prediction reference frame.
在一些实施例中,确定单元21,具体用于将所述当前节点在所述K个预测参考帧中的至少一个预测节点,确定为所述当前节点的N个预测节点。In some embodiments, the determination unit 21 is specifically configured to determine at least one prediction node of the current node in the K prediction reference frames as N prediction nodes of the current node.
在一些实施例中,若所述当前待编码帧为P帧,则所述K个预测参考帧包括所述当前待编码帧的前向帧。In some embodiments, if the current frame to be encoded is a P frame, the K prediction reference frames include a forward frame of the current frame to be encoded.
在一些实施例中,若所述当前待编码帧为B帧,则所述K个预测参考帧包括所述当前待编码帧的前向帧和后向帧。In some embodiments, if the current frame to be encoded is a B frame, the K prediction reference frames include a forward frame and a backward frame of the current frame to be encoded.
在一些实施例中,所述预测节点的直接编码信息包括第一直接编码信息,编码单元22,具体用于将所述N个预测节点的第一直接编码信息作为上下文信息,对所述当前节点对应的第一标志进行预测编码,所述第一标志用于表示所述当前节点所包括的点数是否大于1。In some embodiments, the direct encoding information of the prediction node includes first direct encoding information, and the encoding unit 22 is specifically used to use the first direct encoding information of the N prediction nodes as context information to perform predictive encoding on a first flag corresponding to the current node, where the first flag is used to indicate whether the number of points included in the current node is greater than 1.
在一些实施例中,编码单元22,具体用于基于所述N个预测节点的第一直接编码信息,确定第一上下文模型的索引;基于所述第一上下文模型的索引,从所述第一标志对应的Q个上下文模型中,选出所述第一上下文模型,所述Q为大于1的正整数;使用所述第一上下文模型,对所述当前节点对应的第一标志进行预测编码。In some embodiments, the encoding unit 22 is specifically used to determine the index of the first context model based on the first direct encoding information of the N prediction nodes; based on the index of the first context model, select the first context model from the Q context models corresponding to the first flag, where Q is a positive integer greater than 1; and use the first context model to predictively encode the first flag corresponding to the current node.
在一些实施例中,编码单元22,具体用于针对所述N个预测节点中的任一预测节点,基于所述预测节点的第一直接编码信息,确定所述预测节点对应的第一数值;基于所述N个预测节点对应的第一数值,确定所述第一上下文模型的索引。In some embodiments, the encoding unit 22 is specifically used to determine, for any prediction node among the N prediction nodes, a first numerical value corresponding to the prediction node based on the first direct encoding information of the prediction node; and determine the index of the first context model based on the first numerical values corresponding to the N prediction nodes.
在一些实施例中,编码单元22,具体用于确定所述预测节点对应的第一权重;基于所述第一权重,对所述N个预测节点对应的第一数值进行加权处理,得到第一加权预测值;基于所述第一加权预测值,确定所述第一上下文模型的索引。In some embodiments, the encoding unit 22 is specifically used to determine a first weight corresponding to the prediction node; based on the first weight, weighted processing is performed on the first numerical values corresponding to the N prediction nodes to obtain a first weighted prediction value; based on the first weighted prediction value, the index of the first context model is determined.
在一些实施例中,若所述K大于1时,则编码单元22,具体用于针对所述K个预测参考帧中的第j个预测参考帧,基于所述当前节点在所述第j个预测参考帧中的预测节点的第一直接编码信息,确定所述第j个预测参考帧中的预测节点对应的第一数值,所述j为小于或等于K的正整数;确定所述预测节点对应的第一权重,并基于所述第一权重对所述第j个预测参考帧中的预测节点对应的第一数值进行加权处理,得到所述第j个预测参考帧对应的第二加权预测值;基于所述K个预测参考帧对应的第二加权预测值,确定所述第一上下文模型的索引。In some embodiments, if K is greater than 1, the encoding unit 22 is specifically used to determine, for the j-th prediction reference frame among the K prediction reference frames, a first numerical value corresponding to the prediction node in the j-th prediction reference frame based on the first direct encoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K; determine a first weight corresponding to the prediction node, and weightedly process the first numerical value corresponding to the prediction node in the j-th prediction reference frame based on the first weight to obtain a second weighted prediction value corresponding to the j-th prediction reference frame; and determine the index of the first context model based on the second weighted prediction values corresponding to the K prediction reference frames.
在一些实施例中,编码单元22,具体用于确定所述K个预测参考帧对应的第二权重;基于所述第二权重对所述K个预测参考帧分别对应的第二加权预测值进行加权处理,得到所述第一上下文模型的索引。In some embodiments, the encoding unit 22 is specifically used to determine the second weights corresponding to the K prediction reference frames; based on the second weights, weighted processing is performed on the second weighted prediction values corresponding to the K prediction reference frames respectively to obtain the index of the first context model.
在一些实施例中,所述第一直接编码信息包括第二信息和预测节点的点数中的至少一个,所述第二信息表示所述预测节点是否满足所述直接编码方式进行编码的条件,编码单元22,具体用于基于所述第二信息,确定第一子值;基于所述预测节点的点数,确定第二子值;基于所述第一子值和/或所述第二子值,确定所述预测节点对应的第一数值。In some embodiments, the first direct encoding information includes at least one of second information and the number of points of the prediction node, the second information indicates whether the prediction node satisfies the conditions for encoding by the direct encoding method, and the encoding unit 22 is specifically used to determine the first sub-value based on the second information; determine the second sub-value based on the number of points of the prediction node; determine the first numerical value corresponding to the prediction node based on the first sub-value and/or the second sub-value.
在一些实施例中,编码单元22,具体用于若所述预测节点所包括的点数大于第一预设值,则确定所述第二子值的取值为第一值;若所述预测节点所包括的点数小于或等于第一预设值,则确定所述第二子值的取值为第二值。In some embodiments, the encoding unit 22 is specifically used to determine that the value of the second sub-value is a first value if the number of points included in the prediction node is greater than a first preset value; if the number of points included in the prediction node is less than or equal to the first preset value, determine that the value of the second sub-value is a second value.
在一些实施例中,所述预测节点的直接编码信息包括第二直接编码信息,编码单元22,还用于将所述N个预测节点的第二直接编码信息作为上下文信息,对所述当前节点对应的第二标志进行预测编码,所述第二标志用于表示所述当前节点的第二个点是否为重复点。In some embodiments, the direct encoding information of the prediction node includes second direct encoding information, and the encoding unit 22 is further used to use the second direct encoding information of the N prediction nodes as context information to perform predictive encoding on the second flag corresponding to the current node, and the second flag is used to indicate whether the second point of the current node is a repeated point.
在一些实施例中,编码单元22,具体用于基于所述N个预测节点的第二直接编码信息,确定第二上下文模型的索引;基于所述第二上下文模型的索引,从所述第二标志对应的P个上下文模型中,选出所述第二上下文模型,所述P为大于1的正整数;使用所述第二上下文模型,对所述当前节点对应的第二标志进行预测编码。In some embodiments, the encoding unit 22 is specifically used to determine the index of the second context model based on the second direct encoding information of the N prediction nodes; based on the index of the second context model, select the second context model from the P context models corresponding to the second flag, where P is a positive integer greater than 1; and use the second context model to predictively encode the second flag corresponding to the current node.
在一些实施例中,编码单元22,具体用于针对所述N个预测节点中的任一预测节点,基于所述预测节点的第二直 接编码信息,确定所述预测节点对应的第二数值;基于所述N个预测节点对应的第二数值,确定所述第二上下文模型的索引。In some embodiments, the encoding unit 22 is specifically used to determine, for any prediction node among the N prediction nodes, a second numerical value corresponding to the prediction node based on the second direct encoding information of the prediction node; and determine an index of the second context model based on the second numerical values corresponding to the N prediction nodes.
在一些实施例中,编码单元22,具体用于确定所述预测节点对应的第一权重;基于所述第一权重,对所述N个预测节点对应的第二数值进行加权处理,得到第三加权预测值;基于所述第三加权预测值,确定所述第二上下文模型的索引。In some embodiments, the encoding unit 22 is specifically used to determine a first weight corresponding to the prediction node; based on the first weight, weighted processing is performed on the second numerical values corresponding to the N prediction nodes to obtain a third weighted prediction value; based on the third weighted prediction value, the index of the second context model is determined.
在一些实施例中,若所述K大于1时,则编码单元22,具体用于针对所述K个预测参考帧中的第j个预测参考帧,基于所述当前节点在所述第j个预测参考帧中的预测节点的第二直接编码信息,确定所述第j个预测参考帧中的预测节点对应的第二数值,所述j为小于或等于K的正整数;确定所述预测节点对应的第一权重,并基于所述第一权重对所述第j个预测参考帧中的预测节点对应的第二数值进行加权处理,得到所述第j个预测参考帧对应的第四加权预测值;基于所述K个预测参考帧对应的第四加权预测值,确定所述第二上下文模型的索引。In some embodiments, if K is greater than 1, the encoding unit 22 is specifically used to determine, for the j-th prediction reference frame among the K prediction reference frames, a second numerical value corresponding to the prediction node in the j-th prediction reference frame based on the second direct encoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K; determine a first weight corresponding to the prediction node, and weightedly process the second numerical value corresponding to the prediction node in the j-th prediction reference frame based on the first weight to obtain a fourth weighted prediction value corresponding to the j-th prediction reference frame; and determine the index of the second context model based on the fourth weighted prediction values corresponding to the K prediction reference frames.
在一些实施例中,编码单元22,具体用于确定所述K个预测参考帧对应的第二权重;基于所述第二权重对所述K个预测参考帧分别对应的第四加权预测值进行加权处理,得到所述第二上下文模型的索引。In some embodiments, the encoding unit 22 is specifically used to determine the second weights corresponding to the K prediction reference frames; based on the second weights, weighted processing is performed on the fourth weighted prediction values corresponding to the K prediction reference frames respectively to obtain the index of the second context model.
在一些实施例中,所述第二直接编码信息包括第二信息和第三信息中的至少一个,所述第二信息表示所述预测节点是否满足所述直接编码方式进行编码的条件,所述第三信息表示所述预测节点的第二个点是否为重复点,编码单元22,具体用于基于所述第二信息,确定第一子值;基于所述第三信息,确定第三子值;基于所述第一子值和/或所述第三子值,确定所述预测节点对应的第二数值。In some embodiments, the second direct encoding information includes at least one of second information and third information, the second information indicates whether the prediction node satisfies the conditions for encoding by the direct encoding method, and the third information indicates whether the second point of the prediction node is a repeated point. The encoding unit 22 is specifically used to determine the first sub-value based on the second information; determine the third sub-value based on the third information; and determine the second numerical value corresponding to the prediction node based on the first sub-value and/or the third sub-value.
在一些实施例中,编码单元22,具体用于若所述第三信息表示所述预测节点的第二个点是重复点,则确定所述第三子值的取值为第一值;若所述第三信息表示所述预测节点的第二个点不是重复点,则确定所述第三子值的取值为第二值。In some embodiments, the encoding unit 22 is specifically used to determine that the value of the third sub-value is the first value if the third information indicates that the second point of the prediction node is a repeated point; if the third information indicates that the second point of the prediction node is not a repeated point, determine that the value of the third sub-value is the second value.
在一些实施例中,所述预测节点的直接编码信息包括第三直接编码信息,编码单元22,还用于若所述当前节点的第二个点是重复点时,则将所述N个预测节点的第三直接编码信息作为上下文信息,对所述当前节点对应的第三标志进行预测编码,所述第三标志用于表示所述当前节点的第三个点是否为重复点。In some embodiments, the direct encoding information of the prediction node includes third direct encoding information, and the encoding unit 22 is also used to use the third direct encoding information of the N prediction nodes as context information to perform predictive encoding on the third flag corresponding to the current node if the second point of the current node is a repeated point. The third flag is used to indicate whether the third point of the current node is a repeated point.
在一些实施例中,编码单元22,具体用于基于所述N个预测节点的第三直接编码信息,确定第三上下文模型的索引;基于所述第三上下文模型的索引,从所述第三标志对应的R个上下文模型中,选出所述第三上下文模型,所述R为大于1的正整数;使用所述第三上下文模型,对所述当前节点对应的第三标志进行预测编码。In some embodiments, the encoding unit 22 is specifically used to determine the index of the third context model based on the third direct encoding information of the N prediction nodes; based on the index of the third context model, select the third context model from the R context models corresponding to the third flag, where R is a positive integer greater than 1; and use the third context model to predict and encode the third flag corresponding to the current node.
在一些实施例中,编码单元22,具体用于针对所述N个预测节点中的任一预测节点,基于所述预测节点的第三直接编码信息,确定所述预测节点对应的第三数值;基于所述N个预测节点对应的第三数值,确定所述第三上下文模型的索引。In some embodiments, the encoding unit 22 is specifically used to determine, for any prediction node among the N prediction nodes, a third numerical value corresponding to the prediction node based on the third direct encoding information of the prediction node; and determine an index of the third context model based on the third numerical values corresponding to the N prediction nodes.
在一些实施例中,编码单元22,具体用于确定所述预测节点对应的第一权重;基于所述第一权重,对所述N个预测节点对应的第三数值进行加权处理,得到第五加权预测值;基于所述第五加权预测值,确定所述第三上下文模型的索引。In some embodiments, the encoding unit 22 is specifically used to determine a first weight corresponding to the prediction node; based on the first weight, weighted processing is performed on the third numerical values corresponding to the N prediction nodes to obtain a fifth weighted prediction value; based on the fifth weighted prediction value, the index of the third context model is determined.
在一些实施例中,若所述K大于1时,则编码单元22,具体用于针对所述K个预测参考帧中的第j个预测参考帧,基于所述当前节点在所述第j个预测参考帧中的预测节点的第三直接编码信息,确定所述第j个预测参考帧中的预测节点对应的第三数值,所述j为小于或等于K的正整数;确定所述预测节点对应的第一权重,并基于所述第一权重对所述第j个预测参考帧中的预测节点对应的第三数值进行加权处理,得到所述第j个预测参考帧对应的第六加权预测值;基于所述K个预测参考帧对应的第六加权预测值,确定所述第三上下文模型的索引。In some embodiments, if K is greater than 1, the encoding unit 22 is specifically used to determine, for the j-th prediction reference frame among the K prediction reference frames, a third numerical value corresponding to the prediction node in the j-th prediction reference frame based on the third direct encoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K; determine a first weight corresponding to the prediction node, and weightedly process the third numerical value corresponding to the prediction node in the j-th prediction reference frame based on the first weight to obtain a sixth weighted prediction value corresponding to the j-th prediction reference frame; and determine an index of the third context model based on the sixth weighted prediction values corresponding to the K prediction reference frames.
在一些实施例中,编码单元22,具体用于确定所述K个预测参考帧对应的第二权重;基于所述第二权重对所述K个预测参考帧分别对应的第六加权预测值进行加权处理,得到所述第三上下文模型的索引。In some embodiments, the encoding unit 22 is specifically used to determine the second weights corresponding to the K prediction reference frames; based on the second weights, weighted processing is performed on the sixth weighted prediction values corresponding to the K prediction reference frames respectively to obtain the index of the third context model.
在一些实施例中,所述第三直接编码信息包括第二信息和第四信息中的至少一个,所述第二信息表示所述预测节点是否满足所述直接编码方式进行编码的条件,所述第四信息表示所述预测节点的第三个点是否为重复点,编码单元22,具体用于基于所述第二信息,确定第一子值;基于所述第四信息,确定第四子值;基于所述第一子值和/或所述第四子值,确定所述预测节点对应的第三数值。In some embodiments, the third direct encoding information includes at least one of the second information and the fourth information, the second information indicates whether the prediction node satisfies the conditions for encoding by the direct encoding method, and the fourth information indicates whether the third point of the prediction node is a repeated point. The encoding unit 22 is specifically used to determine the first sub-value based on the second information; determine the fourth sub-value based on the fourth information; and determine the third numerical value corresponding to the prediction node based on the first sub-value and/or the fourth sub-value.
在一些实施例中,编码单元22,具体用于若所述第四信息表示所述预测节点的第三个点是重复点,则确定所述第四子值的取值为第一值;若所述第四信息表示所述预测节点的第三个点不是重复点,则确定所述第四子值的取值为第二值。In some embodiments, the encoding unit 22 is specifically used to determine that the value of the fourth sub-value is the first value if the fourth information indicates that the third point of the prediction node is a repeated point; if the fourth information indicates that the third point of the prediction node is not a repeated point, determine that the value of the fourth sub-value is the second value.
在一些实施例中,编码单元22,还用于若所述第三标志表示所述当前节点的第三个点为重复点时,则对所述当前节点所包括的剩余重复点数目进行编码。In some embodiments, the encoding unit 22 is further configured to encode the number of remaining repeated points included in the current node if the third flag indicates that the third point of the current node is a repeated point.
在一些实施例中,编码单元22,具体用于若所述第二信息指示所述预测节点满足所述直接编码方式进行编码的条件时,则确定所述第一子值的取值为第一值;若所述第二信息指示所述预测节点不满足所述直接编码方式进行编码的条件时,则确定所述第一子值的取值为第二值。In some embodiments, the encoding unit 22 is specifically used to determine that the value of the first sub-value is a first value if the second information indicates that the prediction node satisfies the condition for encoding by the direct encoding method; if the second information indicates that the prediction node does not satisfy the condition for encoding by the direct encoding method, then determine that the value of the first sub-value is a second value.
在一些实施例中,所述第二信息包括所述预测节点的直接编码模式信息。In some embodiments, the second information includes direct encoding mode information of the prediction node.
在一些实施例中,编码单元22,具体用于基于所述预测节点对应的领域节点与所述当前节点之间的距离,确定所述预测节点对应的第一权重。In some embodiments, the encoding unit 22 is specifically configured to determine a first weight corresponding to the prediction node based on a distance between a domain node corresponding to the prediction node and the current node.
在一些实施例中,编码单元22,具体用于基于所述预测参考帧与所述当前待编码帧之间的时间差距,确定所述预测参考帧对应的第二权重。In some embodiments, the encoding unit 22 is specifically configured to determine a second weight corresponding to the predicted reference frame based on a time difference between the predicted reference frame and the current frame to be encoded.
应理解,装置实施例与方法实施例可以相互对应,类似的描述可以参照方法实施例。为避免重复,此处不再赘述。具体地,图17所示的点云编码装置20可以对应于执行本申请实施例的点云编码方法中的相应主体,并且点云编码装置20中的各个单元的前述和其它操作和/或功能分别为了实现点云编码方法中的相应流程,为了简洁,在此不再赘述。It should be understood that the device embodiment and the method embodiment may correspond to each other, and similar descriptions may refer to the method embodiment. To avoid repetition, it will not be repeated here. Specifically, the point cloud encoding device 20 shown in Figure 17 may correspond to the corresponding subject in the point cloud encoding method of the embodiment of the present application, and the aforementioned and other operations and/or functions of each unit in the point cloud encoding device 20 are respectively for implementing the corresponding processes in the point cloud encoding method. For the sake of brevity, they will not be repeated here.
上文中结合附图从功能单元的角度描述了本申请实施例的装置和***。应理解,该功能单元可以通过硬件形式实 现,也可以通过软件形式的指令实现,还可以通过硬件和软件单元组合实现。具体地,本申请实施例中的方法实施例的各步骤可以通过处理器中的硬件的集成逻辑电路和/或软件形式的指令完成,结合本申请实施例公开的方法的步骤可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件单元组合执行完成。可选地,软件单元可以位于随机存储器,闪存、只读存储器、可编程只读存储器、电可擦写可编程存储器、寄存器等本领域的成熟的存储介质中。该存储介质位于存储器,处理器读取存储器中的信息,结合其硬件完成上述方法实施例中的步骤。The above describes the device and system of the embodiment of the present application from the perspective of the functional unit in conjunction with the accompanying drawings. It should be understood that the functional unit can be implemented in hardware form, can be implemented by instructions in software form, and can also be implemented by a combination of hardware and software units. Specifically, the steps of the method embodiment in the embodiment of the present application can be completed by the hardware integrated logic circuit and/or software form instructions in the processor, and the steps of the method disclosed in the embodiment of the present application can be directly embodied as a hardware decoding processor to perform, or a combination of hardware and software units in the decoding processor to perform. Optionally, the software unit can be located in a mature storage medium in the field such as a random access memory, a flash memory, a read-only memory, a programmable read-only memory, an electrically erasable programmable memory, a register, etc. The storage medium is located in a memory, and the processor reads the information in the memory, and completes the steps in the above method embodiment in conjunction with its hardware.
图18是本申请实施例提供的电子设备的示意性框图。FIG. 18 is a schematic block diagram of an electronic device provided in an embodiment of the present application.
如图18所示,该电子设备30可以为本申请实施例所述的点云解码设备,或者点云编码设备,该电子设备30可包括:As shown in FIG. 18 , the electronic device 30 may be a point cloud decoding device or a point cloud encoding device as described in an embodiment of the present application, and the electronic device 30 may include:
存储器33和处理器32,该存储器33用于存储计算机程序34,并将该程序代码34传输给该处理器32。换言之,该处理器32可以从存储器33中调用并运行计算机程序34,以实现本申请实施例中的方法。The memory 33 and the processor 32, the memory 33 is used to store the computer program 34 and transmit the program code 34 to the processor 32. In other words, the processor 32 can call and run the computer program 34 from the memory 33 to implement the method in the embodiment of the present application.
例如,该处理器32可用于根据该计算机程序34中的指令执行上述方法200中的步骤。For example, the processor 32 may be configured to execute the steps in the method 200 according to the instructions in the computer program 34 .
在本申请的一些实施例中,该处理器32可以包括但不限于:In some embodiments of the present application, the processor 32 may include but is not limited to:
通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等等。General-purpose processor, digital signal processor (DSP), application-specific integrated circuit (ASIC), field programmable gate array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
在本申请的一些实施例中,该存储器33包括但不限于:In some embodiments of the present application, the memory 33 includes but is not limited to:
易失性存储器和/或非易失性存储器。其中,非易失性存储器可以是只读存储器(Read-Only Memory,ROM)、可编程只读存储器(Programmable ROM,PROM)、可擦除可编程只读存储器(Erasable PROM,EPROM)、电可擦除可编程只读存储器(Electrically EPROM,EEPROM)或闪存。易失性存储器可以是随机存取存储器(Random Access Memory,RAM),其用作外部高速缓存。通过示例性但不是限制性说明,许多形式的RAM可用,例如静态随机存取存储器(Static RAM,SRAM)、动态随机存取存储器(Dynamic RAM,DRAM)、同步动态随机存取存储器(Synchronous DRAM,SDRAM)、双倍数据速率同步动态随机存取存储器(Double Data Rate SDRAM,DDR SDRAM)、增强型同步动态随机存取存储器(Enhanced SDRAM,ESDRAM)、同步连接动态随机存取存储器(synch link DRAM,SLDRAM)和直接内存总线随机存取存储器(Direct Rambus RAM,DR RAM)。Volatile memory and/or non-volatile memory. Among them, the non-volatile memory can be read-only memory (ROM), programmable read-only memory (PROM), erasable programmable read-only memory (EPROM), electrically erasable programmable read-only memory (EEPROM) or flash memory. The volatile memory can be random access memory (RAM), which is used as an external cache. By way of example and not limitation, many forms of RAM are available, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate synchronous dynamic random access memory (DDR SDRAM), enhanced synchronous dynamic random access memory (ESDRAM), synchronous link DRAM (SLDRAM) and direct RAM bus random access memory (Direct Rambus RAM, DR RAM).
在本申请的一些实施例中,该计算机程序34可以被分割成一个或多个单元,该一个或者多个单元被存储在该存储器33中,并由该处理器32执行,以完成本申请提供的方法。该一个或多个单元可以是能够完成特定功能的一系列计算机程序指令段,该指令段用于描述该计算机程序34在该电子设备30中的执行过程。In some embodiments of the present application, the computer program 34 may be divided into one or more units, which are stored in the memory 33 and executed by the processor 32 to complete the method provided by the present application. The one or more units may be a series of computer program instruction segments capable of completing specific functions, and the instruction segments are used to describe the execution process of the computer program 34 in the electronic device 30.
如图18所示,该电子设备30还可包括:As shown in FIG. 18 , the electronic device 30 may further include:
收发器33,该收发器33可连接至该处理器32或存储器33。The transceiver 33 may be connected to the processor 32 or the memory 33 .
其中,处理器32可以控制该收发器33与其他设备进行通信,具体地,可以向其他设备发送信息或数据,或接收其他设备发送的信息或数据。收发器33可以包括发射机和接收机。收发器33还可以进一步包括天线,天线的数量可以为一个或多个。The processor 32 may control the transceiver 33 to communicate with other devices, specifically, to send information or data to other devices, or to receive information or data sent by other devices. The transceiver 33 may include a transmitter and a receiver. The transceiver 33 may further include an antenna, and the number of antennas may be one or more.
应当理解,该电子设备30中的各个组件通过总线***相连,其中,总线***除包括数据总线之外,还包括电源总线、控制总线和状态信号总线。It should be understood that the various components in the electronic device 30 are connected via a bus system, wherein the bus system includes not only a data bus but also a power bus, a control bus and a status signal bus.
图19是本申请实施例提供的点云编解码***的示意性框图。Figure 19 is a schematic block diagram of the point cloud encoding and decoding system provided in an embodiment of the present application.
如图19所示,该点云编解码***40可包括:点云编码器41和点云解码器42,其中点云编码器41用于执行本申请实施例涉及的点云编码方法,点云解码器42用于执行本申请实施例涉及的点云解码方法。As shown in Figure 19, the point cloud encoding and decoding system 40 may include: a point cloud encoder 41 and a point cloud decoder 42, wherein the point cloud encoder 41 is used to execute the point cloud encoding method involved in the embodiment of the present application, and the point cloud decoder 42 is used to execute the point cloud decoding method involved in the embodiment of the present application.
本申请还提供了一种码流,该码流是根据上述编码方法生成的。The present application also provides a code stream, which is generated according to the above encoding method.
本申请还提供了一种计算机存储介质,其上存储有计算机程序,该计算机程序被计算机执行时使得该计算机能够执行上述方法实施例的方法。或者说,本申请实施例还提供一种包含指令的计算机程序产品,该指令被计算机执行时使得计算机执行上述方法实施例的方法。The present application also provides a computer storage medium on which a computer program is stored, and when the computer program is executed by a computer, the computer can perform the method of the above method embodiment. In other words, the present application embodiment also provides a computer program product containing instructions, and when the instructions are executed by a computer, the computer can perform the method of the above method embodiment.
当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。该计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行该计算机程序指令时,全部或部分地产生按照本申请实施例该的流程或功能。该计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。该计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,该计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(digital subscriber line,DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。该计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。该可用介质可以是磁性介质(例如,软盘、硬盘、磁带)、光介质(例如数字点云光盘(digital video disc,DVD))、或者半导体介质(例如固态硬盘(solid state disk,SSD))等。When software is used for implementation, it can be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, the process or function according to the embodiment of the present application is generated in whole or in part. The computer can be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices. The computer instructions can be stored in a computer-readable storage medium, or transmitted from one computer-readable storage medium to another computer-readable storage medium. For example, the computer instructions can be transmitted from a website site, computer, server or data center by wired (e.g., coaxial cable, optical fiber, digital subscriber line (digital subscriber line, DSL)) or wireless (e.g., infrared, wireless, microwave, etc.) mode to another website site, computer, server or data center. The computer-readable storage medium can be any available medium that a computer can access or a data storage device such as a server or data center that includes one or more available media integrations. The available medium can be a magnetic medium (e.g., a floppy disk, a hard disk, a magnetic tape), an optical medium (e.g., a digital video disc (DVD)), or a semiconductor medium (e.g., a solid state disk (SSD)), etc.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。Those of ordinary skill in the art will appreciate that the units and algorithm steps of each example described in conjunction with the embodiments disclosed herein can be implemented in electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the technical solution. Professional and technical personnel can use different methods to implement the described functions for each specific application, but such implementation should not be considered to be beyond the scope of this application.
在本申请所提供的几个实施例中,应该理解到,所揭露的***、装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,该单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个***,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in the present application, it should be understood that the disclosed systems, devices and methods can be implemented in other ways. For example, the device embodiments described above are only schematic. For example, the division of the unit is only a logical function division. There may be other division methods in actual implementation, such as multiple units or components can be combined or integrated into another system, or some features can be ignored or not executed. Another point is that the mutual coupling or direct coupling or communication connection shown or discussed can be through some interfaces, indirect coupling or communication connection of devices or units, which can be electrical, mechanical or other forms.
作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。例如,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the scheme of this embodiment. For example, each functional unit in each embodiment of the present application may be integrated into a processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
以上内容,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以该权利要求的保护范围为准。The above contents are only specific implementation methods of the present application, but the protection scope of the present application is not limited thereto. Any technician familiar with the technical field can easily think of changes or substitutions within the technical scope disclosed in the present application, which should be included in the protection scope of the present application. Therefore, the protection scope of the present application should be based on the protection scope of the claims.

Claims (92)

  1. 一种点云解码方法,其特征在于,包括:A point cloud decoding method, characterized by comprising:
    确定当前节点对应的帧间解码信息,所述当前节点为当前待解码帧中的待解码节点;Determine inter-frame decoding information corresponding to a current node, wherein the current node is a node to be decoded in a current frame to be decoded;
    基于所述当前节点对应的帧间解码信息,对所述当前节点所包括的点数进行预测解码。Based on the inter-frame decoding information corresponding to the current node, predictive decoding is performed on the number of points included in the current node.
  2. 根据权利要求1所述的方法,其特征在于,所述帧间解码信息包括所述当前节点的N个预测节点的几何解码信息和所述当前节点对应的第一信息中的至少一个,所述第一信息用于指示所述当前节点是否允许启动帧间解码方式,所述确定当前节点对应的帧间解码信息,包括:The method according to claim 1 is characterized in that the inter-frame decoding information includes at least one of geometric decoding information of N prediction nodes of the current node and first information corresponding to the current node, the first information is used to indicate whether the current node allows the inter-frame decoding mode to be started, and the determining of the inter-frame decoding information corresponding to the current node includes:
    在所述当前待解码帧的预测参考帧中,确定所述当前节点的N个预测节点,和/或确定所述第一信息,所述N为正整数;In the prediction reference frame of the current frame to be decoded, determine N prediction nodes of the current node, and/or determine the first information, where N is a positive integer;
    所述基于所述当前节点对应的帧间解码信息,对所述当前节点所包括的点数进行预测解码,包括:The predictive decoding of the number of points included in the current node based on the inter-frame decoding information corresponding to the current node includes:
    基于所述N个预测节点的几何解码信息和/或所述第一信息,对所述当前节点所包括的点数进行预测解码。Based on the geometric decoding information of the N predicted nodes and/or the first information, the number of points included in the current node is predicted and decoded.
  3. 根据权利要求2所述的方法,其特征在于,所述预测节点的几何解码信息包括所述预测节点的直接解码信息,所述基于所述N个预测节点的几何解码信息和/或所述第一信息,对所述当前节点所包括的点数进行预测解码,包括:The method according to claim 2, characterized in that the geometric decoding information of the prediction node includes direct decoding information of the prediction node, and the predictive decoding of the number of points included in the current node based on the geometric decoding information of the N prediction nodes and/or the first information comprises:
    基于所述N个预测节点的直接解码信息和/或所述第一信息,对所述当前节点的点数进行预测解码。Based on the direct decoding information of the N prediction nodes and/or the first information, the number of points of the current node is predicted and decoded.
  4. 根据权利要求3所述的方法,其特征在于,所述当前待解码帧包括K个预测参考帧,所述在所述当前待解码帧的预测参考帧中,确定所述当前节点的N个预测节点,包括:The method according to claim 3, characterized in that the current frame to be decoded includes K prediction reference frames, and determining N prediction nodes of the current node in the prediction reference frames of the current frame to be decoded comprises:
    针对所述K个预测参考帧中的第k个预测参考帧,确定所述当前节点在所述第k个预测参考帧中的至少一个预测节点,所述k为小于或等于K的正整数,所述K为正整数;For a k-th prediction reference frame among the K prediction reference frames, determining at least one prediction node of the current node in the k-th prediction reference frame, where k is a positive integer less than or equal to K, and K is a positive integer;
    基于所述当前节点在所述K个预测参考帧中的至少一个预测节点,确定所述当前节点的N个预测节点。Based on at least one prediction node of the current node in the K prediction reference frames, N prediction nodes of the current node are determined.
  5. 根据权利要求4所述的方法,其特征在于,所述确定所述当前节点在所述第k个预测参考帧中的至少一个预测节点,包括:The method according to claim 4, characterized in that the determining of at least one prediction node of the current node in the k-th prediction reference frame comprises:
    在所述当前待解码帧中,确定所述当前节点的M个领域节点,所述M个领域节点中包括所述当前节点,所述M为正整数;In the current frame to be decoded, determine M domain nodes of the current node, the M domain nodes include the current node, and M is a positive integer;
    针对所述M个领域节点中的第i个领域节点,确定所述第i个领域节点在所述第k个预测参考帧中的对应节点,所述i为小于或等于M的正整数;For an i-th domain node among the M domain nodes, determine a corresponding node of the i-th domain node in the k-th prediction reference frame, where i is a positive integer less than or equal to M;
    基于所述M个领域节点在所述第k个预测参考帧中的对应节点,确定所述当前节点在所述第k个预测参考帧中的至少一个预测节点。Based on the corresponding nodes of the M domain nodes in the k-th prediction reference frame, at least one prediction node of the current node in the k-th prediction reference frame is determined.
  6. 根据权利要求4所述的方法,其特征在于,所述确定所述当前节点在所述第k个预测参考帧中的至少一个预测节点,包括:The method according to claim 4, characterized in that the determining of at least one prediction node of the current node in the k-th prediction reference frame comprises:
    确定所述当前节点在所述第k个预测参考帧中的对应节点;Determine a corresponding node of the current node in the k-th prediction reference frame;
    确定所述对应节点的至少一个领域节点;Determining at least one domain node of the corresponding node;
    将所述至少一个领域节点,确定为所述当前节点在所述第k个预测参考帧中的至少一个预测节点。The at least one domain node is determined as at least one prediction node of the current node in the k-th prediction reference frame.
  7. 根据权利要求5或6所述的方法,其特征在于,所述方法还包括:The method according to claim 5 or 6, characterized in that the method further comprises:
    在所述当前待解码帧中,确定第i个节点的父节点,作为第i个父节点,所述第i个节点为所述第i个领域节点或者为所述当前节点;In the current frame to be decoded, determine the parent node of the ith node as the ith parent node, the ith node being the ith domain node or the current node;
    确定所述第i个父节点在所述第k个预测参考帧中的匹配节点,作为第i个匹配节点;Determine a matching node of the i-th parent node in the k-th prediction reference frame as the i-th matching node;
    将所述i个匹配节点的子节点中的一个子节点,确定为所述第i个节点在所述第k个预测参考帧中的对应节点。One of the child nodes of the i matching nodes is determined as the corresponding node of the i-th node in the k-th prediction reference frame.
  8. 根据权利要求7所述的方法,其特征在于,所述确定所述第i个父节点在所述第k个预测参考帧中的匹配节点,包括:The method according to claim 7, characterized in that the determining the matching node of the i-th parent node in the k-th prediction reference frame comprises:
    基于所述第i个父节点的占位信息,确定所述第i个父节点在所述第k个预测参考帧中的匹配节点。Based on the placeholder information of the i-th parent node, a matching node of the i-th parent node in the k-th prediction reference frame is determined.
  9. 根据权利要求8所述的方法,其特征在于,所述基于所述第i个父节点的占位信息,确定所述第i个父节点在所述第k个预测参考帧中的匹配节点,包括:The method according to claim 8, characterized in that the determining, based on the placeholder information of the i-th parent node, a matching node of the i-th parent node in the k-th prediction reference frame comprises:
    将所述第k个预测参考帧中,占位信息与所述第i个父节点的占位信息之间的差异最小的节点,确定为所述第i个父节点在所述第k个预测参考帧中的匹配节点。A node whose placeholder information in the k-th prediction reference frame has the smallest difference with the placeholder information of the i-th parent node is determined as a matching node of the i-th parent node in the k-th prediction reference frame.
  10. 根据权利要求7所述的方法,其特征在于,所述将所述i个匹配节点的子节点中的一个子节点,确定为所述第i个节点在所述第k个预测参考帧中的对应节点,包括:The method according to claim 7, characterized in that the step of determining one of the child nodes of the i matching nodes as the corresponding node of the i-th node in the k-th prediction reference frame comprises:
    确定所述第i个节点在所述父节点所包括的子节点中的第一序号;Determine the first sequence number of the i-th node among the child nodes included in the parent node;
    将所述第i个匹配节点的子节点中序号为第一序号的子节点,确定为所述第i个节点在所述第k个预测参考帧中的对应节点。The child node with the first sequence number among the child nodes of the i-th matching node is determined as the corresponding node of the i-th node in the k-th prediction reference frame.
  11. 根据权利要求5所述的方法,其特征在于,所述基于所述M个领域节点在所述第k个预测参考帧中的对应节点,确定所述当前节点在所述第k个预测参考帧中的至少一个预测节点,包括:The method according to claim 5, characterized in that the determining at least one prediction node of the current node in the kth prediction reference frame based on the corresponding nodes of the M domain nodes in the kth prediction reference frame comprises:
    将所述M个领域节点在所述第k个预测参考帧中的对应节点,确定为所述当前节点在所述第k个预测参考帧中的至少一个预测节点。The corresponding nodes of the M domain nodes in the k-th prediction reference frame are determined as at least one prediction node of the current node in the k-th prediction reference frame.
  12. 根据权利要求4所述的方法,其特征在于,所述基于所述当前节点在所述K个预测参考帧中的至少一个预测节点,确定所述当前节点的N个预测节点,包括:The method according to claim 4, characterized in that the determining the N prediction nodes of the current node based on at least one prediction node of the current node in the K prediction reference frames comprises:
    将所述当前节点在所述K个预测参考帧中的至少一个预测节点,确定为所述当前节点的N个预测节点。At least one prediction node of the current node in the K prediction reference frames is determined as N prediction nodes of the current node.
  13. 根据权利要求4所述的方法,其特征在于,若所述当前待解码帧为P帧,则所述K个预测参考帧包括所述当前待解码帧的前向帧。The method according to claim 4 is characterized in that if the current frame to be decoded is a P frame, the K prediction reference frames include a forward frame of the current frame to be decoded.
  14. 根据权利要求4所述的方法,其特征在于,若所述当前待解码帧为B帧,则所述K个预测参考帧包括所述当前待解码帧的前向帧和后向帧。The method according to claim 4 is characterized in that, if the current frame to be decoded is a B frame, the K prediction reference frames include a forward frame and a backward frame of the current frame to be decoded.
  15. 根据权利要求4所述的方法,其特征在于,所述预测节点的直接解码信息包括第一直接解码信息,基于所述N个预测节点的直接解码信息,对所述当前节点所包括的点数进行预测解码,包括:The method according to claim 4, characterized in that the direct decoding information of the prediction node includes first direct decoding information, and based on the direct decoding information of the N prediction nodes, predicting and decoding the number of points included in the current node comprises:
    将所述N个预测节点的第一直接解码信息作为上下文信息,对所述当前节点对应的第一标志进行预测解码,所述第一标志用于表示所述当前节点所包括的点数是否大于1。The first direct decoding information of the N prediction nodes is used as context information to perform predictive decoding on a first flag corresponding to the current node, where the first flag is used to indicate whether the number of points included in the current node is greater than 1.
  16. 根据权利要求15所述的方法,其特征在于,所述将所述N个预测节点的第一直接解码信息作为上下文信息,对所述当前节点对应的第一标志进行预测解码,包括:The method according to claim 15, characterized in that the step of using the first direct decoding information of the N predicted nodes as context information to perform predictive decoding on the first flag corresponding to the current node comprises:
    基于所述N个预测节点的第一直接解码信息,确定第一上下文模型的索引;Determining an index of a first context model based on first direct decoding information of the N prediction nodes;
    基于所述第一上下文模型的索引,从所述第一标志对应的Q个上下文模型中,选出所述第一上下文模型,所述Q为大于1的正整数;Based on the index of the first context model, selecting the first context model from Q context models corresponding to the first flag, where Q is a positive integer greater than 1;
    使用所述第一上下文模型,对所述当前节点对应的第一标志进行预测解码。Using the first context model, predictive decoding is performed on a first flag corresponding to the current node.
  17. 根据权利要求16所述的方法,其特征在于,所述基于所述N个预测节点的第一直接解码信息,确定所述第一上下文模型的索引,包括:The method according to claim 16, characterized in that the determining the index of the first context model based on the first direct decoding information of the N prediction nodes comprises:
    针对所述N个预测节点中的任一预测节点,基于所述预测节点的第一直接解码信息,确定所述预测节点对应的第一数值;For any prediction node among the N prediction nodes, determining a first value corresponding to the prediction node based on first direct decoding information of the prediction node;
    基于所述N个预测节点对应的第一数值,确定所述第一上下文模型的索引。Based on first numerical values corresponding to the N prediction nodes, an index of the first context model is determined.
  18. 根据权利要求17所述的方法,其特征在于,所述基于所述N个预测节点对应的第一数值,确定所述第一上下文模型的索引,包括:The method according to claim 17, characterized in that determining the index of the first context model based on the first numerical values corresponding to the N prediction nodes comprises:
    确定所述预测节点对应的第一权重;Determining a first weight corresponding to the prediction node;
    基于所述第一权重,对所述N个预测节点对应的第一数值进行加权处理,得到第一加权预测值;Based on the first weight, weighting the first values corresponding to the N prediction nodes to obtain a first weighted prediction value;
    基于所述第一加权预测值,确定所述第一上下文模型的索引。Based on the first weighted prediction value, determine the index of the first context model.
  19. 根据权利要求16所述的方法,其特征在于,若所述K大于1时,则所述基于所述N个预测节点的第一直接解码信息,确定第一上下文模型的索引,包括:The method according to claim 16, characterized in that, if K is greater than 1, determining the index of the first context model based on the first direct decoding information of the N prediction nodes comprises:
    针对所述K个预测参考帧中的第j个预测参考帧,基于所述当前节点在所述第j个预测参考帧中的预测节点的第一直接解码信息,确定所述第j个预测参考帧中的预测节点对应的第一数值,所述j为小于或等于K的正整数;For a j-th prediction reference frame among the K prediction reference frames, determining a first value corresponding to the prediction node in the j-th prediction reference frame based on first direct decoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K;
    确定所述预测节点对应的第一权重,并基于所述第一权重对所述第j个预测参考帧中的预测节点对应的第一数值进行加权处理,得到所述第j个预测参考帧对应的第二加权预测值;Determine a first weight corresponding to the prediction node, and perform weighted processing on a first value corresponding to the prediction node in the j-th prediction reference frame based on the first weight to obtain a second weighted prediction value corresponding to the j-th prediction reference frame;
    基于所述K个预测参考帧对应的第二加权预测值,确定所述第一上下文模型的索引。Based on the second weighted prediction values corresponding to the K prediction reference frames, an index of the first context model is determined.
  20. 根据权利要求19所述的方法,其特征在于,所述基于所述K个预测参考帧对应的第二加权预测值,确定所述第一上下文模型的索引,包括:The method according to claim 19, characterized in that the determining the index of the first context model based on the second weighted prediction values corresponding to the K prediction reference frames comprises:
    确定所述K个预测参考帧对应的第二权重;Determine second weights corresponding to the K prediction reference frames;
    基于所述第二权重对所述K个预测参考帧分别对应的第二加权预测值进行加权处理,得到所述第一上下文模型的索引。Based on the second weight, weighted processing is performed on the second weighted prediction values corresponding to the K prediction reference frames respectively to obtain an index of the first context model.
  21. 根据权利要求17或19所述的方法,其特征在于,所述第一直接解码信息包括第二信息和预测节点的点数中的至少一个,所述第二信息表示所述预测节点是否满足所述直接解码方式进行解码的条件,基于所述预测节点的第一直接解码信息,确定所述预测节点对应的第一数值,包括:The method according to claim 17 or 19, characterized in that the first direct decoding information includes at least one of second information and the number of points of the prediction node, the second information indicates whether the prediction node satisfies the condition for decoding by the direct decoding method, and determining the first value corresponding to the prediction node based on the first direct decoding information of the prediction node comprises:
    基于所述第二信息,确定第一子值;Based on the second information, determining a first sub-value;
    基于所述预测节点的点数,确定第二子值;Determining a second sub-value based on the number of points of the prediction node;
    基于所述第一子值和/或所述第二子值,确定所述预测节点对应的第一数值。Based on the first sub-value and/or the second sub-value, a first numerical value corresponding to the prediction node is determined.
  22. 根据权利要求21所述的方法,其特征在于,所述基于所述预测节点的点数,确定第二子值,包括:The method according to claim 21, characterized in that the determining the second sub-value based on the number of points of the prediction node comprises:
    若所述预测节点所包括的点数大于第一预设值,则确定所述第二子值的取值为第一值;If the number of points included in the prediction node is greater than a first preset value, determining that the value of the second sub-value is the first value;
    若所述预测节点所包括的点数小于或等于第一预设值,则确定所述第二子值的取值为第二值。If the number of points included in the prediction node is less than or equal to a first preset value, the value of the second sub-value is determined to be a second value.
  23. 根据权利要求15所述的方法,其特征在于,所述方法还包括:The method according to claim 15, characterized in that the method further comprises:
    若所述当前节点对应的第一标志表示所述当前节点所包括的点数大于1时,则确定所述当前节点包括2个点。If the first flag corresponding to the current node indicates that the number of points included in the current node is greater than 1, it is determined that the current node includes 2 points.
  24. 根据权利要求15所述的方法,其特征在于,所述预测节点的直接解码信息包括第二直接解码信息,所述方法还包括:The method according to claim 15, characterized in that the direct decoding information of the prediction node includes second direct decoding information, and the method further comprises:
    若所述当前节点对应的第一标志表示所述当前节点所包括的点数小于或等于1时,则将所述N个预测节点的第二直接解码信息作为上下文信息,对所述当前节点对应的第二标志进行预测解码,所述第二标志用于表示所述当前节点的第二个点是否为重复点。If the first flag corresponding to the current node indicates that the number of points included in the current node is less than or equal to 1, the second direct decoding information of the N predicted nodes is used as context information to predict and decode the second flag corresponding to the current node, and the second flag is used to indicate whether the second point of the current node is a repeated point.
  25. 根据权利要求24所述的方法,其特征在于,所述将所述N个预测节点的第二直接解码信息作为上下文信息,对所述当前节点对应的第二标志进行预测解码,包括:The method according to claim 24, characterized in that the step of using the second direct decoding information of the N predicted nodes as context information to perform predictive decoding on the second flag corresponding to the current node comprises:
    基于所述N个预测节点的第二直接解码信息,确定第二上下文模型的索引;Determining an index of a second context model based on the second direct decoding information of the N prediction nodes;
    基于所述第二上下文模型的索引,从所述第二标志对应的P个上下文模型中,选出所述第二上下文模型,所述P为大于1的正整数;Based on the index of the second context model, selecting the second context model from P context models corresponding to the second flag, where P is a positive integer greater than 1;
    使用所述第二上下文模型,对所述当前节点对应的第二标志进行预测解码。The second context model is used to predict and decode the second flag corresponding to the current node.
  26. 根据权利要求25所述的方法,其特征在于,所述基于所述N个预测节点的第二直接解码信息,确定所述第二上下文模型的索引,包括:The method according to claim 25, characterized in that the determining the index of the second context model based on the second direct decoding information of the N prediction nodes comprises:
    针对所述N个预测节点中的任一预测节点,基于所述预测节点的第二直接解码信息,确定所述预测节点对应的第 二数值;For any prediction node among the N prediction nodes, determining a second numerical value corresponding to the prediction node based on second direct decoding information of the prediction node;
    基于所述N个预测节点对应的第二数值,确定所述第二上下文模型的索引。Based on second numerical values corresponding to the N prediction nodes, an index of the second context model is determined.
  27. 根据权利要求26所述的方法,其特征在于,所述基于所述N个预测节点对应的第二数值,确定所述第二上下文模型的索引,包括:The method according to claim 26, characterized in that the determining the index of the second context model based on the second numerical values corresponding to the N prediction nodes comprises:
    确定所述预测节点对应的第一权重;Determining a first weight corresponding to the prediction node;
    基于所述第一权重,对所述N个预测节点对应的第二数值进行加权处理,得到第三加权预测值;Based on the first weight, weighting the second values corresponding to the N prediction nodes to obtain a third weighted prediction value;
    基于所述第三加权预测值,确定所述第二上下文模型的索引。Based on the third weighted prediction value, determine the index of the second context model.
  28. 根据权利要求25所述的方法,其特征在于,若所述K大于1时,则所述基于所述N个预测节点的第二直接解码信息,确定第二上下文模型的索引,包括:The method according to claim 25, characterized in that, if the K is greater than 1, the determining the index of the second context model based on the second direct decoding information of the N prediction nodes comprises:
    针对所述K个预测参考帧中的第j个预测参考帧,基于所述当前节点在所述第j个预测参考帧中的预测节点的第二直接解码信息,确定所述第j个预测参考帧中的预测节点对应的第二数值,所述j为小于或等于K的正整数;For a j-th prediction reference frame among the K prediction reference frames, determining a second value corresponding to the prediction node in the j-th prediction reference frame based on second direct decoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K;
    确定所述预测节点对应的第一权重,并基于所述第一权重对所述第j个预测参考帧中的预测节点对应的第二数值进行加权处理,得到所述第j个预测参考帧对应的第四加权预测值;Determine a first weight corresponding to the prediction node, and perform weighted processing on the second value corresponding to the prediction node in the j-th prediction reference frame based on the first weight to obtain a fourth weighted prediction value corresponding to the j-th prediction reference frame;
    基于所述K个预测参考帧对应的第四加权预测值,确定所述第二上下文模型的索引。Based on the fourth weighted prediction values corresponding to the K prediction reference frames, an index of the second context model is determined.
  29. 根据权利要求28所述的方法,其特征在于,所述基于所述K个预测参考帧对应的第四加权预测值,确定所述第二上下文模型的索引,包括:The method according to claim 28, characterized in that the determining the index of the second context model based on the fourth weighted prediction values corresponding to the K prediction reference frames comprises:
    确定所述K个预测参考帧对应的第二权重;Determine second weights corresponding to the K prediction reference frames;
    基于所述第二权重对所述K个预测参考帧分别对应的第四加权预测值进行加权处理,得到所述第二上下文模型的索引。Based on the second weight, the fourth weighted prediction values corresponding to the K prediction reference frames are weighted to obtain the index of the second context model.
  30. 根据权利要求26或28所述的方法,其特征在于,所述第二直接解码信息包括第二信息和第三信息中的至少一个,所述第二信息表示所述预测节点是否满足所述直接解码方式进行解码的条件,所述第三信息表示所述预测节点的第二个点是否为重复点,基于所述预测节点的第二直接解码信息,确定所述预测节点对应的第二数值,包括:The method according to claim 26 or 28, characterized in that the second direct decoding information includes at least one of second information and third information, the second information indicates whether the predicted node satisfies the condition for decoding by the direct decoding method, and the third information indicates whether the second point of the predicted node is a repeated point, and determining the second value corresponding to the predicted node based on the second direct decoding information of the predicted node comprises:
    基于所述第二信息,确定第一子值;Based on the second information, determining a first sub-value;
    基于所述第三信息,确定第三子值;Based on the third information, determining a third sub-value;
    基于所述第一子值和/或所述第三子值,确定所述预测节点对应的第二数值。Based on the first sub-value and/or the third sub-value, a second numerical value corresponding to the prediction node is determined.
  31. 根据权利要求30所述的方法,其特征在于,所述基于所述第三信息,确定第三子值,包括:The method according to claim 30, characterized in that the determining the third sub-value based on the third information comprises:
    若所述第三信息表示所述预测节点的第二个点是重复点,则确定所述第三子值的取值为第一值;If the third information indicates that the second point of the predicted node is a duplicate point, determining that the value of the third sub-value is the first value;
    若所述第三信息表示所述预测节点的第二个点不是重复点,则确定所述第三子值的取值为第二值。If the third information indicates that the second point of the predicted node is not a repeated point, the value of the third sub-value is determined to be the second value.
  32. 根据权利要求24所述的方法,其特征在于,所述方法还包括:The method according to claim 24, characterized in that the method further comprises:
    若所述当前节点对应的第二标志表示所述当前节点的第二个点不是重复点1时,则确定所述当前节点包括1个点。If the second flag corresponding to the current node indicates that the second point of the current node is not the repeated point 1, it is determined that the current node includes 1 point.
  33. 根据权利要求24所述的方法,其特征在于,所述预测节点的直接解码信息包括第三直接解码信息,所述方法还包括:The method according to claim 24, characterized in that the direct decoding information of the prediction node includes third direct decoding information, and the method further comprises:
    若所述当前节点对应的第二标志表示所述当前节点的第二个点是重复点时,则将所述N个预测节点的第三直接解码信息作为上下文信息,对所述当前节点对应的第三标志进行预测解码,所述第三标志用于表示所述当前节点的第三个点是否为重复点。If the second flag corresponding to the current node indicates that the second point of the current node is a repeated point, the third direct decoding information of the N predicted nodes is used as context information to predict and decode the third flag corresponding to the current node, and the third flag is used to indicate whether the third point of the current node is a repeated point.
  34. 根据权利要求33所述的方法,其特征在于,所述将所述N个预测节点的第三直接解码信息作为上下文信息,对所述当前节点对应的第三标志进行预测解码,包括:The method according to claim 33, characterized in that the using the third direct decoding information of the N prediction nodes as context information to perform predictive decoding on the third flag corresponding to the current node comprises:
    基于所述N个预测节点的第三直接解码信息,确定第三上下文模型的索引;Determining an index of a third context model based on third direct decoding information of the N prediction nodes;
    基于所述第三上下文模型的索引,从所述第三标志对应的R个上下文模型中,选出所述第三上下文模型,所述R为大于1的正整数;Based on the index of the third context model, selecting the third context model from R context models corresponding to the third flag, where R is a positive integer greater than 1;
    使用所述第三上下文模型,对所述当前节点对应的第三标志进行预测解码。The third context model is used to predict and decode the third flag corresponding to the current node.
  35. 根据权利要求34所述的方法,其特征在于,所述基于所述N个预测节点的第三直接解码信息,确定所述第三上下文模型的索引,包括:The method according to claim 34, characterized in that the determining the index of the third context model based on the third direct decoding information of the N prediction nodes comprises:
    针对所述N个预测节点中的任一预测节点,基于所述预测节点的第三直接解码信息,确定所述预测节点对应的第三数值;For any prediction node among the N prediction nodes, determining a third value corresponding to the prediction node based on third direct decoding information of the prediction node;
    基于所述N个预测节点对应的第三数值,确定所述第三上下文模型的索引。Based on third numerical values corresponding to the N prediction nodes, an index of the third context model is determined.
  36. 根据权利要求35所述的方法,其特征在于,所述基于所述N个预测节点对应的第三数值,确定所述第三上下文模型的索引,包括:The method according to claim 35, characterized in that the determining the index of the third context model based on the third values corresponding to the N prediction nodes comprises:
    确定所述预测节点对应的第一权重;Determining a first weight corresponding to the prediction node;
    基于所述第一权重,对所述N个预测节点对应的第三数值进行加权处理,得到第五加权预测值;Based on the first weight, weighting the third values corresponding to the N prediction nodes to obtain a fifth weighted prediction value;
    基于所述第五加权预测值,确定所述第三上下文模型的索引。Based on the fifth weighted prediction value, an index of the third context model is determined.
  37. 根据权利要求34所述的方法,其特征在于,若所述K大于1时,则所述基于所述N个预测节点的第三直接解码信息,确定第三上下文模型的索引,包括:The method according to claim 34, characterized in that, if the K is greater than 1, then the determining the index of the third context model based on the third direct decoding information of the N prediction nodes comprises:
    针对所述K个预测参考帧中的第j个预测参考帧,基于所述当前节点在所述第j个预测参考帧中的预测节点的第三直接解码信息,确定所述第j个预测参考帧中的预测节点对应的第三数值,所述j为小于或等于K的正整数;For a j-th prediction reference frame among the K prediction reference frames, determining a third value corresponding to the prediction node in the j-th prediction reference frame based on third direct decoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K;
    确定所述预测节点对应的第一权重,并基于所述第一权重对所述第j个预测参考帧中的预测节点对应的第三数值进行加权处理,得到所述第j个预测参考帧对应的第六加权预测值;Determine a first weight corresponding to the prediction node, and perform weighted processing on a third value corresponding to the prediction node in the j-th prediction reference frame based on the first weight to obtain a sixth weighted prediction value corresponding to the j-th prediction reference frame;
    基于所述K个预测参考帧对应的第六加权预测值,确定所述第三上下文模型的索引。Based on the sixth weighted prediction values corresponding to the K prediction reference frames, an index of the third context model is determined.
  38. 根据权利要求37所述的方法,其特征在于,所述基于所述K个预测参考帧对应的第六加权预测值,确定所 述第三上下文模型的索引,包括:The method according to claim 37, characterized in that the determining the index of the third context model based on the sixth weighted prediction values corresponding to the K prediction reference frames comprises:
    确定所述K个预测参考帧对应的第二权重;Determine second weights corresponding to the K prediction reference frames;
    基于所述第二权重对所述K个预测参考帧分别对应的第六加权预测值进行加权处理,得到所述第三上下文模型的索引。Based on the second weight, the sixth weighted prediction values corresponding to the K prediction reference frames are weighted to obtain the index of the third context model.
  39. 根据权利要求35或37所述的方法,其特征在于,所述第三直接解码信息包括第二信息和第四信息中的至少一个,所述第二信息表示所述预测节点是否满足所述直接解码方式进行解码的条件,所述第四信息表示所述预测节点的第三个点是否为重复点,其中,基于所述预测节点的第三直接解码信息,确定所述预测节点对应的第三数值,包括:The method according to claim 35 or 37, characterized in that the third direct decoding information includes at least one of second information and fourth information, the second information indicates whether the predicted node satisfies the condition for decoding by the direct decoding method, and the fourth information indicates whether the third point of the predicted node is a repeated point, wherein determining the third value corresponding to the predicted node based on the third direct decoding information of the predicted node includes:
    基于所述第二信息,确定第一子值;Based on the second information, determining a first sub-value;
    基于所述第四信息,确定第四子值;Based on the fourth information, determining a fourth sub-value;
    基于所述第一子值和/或所述第四子值,确定所述预测节点对应的第三数值。Based on the first sub-value and/or the fourth sub-value, a third numerical value corresponding to the prediction node is determined.
  40. 根据权利要求39所述的方法,其特征在于,所述基于所述第四信息,确定第四子值,包括:The method according to claim 39, characterized in that the determining the fourth sub-value based on the fourth information comprises:
    若所述第四信息表示所述预测节点的第三个点是重复点,则确定所述第四子值的取值为第一值;If the fourth information indicates that the third point of the predicted node is a repeated point, determining that the value of the fourth subvalue is the first value;
    若所述第四信息表示所述预测节点的第三个点不是重复点,则确定所述第四子值的取值为第二值。If the fourth information indicates that the third point of the predicted node is not a repeated point, the value of the fourth sub-value is determined to be the second value.
  41. 根据权利要求33所述的方法,其特征在于,所述方法还包括:The method according to claim 33, characterized in that the method further comprises:
    若所述第三标志表示所述当前节点的第三个点为重复点时,则对所述当前节点所包括的剩余重复点数目进行解码。If the third flag indicates that the third point of the current node is a repeated point, the number of remaining repeated points included in the current node is decoded.
  42. 根据权利要求21、30或39所述的方法,其特征在于,所述基于所述第二信息,确定第一子值,包括:The method according to claim 21, 30 or 39, characterized in that determining the first sub-value based on the second information comprises:
    若所述第二信息指示所述预测节点满足所述直接解码方式进行解码的条件时,则确定所述第一子值的取值为第一值;If the second information indicates that the prediction node satisfies the condition for decoding in the direct decoding manner, determining that the value of the first subvalue is a first value;
    若所述第二信息指示所述预测节点不满足所述直接解码方式进行解码的条件时,则确定所述第一子值的取值为第二值。If the second information indicates that the prediction node does not satisfy the condition for decoding in the direct decoding manner, the value of the first sub-value is determined to be a second value.
  43. 根据权利要求21、30或39所述的方法,其特征在于,所述第二信息包括所述预测节点的直接解码模式信息。The method according to claim 21, 30 or 39 is characterized in that the second information includes direct decoding mode information of the prediction node.
  44. 根据权利要求18、19、27、28、36或37所述的方法,其特征在于,确定所述预测节点对应的第一权重,包括:The method according to claim 18, 19, 27, 28, 36 or 37, wherein determining the first weight corresponding to the prediction node comprises:
    基于所述预测节点对应的领域节点与所述当前节点之间的距离,确定所述预测节点对应的第一权重。Based on the distance between the domain node corresponding to the prediction node and the current node, a first weight corresponding to the prediction node is determined.
  45. 根据权利要求20、29或38所述的方法,其特征在于,所述确定所述K个预测参考帧对应的第二权重,包括:The method according to claim 20, 29 or 38, characterized in that the determining the second weights corresponding to the K prediction reference frames comprises:
    基于所述预测参考帧与所述当前待解码帧之间的时间差距,确定所述预测参考帧对应的第二权重。Based on the time difference between the predicted reference frame and the current frame to be decoded, a second weight corresponding to the predicted reference frame is determined.
  46. 一种点云编码方法,其特征在于,包括:A point cloud encoding method, characterized by comprising:
    确定当前节点对应的帧间编码信息,所述当前节点为当前待编码帧中的待编码节点;Determine inter-frame coding information corresponding to a current node, where the current node is a node to be coded in a current frame to be coded;
    基于所述当前节点对应的帧间编码信息,对所述当前节点所包括的点数进行预测编码。Based on the inter-frame coding information corresponding to the current node, predictive coding is performed on the points included in the current node.
  47. 根据权利要求46所述的方法,其特征在于,所述帧间编码信息包括所述当前节点的N个预测节点的几何编码信息和所述当前节点对应的第一信息中的至少一个,所述第一信息用于指示所述当前节点是否允许启动帧间编码方式,所述确定当前节点对应的帧间编码信息,包括:The method according to claim 46 is characterized in that the inter-frame coding information includes at least one of geometric coding information of N prediction nodes of the current node and first information corresponding to the current node, the first information is used to indicate whether the current node allows the inter-frame coding mode to be started, and the determining of the inter-frame coding information corresponding to the current node includes:
    在所述当前待编码帧的预测参考帧中,确定所述当前节点的N个预测节点,和/或确定所述第一信息,所述N为正整数;In a prediction reference frame of the current frame to be encoded, determine N prediction nodes of the current node, and/or determine the first information, where N is a positive integer;
    所述基于所述当前节点对应的帧间编码信息,对所述当前节点所包括的点数进行预测编码,包括:The predictive coding of the number of points included in the current node based on the inter-frame coding information corresponding to the current node includes:
    基于所述N个预测节点的几何编码信息和/或所述第一信息,对所述当前节点所包括的点数进行预测编码。Based on the geometric coding information of the N predicted nodes and/or the first information, predictive coding is performed on the number of points included in the current node.
  48. 根据权利要求47所述的方法,其特征在于,所述预测节点的几何编码信息包括所述预测节点的直接编码信息,所述基于所述N个预测节点的几何编码信息和/或所述第一信息,对所述当前节点所包括的点数进行预测编码,包括:The method according to claim 47, characterized in that the geometric coding information of the prediction node includes direct coding information of the prediction node, and the predictive coding of the number of points included in the current node based on the geometric coding information of the N prediction nodes and/or the first information comprises:
    基于所述N个预测节点的直接编码信息和/或所述第一信息,对所述当前节点的点数进行预测编码。Based on the direct encoding information of the N prediction nodes and/or the first information, predictive encoding is performed on the number of points of the current node.
  49. 根据权利要求48所述的方法,其特征在于,所述当前待编码帧包括K个预测参考帧,所述在所述当前待编码帧的预测参考帧中,确定所述当前节点的N个预测节点,包括:The method according to claim 48, characterized in that the current frame to be encoded includes K prediction reference frames, and determining N prediction nodes of the current node in the prediction reference frames of the current frame to be encoded comprises:
    针对所述K个预测参考帧中的第k个预测参考帧,确定所述当前节点在所述第k个预测参考帧中的至少一个预测节点,所述k为小于或等于K的正整数,所述K为正整数;For a k-th prediction reference frame among the K prediction reference frames, determining at least one prediction node of the current node in the k-th prediction reference frame, where k is a positive integer less than or equal to K, and K is a positive integer;
    基于所述当前节点在所述K个预测参考帧中的至少一个预测节点,确定所述当前节点的N个预测节点。Based on at least one prediction node of the current node in the K prediction reference frames, N prediction nodes of the current node are determined.
  50. 根据权利要求49所述的方法,其特征在于,所述确定所述当前节点在所述第k个预测参考帧中的至少一个预测节点,包括:The method according to claim 49, characterized in that the determining of at least one prediction node of the current node in the k-th prediction reference frame comprises:
    在所述当前待编码帧中,确定所述当前节点的M个领域节点,所述M个领域节点中包括所述当前节点,所述M为正整数;In the current frame to be encoded, determine M domain nodes of the current node, the M domain nodes include the current node, and M is a positive integer;
    针对所述M个领域节点中的第i个领域节点,确定所述第i个领域节点在所述第k个预测参考帧中的对应节点,所述i为小于或等于M的正整数;For an i-th domain node among the M domain nodes, determine a corresponding node of the i-th domain node in the k-th prediction reference frame, where i is a positive integer less than or equal to M;
    基于所述M个领域节点在所述第k个预测参考帧中的对应节点,确定所述当前节点在所述第k个预测参考帧中的至少一个预测节点。Based on the corresponding nodes of the M domain nodes in the k-th prediction reference frame, at least one prediction node of the current node in the k-th prediction reference frame is determined.
  51. 根据权利要求49所述的方法,其特征在于,所述确定所述当前节点在所述第k个预测参考帧中的至少一个预测节点,包括:The method according to claim 49, characterized in that the determining of at least one prediction node of the current node in the k-th prediction reference frame comprises:
    确定所述当前节点在所述第k个预测参考帧中的对应节点;Determine a corresponding node of the current node in the k-th prediction reference frame;
    确定所述对应节点的至少一个领域节点;Determining at least one domain node of the corresponding node;
    将所述至少一个领域节点,确定为所述当前节点在所述第k个预测参考帧中的至少一个预测节点。The at least one domain node is determined as at least one prediction node of the current node in the k-th prediction reference frame.
  52. 根据权利要求50或51所述的方法,其特征在于,所述方法还包括:The method according to claim 50 or 51, characterized in that the method further comprises:
    在所述当前待编码帧中,确定第i个节点的父节点,作为第i个父节点,所述第i个节点为所述第i个领域节点或者为所述当前节点;In the current frame to be encoded, determine the parent node of the ith node as the ith parent node, the ith node being the ith domain node or the current node;
    确定所述第i个父节点在所述第k个预测参考帧中的匹配节点,作为第i个匹配节点;Determine a matching node of the i-th parent node in the k-th prediction reference frame as the i-th matching node;
    将所述i个匹配节点的子节点中的一个子节点,确定为所述第i个节点在所述第k个预测参考帧中的对应节点。One of the child nodes of the i matching nodes is determined as the corresponding node of the i-th node in the k-th prediction reference frame.
  53. 根据权利要求52所述的方法,其特征在于,所述确定所述第i个父节点在所述第k个预测参考帧中的匹配节点,包括:The method according to claim 52, characterized in that the determining the matching node of the i-th parent node in the k-th prediction reference frame comprises:
    基于所述第i个父节点的占位信息,确定所述第i个父节点在所述第k个预测参考帧中的匹配节点。Based on the placeholder information of the i-th parent node, a matching node of the i-th parent node in the k-th prediction reference frame is determined.
  54. 根据权利要求53所述的方法,其特征在于,所述基于所述第i个父节点的占位信息,确定所述第i个父节点在所述第k个预测参考帧中的匹配节点,包括:The method according to claim 53, characterized in that the determining, based on the placeholder information of the i-th parent node, a matching node of the i-th parent node in the k-th prediction reference frame comprises:
    将所述第k个预测参考帧中,占位信息与所述第i个父节点的占位信息之间的差异最小的节点,确定为所述第i个父节点在所述第k个预测参考帧中的匹配节点。A node whose placeholder information in the k-th prediction reference frame has the smallest difference with the placeholder information of the i-th parent node is determined as a matching node of the i-th parent node in the k-th prediction reference frame.
  55. 根据权利要求54所述的方法,其特征在于,所述将所述i个匹配节点的子节点中的一个子节点,确定为所述第i个节点在所述第k个预测参考帧中的对应节点,包括:The method according to claim 54, characterized in that the step of determining one of the child nodes of the i matching nodes as the corresponding node of the i-th node in the k-th prediction reference frame comprises:
    确定所述第i个节点在所述父节点所包括的子节点中的第一序号;Determine the first sequence number of the i-th node among the child nodes included in the parent node;
    将所述第i个匹配节点的子节点中序号为第一序号的子节点,确定为所述第i个节点在所述第k个预测参考帧中的对应节点。The child node with the first sequence number among the child nodes of the i-th matching node is determined as the corresponding node of the i-th node in the k-th prediction reference frame.
  56. 根据权利要求50所述的方法,其特征在于,所述基于所述M个领域节点在所述第k个预测参考帧中的对应节点,确定所述当前节点在所述第k个预测参考帧中的至少一个预测节点,包括:The method according to claim 50, characterized in that the determining at least one prediction node of the current node in the kth prediction reference frame based on the corresponding nodes of the M domain nodes in the kth prediction reference frame comprises:
    将所述M个领域节点在所述第k个预测参考帧中的对应节点,确定为所述当前节点在所述第k个预测参考帧中的至少一个预测节点。The corresponding nodes of the M domain nodes in the k-th prediction reference frame are determined as at least one prediction node of the current node in the k-th prediction reference frame.
  57. 根据权利要求49所述的方法,其特征在于,所述基于所述当前节点在所述K个预测参考帧中的至少一个预测节点,确定所述当前节点的N个预测节点,包括:The method according to claim 49, characterized in that the determining the N prediction nodes of the current node based on at least one prediction node of the current node in the K prediction reference frames comprises:
    将所述当前节点在所述K个预测参考帧中的至少一个预测节点,确定为所述当前节点的N个预测节点。At least one prediction node of the current node in the K prediction reference frames is determined as N prediction nodes of the current node.
  58. 根据权利要求49所述的方法,其特征在于,若所述当前待编码帧为P帧,则所述K个预测参考帧包括所述当前待编码帧的前向帧。The method according to claim 49 is characterized in that if the current frame to be encoded is a P frame, the K prediction reference frames include a forward frame of the current frame to be encoded.
  59. 根据权利要求49所述的方法,其特征在于,若所述当前待编码帧为B帧,则所述K个预测参考帧包括所述当前待编码帧的前向帧和后向帧。The method according to claim 49 is characterized in that if the current frame to be encoded is a B frame, the K prediction reference frames include a forward frame and a backward frame of the current frame to be encoded.
  60. 根据权利要求49所述的方法,其特征在于,所述预测节点的直接编码信息包括第一直接编码信息,基于所述N个预测节点的直接编码信息,对所述当前节点所包括的点数进行预测编码,包括:The method according to claim 49, characterized in that the direct coding information of the prediction node includes first direct coding information, and based on the direct coding information of the N prediction nodes, predictive coding is performed on the number of points included in the current node, comprising:
    将所述N个预测节点的第一直接编码信息作为上下文信息,对所述当前节点对应的第一标志进行预测编码,所述第一标志用于表示所述当前节点所包括的点数是否大于1。The first direct encoding information of the N prediction nodes is used as context information to perform predictive encoding on a first flag corresponding to the current node, where the first flag is used to indicate whether the number of points included in the current node is greater than 1.
  61. 根据权利要求60所述的方法,其特征在于,所述将所述N个预测节点的第一直接编码信息作为上下文信息,对所述当前节点对应的第一标志进行预测编码,包括:The method according to claim 60, characterized in that the step of using the first direct encoding information of the N prediction nodes as context information to perform predictive encoding on the first flag corresponding to the current node comprises:
    基于所述N个预测节点的第一直接编码信息,确定第一上下文模型的索引;Determining an index of a first context model based on the first direct encoding information of the N prediction nodes;
    基于所述第一上下文模型的索引,从所述第一标志对应的Q个上下文模型中,选出所述第一上下文模型,所述Q为大于1的正整数;Based on the index of the first context model, selecting the first context model from Q context models corresponding to the first flag, where Q is a positive integer greater than 1;
    使用所述第一上下文模型,对所述当前节点对应的第一标志进行预测编码。Using the first context model, predictive encoding is performed on a first flag corresponding to the current node.
  62. 根据权利要求61所述的方法,其特征在于,所述基于所述N个预测节点的第一直接编码信息,确定所述第一上下文模型的索引,包括:The method according to claim 61, characterized in that the determining the index of the first context model based on the first direct encoding information of the N prediction nodes comprises:
    针对所述N个预测节点中的任一预测节点,基于所述预测节点的第一直接编码信息,确定所述预测节点对应的第一数值;For any prediction node among the N prediction nodes, determining a first value corresponding to the prediction node based on first direct encoding information of the prediction node;
    基于所述N个预测节点对应的第一数值,确定所述第一上下文模型的索引。Based on first numerical values corresponding to the N prediction nodes, an index of the first context model is determined.
  63. 根据权利要求62所述的方法,其特征在于,所述基于所述N个预测节点对应的第一数值,确定所述第一上下文模型的索引,包括:The method according to claim 62, characterized in that the determining the index of the first context model based on the first values corresponding to the N prediction nodes comprises:
    确定所述预测节点对应的第一权重;Determining a first weight corresponding to the prediction node;
    基于所述第一权重,对所述N个预测节点对应的第一数值进行加权处理,得到第一加权预测值;Based on the first weight, weighting the first values corresponding to the N prediction nodes to obtain a first weighted prediction value;
    基于所述第一加权预测值,确定所述第一上下文模型的索引。Based on the first weighted prediction value, determine the index of the first context model.
  64. 根据权利要求61所述的方法,其特征在于,若所述K大于1时,则所述基于所述N个预测节点的第一直接编码信息,确定第一上下文模型的索引,包括:The method according to claim 61, characterized in that if K is greater than 1, then determining the index of the first context model based on the first direct encoding information of the N prediction nodes comprises:
    针对所述K个预测参考帧中的第j个预测参考帧,基于所述当前节点在所述第j个预测参考帧中的预测节点的第一直接编码信息,确定所述第j个预测参考帧中的预测节点对应的第一数值,所述j为小于或等于K的正整数;For a j-th prediction reference frame among the K prediction reference frames, determining a first value corresponding to the prediction node in the j-th prediction reference frame based on first direct encoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K;
    确定所述预测节点对应的第一权重,并基于所述第一权重对所述第j个预测参考帧中的预测节点对应的第一数值进行加权处理,得到所述第j个预测参考帧对应的第二加权预测值;Determine a first weight corresponding to the prediction node, and perform weighted processing on a first value corresponding to the prediction node in the j-th prediction reference frame based on the first weight to obtain a second weighted prediction value corresponding to the j-th prediction reference frame;
    基于所述K个预测参考帧对应的第二加权预测值,确定所述第一上下文模型的索引。Based on the second weighted prediction values corresponding to the K prediction reference frames, an index of the first context model is determined.
  65. 根据权利要求64所述的方法,其特征在于,所述基于所述K个预测参考帧对应的第二加权预测值,确定所述第一上下文模型的索引,包括:The method according to claim 64, characterized in that the determining the index of the first context model based on the second weighted prediction values corresponding to the K prediction reference frames comprises:
    确定所述K个预测参考帧对应的第二权重;Determine second weights corresponding to the K prediction reference frames;
    基于所述第二权重对所述K个预测参考帧分别对应的第二加权预测值进行加权处理,得到所述第一上下文模型的索引。Based on the second weight, weighted processing is performed on the second weighted prediction values corresponding to the K prediction reference frames respectively to obtain an index of the first context model.
  66. 根据权利要求62或64所述的方法,其特征在于,所述第一直接编码信息包括第二信息和预测节点的点数中的至少一个,所述第二信息表示所述预测节点是否满足所述直接编码方式进行编码的条件,基于所述预测节点的第一直接编码信息,确定所述预测节点对应的第一数值,包括:The method according to claim 62 or 64, characterized in that the first direct encoding information includes at least one of second information and the number of points of the prediction node, the second information indicates whether the prediction node satisfies the condition for encoding by the direct encoding method, and determining the first value corresponding to the prediction node based on the first direct encoding information of the prediction node comprises:
    基于所述第二信息,确定第一子值;Based on the second information, determining a first sub-value;
    基于所述预测节点的点数,确定第二子值;Determining a second sub-value based on the number of points of the prediction node;
    基于所述第一子值和/或所述第二子值,确定所述预测节点对应的第一数值。Based on the first sub-value and/or the second sub-value, a first numerical value corresponding to the prediction node is determined.
  67. 根据权利要求66所述的方法,其特征在于,所述基于所述预测节点的点数,确定第二子值,包括:The method according to claim 66, characterized in that the determining the second sub-value based on the number of points of the prediction node comprises:
    若所述预测节点所包括的点数大于第一预设值,则确定所述第二子值的取值为第一值;If the number of points included in the prediction node is greater than a first preset value, determining that the value of the second sub-value is the first value;
    若所述预测节点所包括的点数小于或等于第一预设值,则确定所述第二子值的取值为第二值。If the number of points included in the prediction node is less than or equal to a first preset value, the value of the second sub-value is determined to be a second value.
  68. 根据权利要求60所述的方法,其特征在于,所述预测节点的直接编码信息包括第二直接编码信息,所述方法还包括:The method according to claim 60, characterized in that the direct encoding information of the prediction node includes second direct encoding information, and the method further comprises:
    将所述N个预测节点的第二直接编码信息作为上下文信息,对所述当前节点对应的第二标志进行预测编码,所述第二标志用于表示所述当前节点的第二个点是否为重复点。The second direct encoding information of the N predicted nodes is used as context information to perform predictive encoding on a second flag corresponding to the current node, where the second flag is used to indicate whether the second point of the current node is a repeated point.
  69. 根据权利要求68所述的方法,其特征在于,所述将所述N个预测节点的第二直接编码信息作为上下文信息,对所述当前节点对应的第二标志进行预测编码,包括:The method according to claim 68, characterized in that the step of using the second direct encoding information of the N prediction nodes as context information to perform predictive encoding on the second flag corresponding to the current node comprises:
    基于所述N个预测节点的第二直接编码信息,确定第二上下文模型的索引;Determining an index of a second context model based on the second direct encoding information of the N prediction nodes;
    基于所述第二上下文模型的索引,从所述第二标志对应的P个上下文模型中,选出所述第二上下文模型,所述P为大于1的正整数;Based on the index of the second context model, selecting the second context model from P context models corresponding to the second flag, where P is a positive integer greater than 1;
    使用所述第二上下文模型,对所述当前节点对应的第二标志进行预测编码。The second context model is used to perform predictive encoding on a second flag corresponding to the current node.
  70. 根据权利要求69所述的方法,其特征在于,所述基于所述N个预测节点的第二直接编码信息,确定所述第二上下文模型的索引,包括:The method according to claim 69, characterized in that the determining the index of the second context model based on the second direct encoding information of the N prediction nodes comprises:
    针对所述N个预测节点中的任一预测节点,基于所述预测节点的第二直接编码信息,确定所述预测节点对应的第二数值;For any prediction node among the N prediction nodes, determining a second value corresponding to the prediction node based on second direct encoding information of the prediction node;
    基于所述N个预测节点对应的第二数值,确定所述第二上下文模型的索引。Based on second numerical values corresponding to the N prediction nodes, an index of the second context model is determined.
  71. 根据权利要求70所述的方法,其特征在于,所述基于所述N个预测节点对应的第二数值,确定所述第二上下文模型的索引,包括:The method according to claim 70, characterized in that the determining the index of the second context model based on the second values corresponding to the N prediction nodes comprises:
    确定所述预测节点对应的第一权重;Determining a first weight corresponding to the prediction node;
    基于所述第一权重,对所述N个预测节点对应的第二数值进行加权处理,得到第三加权预测值;Based on the first weight, weighting the second values corresponding to the N prediction nodes to obtain a third weighted prediction value;
    基于所述第三加权预测值,确定所述第二上下文模型的索引。Based on the third weighted prediction value, determine the index of the second context model.
  72. 根据权利要求69所述的方法,其特征在于,若所述K大于1时,则所述基于所述N个预测节点的第二直接编码信息,确定第二上下文模型的索引,包括:The method according to claim 69, characterized in that if the K is greater than 1, then the determining the index of the second context model based on the second direct encoding information of the N prediction nodes comprises:
    针对所述K个预测参考帧中的第j个预测参考帧,基于所述当前节点在所述第j个预测参考帧中的预测节点的第二直接编码信息,确定所述第j个预测参考帧中的预测节点对应的第二数值,所述j为小于或等于K的正整数;For a j-th prediction reference frame among the K prediction reference frames, determining a second value corresponding to the prediction node in the j-th prediction reference frame based on second direct encoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K;
    确定所述预测节点对应的第一权重,并基于所述第一权重对所述第j个预测参考帧中的预测节点对应的第二数值进行加权处理,得到所述第j个预测参考帧对应的第四加权预测值;Determine a first weight corresponding to the prediction node, and perform weighted processing on the second value corresponding to the prediction node in the j-th prediction reference frame based on the first weight to obtain a fourth weighted prediction value corresponding to the j-th prediction reference frame;
    基于所述K个预测参考帧对应的第四加权预测值,确定所述第二上下文模型的索引。Based on the fourth weighted prediction values corresponding to the K prediction reference frames, an index of the second context model is determined.
  73. 根据权利要求72所述的方法,其特征在于,所述基于所述K个预测参考帧对应的第四加权预测值,确定所述第二上下文模型的索引,包括:The method according to claim 72, characterized in that the determining the index of the second context model based on the fourth weighted prediction values corresponding to the K prediction reference frames comprises:
    确定所述K个预测参考帧对应的第二权重;Determine second weights corresponding to the K prediction reference frames;
    基于所述第二权重对所述K个预测参考帧分别对应的第四加权预测值进行加权处理,得到所述第二上下文模型的索引。Based on the second weight, the fourth weighted prediction values corresponding to the K prediction reference frames are weighted to obtain the index of the second context model.
  74. 根据权利要求70或72所述的方法,其特征在于,所述第二直接编码信息包括第二信息和第三信息中的至少一个,所述第二信息表示所述预测节点是否满足所述直接编码方式进行编码的条件,所述第三信息表示所述预测节点的第二个点是否为重复点,基于所述预测节点的第二直接编码信息,确定所述预测节点对应的第二数值,包括:The method according to claim 70 or 72, characterized in that the second direct encoding information includes at least one of second information and third information, the second information indicates whether the predicted node meets the condition for encoding by the direct encoding method, and the third information indicates whether the second point of the predicted node is a repeated point, and determining the second value corresponding to the predicted node based on the second direct encoding information of the predicted node comprises:
    基于所述第二信息,确定第一子值;Based on the second information, determining a first sub-value;
    基于所述第三信息,确定第三子值;Based on the third information, determining a third sub-value;
    基于所述第一子值和/或所述第三子值,确定所述预测节点对应的第二数值。Based on the first sub-value and/or the third sub-value, a second numerical value corresponding to the prediction node is determined.
  75. 根据权利要求74所述的方法,其特征在于,所述基于所述第三信息,确定第三子值,包括:The method according to claim 74, characterized in that the determining the third sub-value based on the third information comprises:
    若所述第三信息表示所述预测节点的第二个点是重复点,则确定所述第三子值的取值为第一值;If the third information indicates that the second point of the predicted node is a duplicate point, determining that the value of the third sub-value is the first value;
    若所述第三信息表示所述预测节点的第二个点不是重复点,则确定所述第三子值的取值为第二值。If the third information indicates that the second point of the predicted node is not a repeated point, the value of the third sub-value is determined to be the second value.
  76. 根据权利要求68所述的方法,其特征在于,所述预测节点的直接编码信息包括第三直接编码信息,所述方法还包括:The method according to claim 68, characterized in that the direct encoding information of the prediction node includes third direct encoding information, and the method further comprises:
    若所述当前节点的第二个点是重复点时,则将所述N个预测节点的第三直接编码信息作为上下文信息,对所述当前节点对应的第三标志进行预测编码,所述第三标志用于表示所述当前节点的第三个点是否为重复点。If the second point of the current node is a repeated point, the third direct encoding information of the N predicted nodes is used as context information to perform predictive encoding on the third flag corresponding to the current node, and the third flag is used to indicate whether the third point of the current node is a repeated point.
  77. 根据权利要求76所述的方法,其特征在于,所述将所述N个预测节点的第三直接编码信息作为上下文信息,对所述当前节点对应的第三标志进行预测编码,包括:The method according to claim 76, characterized in that the step of using the third direct encoding information of the N prediction nodes as context information to perform predictive encoding on the third flag corresponding to the current node comprises:
    基于所述N个预测节点的第三直接编码信息,确定第三上下文模型的索引;Determining an index of a third context model based on the third direct encoding information of the N prediction nodes;
    基于所述第三上下文模型的索引,从所述第三标志对应的R个上下文模型中,选出所述第三上下文模型,所述R为大于1的正整数;Based on the index of the third context model, selecting the third context model from R context models corresponding to the third flag, where R is a positive integer greater than 1;
    使用所述第三上下文模型,对所述当前节点对应的第三标志进行预测编码。The third flag corresponding to the current node is predictively encoded using the third context model.
  78. 根据权利要求77所述的方法,其特征在于,所述基于所述N个预测节点的第三直接编码信息,确定所述第三上下文模型的索引,包括:The method according to claim 77, characterized in that the determining the index of the third context model based on the third direct encoding information of the N prediction nodes comprises:
    针对所述N个预测节点中的任一预测节点,基于所述预测节点的第三直接编码信息,确定所述预测节点对应的第三数值;For any prediction node among the N prediction nodes, determining a third value corresponding to the prediction node based on third direct encoding information of the prediction node;
    基于所述N个预测节点对应的第三数值,确定所述第三上下文模型的索引。Based on third numerical values corresponding to the N prediction nodes, an index of the third context model is determined.
  79. 根据权利要求78所述的方法,其特征在于,所述基于所述N个预测节点对应的第三数值,确定所述第三上下文模型的索引,包括:The method according to claim 78, characterized in that the determining the index of the third context model based on the third values corresponding to the N prediction nodes comprises:
    确定所述预测节点对应的第一权重;Determining a first weight corresponding to the prediction node;
    基于所述第一权重,对所述N个预测节点对应的第三数值进行加权处理,得到第五加权预测值;Based on the first weight, weighting the third values corresponding to the N prediction nodes to obtain a fifth weighted prediction value;
    基于所述第五加权预测值,确定所述第三上下文模型的索引。Based on the fifth weighted prediction value, an index of the third context model is determined.
  80. 根据权利要求77所述的方法,其特征在于,若所述K大于1时,则所述基于所述N个预测节点的第三直接编码信息,确定第三上下文模型的索引,包括:The method according to claim 77, characterized in that if the K is greater than 1, then the determining the index of the third context model based on the third direct encoding information of the N prediction nodes comprises:
    针对所述K个预测参考帧中的第j个预测参考帧,基于所述当前节点在所述第j个预测参考帧中的预测节点的第三直接编码信息,确定所述第j个预测参考帧中的预测节点对应的第三数值,所述j为小于或等于K的正整数;For a j-th prediction reference frame among the K prediction reference frames, determining a third value corresponding to the prediction node in the j-th prediction reference frame based on third direct encoding information of the prediction node of the current node in the j-th prediction reference frame, where j is a positive integer less than or equal to K;
    确定所述预测节点对应的第一权重,并基于所述第一权重对所述第j个预测参考帧中的预测节点对应的第三数值进行加权处理,得到所述第j个预测参考帧对应的第六加权预测值;Determine a first weight corresponding to the prediction node, and perform weighted processing on a third value corresponding to the prediction node in the j-th prediction reference frame based on the first weight to obtain a sixth weighted prediction value corresponding to the j-th prediction reference frame;
    基于所述K个预测参考帧对应的第六加权预测值,确定所述第三上下文模型的索引。Based on the sixth weighted prediction values corresponding to the K prediction reference frames, an index of the third context model is determined.
  81. 根据权利要求80所述的方法,其特征在于,所述基于所述K个预测参考帧对应的第六加权预测值,确定所述第三上下文模型的索引,包括:The method according to claim 80, characterized in that the determining the index of the third context model based on the sixth weighted prediction values corresponding to the K prediction reference frames comprises:
    确定所述K个预测参考帧对应的第二权重;Determine second weights corresponding to the K prediction reference frames;
    基于所述第二权重对所述K个预测参考帧分别对应的第六加权预测值进行加权处理,得到所述第三上下文模型的索引。Based on the second weight, the sixth weighted prediction values corresponding to the K prediction reference frames are weighted to obtain the index of the third context model.
  82. 根据权利要求78或80所述的方法,其特征在于,所述第三直接编码信息包括第二信息和第四信息中的至少一个,所述第二信息表示所述预测节点是否满足所述直接编码方式进行编码的条件,所述第四信息表示所述预测节点的第三个点是否为重复点,其中,基于所述预测节点的第三直接编码信息,确定所述预测节点对应的第三数值,包括:The method according to claim 78 or 80, characterized in that the third direct encoding information includes at least one of second information and fourth information, the second information indicates whether the predicted node meets the condition for encoding by the direct encoding method, and the fourth information indicates whether the third point of the predicted node is a repeated point, wherein determining the third value corresponding to the predicted node based on the third direct encoding information of the predicted node includes:
    基于所述第二信息,确定第一子值;Based on the second information, determining a first sub-value;
    基于所述第四信息,确定第四子值;Based on the fourth information, determining a fourth sub-value;
    基于所述第一子值和/或所述第四子值,确定所述预测节点对应的第三数值。Based on the first sub-value and/or the fourth sub-value, a third numerical value corresponding to the prediction node is determined.
  83. 根据权利要求82所述的方法,其特征在于,所述基于所述第四信息,确定第四子值,包括:The method according to claim 82, characterized in that the determining the fourth sub-value based on the fourth information comprises:
    若所述第四信息表示所述预测节点的第三个点是重复点,则确定所述第四子值的取值为第一值;If the fourth information indicates that the third point of the predicted node is a repeated point, determining that the value of the fourth subvalue is the first value;
    若所述第四信息表示所述预测节点的第三个点不是重复点,则确定所述第四子值的取值为第二值。If the fourth information indicates that the third point of the predicted node is not a repeated point, the value of the fourth sub-value is determined to be the second value.
  84. 根据权利要求76所述的方法,其特征在于,所述方法还包括:The method according to claim 76, characterized in that the method further comprises:
    若所述第三标志表示所述当前节点的第三个点为重复点时,则对所述当前节点所包括的剩余重复点数目进行编码。If the third flag indicates that the third point of the current node is a repeated point, the number of remaining repeated points included in the current node is encoded.
  85. 根据权利要求66、74或82所述的方法,其特征在于,所述基于所述第二信息,确定第一子值,包括:The method according to claim 66, 74 or 82, characterized in that the determining the first sub-value based on the second information comprises:
    若所述第二信息指示所述预测节点满足所述直接编码方式进行编码的条件时,则确定所述第一子值的取值为第一值;If the second information indicates that the prediction node satisfies the condition for encoding in the direct encoding mode, determining that the value of the first subvalue is a first value;
    若所述第二信息指示所述预测节点不满足所述直接编码方式进行编码的条件时,则确定所述第一子值的取值为第二值。If the second information indicates that the prediction node does not meet the condition for encoding in the direct encoding manner, the value of the first sub-value is determined to be a second value.
  86. 根据权利要求66、74或82所述的方法,其特征在于,所述第二信息包括所述预测节点的直接编码模式信息。The method according to claim 66, 74 or 82 is characterized in that the second information includes direct encoding mode information of the prediction node.
  87. 根据权利要求63、64、71、72、79或80所述的方法,其特征在于,确定所述预测节点对应的第一权重,包括:The method according to claim 63, 64, 71, 72, 79 or 80, characterized in that determining the first weight corresponding to the prediction node comprises:
    基于所述预测节点对应的领域节点与所述当前节点之间的距离,确定所述预测节点对应的第一权重。Based on the distance between the domain node corresponding to the prediction node and the current node, a first weight corresponding to the prediction node is determined.
  88. 根据权利要求65、73或81所述的方法,其特征在于,所述确定所述K个预测参考帧对应的第二权重,包括:The method according to claim 65, 73 or 81, characterized in that the determining the second weights corresponding to the K prediction reference frames comprises:
    基于所述预测参考帧与所述当前待编码帧之间的时间差距,确定所述预测参考帧对应的第二权重。Based on the time difference between the predicted reference frame and the current frame to be encoded, a second weight corresponding to the predicted reference frame is determined.
  89. 一种点云解码装置,其特征在于,包括:A point cloud decoding device, characterized by comprising:
    确定单元,用于确定当前节点对应的帧间解码信息,所述当前节点为当前待解码帧中的待解码节点;A determination unit, configured to determine inter-frame decoding information corresponding to a current node, wherein the current node is a node to be decoded in a current frame to be decoded;
    解码单元,用于基于所述当前节点对应的帧间解码信息,对所述当前节点所包括的点数进行预测解码。A decoding unit is used to predict and decode the number of points included in the current node based on the inter-frame decoding information corresponding to the current node.
  90. 一种点云编码装置,其特征在于,包括:A point cloud encoding device, characterized by comprising:
    确定单元,用于确定当前节点对应的帧间编码信息,所述当前节点为当前待编码帧中的待编码节点;A determination unit, configured to determine inter-frame coding information corresponding to a current node, wherein the current node is a node to be coded in a current frame to be coded;
    编码单元,用于基于所述当前节点对应的帧间编码信息,对所述当前节点所包括的点数进行预测编码。The encoding unit is used to perform predictive encoding on the number of points included in the current node based on the inter-frame encoding information corresponding to the current node.
  91. 一种电子设备,其特征在于,包括:处理器和存储器;An electronic device, characterized in that it comprises: a processor and a memory;
    所述存储器用于存储计算机程序;The memory is used to store computer programs;
    所述处理器用于调用并运行所述存储器中存储的计算机程序,以执行如权利要求1至45或46至88任一项所述的方法。The processor is used to call and run the computer program stored in the memory to perform the method according to any one of claims 1 to 45 or 46 to 88.
  92. 一种计算机可读存储介质,其特征在于,用于存储计算机程序,所述计算机程序使得计算机执行如权利要求1至45或46至88任一项所述的方法。A computer-readable storage medium, characterized in that it is used to store a computer program, wherein the computer program enables a computer to execute the method according to any one of claims 1 to 45 or 46 to 88.
PCT/CN2023/071069 2023-01-06 2023-01-06 Point cloud coding method and apparatus, point cloud decoding method and apparatus, and devices and storage medium WO2024145933A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2023/071069 WO2024145933A1 (en) 2023-01-06 2023-01-06 Point cloud coding method and apparatus, point cloud decoding method and apparatus, and devices and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2023/071069 WO2024145933A1 (en) 2023-01-06 2023-01-06 Point cloud coding method and apparatus, point cloud decoding method and apparatus, and devices and storage medium

Publications (1)

Publication Number Publication Date
WO2024145933A1 true WO2024145933A1 (en) 2024-07-11

Family

ID=91803374

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/071069 WO2024145933A1 (en) 2023-01-06 2023-01-06 Point cloud coding method and apparatus, point cloud decoding method and apparatus, and devices and storage medium

Country Status (1)

Country Link
WO (1) WO2024145933A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109196559A (en) * 2016-05-28 2019-01-11 微软技术许可有限责任公司 The motion compensation of dynamic voxelization point cloud is compressed
CN112565764A (en) * 2020-12-03 2021-03-26 西安电子科技大学 Point cloud geometric information interframe coding and decoding method
WO2022042538A1 (en) * 2020-08-24 2022-03-03 北京大学深圳研究生院 Block-based point cloud geometric inter-frame prediction method and decoding method
TW202234884A (en) * 2020-12-29 2022-09-01 美商高通公司 Inter prediction coding for geometry point cloud compression

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109196559A (en) * 2016-05-28 2019-01-11 微软技术许可有限责任公司 The motion compensation of dynamic voxelization point cloud is compressed
WO2022042538A1 (en) * 2020-08-24 2022-03-03 北京大学深圳研究生院 Block-based point cloud geometric inter-frame prediction method and decoding method
CN112565764A (en) * 2020-12-03 2021-03-26 西安电子科技大学 Point cloud geometric information interframe coding and decoding method
TW202234884A (en) * 2020-12-29 2022-09-01 美商高通公司 Inter prediction coding for geometry point cloud compression

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
D. MEHLEM (RWTH-AACHEN), C. ROHLFING (RWTH): "Versatile Video Coding for VPCC", 130. MPEG MEETING; 20200420 - 20200424; ALPBACH; (MOTION PICTURE EXPERT GROUP OR ISO/IEC JTC1/SC29/WG11), 8 April 2020 (2020-04-08), XP030287077 *

Similar Documents

Publication Publication Date Title
US11910017B2 (en) Method for predicting point cloud attribute, encoder, decoder, and storage medium
TWI806481B (en) Method and device for selecting neighboring points in a point cloud, encoding device, decoding device and computer device
TW202249488A (en) Point cloud attribute prediction method and apparatus, and codec
WO2024145933A1 (en) Point cloud coding method and apparatus, point cloud decoding method and apparatus, and devices and storage medium
WO2024145934A1 (en) Point cloud coding/decoding method and apparatus, and device and storage medium
WO2024145913A1 (en) Point cloud encoding and decoding method and apparatus, device, and storage medium
WO2024145935A1 (en) Point cloud encoding method and apparatus, point cloud decoding method and apparatus, device, and storage medium
WO2024145912A1 (en) Point cloud coding method and apparatus, point cloud decoding method and apparatus, device, and storage medium
WO2024145911A1 (en) Point cloud encoding/decoding method and apparatus, device and storage medium
WO2024065269A1 (en) Point cloud encoding and decoding method and apparatus, device, and storage medium
WO2024065270A1 (en) Point cloud encoding method and apparatus, point cloud decoding method and apparatus, devices, and storage medium
WO2024065271A1 (en) Point cloud encoding/decoding method and apparatus, and device and storage medium
WO2024065272A1 (en) Point cloud coding method and apparatus, point cloud decoding method and apparatus, and device and storage medium
WO2023024840A1 (en) Point cloud encoding and decoding methods, encoder, decoder and storage medium
WO2023024842A1 (en) Point cloud encoding/decoding method, apparatus and device, and storage medium
WO2024026712A1 (en) Point cloud coding method and apparatus, point cloud decoding method and apparatus, and device and storage medium
TW202425635A (en) Point cloud encoding and decoding method, device, equipment and storage medium
WO2024065406A1 (en) Encoding and decoding methods, bit stream, encoder, decoder, and storage medium
TW202425653A (en) Point cloud encoding and decoding method, device, equipment and storage medium
TW202425629A (en) Point cloud encoding and decoding method, device, equipment and storage medium
WO2024011381A1 (en) Point cloud encoding method and apparatus, point cloud decoding method and apparatus, device and storage medium
WO2022257150A1 (en) Point cloud encoding and decoding methods and apparatus, point cloud codec, and storage medium
US20230082456A1 (en) Point cloud attribute prediction method and apparatus, and related device
US20240244242A1 (en) Reduced Memory Coding
WO2024148598A1 (en) Encoding method, decoding method, encoder, decoder, and storage medium