CN115456900A

CN115456900A - Improved transform-based Qinhong tomb warrior fragment denoising method

Info

Publication number: CN115456900A
Application number: CN202211133859.0A
Authority: CN
Inventors: 徐雪丽; 耿国华; 王红珍; 王敬禹; 周明全; 曹欣
Original assignee: Northwest University; Yanan University
Current assignee: Northwest University; Yanan University
Priority date: 2022-09-19
Filing date: 2022-09-19
Publication date: 2022-12-09

Abstract

The invention discloses an improved transform-based Qin tomb warrior fragment denoising method, which comprises the following steps: 1. preprocessing a point cloud sample of Qin warrior data; 2. importing the preprocessed point cloud sample as a training set into an input embedding module and mapping the point cloud sample to a high-dimensional space; 3. importing the high-dimensional space point cloud into a self-adaptive down-sampling module of a transform encoder, obtaining relatively uniform points by using an FPS (field programmable system) AS original sampling points, and automatically learning the offset of each sampling point and updating position information by an AS (application server) so AS to reduce the data volume and keep the structural attribute of an original point cloud model; 4. importing the down-sampled result into an encoder module of a Transformer, and enhancing the characteristics of the point cloud through an RA module so as to effectively extract the characteristics; 5. selecting points closer to clean point cloud by using a self-adaptive sampling method to reconstruct a three-dimensional surface by taking the output of a transform decoder as a basis; 6. and continuously carrying out iterative training on the imported data until the loss value is small and tends to be stable, so that the de-noised clean point cloud is obtained, and the robustness to high noise is better.

Description

Improved transform-based Qinhong tomb warrior fragment denoising method

Technical Field

The invention belongs to the technical field of cultural relic protection, and particularly relates to an improved transform-based method for denoising fragments of Qin warriors.

Background

In the field of cultural relic excavation protection, digital initialization acquisition of cultural relic fragments is influenced by various factors such as measuring equipment, external environment, surface characteristics of a measured object and the like, and an initial point cloud data model obtained by scanning often contains a large number of noise points. The more the number of the noise points is, the greater the influence on the quality of point cloud is, and the accuracy and efficiency of tasks such as feature extraction, registration, curved surface reconstruction and visualization in the later stage are directly influenced. Therefore, denoising acquired initial digitized point cloud data is an important research content in the field.

In the traditional denoising method, a point cloud data denoising method based on curved surface fitting firstly carries out surface fitting on three-dimensional scanning point cloud data of an object, then calculates the distance between each point and a fitting surface, and finally deletes the gross error or abnormal value of the point cloud data according to a certain criterion to achieve the purpose of denoising the point cloud data. The method is a simple and effective estimation method, but the accuracy is not high, and large calculation errors exist particularly for complex models and models containing noise; in a moving robust principal component analysis method based on sparse representation theory, the estimated positions of points are calculated through local average, the sharp features are reserved through a weighted minimization method, and the positions of the points are updated by measuring the similarity between normal vectors in local neighborhoods through weights to perform noise elimination. But when the noise level is high, performance tends to decrease due to over-smoothing or over-sharpening.

In recent years, artificial intelligence methods represented by deep learning have taken a series of important breakthroughs and received unprecedented attention. The PointNet creates a pioneer for directly applying a deep learning model on point cloud to carry out feature learning, and in order to ensure the invariance of the replacement, the method applies a normalized rotation matrix on the point cloud, so that the points are too independent; in order to realize order independence, the network performs global feature extraction on all point cloud data by using global pooling operation, but geometric relevance among points is ignored, and a part of local feature information is lost; some improved networks based on PointNet, such as PointNet + +, neural Projection, pointclearnet, and Total cancellation, etc., take into account the local nature of the points for improving model performance. These methods can infer the displacement of noise from the underlying surface and reconstruct points, but these points are not specified for explicit surface restoration, possibly resulting in sub-optimal denoising results.

Disclosure of Invention

Aiming at the defects in the prior art, the invention aims to provide an improved transform-based method for denoising fragments of Qin warriors, which is beneficial to learning the potential manifold of noise point cloud and capturing the inherent structure for recovering the surface to reconstruct the manifold, and has better robustness to high noise.

In order to achieve the purpose, the invention adopts the following technical scheme:

a method for denoising Qin warriors fragments based on an improved Transformer comprises the following steps:

step 1, preprocessing a point cloud sample of Qin warrior data so as to realize data enhancement and labeling processing;

step 2, all the preprocessed point cloud samples are used as training sets, and are led into an input embedding module in batches and are mapped to a high-dimensional space;

step 3, importing the point cloud of the high-dimensional space into an adaptive down-sampling module in an improved transform encoder, firstly obtaining relatively uniform points by using a maximum-distance point sampling algorithm (FPS) AS original sampling points, and then automatically learning the offset of each sampling point by using an adaptive neighborhood sampling Algorithm (AS) and updating the position information of the sampling points, thereby reducing the data volume and retaining the structural attribute of an original point cloud model;

step 4, importing the down-sampled result into an improved Transformer encoder module, and enhancing the characteristics of the point cloud through a relative attention RA module of the point cloud so as to effectively extract the characteristics;

step 5, taking the output of an improved transform decoder as a basis, reconstructing the manifold of each point, sampling the manifold structure corresponding to each point in proportion, and selecting the point closer to the clean point cloud by using a self-adaptive sampling method to reconstruct the three-dimensional surface;

and 6, continuously performing iterative training of the steps 3 to 5 on the imported data by utilizing an improved Transformer encoder-decoder framework until the loss value of the loss function is small and tends to be stable, and obtaining the denoised clean point cloud.

Further, the data enhancement of step 1 includes rotating, translating and scaling the data.

Further, the adaptive neighborhood sampling algorithm AS of step 3 includes the following steps:

step 3.1, let P _s To sample N from the Nth input point _s A set of points consisting of points, and N _s <N，x _i As a set of points P _s Sample point of (1), x _i ∈P _s ，f _i Is the sampling point x _i Characteristic of (b), f _i ∈F _s Sampling point x by k-NN query _i The neighbor of (2) is grouped, and a general self-attention mechanism is used for carrying out feature updating;

step 3.2, sampling point x _i K neighbors x _i,1 ，...，x _i,k Corresponding characteristic f _i,1 ，...，f _i,k Expressed as:

where A is used for aggregate characterization and R is used to describe the sampling point x _i And neighbor point x _i,j The high-level relation between gamma is used for changing the characteristic dimension of each neighbor point, and in order to reduce the calculation amount, let gamma (x) _i,j )＝W _γ f _i,j Closing (c)The system function R is expressed as:

wherein D' is the output channel of Conv;

step 3.3, for each sampling point x _i Using MLP + Softmax, obtaining the coordinates of each point in the group and the normalized weight W of the characteristic channel _p And W _f Expressed as:

W _p ＝Softmax(mlp(x _i,j )) (3)

W _f ＝Softmax(mlp(f _i,j )) (4)

in the formulas (3) and (4), j belongs to [1, k ];

step 3.4, realizing the self-adaptive updating of the sampling point xi and the characteristic fi thereof through weighted summation operation,

and

that is, the updated point information is expressed as:

further, the relative attention RA module of the point cloud in step 4 is used to calculate the relative attention feature between the self-attention SA module feature and the input feature, and is expressed as:

F _ra ＝F _in -F _sa (7)

in the formula, F _ra For relative attention features, F _in As input features, F _sa Is a self-attention SA module feature;

finally, the relative attention feature F _ra Inputting features F over a network _in Relative attention RA Module Final output feature F as the entire Point cloud _out Is represented as:

F _out ＝RA(F _in )＝relu(bn(mlp(F _ra )))+F _in 。 (8)

further, the self-attention SA module is characterized by:

wherein (Q, K, V) = F _in .(W _q ,W _k ,W _v )；W _q 、W _k And W _v To share the learned weight matrix, Q, K, and V are the Query, key, and Value matrices, respectively, generated by linear transformation of the input features,

one dimension for the query and key vectors.

Further, the calculation formula (7) of the relative attention feature is a discrete laplacian operator, and in the graph G having N nodes, the N-dimensional vector f is expressed as:

f＝(f ₁ ,f ₂ ,...,f _N ) (10)

in the formula (f) _i The function value of the function f at the node i;

disturbing the point i, changing the point i into any node j adjacent to the point i, and then expressing the gain caused by the change from the node j to the node i as follows:

when edge E _ij Having a weight W _ij Then:

when W is _ij If =0, indicating that node i and node j are not adjacent, then:

finally, the following components are obtained:

in the formula (I), the compound is shown in the specification,

is the degree of vertex i; w is a _i: ＝(w _i1 ,...,w _iN ) Is an N-dimensional row vector;

is an N-dimensional column vector; w is a _i: f represents the inner product of the two vectors, and the gain accumulation for all N nodes is expressed as:

in the formula, W _i: And representing the weight of the operation corresponding to the ith point, wherein W represents the weight of the operation corresponding to all the points, and D-W is a Laplace matrix L.

Further, in the step (5), the decoder firstly converts the embedded features of each sampling point and its neighborhood into a local curved surface with the point as the center to infer the potential manifold, and then repeatedly samples on the inferred manifold to generate the noise reduction point set

Reconstructing a clean point cloud

Further, the step (5) specifically includes the following steps:

step 5.1, first defining the 2D manifold M embedded in the 3D space parameterized by the feature vector y as:

M(u,v；y):[-1,1]×[-1,1]→R ³ (16)

wherein (u, v) is a 2D rectangular region [ -1,1] ² A certain point in (a);

the 2D rectangle is mapped by the function approximator MLP to a patch manifold of arbitrary shape parameterized by y by equation (16), expressed as:

M _i (u,v；y _i )＝MLP _M ([u,v,y _i ]) (17)

in the formula, M _i (u,v；y _i ) Representing a parameterized patch manifold;

step 5.2, the adaptive downsampling is collected

Point P in _i Corresponding dough sheet manifold M _i Is defined as:

M _i (u,v；y _i )＝p _i +M(u,v；y _i ) (18)

equation (18) represents the manifold M (u, v; y) to be constructed _i ) Move to a local curve centered at pi,

the surface patch manifold of all point correspondences in the table can be expressed as

I.e. characterizing the potential surface of the point cloud;

step 5.3, reducing the number of input points by half in the adaptive downsampling process through step 5.1 and step 5.2, namely M = N/2, and performing M (u, v; y) on each patch manifold _i ) Resampling, and collecting two points on each surface patch manifold to obtain a denoising point cloud

Expressed as:

further, the loss function loss in the step (6) includes a loss function Las and a loss function Lus;

the loss function Las is used for quantizing the adaptive down-sampling set

Distance from ground route point cloud Pgt due to

And Pgt contains a different number of dots, and

therefore, the chamfer distance CD is selected as L _as Expressed as:

the loss function Lus is used to quantify the final reconstructed point cloud

The distance from the ground channel Pgt, using the earth movement distance EMD as Lus, is expressed as:

in the formula (I), the compound is shown in the specification,

phi is bijective;

finally, the network is trained end-to-end with supervision, and the minimum total loss function is expressed as:

L _denoise ＝λL _as +(1-λ)L _us (22)

in the formula, λ is an empirical value of 0.01.

Compared with the prior art, the invention has the following technical effects:

based on the high performance of a Transformer in natural language processing and the sequence independence of all operations, the method is improved and is very suitable for point cloud feature learning, the problem that a relative attention RA module in the improved Transformer is very sensitive to abnormal points in a widely used FPS (remote data system) which is a farthest point sampling method, so that point cloud data in the real world are very unstable when being processed, and the problem that the sampling points from the FPS are required to be subsets of original point clouds, so that the inference of the original data and original geometric information is increased, basic point cloud information can be extracted in a self-adaptive mode, a more valuable basis is provided for smooth development of subsequent work, and meanwhile, an attention mechanism and global pooling operation are used in the feature extraction process, so that not only global information can be extracted, but also the integrity of local detailed information can be kept. Specifically, abundant high-dimensional characteristics of a point cloud sample of Qin terracotta warrior data are obtained by utilizing an improved Transformer structure, potential manifold of noise point cloud is learned from sampling points, namely, the sampling points obtained by an FPS are self-adapted through a self-adaptive down-sampling module, and are closer to the surface where the points are located; and then converting each sampling point and the embedded neighborhood characteristics thereof into a local surface to infer the surface manifold, reconstructing a clean point cloud capturing an internal structure by sampling on each surface manifold without being influenced by abnormal values, recovering the surface to reconstruct the manifold, realizing denoising, having good robustness under synthetic noise and real noise, and having good propulsion effect on the virtual recovery work of the computer-aided Qin warriors.

Drawings

FIG. 1: the invention is based on the point cloud denoising network structure chart of the improved transformer;

FIG. 2: qualitative analysis comparison graphs of different denoising methods are obtained;

FIG. 3: the invention is a schematic diagram of an adaptive down-sampling principle;

FIG. 4: in the structure diagram of the relative attention module RA of the invention, a self-attention module structure SA is arranged in a dashed line frame;

FIG. 5: the invention discloses a schematic diagram of patch manifold reconstruction and resampling.

Detailed Description

The present invention will be explained in further detail with reference to examples.

As shown in fig. 1, a method for denoising fragments of Qin tomb warriors based on improved Transformer includes the following steps:

step 1, preprocessing a point cloud sample of Qin warrior data, realizing data enhancement through rotation, translation and scaling, and labeling the data;

step 3, importing the point cloud of the high-dimensional space into a self-adaptive down-sampling module in the improved transform encoder so as to reduce the data volume and simultaneously keep the structural attribute of the origin cloud model as much as possible;

the specific process is AS shown in fig. 3, firstly, a farthest point sampling algorithm FPS is used to obtain relatively uniform points AS original sampling points, and then, an adaptive neighborhood sampling algorithm AS is used to automatically learn the offset of each sampling point and update the position information of the sampling point;

the adaptive neighborhood sampling algorithm AS specifically comprises the following steps:

step 3.1, let P _s To sample N from the Nth input point _s A set of points consisting of points, and N _s <N，x _i As a set of points P _s Sampling point of (1), x _i ∈P _s ，f _i Is the sampling point x _i Is characterized by f _i ∈F _s Corresponding to the dimension D1, the sampling point x is inquired through k-NN _i The neighbor of (2) is grouped, and a general self-attention mechanism is used for carrying out feature updating;

where A is used for aggregation characterization and R is used to describe the sampling point x _i And neighbor point x _i,j The high-level relation between gamma is used for changing the characteristic dimension of each neighbor point, and in order to reduce the calculation amount, let gamma (x) _i,j )＝W _γ f _i,j The relationship function R is expressed as:

wherein D' is the output channel of Conv;

step 3.3, for each sampling point x _i Obtaining the coordinates of each point in the group and the normalized weight W of the characteristic channel by using MLP + Softmax _p And W _f Expressed as:

W _p ＝Softmax(mlp(x _i,j )) (3)

W _f ＝Softmax(mlp(f _i,j )) (4)

in the formulas (3) and (4), j belongs to [1, k ];

step 3.4, realizing sampling point x by weighted summation operation _i And characteristic f thereof _i The adaptive update of the time-domain data stream,

and

that is, the updated point information is expressed as:

the adaptive down-sampling operation can obtain points closer to the potential surface, namely points with less noise disturbance, and is helpful for reducing the potential space for reconstructing the potential manifold when reconstructing the manifold;

attention is to screen a small amount of important information from a large amount of information, focus on the important information, ignore most of the unimportant information, focus on its corresponding Value the larger the weight is, self-Attention SA in the original transformer is a mechanism for focusing on other words of an input sentence when encoding each word, an architecture of an SA layer is described in a dashed box in fig. 4, when switching to a point data stream, according to terms, Q, K, and V are Query, key, and Value matrices generated by linear transformation of input features, first, a weighting coefficient is calculated according to Query and Key, and then, weighted summation is performed on Value according to the weighting coefficient, for the weighting coefficient, the most common method at present includes: evaluating the vector dot product of the two, evaluating the similarity of the vectors Cosine of the two or introducing an additional neural network, wherein the embodiment uses the method of the vector dot product to calculate, and in order to prevent the calculation result from being overlarge, the calculation result is calculated by dividing the vector dot product by a scale

Wherein

For a dimension of query and key vectors, normalizing the result into probability distribution by utilizing Softmax, and multiplying the probability distribution by a matrix Value to obtain a representation of weight summation, namely, the self-attention SA module is characterized by:

in the formula，(Q,K,V)＝F _in .(W _q ,W _k ,W _v )；W _q 、W _k And W _v A learned weight matrix is shared.

As can be seen from the calculation process of the formula (9), the whole self-attention process is unchanged by displacement, so that the self-attention process is very suitable for disorder and irregularity of point clouds, but the absolute coordinates of the same point cloud after rigid transformation are greatly different from those before transformation, in order to describe the inherent characteristics of the point cloud, the embodiment introduces the relative attention characteristics of the point cloud, which is inspired by using a laplacian matrix L = D-a to replace an adjacent matrix a in a graph convolution network, D is a diagonal matrix, and each diagonal element D is a diagonal element D _ii Representing the degree of the ith node, replacing the self-attention SA module in the original transform with a relative attention RA module to enhance the feature representation of the point cloud, as shown in FIG. 4, the relative attention RA module is to calculate the relative attention feature between the self-attention SA feature and the input feature, and is represented as:

F _ra ＝F _in -F _sa (7)

finally, the relative attention feature is further used as the final output feature F of the whole RA through the network and the input feature _out Is represented as:

F _out ＝RA(F _in )＝relu(bn(mlp(F _ra )))+F _in (8)

F _in -F _sa similar to the discrete laplacian, in a graph G with N nodes, the N-dimensional vector f is represented as:

f＝(f ₁ ,f ₂ ,...,f _N ) (10)

in the formula f _i The function value of the function f at the node i;

the point i is disturbed and may become any node j adjacent to the point i, and since the laplacian operator can calculate the gain from one point to the tiny disturbance in all degrees of freedom, the gain caused by changing any node j to the node i is represented by a graph, which is expressed as:

when edge E _ij Having a weight W _ij Then:

when W is _ij If =0, indicating that node i and node j are not adjacent, then:

finally, the following is obtained:

in the formula (I), the compound is shown in the specification,

is an N-dimensional column vector; w is a _i: f represents the inner product of the two vectors, and the gain accumulation is expressed for all N nodes as:

in the formula, W _i: And representing the weight of the operation corresponding to the ith point, wherein W represents the weight of the operation corresponding to all the points, and D-W is the Laplace matrix L.

The ith row in the laplace matrix actually reflects the gain accumulation of the ith node when it perturbs all other nodes. Intuitively, the graph laplace reflects that a potential is applied to the node i, and the potential flows to other nodes smoothly, so that the function of supervision and guidance is played on model iterative optimization. Relative attention increases attention weighting and reduces the effects of noise, which is helpful to downstream tasks.

after the decoder acquires the high-dimensional characteristic representation of the point cloud, the decoder can be used for processing a denoising task, the previous denoising task mostly depends on the idea that points are displaced from potential surfaces, but the points are not specified for restoring the surfaces, which may result in suboptimal denoising effect, the point cloud generally represents some potential surfaces or 2D manifolds of a set of sampling points, and in order to achieve the robustness of the denoising effect, the present embodiment can learn the potential manifolds of the noise point cloud, capture the inherent structures of the original point cloud, and reconstruct and restore the surfaces, and the process is as shown in (b) the decoder part in fig. 1;

the decoder transforms the embedded features of each sample point and its neighborhood into a local surface centered on the point to infer a potential manifold, and then samples the inferred manifold multiple times to produce a set of noise reduction points

I.e. a clean point cloud is reconstructed

The whole process is shown in fig. 5, and specifically comprises the following steps:

step 5.1, first define the 2D manifold M embedded in the 3D space parameterized by some eigenvectors y as:

M(u,v；y):[-1,1]×[-1,1]→R ³ (16)

wherein (u, v) is a 2D rectangular region [ -1,1] ² A certain point in (a);

equation (16) maps the 2D rectangle to an arbitrarily shaped patch manifold parameterized by y, the parameterized patch manifold Mi (u, v; y _i ) The MLP is realized because the MLP is a general function approximator with expression capability enough to approximate the manifold of an arbitrary shape, which is expressed as:

M _i (u,v；y _i )＝MLP _M ([u,v,y _i ]) (17)

in the formula, M _i (u,v；y _i ) Representing a parameterized patch manifold;

step 5.2, after the definition of the manifold M, the adaptive downsampling set is collected

Point P in _i Corresponding dough sheet manifold M _i Is defined as follows:

M _i (u,v；y _i )＝p _i +M(u,v；y _i ) (18)

I.e. characterizing the underlying surface of the point cloud;

step 5.3, reducing the number of input points in the adaptive down-sampling process by half through step 5.1 and step 5.2, namely M = N/2, and performing M (u, v; y) on each patch manifold _i ) Resampling, and sampling two points on each surface patch manifold to obtain a denoising point cloud

Expressed as:

and 6, continuously performing iterative training from the step 3 to the step 5 on the imported data by utilizing an improved Transformer encoder-decoder framework until the loss value of the loss function is small and tends to be stable, and obtaining a denoised clean point cloud.

To measure the reconstruction quality of the final point cloud, the Loss function Loss consists of two parts: 1) Loss function Las for quantization of adaptively downsampled sets

The distance between the point cloud Pgt and the ground route point cloud; 2) Loss function Lus quantization of the final reconstructed point cloud

Distance from ground channel Pgt.

Due to the fact that

And Pgt contains a different number of dots, and

therefore, the chamfer distance CD is selected as L _as Expressed as:

measuring the denoised point cloud by using Earth moving distance Earth's distance (EMD) as Lus

The distance between the point cloud Pgt and the ground route point cloud is expressed as:

in the formula (I), the compound is shown in the specification,

phi is bijective;

L _denoise ＝λL _as +(1-λ)L _us (22)

in the formula, λ is an empirical value of 0.01.

The data set of the Qin warriors is a model which is scanned and collected on the spot from researchers at the cultural heritage digital national and local union engineering research center of northwest university to the warrior museum, and more than 500 existing models are accurately marked. Fig. 2 is a qualitative analysis comparison diagram of the denoising effect of the head and hand data sets of the terracotta warrior by using different denoising methods, and it can be seen that compared with other three methods NPD, tlDn and PCNet based on deep learning, the denoising method provided by the invention has better robustness on abnormal values and cleaner obtained results.

In the embodiment, the point cloud denoising network based on the transform is used as a feature extractor, the structure and semantic comprehension capability of point cloud is stronger, and compared with other three denoising methods, the denoising effect of the method is more remarkable along with the improvement of noise level, and as shown in table 1, the method is better than the previous deep learning method and is more robust to high noise.

TABLE 1 comparison of CD (chamfer distance) for each denoising method at different noise ratios

	0.25％	0.5％	1％	2％	3％
						NPD	0.24	0.62	1.28	2.32	3.27
PCNet	0.18	0.46	0.97	1.42	2.91
						TlDn	0.34	0.78	1.15	2.26	3.12
TDNet-RA(ours)	0.16	0.39	0.83	1.20	2.15

Claims

1. A method for denoising Qin warriors fragments based on an improved Transformer is characterized by comprising the following steps:

step 1, carrying out pretreatment on a point cloud sample of the Qin warriors data so as to realize data enhancement and marking treatment;

step 5, reconstructing the manifold of each point by taking the output of an improved transform decoder as a basis, sampling the manifold structure corresponding to each point in proportion, and selecting a point closer to a clean point cloud to reconstruct a three-dimensional surface;

2. The improved transform-based Qin warriors fragment denoising method according to claim 1, wherein the data enhancement of step 1 comprises rotating, translating and scaling the data.

3. The improved transform-based Qin warrior fragment denoising method according to claim 1, wherein the adaptive neighborhood sampling algorithm AS of the step 3 comprises the following steps:

step 3.1, let P _s To sample N from the Nth input point _s A set of points consisting of points, and N _s <N，x _i As a set of points P _s Sample point of (1), x _i ∈P _s ，f _i Is the sampling point x _i Characteristic of (b), f _i ∈F _s Sampling point x by k-NN query _i The neighbor of (2) is grouped and a general self-attention mechanism is used for updating the characteristics;

where A is used for aggregation characterization and R is used to describe the sampling point x _i And neighbor point x _i,j The high-level relation between the two points is that gamma is used for changing the characteristic dimension of each neighbor point, and in order to reduce the calculation amount, gamma (x) _i,j )＝W _γ f _i,j The relationship function R is expressed as:

wherein D' is the output channel of Conv;

W _p ＝Soft max(mlp(x _i,j )) (3)

W _f ＝Soft max(mlp(f _i,j )) (4)

in the formulas (3) and (4), j belongs to [1, k ];

step 3.4, realizing sampling point x by weighted summation operation _i And characteristic f thereof _i The adaptive update of (a) is performed,

and

that is, the updated point information is expressed as:

4. the improved Transformer-based Qin warriors fragment denoising method according to claim 1, wherein the relative attention RA module of the point cloud in the step 4 is used for calculating the relative attention feature between the self-attention SA module feature and the input feature, and is expressed as:

F _ra ＝F _in -F _sa (7)

in the formula, F _ra For relative attention features, F _in As an input feature, F _sa A self-attention SA module feature;

F _out ＝RA(F _in )＝relu(bn(mlp(F _ra )))+F _in 。 (8)

5. the improved transform-based terracotta warrior fragment denoising method according to claim 4, wherein the self-attention SA module characteristic is represented as:

wherein, (Q, K, V) = F _in .(W _q ,W _k ,W _v )；W _q 、W _k And W _v Learning rights for sharingThe weight matrix, Q, K and V are respectively the Query, key and Value matrix generated by linear transformation of input features,

one dimension for the query and key vectors.

6. The improved transform-based denoising method for fragments of Qin dynasty figures, according to claim 4, wherein the calculation formula (7) of the relative attention features is a discrete Laplacian, and in a graph G with N nodes, an N-dimensional vector f is expressed as:

f＝(f ₁ ,f ₂ ,...,f _N ) (10)

in the formula, f _i The function value of the function f at the node i;

when edge E _ij Having a weight W _ij Then:

when W is _ij If =0, indicating that node i and node j are not adjacent, then:

finally, the following components are obtained:

in the formula (I), the compound is shown in the specification,

7. The improved transform-based Qin warrior fragment denoising method of claim 1, wherein the decoder in step (5) transforms the embedded features of each sampling point and its neighborhood into a local surface centered on the point to infer the potential manifold, and then re-samples the inferred manifold to generate a denoising point set

Reconstructing a clean point cloud

8. The improved transform-based terracotta warrior fragment denoising method according to claim 7, wherein the step (5) comprises the following steps:

M(u,v；y):[-1,1]×[-1,1]→R ³ (16)

wherein (u, v) is a 2D rectangular region [ -1,1] ² A certain point in (a);

M _i (u,v；y _i )＝MLP _M ([u,v,y _i ]) (17)

in the formula, M _i (u,v；y _i ) Representing a parameterized patch manifold;

step 5.2, self-adaptive down-sampling aggregation

Point P in _i Corresponding patch manifold M _i Is defined as:

M _i (u,v；y _i )＝p _i +M(u,v；y _i ) (18)

the patch manifold of all point correspondences in the set can be represented as

I.e. characterizing the underlying surface of the point cloud;

step 5.3, reducing the number of input points in the adaptive down-sampling process by half through step 5.1 and step 5.2, namely M = N/2, and M is a manifold for each patch _i ([u,v；y _i ]) Resampling, and collecting two points on each surface patch manifold to obtain a denoising point cloud

Expressed as:

9. the improved transform-based terracotta warrior fragment denoising method according to claim 1, wherein the loss function loss in step (6) comprises a loss function Las and a loss function Lus;

the loss function Las is used for quantizing the adaptive down-sampling set

Distance from ground route point cloud Pgt due to

And Pgt contains a different number of points, an

Therefore, the chamfer distance CD is selected as L _as Expressed as:

the loss function Lus is used to quantify the final reconstructed point cloud

in the formula (I), the compound is shown in the specification,

phi is bijective;

L _denoise ＝λL _as +(1-λ)L _us (22)

in the formula, λ is an empirical value of 0.01.