CN115421158B

CN115421158B - Self-supervision learning solid-state laser radar three-dimensional semantic mapping method and device

Info

Publication number: CN115421158B
Application number: CN202211387608.5A
Authority: CN
Inventors: 陈昶昊; 李方言; 王雄飞; 何晓峰; 褚超群; 潘献飞; 毛军; 张礼廉; 范晨; 胡小平
Original assignee: National University of Defense Technology
Current assignee: National University of Defense Technology
Priority date: 2022-11-07
Filing date: 2022-11-07
Publication date: 2023-04-07
Anticipated expiration: 2042-11-07
Also published as: CN115421158A

Abstract

The invention discloses a solid-state laser radar three-dimensional semantic map building method and device for self-supervised learning. The light-weight laser radar/inertia combined mapping algorithm based on Kalman filtering carries out scene three-dimensional reconstruction, and the three-dimensional point cloud map is semantically segmented through the point cloud semantic segmentation model, so that the semantic segmentation effect on the three-dimensional coordinate point cloud is good, and the point cloud segmentation understanding capability and the semantic segmentation precision are improved.

Description

Self-supervision learning solid-state laser radar three-dimensional semantic mapping method and device

Technical Field

The invention relates to the technical field of three-dimensional point cloud semantic segmentation, in particular to a solid-state laser radar three-dimensional semantic mapping method and device for self-supervised learning.

Background

Semantic segmentation is a typical computer vision problem that involves taking some raw data (e.g., flat images) as input and converting them into segmented regions with semantic meaning, forming a semantic map that facilitates understanding by robots and humans. The appearance of lightweight solid-state lidar has reduced lidar's cost and weight size, and is different with rotation type lidar, and small-size solid-state lidar's the angle of vision is less, and the scanning mode is irregular. The invention relates to a method and a device for constructing a graph aiming at dense three-dimensional point cloud semantics of a light and small solid-state laser radar. The method and apparatus may be used for mobile robots as well as hand-held devices.

Along with the rapid popularization of robots, the application of logistics distribution, household robots, medical robots and the like requires that the robots can independently navigate and establish images in various indoor and outdoor complex environments. The execution of the robot task needs to have the capabilities of autonomous navigation, map building and scene understanding. The laser radar can directly acquire information such as three-dimensional geometric position and reflection intensity of a target, the measuring distance can reach hundreds of meters, and the measuring precision is high. However, it is not enough for an indoor robot autonomous navigation to obtain only a three-dimensional point cloud map. The human body can distinguish roads, walls and other sundries from the point cloud map, but for the robot, the point cloud map in the eyes is a group of points with unknown meanings. And when the robot is required to effectively identify objects and regions with semantic meanings in the point cloud, semantic segmentation and image building are required. In recent years, the rapid increase of computational power enables the three-dimensional point cloud semantic segmentation algorithm based on deep learning to rapidly develop and improve the performance, and the research on the aspect also becomes a research hotspot in recent years.

The three-dimensional semantic mapping can be divided into two steps of semantic segmentation understanding and three-dimensional dense mapping of three-dimensional point cloud. In recent years, relevant research results focus on three-dimensional point cloud mapping and point cloud semantic segmentation. In the aspect of three-dimensional point cloud mapping, zhang et al provides a laser radar odometer and mapping algorithm based on batch optimization, and real-time three-dimensional point cloud mapping is realized. Lin et al further provides a positioning and mapping algorithm suitable for the small solid-state laser radar, and realizes feature extraction under a limited field angle and motion compensation and point cloud matching under an irregular sampling condition. And Xu et al, fusing the point cloud characteristics and inertial vision of the laser radar by adopting extended Kalman filtering, and realizing positioning and mapping with higher efficiency and robustness. In the aspect of three-dimensional point cloud matching, qi et al. And (3) projecting the point cloud into a strip picture, performing semantic segmentation on the picture and mapping the point cloud back to realize the construction and segmentation of the point cloud, wherein the algorithm is only suitable for the rotary laser radar. At present, a three-dimensional dense semantic mapping method for small solid-state lidar does not exist.

Disclosure of Invention

Aiming at the defects in the prior art, the invention provides a method and a device for building a three-dimensional semantic map of a solid-state laser radar for self-supervision learning, which can be applied to mobile robots and handheld equipment. The method constructs a coding-decoding deep neural network, semantically segments real-time input three-dimensional point cloud based on a self-supervision learning mode, and adopts Kalman filtering to fuse laser radar point cloud data, semantic segmentation results and inertial data to construct a map so as to realize scene three-dimensional reconstruction. In addition, the reflectivity of the laser radar signal is introduced into a point cloud segmentation network of self-supervision learning, and therefore the point cloud segmentation understanding capability is effectively improved.

In order to achieve the purpose, the invention provides a solid-state laser radar three-dimensional semantic mapping method for self-supervision learning, which comprises the following steps:

step 1, constructing a point cloud semantic segmentation model of a coding-decoding structure, and carrying out self-supervision training on the point cloud semantic segmentation model based on a synchronously acquired RGB image and a three-dimensional point cloud data set;

step 2, collecting real-time three-dimensional point cloud and real-time inertia data through a small solid-state laser radar and an inertia measurement element, fusing the real-time three-dimensional point cloud and the real-time inertia data based on a Kalman filter, and outputting a dense three-dimensional point cloud map composed of the real-time point cloud data under global coordinates;

step 3, inputting the real-time three-dimensional point cloud into the trained point cloud semantic segmentation model to obtain a point cloud segmentation result corresponding to each point cloud coordinate at the current moment, and corresponding the point cloud segmentation result corresponding to each point cloud coordinate with the dense three-dimensional point cloud map coordinate to generate a three-dimensional semantic map;

and 4, deploying the trained point cloud semantic segmentation model on computing equipment with an ARM + GPU architecture, constructing a semantic map building system together with a small solid-state laser radar and an inertia measurement element, carrying out three-dimensional point cloud, inertia data acquisition and map updating according to fixed frequency, carrying out real-time semantic segmentation, and generating a three-dimensional semantic map. Because the point cloud segmentation network is trained, the synchronous acquisition of RGB images is not required.

In order to achieve the above object, the present invention further provides a solid-state lidar three-dimensional semantic graph building apparatus for self-supervised learning, comprising:

the system comprises an RGB camera, a laser radar and an inertia measuring device, wherein the RGB camera, the laser radar and the inertia measuring device are respectively used for collecting RGB images, three-dimensional point cloud data and inertia data;

the data acquisition module is connected with the RGB camera, the laser radar and the inertia measurement device and is used for acquiring RGB images, three-dimensional point cloud data and inertia data;

the point cloud semantic segmentation model is connected with the data acquisition module and is used for performing point cloud semantic segmentation on the three-dimensional point cloud to obtain a point cloud segmentation result corresponding to each point cloud coordinate;

the self-supervision training module is connected with the data acquisition module and the point cloud semantic segmentation model and is used for carrying out self-supervision training on the point cloud semantic segmentation model according to the synchronously acquired RGB image and the three-dimensional point cloud data set;

the three-dimensional point cloud map building module is connected with the data acquisition module and used for fusing the real-time three-dimensional point cloud and the real-time inertia data and outputting a dense three-dimensional point cloud map consisting of the real-time point cloud data under the global coordinate;

and the three-dimensional semantic map building module is connected with the point cloud semantic division model and the three-dimensional point cloud map building module and is used for generating a three-dimensional semantic map by corresponding the point cloud division result corresponding to each point cloud coordinate with the dense three-dimensional point cloud map coordinate.

The invention provides a method and a device for building a three-dimensional semantic map of a solid-state laser radar for self-supervised learning. In addition, the reflectivity of the laser radar signal is introduced into the point cloud segmentation network, so that the point cloud segmentation understanding capability is improved.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the structures shown in the drawings without creative efforts.

FIG. 1 is a flow chart of a solid-state lidar three-dimensional semantic mapping method in an embodiment of the invention;

fig. 2 is a block diagram of a three-dimensional semantic map building apparatus for solid-state lidar in an embodiment of the invention.

The implementation, functional features and advantages of the objects of the present invention will be further explained with reference to the accompanying drawings.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

In addition, descriptions such as "first", "second", etc. in the present invention are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include at least one such feature. In the description of the present invention, "a plurality" means at least two, e.g., two, three, etc., unless specifically limited otherwise.

In addition, the technical solutions in the embodiments of the present invention may be combined with each other, but it must be based on the realization of those skilled in the art, and when the technical solutions are contradictory or cannot be realized, such a combination of technical solutions should not be considered to exist, and is not within the protection scope of the present invention.

Example 1

Fig. 1 shows a three-dimensional semantic mapping method for a solid-state lidar for self-supervised learning disclosed in this embodiment, which specifically includes the following steps:

step 2, collecting real-time three-dimensional point cloud and real-time Inertial data through a small solid-state laser radar and an Inertial Measurement Unit (IMU), fusing the real-time three-dimensional point cloud and the real-time Inertial data based on a Kalman filter, and outputting a dense three-dimensional point cloud map composed of the real-time point cloud data under the global coordinate;

and 4, deploying the trained point cloud semantic segmentation model on computing equipment with an ARM + GPU architecture, constructing a semantic map building system together with a small solid-state laser radar and an inertia measurement element, carrying out three-dimensional point cloud and inertia data acquisition and map updating according to the frequency of 50Hz, carrying out real-time semantic segmentation, and generating a three-dimensional semantic map.

In this embodiment, the point cloud semantic segmentation model includes an encoding layer, a decoding layer, and a semantic prediction layer. The point cloud semantic segmentation model is input as

Is collected and evaluated>

Is the number of points, is greater or less>

Is the characteristic dimension of each point, in this example 4 dimensions, including the three-dimensional coordinates ≥>

And a reflectivity>

。

The coding layer adopts a 3-layer coder, is mainly used for carrying out feature extraction on the input three-dimensional point cloud, sequentially processes point cloud data, reduces the number of the point cloud, increases the dimension of each point, and finally counts the point cloud

Is reduced to be->

And promoting the dimension of the point cloud feature from 4 dimensions to 128 dimensions. The encoder consists of a local feature aggregator and a random sampling layer, and specifically comprises the following parts:

the local feature aggregator consists of a local space encoder and an attention mechanism and aims to expand the receptive field of each point so as to extract more effective features;

the random sampling layer is used for accelerating the extraction of point features and improving the operation efficiency.

And the decoding layer designs a decoder with 3 layers, the nearest K adjacent points are found for each input point by adopting a K nearest neighbor algorithm (K nearest neighbors are found to represent) through the encoder of each layer, and the point cloud characteristic set is up-sampled by a nearest neighbor difference algorithm. Finally, the features obtained by the up-sampling are connected with the features obtained by the encoder to obtain the feature vector output by the decoding layer

。

The semantic prediction layer is used for mapping the feature vector obtained by the decoding layer to the full-link layer and the linear occipital flow function ReLU

，/>

Is a semantic category, having>

Class, is>

The number of the midpoints of the three-dimensional point cloud is shown.

In the specific implementation process, the self-supervision training of the point cloud semantic segmentation model specifically comprises the following steps:

step 1.1, corresponding three-dimensional point cloud coordinates and RGB image pixels one by one through a geometric relationship, obtaining a semantic segmentation result of each pixel in an RGB image by adopting a trained RGB image segmentation model, and obtaining a point cloud semantic label through a corresponding relationship between the three-dimensional point cloud coordinates and the RGB image pixels, wherein the label obtained through the corresponding relationship of the RGB image is defined as an automatic supervision semantic label in the embodiment;

step 1.2, point characteristics output by point cloud segmentation network

Corresponding to images by RGBComparing the obtained self-supervision semantic labels, and iteratively adjusting parameters of each level of the point cloud semantic segmentation model to enable semantic results output by the point cloud semantic segmentation model to approach the self-supervision semantic labels, wherein the process of iteratively adjusting the parameters is self-supervision learning training;

and step 1.3, after iteration for a certain number of times, a model with an output score closer to the self-supervision semantic label is left, so that a point cloud semantic segmentation model is formed.

It should be noted that the RGB image is only used for self-supervised training of the point cloud semantic segmentation model, and the trained point cloud semantic segmentation model can perform point cloud semantic segmentation only by inputting point cloud data, so as to obtain a point cloud segmentation result corresponding to each point cloud coordinate at the current time.

In this embodiment, the map building process of the dense three-dimensional point cloud map specifically includes:

step 2.1, performing inertial integration on the real-time inertial data to obtain a first initial state quantity (including position, attitude, speed and IMU error quantity) of the system, wherein the implementation process comprises the following steps:

IMU measurement noise

0, the integral of inertia is:

in the formula

Indicate a relation>

、/>

、/>

(here ` Harbour `>

An inertia integration function of 0), based on a predetermined criterion>

Representing an exponential function, the combination of which represents the inertial integration state update process; />

I.e. two adjacent IMU sample times ∑ in one radar scan>

And/or>

Difference of (IMU sampling interval), ->

Is the label of the ith IMU measurement data sample.

Is the measurement data (i.e. </or >>

Inertial data representing the ith IMU sample); />

Indicates a system state, wherein>

Indicating that the system is in the ^ th->

The first status quantity at the subsampling instant->

Presentation system in the fifth or fifth place>

A first state quantity at the sub-sampling time;

step 2.2, based on the real-time three-dimensional point cloud, performing state updating on the first state quantity by adopting Kalman filtering to obtain a second state quantity of the system, wherein the implementation process comprises the following steps:

firstly, calculating the point cloud residual error of the laser radar three-dimensional point cloud

The method comprises the following steps:

in the formula (I), the compound is shown in the specification,

is the nearest characteristic point position on the map, is>

Is point->

The normal vector (or edge orientation) of the respective plane (or edge) in which it lies, or->

Estimating the point cloud position by the IMU;

iterative updating is carried out on the state estimation by adopting an iterative Kalman filter until the point cloud residual error

And converging, wherein the process of iteratively updating the state estimation comprises the following steps:

in the formula

Is represented by>

Moment (>

Indicates the fifth->

Scan end time of a sub lidar scan) th>

Status after sub-kalman filtering>

Is generated value of->

Is indicated to be at>

Is at a moment->

Status after sub-kalman filtering>

The generated value of (a) is,Irepresents a unit matrix, <' > based on>

Represents an observation matrix, <' > based on>

Represents->

In connection with>

Is based on the partial derivative of (4)>

In an exponential function,>

is a logarithmic function>

Represents->

Status of the moment>

Is true and>

and the generating value->

Based on the error status of the status flag, based on the status flag>

Indicates a generation value->

And the evaluation value->

An error therebetween;

and representing a Kalman filtering gain matrix, wherein the calculation process is as follows:

in the formula (I), the compound is shown in the specification,

represents a covariance matrix, <' > based on a covariance matrix>

Represents a fifth or fifth party>

IMU state covariance at sub-sampling time, <' >>

Is shown as

IMU state covariance at sub-sampling time, <' >>

Represents the covariance matrix pick>

And partial derivative matrix->

The intermediate variable which is generated, is taken>

And &>

Respectively represent->

To (X)>

And &>

In a partial derivative matrix of>

Indicates IMU noise->

Covariance of (2), superscript

Represents a transpose of a matrix;

in the above iterative update of the state estimate, the cloud residual is present at the point

After convergence, an optimal state estimate, i.e. a second state quantity, can be obtained as follows:

in the formula (I), the compound is shown in the specification,

indicating that the system is in the ^ th->

A second state quantity at the sub-sampling time;

and 2.3, updating the reverse state of the second state quantity to obtain a third state quantity of the system so as to optimize the estimation of the position and the attitude and improve the positioning accuracy, wherein the implementation process of the reverse state updating comprises the following steps:

/>

in the formula (I), the compound is shown in the specification,

presentation system in the fifth or fifth place>

A third state quantity at the subsampling instant->

Indicating that the system is in the ^ th->

A third state quantity at the sub-sampling time;

step 2.4, obtaining a conversion matrix from the laser radar coordinate system to the IMU coordinate system based on the third state quantity

And

IMU coordinate system at a time instant>

To the global coordinate system>

Update the transition matrix->

And pass through>

And/or>

Converting point cloud coordinates of each frame of self coordinate system in one scanning of the laser radar into coordinates of a coordinate system at the scanning end time to obtain global coordinates, wherein the global coordinates are as follows:

in the formula (I), the compound is shown in the specification,

represents the coordinates of the laser point cloud, and is used for judging whether the laser point is in the dark or not>

Represents->

Sub-scanning lidar frame>

A global coordinate system is represented, and,

indicates the fifth->

A characteristic point->

Represents a fifth or fifth party>

A characteristic point is at>

Sub-scanning the coordinates of the lidar frame>

Represents a fifth or fifth party>

Global coordinates of the feature points after coordinate conversion; />

A coordinate conversion matrix representing the coordinate system from the lower right to the upper left>

Represents the IMU coordinate system, is>

Represents the lidar coordinate system>

Represents a transformation matrix from the lidar coordinate system to the IMU coordinate system, based on the evaluation of the measured values>

Represents->

IMU coordinate system at a time->

To the global coordinate system>

Updating the transformation matrix;

the number of point clouds;

and 2.5, adding all the feature points at each time to the existing map according to the global coordinate to obtain the three-dimensional point cloud map under the global coordinate system.

In the global mapping process of step 2.5, each point cloud has a fixed serial number and a feature quantity, and the feature quantity is a point cloud segmentation result obtained by the semantic segmentation model in step 3. And (4) corresponding the semantic segmentation result of the local point cloud obtained in the step (3) with the generated global three-dimensional point cloud map coordinate, thus obtaining a global three-dimensional semantic map.

Example 2

On the basis of the solid-state laser radar three-dimensional semantic map building method for the self-supervised learning in the embodiment 1, the embodiment also discloses a solid-state laser radar three-dimensional semantic map building device for the self-supervised learning, and referring to fig. 2, the device mainly comprises an RGB camera, a laser radar, an inertia measuring device, a data acquisition module, a point cloud semantic segmentation model, a self-supervised training module, a three-dimensional point cloud map building module and a three-dimensional semantic map building module. Specifically, the method comprises the following steps:

the system comprises an RGB camera, a laser radar and an inertia measuring device, wherein the RGB camera, the laser radar and the inertia measuring device are arranged on carriers such as a mobile robot or handheld equipment and are respectively used for collecting RGB images, three-dimensional point cloud data and inertia data;

the data acquisition module is connected with the RGB camera, the laser radar and the inertia measurement device and is used for acquiring the acquired RGB image, the three-dimensional point cloud data and the inertia data;

the three-dimensional point cloud map building module is connected with the data acquisition module and used for fusing the real-time three-dimensional point cloud and the real-time inertial data and outputting a dense three-dimensional point cloud map consisting of the real-time point cloud data under the global coordinate;

the three-dimensional semantic map building module is connected with the point cloud semantic segmentation model and the three-dimensional point cloud map building module and is used for enabling the point cloud segmentation result corresponding to each point cloud coordinate to correspond to the dense three-dimensional point cloud map coordinate to generate the three-dimensional semantic map.

In a specific application process, each functional module of the solid-state lidar three-dimensional semantic map building device is the same as that in embodiment 1, and therefore, the description thereof is omitted in this embodiment.

The above description is only a preferred embodiment of the present invention, and is not intended to limit the scope of the present invention, and all modifications and equivalents of the present invention, which are made by the contents of the present specification and the accompanying drawings, or directly/indirectly applied to other related technical fields, are included in the scope of the present invention.

Claims

1. A solid-state laser radar three-dimensional semantic mapping method for self-supervision learning is characterized by comprising the following steps:

step 2, collecting real-time three-dimensional point cloud and real-time inertial data through a small solid-state laser radar and an inertial measurement element, fusing the real-time three-dimensional point cloud and the real-time inertial data based on a Kalman filter, and outputting a dense three-dimensional point cloud map consisting of the real-time point cloud data under a global coordinate, wherein the dense three-dimensional point cloud map specifically comprises the following steps:

step 2.1, performing inertia integration on the real-time inertia data to obtain a first initial state quantity of the system;

2.2, based on the real-time three-dimensional point cloud, performing state updating on the first state quantity by adopting Kalman filtering to obtain a second state quantity;

step 2.3, updating the reverse state of the second state quantity to obtain a third state quantity of the system;

step 2.4, obtaining a conversion matrix from the laser radar coordinate system to an inertial coordinate system and an estimation updating conversion matrix from the inertial coordinate system to a global coordinate system based on the third state quantity, and converting point cloud coordinates of each frame of self coordinate system in one scanning of the laser radar to coordinates of a coordinate system at the last scanning moment to obtain global coordinates;

step 2.5, adding all the feature points at each time of adoption to the existing map according to the global coordinate to obtain a dense three-dimensional point cloud map under the global coordinate system;

and 4, deploying the trained point cloud semantic segmentation model on computing equipment with an ARM + GPU architecture, constructing a semantic map building system together with a small solid-state laser radar and an inertia measurement element, carrying out three-dimensional point cloud, inertia data acquisition and map updating according to fixed frequency, carrying out real-time semantic segmentation, and generating a three-dimensional semantic map.

2. The self-supervised learning solid-state lidar three-dimensional semantic mapping method according to claim 1, wherein in the step 1, the point cloud semantic segmentation model comprises:

the encoding layer is used for extracting the characteristics of the input three-dimensional point cloud;

the decoding layer is used for finding K nearest adjacent points for each input point on the basis of the characteristics extracted by the coding layer, then carrying out up-sampling on the point cloud characteristic set through a nearest neighbor difference algorithm, and connecting the characteristics obtained by the up-sampling with the characteristics obtained by the coder to obtain a characteristic vector output by the decoding layer;

a semantic prediction layer for mapping the feature vectors obtained from the decoding layer to the full-link layer and the linear anchor function ReLU

，/>

Is a semantic category, having>

Class, is>

Is threeThe number of the midpoints of the dimensional point cloud.

3. The self-supervised learning solid-state lidar three-dimensional semantic mapping method according to claim 2, wherein in step 1, the self-supervised training is performed on the point cloud semantic segmentation model, specifically:

step 1.1, corresponding three-dimensional point cloud coordinates and RGB image pixels one by one through a geometric relationship, obtaining a semantic segmentation result of each pixel in an RGB image by adopting a trained RGB image segmentation model, and obtaining a self-supervision semantic label through the corresponding relationship between the three-dimensional point cloud coordinates and the RGB image pixels;

step 1.2, point characteristics output by point cloud segmentation network

Comparing with the self-supervision semantic label obtained through the corresponding relation of the RGB image, and enabling the semantic result output by the point cloud semantic segmentation model to approach the self-supervision semantic label by iteratively adjusting the parameters of each level of the point cloud semantic segmentation model;

4. The method for self-supervised learning solid-state lidar three-dimensional semantic mapping according to claim 1, 2 or 3, wherein in step 2.1, inertial measurement noise is set

0, the integral of inertia is:

in the formula (I), the compound is shown in the specification,

indicate a relation>

、/>

Is integrated by inertia function of->

Representing an exponential operator, the combination of which represents the inertia integral state update process; />

I.e. two adjacent sampling instants->

And/or>

The difference between the two; />

Is the first->

The label of the sub-inertial data sample,>

indicates the fifth->

Real-time inertial data sampled by the secondary IMU; />

Indicating the state of the system in which, among other things,

indicating that the system is in the ^ th->

The first status quantity at the subsampling instant->

Presentation system in the fifth or fifth place>

A first state quantity at the sub-sampling instant.

5. The self-supervised learning solid-state lidar three-dimensional semantic mapping method according to claim 4, wherein the step 2.2 is specifically as follows:

firstly, calculating the point cloud residual error of the real-time three-dimensional point cloud

The method comprises the following steps:

in the formula (I), the compound is shown in the specification,

is the location of the nearest characteristic point on the map, in combination with the characteristic point on the map>

Is point->

In the normal vector or edge orientation of the respective plane or edge in which it lies>

Estimating the point cloud position by the IMU;

Convergence, wherein,the process of iteratively updating the state estimate is:

in the formula (I), the compound is shown in the specification,

is represented by>

Is at a moment->

Status after sub-kalman filtering>

Is generated value of->

Is indicated to be at>

Moment k-th Carlman filtered status->

Is generated value of->

Represents a fifth or fifth party>

The scan end time of the secondary lidar scan,Irepresents a unit matrix, <' > based on>

Represents an observation matrix, <' > is selected>

Represents->

About>

Is based on the partial derivative of (4)>

In the form of a logarithmic function of the function,

represents->

Status of the moment>

Is true and>

and the generating value->

Based on the error status of the status flag, based on the status flag>

Indicates a generation value->

And evaluation value->

The error between;

in the formula (I), the compound is shown in the specification,

represents a covariance matrix, based on the covariance matrix>

Represents a fifth or fifth party>

IMU state covariance at sub-sampling time, <' >>

Indicates the fifth->

IMU state covariance at sub-sampling time, <' >>

Represents the covariance matrix pick>

And partial derivative matrix->

The intermediate variable which is generated, is taken>

And &>

Respectively represent->

Is paired and/or matched>

And &>

Based on the partial derivative matrix, is greater than or equal to>

Indicates IMU noise->

Covariance of (4), upper corner flag->

Represents a transpose of a matrix;

in the formula (I), the compound is shown in the specification,

presentation system at the fifth place>

A second state quantity at the sub-sampling instant. />

6. The method for building a three-dimensional semantic map of a solid-state lidar for autonomous learning according to claim 5, wherein in step 2.3, the updating of the reverse state of the second state quantity specifically comprises:

in the formula (I), the compound is shown in the specification,

indicating that the system is in the ^ th->

A third state quantity at the subsampling instant->

Indicating that the system is in the ^ th->

A third state quantity at the sub-sampling instant.

7. The method for building a three-dimensional semantic map of a solid-state lidar according to claim 1, 2 or 3, wherein in step 2.4, converting the point cloud coordinates of the own coordinate system of each frame in one scan of the lidar into the coordinates of the global coordinate system is specifically:

in the formula (I), the compound is shown in the specification,

Represents->

Sub-scanning lidar frame>

Represents a global coordinate system, is>

Indicates the fifth->

A characteristic point->

Indicates the fifth->

Individual characteristic point is in>

Sub-scanning the coordinates of the lidar frame>

Represents a fifth or fifth party>

Global coordinates of the feature points after coordinate conversion; />

A coordinate transformation matrix representing the coordinate system from the lower right corner mark to the upper left corner mark,

represents an inertial frame, is present>

Represents the lidar coordinate system>

Represents a transformation matrix from the lidar coordinate system to the inertial coordinate system, and>

represents->

Inertial frame of time->

To a global coordinate system>

Updating the transformation matrix; />

Is the number of lidar point clouds.

8. An apparatus for building a three-dimensional semantic map of a solid-state lidar for self-supervised learning, wherein the method of any one of claims 1 to 7 is adopted, and the apparatus comprises: