CN111915672B

CN111915672B - Target labeling method and device based on 3D virtual driving scene

Info

Publication number: CN111915672B
Application number: CN202010695891.2A
Authority: CN
Inventors: 朱向雷; 陈辰; 任女尔; 程旭; 梅俊宇; 陈荣杰
Original assignee: China Automotive Technology and Research Center Co Ltd; Automotive Data of China Tianjin Co Ltd
Current assignee: China Automotive Technology and Research Center Co Ltd; Automotive Data of China Tianjin Co Ltd
Priority date: 2020-07-20
Filing date: 2020-07-20
Publication date: 2022-08-16
Anticipated expiration: 2040-07-20
Also published as: CN111915672A

Abstract

The invention provides a target labeling method and equipment based on a 3D virtual driving scene, wherein the target labeling method comprises the steps of constructing the 3D virtual driving scene, setting virtual scene elements to be labeled at preset positions of the virtual driving scene, obtaining spatial three-dimensional coordinates of four vertexes of a minimum external rectangle tangent to the front surface of the virtual scene elements, carrying out virtual driving shooting according to a virtual driving route through a virtual vehicle with a virtual camera, defining pictures containing the virtual scene elements to be labeled in pictures as key background pictures, and carrying out batch labeling on the virtual scene elements in the minimum external rectangle in the key background pictures. The invention realizes batch marking and batch exporting of the targets by depending on the 3D virtual scene, and improves marking efficiency and precision.

Description

Target labeling method and device based on 3D virtual driving scene

Technical Field

The invention relates to the field of intelligent driving of automobiles, in particular to a target labeling method and device based on a 3D virtual driving scene.

Background

Currently, most data annotation is for video images. In the traditional labeling method, each frame of image is generally extracted first, and coordinate position mark information of key concern factors in the image is given, so that the efficiency is low and the resource consumption of a system CPU is high.

In addition, patent CN106791937A proposes a method and system for labeling a video image, in which a canvas is created, pixels of the canvas correspond to pixels of a video image to be labeled, labeling content is drawn at a designated position of the canvas according to a received labeling command and a corresponding relationship between the pixels, format conversion is performed on the canvas to obtain a first labeled image, a pixel value of the labeling content on the first labeled image is assigned to a corresponding position on the video image to be labeled according to the corresponding relationship between the pixels to obtain a labeled video image, and thus, labeling of the video image is completed. The canvas is used for drawing the annotation content, so that the resource consumption of a system CPU (central processing unit) during video image annotation can be reduced, but the annotation efficiency is relatively low.

Moreover, intelligent driving is taken as the development direction of the future automobile industry, relevant policy and regulation are gradually formed, the technology in the industry is continuously improved, and the development prospect is promising. No matter the intelligent driving real vehicle experiment or the simulation experiment, the vehicle-mounted camera can acquire road condition images in the driving process to assist driving decision. Therefore, labeling the data of the driving image becomes a starting point of the machine-perceived world. At present, data marking of driving images is to manually mark a large number of shot road condition images one by one, and marked data is used for learning of a vehicle-mounted camera.

The current data labeling is to process two-dimensional images, the result of labeling specific areas in a 3D virtual scene for reconstructing an objective world is less, and the quantization labeling precision and the labeling efficiency in the existing labeling method are few.

In view of the above, the present invention is particularly proposed.

Disclosure of Invention

The invention aims to provide a target labeling method based on a 3D virtual driving scene, which realizes batch labeling and batch export of targets by means of the 3D virtual scene and improves the efficiency and the precision of labeling.

In order to achieve the purpose, the invention provides the following technical scheme:

in a first aspect, the present invention provides a target labeling method based on a 3D virtual driving scene, the method including the steps of:

step 1: building a 3D virtual driving scene, wherein the 3D virtual driving scene comprises a virtual road surface and first virtual scene elements arranged at a plurality of preset positions along a virtual path formed by the virtual road surface;

step 2: creating a minimum bounding rectangle tangent to the front face of the first virtual scene element model, and combining the spatial three-dimensional coordinates P of four vertexes of the minimum bounding rectangle _in (X _in ,Y _in ,Z _in Where i is an integer, and n ═ 1,2,3,4) are defined as the four bin points of the first virtual scene element;

and step 3: in the 3D virtual driving scene, virtual driving shooting is carried out through a virtual vehicle provided with a virtual camera according to a virtual driving route, and a first position L of each first virtual scene element entering a shooting picture of the virtual camera is determined _i1 And a second position L away from the virtual camera to shoot pictures _i2 ；

And 4, step 4: setting a preset movement rate of the virtual vehicle on the virtual driving route, and calculating the first position L of the virtual vehicle passing through based on the preset movement rate _i1 First time node T of _i1 And passing through said second position L _i2 Second time node T _i2 So as to obtain a label configuration table [ T ] of each first virtual driving scene _i1 ,T _i2 ,P _i1 ,P _i2 ,P _i3 ,P _i4 ]；

And 5: in the 3D virtual driving scene, the virtual vehicle-mounted camera performs virtual driving shooting at the preset speed according to a virtual driving route, and the shooting is performed at the first time node T _i1 And a second time node T _i2 The background picture shot in the time interval is defined as a key background pictureSlice and to the spatial three-dimensional coordinate P in the key background picture _in Marking the first virtual scene element in a rectangle formed by the corresponding four slot points to obtain marking information, and storing the key background picture and the marking information;

preferably, the spatial three-dimensional coordinates P in the key background picture in step 5 are determined _in Marking the first virtual scene element in the rectangle formed by the corresponding four slot points to obtain marking information, and specifically comprising the following steps of:

step 5-1: setting the spatial three-dimensional coordinate P of the four slot points in the 3D virtual driving scene _in (X _in ,Y _in ,Z _in ) Converting into view coordinates S in the key background picture through coordinate transformation _in (x _in ,y _in )；

Step 5-2: setting the mark information format of the key background picture as [ x ] _i1 ,y _i1 ,x _i2 ,y _i2 ,x _i3 ,y _i3 ,x _i4 ,y _i4 Type text description of the first virtual scene element]；

Preferably, view coordinates S of four bin points of said first virtual scene element are also taken after step 5-2 _in The correction is carried out, and the method specifically comprises the following steps:

step 5-1-1: comparing the view coordinates S of four slot points _in (x _in ,y _in ) Separately obtaining Min (x) _in )，Min(y _in ),Max(x _in ),Max(y _in ) And then the view correction coordinates of the four slot points are S' _i1 (x _imin ,y _imin ),S′ _i2 (x _imax ,y _imin ),S′ _i3 (x _imax ,y _imax ),S′ _i4 (x _imin ,y _imax )；

Preferably, the method also comprises correcting the coordinate S 'if the views of the four slot points are corrected after the step 5-1-1' _i1 (x _imin ,y _imin ),S′ _i2 (x _imax ,y _imin ),S′ _i3 (x _imax ,y _imax ),S′ _i4 (x _imin ,y _imax ) If the boundary of the key background picture is exceeded, correcting the coordinates of the slot points exceeding the boundary into the coordinate values of the intersection points of the minimum external rectangle and the boundary of the key background picture;

preferably, the first virtual scene element comprises a plurality of different category models, and the arrangement position of each category model in the space is set according to the requirement of diversification of the 3D virtual driving scene;

preferably, the 3D virtual driving scene further includes a plurality of other virtual scene elements, and the configuration relationship of the other virtual scene elements with respect to the first virtual scene element is set by the program.

On the other hand, the invention provides a target labeling device based on a 3D virtual driving scene, which realizes labeling of virtual scene elements of the 3D virtual driving scene by adopting the target labeling method, and the target labeling device comprises,

the system comprises a scene building module, a scene building module and a driving control module, wherein the scene building module is used for building a 3D virtual driving scene, and the 3D virtual driving scene comprises a virtual road surface and first virtual scene elements arranged at a plurality of preset positions along a virtual path formed by the virtual road surface;

a model editing module for creating a minimum bounding rectangle tangent to the front surface of the first virtual scene element and defining the spatial three-dimensional coordinates P of four vertexes of the minimum bounding rectangle _in Four slot points defined as first virtual scene elements, said spatial three-dimensional coordinate P _in Is X _in ,Y _in ,Z _in Wherein i is an integer, n is 1,2,3, 4;

a position determining module, configured to perform virtual driving shooting according to a virtual driving route by a virtual vehicle equipped with a virtual camera in the 3D virtual driving scene, and determine that each first virtual scene element enters a first position L of a shooting picture of the virtual camera _i1 And a second position L away from the virtual camera to shoot pictures _i2 ；

A mark configuration module for setting a preset movement rate of the virtual vehicle on the virtual driving route and calculating the virtual vehicle based on the preset movement rateThe vehicle passing said first position L _i1 First time node T of _i1 And passing through said second position L _i2 Second time node T _i2 So as to obtain a label configuration table [ T ] of each first virtual driving scene _i1 ,T _i2 ,P _i1 ,P _i2 ,P _i3 ,P _i4 ]；

A picture marking module, configured to, in the 3D virtual driving scene, perform virtual driving shooting by the virtual vehicle-mounted camera at the preset rate according to a virtual driving route, to be at the first time node T _i1 And a second time node T _i2 Defining the background picture shot in the time interval as a key background picture, and calculating the space three-dimensional coordinate P in the key background picture _in Marking the first virtual scene element in a rectangle formed by the corresponding four slot points to obtain marking information, and storing the key background picture and the marking information;

the scene building module, the model editing module, the position determining module, the labeling configuration module and the picture labeling module are sequentially in bidirectional communication connection.

Preferably, the target annotation device further comprises a coordinate correction module, configured to correct view coordinates S of four slot points of the first virtual scene element _in And (6) performing correction.

Compared with the prior art, the invention has the following beneficial technical effects:

(1) compared with the traditional method that a large number of road condition images are collected by a real vehicle and manually marked one by one, the method has the advantages that the 3D virtual driving scene is built, the virtual vehicle-mounted camera shooting is carried out in the 3D virtual driving scene, the batch marking and batch exporting of the images are further realized, the efficiency is high, and the exporting speed can reach 100 pieces/minute;

(2) the virtual scene element configuration in the 3D virtual driving scene can be conveniently set through a program, so that the marked image set is richer and closer to reality;

(3) the data labeling precision is high, and the coordinate error is within 1 pixel value through the special traffic element target labeled by the minimum circumscribed rectangle;

(4) the resolution of the derived pictures is high, and each picture is 3200 x 1800 pixels and has the size of about 10 MB.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.

FIG. 1 is a key background picture taken by a virtual vehicle-mounted camera in a 3D virtual driving scene according to the present invention;

fig. 2 is a traffic sign board photographed by a virtual vehicle-mounted camera in a 3D virtual driving scene in a non-frontal manner according to the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the technical solutions of the present invention will be clearly and completely described below. It is to be understood that the described embodiments are merely exemplary of the invention, and not restrictive of the full scope of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

According to one aspect of the invention, a target labeling method based on a 3D virtual driving scene is provided, and the method comprises the following steps:

step 1: the method comprises the steps of building a 3D virtual driving scene, wherein the 3D virtual driving scene comprises a virtual road surface and first virtual scene elements arranged at a plurality of preset positions along a virtual path formed by the virtual road surface.

Specifically, intelligent driving is taken as the development direction of the future automobile industry, relevant policy and regulations are gradually formed, the technology in the industry is continuously improved, and the development prospect is promising. No matter the intelligent driving real vehicle experiment or the simulation experiment, the vehicle-mounted camera can acquire road condition images in the driving process to assist driving decision. Therefore, the data annotation on the driving image becomes the starting point of the machine-perceived world. In this embodiment, a 3D virtual driving scene is constructed, which includes a virtual road surface and first virtual scene elements arranged at a plurality of preset positions along a virtual path formed by the virtual road surface, where the first virtual scene elements may be any traffic participation elements that may appear in an actual traffic scene, including but not limited to a traffic sign, a vehicle, a pedestrian, a building, and the like, and in this embodiment, the first virtual scene elements are traffic signs. The 3D virtual driving scene comprises a plurality of virtual road surfaces, and traffic signboards are arranged at preset positions of the virtual road surfaces according to the requirements of subsequent picture marking, namely, the spatial positions of the traffic signboards in the picture frames where the traffic signboards are located are set artificially and are known.

Step 2: creating a minimum bounding rectangle tangent to the front face of the first virtual scene element model, and combining the spatial three-dimensional coordinates P of four vertexes of the minimum bounding rectangle _in (X _in ,Y _in ,Z _in Where i is an integer and n is 1,2,3,4) are defined as the four bin points of the first virtual scene element.

Specifically, after a first virtual scene element is arranged at a preset position of a 3D virtual driving scene, a minimum bounding rectangle tangent to the front surface of a first virtual scene element model is manually created artificially, and the spatial three-dimensional coordinates P of four vertexes of the minimum bounding rectangle are used for representing the three-dimensional coordinates P of the three-dimensional space _in (X _in ,Y _in ,Z _in Where i is an integer and n is 1,2,3,4) are defined as the four bin points of the first virtual scene element. And the subsequent picture content positioned in the minimum circumscribed rectangle is the content needing to be learned by the machine.

And step 3: in the 3D virtual driving scene, virtual driving shooting is carried out through a virtual vehicle provided with a virtual camera according to a virtual driving route, and a first position L of each first virtual scene element entering a shooting picture of the virtual camera is determined _i1 And a second position L away from the virtual camera to shoot pictures _i2 。

In particular, according to a 3D virtual driving scenarioSetting a virtual driving route, wherein the virtual driving route can be selected according to the specific situation of the 3D virtual driving scene, the selection principle includes but is not limited to that the virtual driving route includes as many first virtual scene elements as possible, in this embodiment, when the virtual driving route is selected, a route including as many traffic signs as possible is preferably used as the virtual driving route; after a virtual driving route is selected, virtual driving shooting is carried out through a virtual vehicle with a virtual camera according to the virtual driving route, in the invention, pictures with traffic signs appearing in pictures shot by the virtual camera are pictures which need to be marked subsequently, the pictures are defined as key background pictures, and each traffic sign arranged along the way is determined to enter a first position L of a picture shot by the virtual camera _i1 And a second position L away from the virtual camera to shoot pictures _i2 Then in the first position L _i1 And a second position L _i2 The pictures shot by the virtual camera in the position interval are all key background pictures, and the subsequent pictures need to be marked.

And 4, step 4: setting a preset movement rate of the virtual vehicle on the virtual driving route, and calculating the virtual vehicle passing through the first position L based on the preset movement rate _i1 First time node T of _i1 And passing through said second position L _i2 Second time node T _i2 So as to obtain a label configuration table [ T ] of each first virtual driving scene _i1 ,T _i2 ,P _i1 ,P _i2 ,P _i3 ,P _i4 ]；

Specifically, the first position L corresponding to each traffic sign is determined in step 3 _i1 And a second position L _i2 Then, setting a preset movement rate of the virtual vehicle on the virtual driving route, and calculating the virtual vehicle to pass through the first position L based on the preset movement rate _i1 First time node T of _i1 And passing through said second position L _i2 Second time node T _i2 So as to obtain a label configuration table [ T ] of each first virtual driving scene _i1 ,T _i2 ,P _i1 ,P _i2 ,P _i3 ,P _i4 ]. Based on the information of the labeling configuration table, the virtual vehicle is located at (T) when the virtual vehicle runs on the virtual driving route according to the preset movement speed _i1 ,T _i2 ) All background pictures shot in a time interval are key background pictures, wherein a space three-dimensional coordinate P _i1 ,P _i2 ,P _i3 ,P _i4 The picture content in the rectangular frame formed by the corresponding four slot points is the content to be marked in the key background picture.

And 5: in the 3D virtual driving scene, the virtual vehicle-mounted camera performs virtual driving shooting at the preset speed according to a virtual driving route, and the shooting is performed at the first time node T _i1 And a second time node T _i2 Defining the background picture shot in the time interval as a key background picture, and calculating a spatial three-dimensional coordinate P in the key background picture _in Marking the first virtual scene element in a rectangle formed by the corresponding four slot points to obtain marking information, and storing the key background picture and the marking information;

specifically, in the 3D virtual driving scene, virtual driving shooting is performed according to a virtual driving route through a virtual vehicle-mounted camera, as shown in fig. 1, at the first time node T _i1 And a second time node T _i2 In the time interval, the background picture shot by the virtual vehicle-mounted camera comprises a traffic sign board, the picture frame comprising the traffic sign board is defined as a key background picture, and the space three-dimensional coordinate P in the key background picture shot by the virtual vehicle-mounted camera _i1 ,P _i2 ,P _i3 ,P _i4 And marking the traffic sign board in the rectangular frame formed by the corresponding four groove points to obtain marking information, and storing the key background picture and the marking information.

step 5-1: empty the four slot points in the 3D virtual driving sceneThree-dimensional coordinate P _in (X _in ,Y _in ,Z _in ) Converting into view coordinates S in the key background picture through coordinate transformation _in (x _in ,y _in )；

Step 5-2: setting the mark information format of the key background picture as (x) _i1 ,y _i1 ,x _i2 ,y _i2 ,x _i3 ,y _i3 ,x _i4 ,y _i4 Type caption of the first virtual scene element);

specifically, in this embodiment, the first virtual scene element is a traffic sign, the traffic sign has multiple categories, such as a highest running speed sign, a lowest running speed sign, a left turn prohibition sign, a warning sign, and the like, and for a key background picture including the traffic sign, the labeling information of the key background picture is view coordinates of four slot points of the traffic sign and the category of the traffic sign, for example, if the traffic sign is that the vehicle speed is not more than 50 km/h, the labeling information format of the key background picture is [ x [, [ x ] m _i1 ,y _i1 ,x _i2 ,y _i2 ,x _i3 ,y _i3 ,x _i4 ,y _i4 The speed of the vehicle must not exceed 50 km/h]。

Preferably, the view coordinates S of the four bin points of the first virtual scene element are also applied after step 5-2 _in The correction is carried out, and the method specifically comprises the following steps:

Preferably, the method also comprises the step of correcting the coordinate S 'if the view of the four slot points is corrected after the step 5-1-1' _i1 (x _imin ,y _imin ),S′ _i2 (x _imax ,y _imin ),S′ _i3 (x _imax ,y _imax ),S′ _i4 (x _imin ,y _imax ) If the boundary of the key background picture is exceeded, correcting the coordinates of the slot points exceeding the boundary into the coordinate values of the intersection points of the minimum external rectangle and the boundary of the key background picture;

specifically, as shown in fig. 2, the view coordinates S of the four slot points obtained in step 5-2 are due to the problem of the shooting angle of the virtual in-vehicle camera in the 3D virtual driving environment _in (x _in ,y _in ) The quadrilateral formed may be an irregular quadrilateral. In order to ensure that the shape of the marking frame is always the minimum circumscribed rectangle, view coordinates S of four groove points are compared _in (x _in ,y _in ) Separately obtaining Min (x) _in )，Min(y _in ),Max(x _in ),Max(y _in ) And then the view correction coordinates of the four slot points are S' _i1 (x _imin ,y _imin ),S′ _i2 (x _imax ,y _imin ),S′ _i3 (x _imax ,y _imax ),S′ _i4 (x _imin ,y _imax )；

Preferably, the method also comprises the step of correcting the coordinate S 'if the view of the four slot points is corrected after the step 5-1-1' _i1 (x _imin ,y _imin ),S′ _i2 (x _imax ,y _imin ),S′ _i3 (x _imax ,y _imax ),S′ _i4 (x _imin ,y _imax ) And if the boundary exceeds the boundary of the key background picture, correcting the coordinates of the slot points exceeding the boundary into the coordinate values of the intersection points of the minimum external rectangle and the boundary of the key background picture.

In the invention, the motion trail and the motion rate of the virtual vehicle-mounted camera in the 3D virtual driving scene are planned according to the virtual driving route. And setting the extraction quantity and time interval of the key background pictures of the camera based on the motion rate, and finally, simultaneously and massively deriving the labeled key background pictures and labeled information in the csv format at the efficiency of 100 pieces/minute, wherein the error of the labeled coordinate of the traffic signboard is within one pixel.

Preferably, the first virtual scene element comprises a plurality of different category models, and the arrangement position of each category model on the space is reasonably set according to the diversified requirements of the 3D virtual driving scene;

specifically, in this embodiment, the first virtual scene element is a traffic sign, the traffic sign includes multiple traffic sign types, such as a highest running speed sign, a lowest running speed sign, a left-turn prohibition sign, a warning sign, and the like, multiple types of traffic signs are reasonably configured on the virtual road according to the requirement of diversification of the 3D virtual driving scene, and the traffic signs of the same type are arranged at different azimuth angles;

Specifically, in this embodiment, the 3D virtual driving scene may further include traffic participants such as trees, buildings, pedestrians, and weather, and the weather setting in the scene sets different light effects according to different weather, so that an environmental effect similar to the real world is created by matching the particle special effect. In order to solve the scene singleness of an image marked by a static scene and realize the diversification of the marked image, the method is used for the scene marked by the image, the diversification of the marked scene is ensured, and the diversification of the scene is realized by modifying the positions, directions, existence and nonexistence of participators such as trees, buildings, pedestrians and the like in the scene through program control on the basis of the static scene so as to realize the batch derivation of different pictures. Meanwhile, the weather setting in the scene can be modified, so that different weather effects are reflected during marking, and the marked image set is richer and closer to reality.

On the other hand, the invention also provides a target labeling device based on the 3D virtual driving scene, which adopts the target labeling method to label the virtual scene elements of the 3D virtual driving scene, the target labeling device comprises,

a model editing module for creating a minimum bounding rectangle tangent to the front surface of the first virtual scene element and combining the spatial three-dimensional coordinates P of four vertexes of the minimum bounding rectangle _in Four slot points defined as first virtual scene elements, said spatial three-dimensional coordinate P _in Is X _in ,Y _in ,Z _in Wherein i is an integer, n is 1,2,3, 4;

A mark configuration module for setting a preset movement rate of the virtual vehicle on the virtual driving route, and calculating the virtual vehicle passing through the first position L based on the preset movement rate _i1 First time node T of _i1 And passing through said second position L _i2 Second time node T _i2 So as to obtain a label configuration table [ T ] of each first virtual driving scene _i1 ,T _i2 ,P _i1 ,P _i2 ,P _i3 ,P _i4 ]；

From the above description, it can be seen that the above-described embodiments of the present invention achieve the following technical effects:

The foregoing is considered as illustrative of the preferred embodiments of the invention and the technical principles employed. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments illustrated herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the invention. Therefore, although the present invention has been described in greater detail by the above embodiments, the present invention is not limited to the above embodiments, and may include other equivalent embodiments without departing from the spirit of the present invention, and the scope of the present invention is determined by the scope of the appended claims.

Claims

1. A target labeling method based on a 3D virtual driving scene is characterized by comprising the following steps,

and 2, step: creating a minimum bounding rectangle tangent to the front of the first virtual scene element and relating the spatial three-dimensional coordinates P of the four vertices of the minimum bounding rectangle _in Four slot points defined as first virtual scene elements, said spatial three-dimensional coordinate P _in Is X _in ,Y _in ,Z _in Wherein i is an integer, n is 1,2,3, 4;

And 5: in the 3D virtual driving scene, the virtual camera performs virtual driving shooting at the preset movement rate according to a virtual driving route, and the virtual camera is to be shot at the first time node T _i1 And a second time node T _i2 Defining the background picture shot in the time interval as a key background picture, and calculating a spatial three-dimensional coordinate P in the key background picture _in Marking the first virtual scene element in the rectangle formed by the corresponding four slot points to obtain marking information, and storing the key background picture and the marking information.

2. Target labeling method based on 3D virtual driving scene as claimed in claim 1The method is characterized in that the spatial three-dimensional coordinate P in the key background picture is processed in step 5 _in Marking the first virtual scene element in the rectangle formed by the corresponding four slot points to obtain marking information, specifically comprising the following steps,

step 5-1: setting the spatial three-dimensional coordinate P of the four slot points in the 3D virtual driving scene _in Converting into view coordinates S in the key background picture through coordinate transformation _in Said view coordinate S _in Is x _in ,y _in ；

Step 5-2: setting the mark information format of the key background picture as [ x ] _i1 ,y _i1 ,x _i2 ,y _i2 ,x _i3 ,y _i3 ,x _i4 ,y _i4 Type text description of the first virtual scene element]。

3. The method for labeling targets based on 3D virtual driving scene as claimed in claim 2, characterized in that, after step 5-1, the view coordinates S of four slot points of the first virtual scene element are also measured _in The correction is carried out, which specifically comprises the following steps,

step 5-1-1: comparing the view coordinates S of four slot points _in Separately obtaining Min (x) _in )，Min(y _in ),Max(x _in ),Max(y _in ) And then the view correction coordinates of the four slot points are S' _i1 (x _imin ,y _imin ),S’ _i2 (x _imax ,y _imin ),S’ _i3 (x _imax ,y _imax ),S’ _i4 (x _imin ,y _imax )。

4. The target labeling method based on the 3D virtual driving scene as claimed in claim 3, further comprising, after the step 5-1-1, correcting the coordinates S 'if the views of the four slot points correct' _i1 (x _imin ,y _imin ),S’ _i2 (x _imax ,y _imin ),S’ _i3 (x _imax ,y _imax ),S’ _i4 (x _imin ,y _imax ) Beyond the key background pictureAnd (4) correcting the coordinates of the slot points beyond the boundary into the coordinate values of the intersection points of the minimum circumscribed rectangle and the boundary of the key background picture.

5. The method for labeling the target based on the 3D virtual driving scene as claimed in any one of claims 1 to 4, wherein the first virtual scene element comprises a plurality of different category models, and the arrangement orientation of each category model in the space is set according to the diversified requirements of the 3D virtual driving scene.

6. The target labeling method based on the 3D virtual driving scene as claimed in claim 5, wherein the 3D virtual driving scene further comprises a plurality of other virtual scene elements, and the configuration relationship of the other virtual scene elements with respect to the first virtual scene element is set through a program.

7. A target labeling device based on a 3D virtual driving scene is characterized in that batch labeling of virtual scene elements of the 3D virtual driving scene is realized by adopting the target labeling method of any one of claims 1 to 6, the target labeling device comprises,

a position determining module, configured to perform virtual driving shooting according to a virtual driving route by a virtual vehicle equipped with a virtual camera in the 3D virtual driving scene, and determine that each first virtual scene element enters the virtual camera shooting screenFirst position L of _i1 And a second position L away from the virtual camera to shoot pictures _i2 ；

A mark configuration module for setting a preset movement rate of the virtual vehicle on the virtual driving route, and calculating the virtual vehicle passing through the first position L based on the preset movement rate _i1 First time node T of _i1 And passing said second position L _i2 Second time node T _i2 So as to obtain a label configuration table [ T ] of each first virtual driving scene _i1 ,T _i2 ,P _i1 ,P _i2 ,P _i3 ,P _i4 ]；

A picture marking module, configured to perform virtual driving shooting at the preset motion rate according to a virtual driving route by the virtual camera in the 3D virtual driving scene, where the virtual camera is to be at the first time node T _i1 And a second time node T _i2 Defining the background picture shot in the time interval as a key background picture, and calculating the space three-dimensional coordinate P in the key background picture _in Marking the first virtual scene element in a rectangle formed by the corresponding four slot points to obtain marking information, and storing the key background picture and the marking information;

8. The 3D virtual driving scene-based target annotation device of claim 7, further comprising a coordinate rectification module for correcting the view coordinates S of the four bin points of the first virtual scene element _in And (6) performing correction.