CN113989627B

CN113989627B - City prevention and control image detection method and system based on asynchronous federal learning

Info

Publication number: CN113989627B
Application number: CN202111633287.8A
Authority: CN
Inventors: 袁戟; 常可欣
Original assignee: Shenzhen Wanwuyun Technology Co ltd
Current assignee: Shenzhen Wanwuyun Technology Co ltd
Priority date: 2021-12-29
Filing date: 2021-12-29
Publication date: 2022-05-27
Anticipated expiration: 2041-12-29
Also published as: CN113989627A

Abstract

The invention discloses an urban prevention and control image detection method based on asynchronous federal learning, which comprises the following steps that a cloud server initializes a global model; each end device carries out scene labeling on the urban prevention and control image data, divides the urban prevention and control image data into a training set and a testing set, and screens out a data set which is distributed close to a sample of the testing set from the training set as a to-be-trained set; acquiring a global model from a cloud server, and performing local training on a to-be-trained set to obtain a local model; each end device uploads the local model to a cloud server after homomorphic encryption; and the cloud server performs global model training on the local model by adopting an asynchronous federated calculation strategy to obtain an updated global model.

Description

Urban prevention and control image detection method and system based on asynchronous federal learning

Technical Field

The invention relates to the technical field of image recognition, in particular to an urban prevention and control image detection method and system based on asynchronous federal learning.

Background

At present, the image recognition scenes of urban prevention and control mainly include application scenes such as road surface construction damage, vehicle parking violation, river surface floating objects, garbage dumping and the like. Since the above application scenarios are widely distributed in various parts of a city, collaboration between different companies is required. For example, company a uses a camera to acquire image data, company B uses an unmanned aerial vehicle mounted camera to acquire images, and since pictures acquired by different companies cannot be directly shared due to data security, privacy protection and the like, a federal training network needs to be established to extract data features locally in each company, data is pushed to a cloud for global training by using a homomorphic encryption technology, and trained network parameters are respectively transmitted back to company a and company B for subsequent model application after decryption.

Due to the fact that the frequency of image updating and model training of different companies is different, global synchronous federal calculation needs to consume more manpower and material resources to carry out unified management on side equipment, and efficiency is low; and the image angle that acquires through unmanned aerial vehicle and fixed camera etc. can show the difference, lead to not being accurate in the training.

Disclosure of Invention

The invention aims to provide an urban prevention and control image detection method and system based on asynchronous federal learning, and aims to solve the problems that in the prior art, a global synchronous federal calculation strategy is adopted, more manpower and material resources are consumed to carry out unified management on edge-end equipment, the efficiency is low, and training is not accurate due to differences of image angles and the like.

In a first aspect, an embodiment of the present invention provides an urban prevention and control image detection method based on asynchronous federal learning, including:

s101, a cloud server initializes a global model;

s102, each end device carries out scene labeling on city prevention and control image data, divides the city prevention and control image data into a training set and a testing set, and screens out a data set which is distributed close to a sample of the testing set from the training set as a to-be-trained set;

s103, acquiring the global model from a cloud server, and performing local training on the to-be-trained set to obtain a local model;

s104, each end device uploads the local model to a cloud server after homomorphic encryption;

s105, the cloud server performs global model training on the local model by adopting an asynchronous federated computing strategy to obtain an updated global model;

and S106, turning to the step S102 until the global model on the cloud server reaches the expected performance.

In a second aspect, an embodiment of the present invention provides an urban prevention and control image detection system based on asynchronous federal learning, which is characterized by including: the system comprises a cloud server and end devices;

the cloud server is used for initializing a global model; carrying out global model training on the local model by adopting an asynchronous federal calculation strategy to obtain an updated global model;

the end equipment is used for carrying out scene labeling on the urban prevention and control image data, dividing the urban prevention and control image data into a training set and a testing set, and screening out a data set which is distributed close to a sample of the testing set from the training set to serve as a to-be-trained set; acquiring the global model from a cloud server, and carrying out local training on the set to be trained to obtain a local model; and uploading the local model to a cloud server after homomorphic encryption.

According to the embodiment of the invention, the safety is well guaranteed by limiting the urban prevention and control image data in each end device for marking and training; the cloud server performs weight calculation on the local models trained by the end devices through an asynchronous federal calculation strategy, selects the local model with larger weight to perform global training, so that the local models participating in training are more closely associated and have higher relevancy, and compared with a synchronous federal calculation strategy, the cloud server does not need to perform global training after all the end devices train the local models of the current round, so that the efficiency is higher, and the end devices which are not used can perform other work and cannot be always occupied;

meanwhile, the training sets in the urban prevention and control image data are screened through the equipment at each end, a to-be-trained set which is close to the data distribution of the test set is obtained, and the training accuracy of the model is improved.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.

Fig. 1 is a schematic flow chart of a city prevention and control image detection method based on asynchronous federal learning according to an embodiment of the present invention;

fig. 2 is a system block diagram of an urban prevention and control image detection system based on asynchronous federal learning according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

It will be understood that the terms "comprises" and/or "comprising," when used in this specification and the appended claims, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.

It is also to be understood that the terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used in the specification of the present invention and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise.

It should be further understood that the term "and/or" as used in this specification and the appended claims refers to and includes any and all possible combinations of one or more of the associated listed items.

Referring to fig. 1, a city prevention and control image detection method based on asynchronous federal learning includes:

s101, a cloud server initializes a global model;

In the embodiment, the safety is well guaranteed by limiting the urban prevention and control image data in each end device for marking and training; and through asynchronous federal calculation strategy, the cloud server carries out weight calculation on the local model trained by each end device, and selects the local model with larger weight to carry out global training, so that the connection between the local models participating in training is tighter, the correlation degree is higher, and compared with the synchronous federal calculation strategy, the cloud server can carry out global training without waiting for all the end devices to train out the local models of the round, the efficiency is higher, the end devices which are not in use can carry out other works, and the cloud server cannot be occupied all the time.

The city prevention and control image types comprise road surface construction damage, vehicle illegal parking, river surface floating objects, garbage dumping and the like.

The end equipment can be processors distributed in different companies or processors distributed in the same company; can be distributed in different corners of the same city, or can be centralized in a machine room.

Because each end device can obtain a large number of images through each image obtaining terminal, each end device needs to label the images after obtaining the images, and executes the step S102, and executes the steps S102-S105 in a circulating manner; the time interval from S105 to the beginning of the step S102 may be a preset time, or a certain preset condition is reached, so that the whole step operation may not be performed when any image is obtained, which wastes the running resources of the end device, and it is not necessary, and only after enough city prevention and control image data is obtained, the labeling and local training are performed, which may be set according to the actual situation.

In an embodiment, the screening out, from the training set, a data set that is distributed close to the test set samples as a to-be-trained set includes:

calculating a data distribution of the test set

；

Calculating the feature distribution KL divergence of the training set and the test set by adopting the following formula:

；

judging whether the sample distribution of the training set is close to that of the test set according to the KL divergence;

if yes, sampling is not needed to be carried out on the training set;

and if not, sampling the training set to obtain the set to be trained.

Wherein the content of the first and second substances,

representing the corresponding normalized features of the partially sampled data in the kth end device,

a data distribution representing the training set is determined,

representing the normalized data characteristics collected by an end device.

In this embodiment, the KL divergence is used to measure the distance between the distributions;

wherein the distribution data is distributed

The model is obtained by calculation through a test set and can be determined through a probability density equation pdf (probability density function) in the calculation as a module for displaying kdeplot in matplotlib.

Judging whether the training set is close to the sample distribution of the test set according to the KL divergence, wherein a threshold value is mainly set, if the KL divergence is smaller than the threshold value, the two samples are judged to be close enough, otherwise, the samples are judged to be required to be sampled, and the samples close enough are obtained and used as a set to be trained; the threshold value needs to be adjusted according to the actual distribution form and the training result on the test set, and is not a specific value.

Specifically, samples in the training set are subjected to screening and sampling through a metroplis algorithm.

In an embodiment, the sampling the training set to obtain the set to be trained includes:

data distribution from the training set

Randomly sampling image data features

；

According to the suggested distribution

Compliance

Sampling to obtain image data features

；

The first intermediate value is calculated using the following formula

：

；

From

Sampling in the uniform distribution to obtain a second intermediate value u;

judging whether the first intermediate value is greater than or equal to a second intermediate value;

if so, accepting the image data feature

Into the set to be trained, i.e.

；

If not, receiving image data characteristics

Into the set to be trained, i.e.

；

Repeating the above operations to characterize all image data

Screening is carried out, and obedience data distribution is obtained after T rounds of iteration

The set of image data features of;

carrying out reverse iteration on the image data feature set to obtain the to-be-trained set;

wherein the content of the first and second substances,

to represent

The distribution of the data of (a) is,

to represent

The data distribution of (2).

In this example, after obtaining the sample

Due to the fact that

To generate samples, a reverse iteration is required to obtain samples

Neutralization

Best matched sample

；

Calculated using the following formula

：

；

Wherein the content of the first and second substances,

to represent

And (4) norm.

In an embodiment, the obtaining the global model from the cloud server, and performing local training on the to-be-trained set to obtain a local model includes:

and each end device adopts a spark R-CNN model to carry out local training on the marked urban prevention and control image data.

In this embodiment, the single-stage model of YOLO is adopted in the prior art, and compared with the single-stage model of YOLO, the Sparse R-CNN model has higher precision.

In an embodiment, the training the local model by using an asynchronous federated calculation strategy to obtain an updated global model includes:

and the cloud server trains the local model by adopting a spark R-CNN model to obtain an updated global model.

In an embodiment, the training of the local model by using an asynchronous federated calculation strategy to obtain an updated global model (step S104) includes:

s201, the cloud server counts the number of local models participating in the global model training, and judges whether each local model participates in past training in a specified time window;

s202, if the local model is judged to be negative, the local model is not added with the global model training;

and S203, if the weight is judged to be positive, giving corresponding weight to the local model, and adding global model training.

In this embodiment, each local model is determined to see whether it has participated in past training in a specified time window (within the specified time window, the relationship between the local models is closer, the correlation is higher, and the accuracy of the trained global model is higher), that is, the local model trained within a specific number of rounds of global training is different from the local model trained in the current round of global training.

And if so, giving corresponding weight to the local model, wherein the weight is larger as the number of rounds different from the global training of the current round is closer.

Wherein the specified window may be the number of rounds of global training.

In one embodiment, the determining whether each of the local models has engaged in past training in a specified time window comprises:

determining whether each local model has engaged in past training in a specified time window using the following formula:

；

wherein s is the number of rounds of the current global model update,

for the number of rounds a local model has taken part in training,

and the local model time window is a hyper-parameter and is used for setting the local model time window of how many versions can be accommodated in the global model training.

In this embodiment, generally, the number of rounds in which the local model participates in training is equal to the number of rounds updated by the global model in the previous round.

In an embodiment, if yes, assigning a corresponding weight to the local model, and adding global model training, including:

s301, corresponding weight is given to the local model by adopting the following formula:

；

s302, calculating the influence weight of each local model in the S-th round on the global model by adopting the following formula:

；

wherein the content of the first and second substances,

for the amount of data in a local model that the same round participates in updating,

the total amount of data for all local models participating in the update for the same round,

is the weight value of the local model for the s-th round,

the weight values of the global model for round s,

for each set of local model codes for the s-th round,

in order to obtain a learning rate,

to determine the hyper-parameters that influence the decay rate of the weights.

In this embodiment, if

The local model can participate in the global model training, and the training participation degree given to the local model by calculating a proper weight value through a formula is achieved to achieve higher precision.

In an embodiment, if the determination is yes, the local model is given corresponding weight, a global model training is added, and then the method includes:

s401, after the updated global model is obtained through training, the cloud server sends the global model to each end device;

s402, the step S102 is carried out until the global model on the cloud server reaches the expected performance.

In this embodiment, the updated global model is sent back to each end device again, the local model on the end device is updated, the local model uploaded to the cloud server every time is guaranteed to be heavy, and the prediction accuracy is improved.

Referring to fig. 2, an urban prevention and control image detection system based on asynchronous federal learning includes a cloud server and end devices;

In an embodiment, the end device is further configured to locally train the labeled city prevention and control image data using a Sparse R-CNN model.

In an embodiment, the cloud server is configured to train the local model by using a Sparse R-CNN model to obtain an updated global model.

In an embodiment, the cloud server is configured to count the number of local models participating in the global model training, and determine whether each local model participates in past training in a specified time window;

if not, the local model is not added with the global model training;

if the local model is judged to be the global model, corresponding weight is given to the local model, and global model training is added.

In one embodiment, the cloud server is configured to determine whether each local model has been in past training within a specified time window using the following formula:

；

wherein s is the number of rounds of the current global model update,

for the number of rounds a local model has taken part in training,

and the time window is a super parameter and is used for setting the local model time window of how many versions can be accommodated in the global model training.

In an embodiment, the cloud server is configured to assign a corresponding weight to the local model by using the following formula:

；

and calculating the influence weight of each local model on the global model in the s-th round by adopting the following formula:

；

wherein the content of the first and second substances,

is the weight value of the local model for the s-th round,

the s wheelThe weight value of the global model of (2),

for each set of local model codes for the s-th round,

in order to obtain the learning rate of the learning,

to determine the hyper-parameters that influence the decay rate of the weights.

While the invention has been described with reference to specific embodiments, the invention is not limited thereto, and various equivalent modifications and substitutions can be easily made by those skilled in the art within the technical scope of the invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. A city prevention and control image detection method based on asynchronous federal learning is characterized by comprising the following steps:

s101, a cloud server initializes a global model;

s106, turning to the step S102 until the global model on the cloud server reaches the expected performance;

in step S102, the screening out a data set from the training set, which is distributed close to the test set sample, as a to-be-trained set includes:

calculating a data distribution of the test set

；

；

judging whether the training set is close to the sample distribution of the test set or not according to the KL divergence;

if yes, sampling is not needed to be carried out on the training set;

if not, sampling the training set to obtain the set to be trained;

wherein the content of the first and second substances,

representing the normalized data characteristic corresponding to the partially sampled data in the kth end device,

a data distribution representing the training set is determined,

representing normalized data characteristics collected by an end device;

wherein, the sampling the training set to obtain the to-be-trained set includes:

data distribution from the training set

Randomly sampling image data features

；

According to the suggested distribution

Compliance

Sampling to obtain image data features

；

The first intermediate value is calculated using the following formula

：

；

From

Sampling in the uniform distribution to obtain a second intermediate value u;

if so, accepting the image data feature

Into the set to be trained, i.e.

；

If not, receiving image data characteristics

Into the set to be trained, i.e.

；

Repeating the above operations to characterize all image data

The set of image data features of;

wherein the content of the first and second substances,

to represent

The distribution of the data of (a) is,

to represent

The data distribution of (2).

2. The urban defense and control image detection method based on asynchronous federal learning according to claim 1, wherein the step of obtaining the global model from a cloud server and performing local training on the set to be trained to obtain a local model comprises the steps of:

and each end device adopts a spark R-CNN model to carry out local training on the marked set to be trained.

3. The urban prevention and control image detection method based on asynchronous federal learning according to claim 1, wherein the step of training the local model by adopting an asynchronous federal calculation strategy to obtain an updated global model comprises the following steps:

the cloud server counts the number of local models participating in the global model training, and judges whether each local model participates in past training in a specified time window;

if not, the local model is not added with the global model training;

4. The city prevention and control image detection method based on asynchronous federal learning as claimed in claim 3, wherein said determining whether each local model has been involved in past training in a specified time window comprises:

；

wherein s is the number of rounds of the current global model update,

for the number of rounds a local model has taken part in training,

5. The method for detecting the urban prevention and control images based on the asynchronous federated learning as recited in claim 3, wherein if the result of the determination is yes, the local model is given corresponding weight, and a global model training is added, comprising:

corresponding weight is given to the local model by adopting the following formula:

；

；

wherein the content of the first and second substances,

is the weight value of the local model for the s-th round,

the weight values of the global model for round s,

for each set of local model codes for the s-th round,

in order to obtain a learning rate,

to determine the hyperparameters that influence the decay rate of the weights.

6. An urban prevention and control image detection system based on asynchronous federal learning is characterized by comprising: the system comprises a cloud server and end devices;

the end equipment is used for carrying out scene labeling on the urban prevention and control image data, dividing the urban prevention and control image data into a training set and a testing set, and screening out a data set which is distributed close to a sample of the testing set from the training set to serve as a to-be-trained set; acquiring the global model from a cloud server, and carrying out local training on the set to be trained to obtain a local model; uploading the local model to a cloud server after homomorphic encryption;

wherein the end device is further configured to:

calculating a data distribution of the test set

；

；

if yes, sampling is not needed to be carried out on the training set;

if not, sampling the training set to obtain the set to be trained;

wherein the content of the first and second substances,

a data distribution representing the training set is determined,

normalization to represent an end device acquisitionTransforming the data characteristics;

wherein the end device is further configured to:

data distribution from the training set

Randomly sampling image data features

；

According to the suggested distribution

Compliance

Sampling to obtain image data features

；

The first intermediate value is calculated using the following formula

：

；

From

Sampling in the uniform distribution to obtain a second intermediate value u;

if so, accepting the image data feature

Into the set to be trained, i.e.

；

If not, receiving image data characteristics

Into the set to be trained, i.e.

；

Repeating the above operations to characterize all image data

The set of image data features of;

wherein the content of the first and second substances,

represent

The distribution of the data of (a) is,

represent

The data distribution of (2).

7. The urban defense and control image detection system based on asynchronous federal learning of claim 6, wherein: the end device is further used for locally training the marked urban prevention and control image data by adopting a Sparse R-CNN model.

8. The urban defense and control image detection system based on asynchronous federal learning of claim 6, wherein:

the cloud server is used for counting the number of local models participating in the global model training and judging whether each local model participates in past training in a specified time window;

if not, the local model is not added with the global model training;