CN116205311A

CN116205311A - Federal learning method based on Shapley value

Info

Publication number: CN116205311A
Application number: CN202310124072.6A
Authority: CN
Inventors: 朱亚萍; 赵生捷
Original assignee: Tongji University
Current assignee: Tongji University
Priority date: 2023-02-16
Filing date: 2023-02-16
Publication date: 2023-06-02

Abstract

The invention provides a federal learning method based on Shapley values. According to the method, the difference of data distribution of different clients in federal learning is considered, and when global model parameters are acquired, the parameters of the global training target are weighted and aggregated according to the contribution of a local training model of the client to the global training target. After each iteration training of federal learning, a weighted graph is constructed according to cosine similarity among local model parameters of each client, and a shape value of each client vertex in the graph is calculated. The server sets corresponding weight coefficients for the model parameters of each client based on the shape values of the clients, and carries out weighted aggregation on the model parameters of the clients according to the coefficients to obtain global model parameters of the next training until the training target is reached.

Description

Federal learning method based on Shapley value

Technical Field

The invention belongs to the field of machine learning, and particularly relates to a federal machine learning method.

Background

Along with the large-scale growth of intelligent terminals and internet of things interconnection equipment, the processing of mass data has become a necessary technology in the digital transformation era. For large-scale application scenarios, the federal machine learning method becomes one of the key technologies. As a novel distributed learning method, federal learning can alleviate the problems of high computational load and the like which are required when a single server processes large-scale data, and training the data is assigned to a plurality of clients for processing so as to share the cost. Meanwhile, by means of joint modeling of different clients on the premise that original data are not shared, privacy protection is achieved, and data safety is guaranteed.

Federal learning trains local models simultaneously through multiple clients and aggregates local models from different clients to obtain a global model. However, there is a data heterogeneity problem because data in federal learning is typically not evenly distributed across different clients and is typically of a non-independent co-distributed type. If the local models of different clients are aggregated without distinction, the overall training effect is degraded due to data heterogeneity. Therefore, a reasonably designed and effective method is needed to cope with the data heterogeneity problem in federal learning to improve the overall effect of training.

Disclosure of Invention

Technical problems: in federal learning, the data on multiple clients participating in the learning is typically non-independent and uniformly distributed, and the effect of the trained model on the overall training goal tends to be different for different clients on different rounds of iteration. Therefore, when the local parameter models from different clients are aggregated, the local parameter models need to be distinguished, the differences among the local models trained by the different clients are fully considered, and adverse effects on the whole federal learning effect due to data heterogeneity are relieved.

The technical scheme is as follows: in order to solve the technical problems, the invention provides a federal learning method based on Shapley values, which is characterized in that when local model parameters of clients are aggregated in federal learning, a weight coefficient is set based on the Shapley value of each client, and then the local model parameters are weighted and aggregated according to the weight coefficient to obtain global model parameters.

Further, a weighted graph is built at the end of each iteration training of federal learning, the vertexes of the graph are all clients participating in the learning of the federal learning, the clients are connected in pairs to form edges of the graph, and the weights of the edges are cosine similarity between local model parameters of the two clients connected with the edges.

Calculating shape of each client according to the constructed graph, wherein for a certain vertex (i.e. client) i in the graph, the shape (recorded as

) The calculation method of (1) is that

wherein ,S_i Represents a set of all subsets including vertex members i, s represents the number of elements in set s, n represents the number of all vertices in the graph, j is any vertex in set s except i, e _ij Is the weight corresponding to the edge connecting vertices i and j.

The global model parameter after each iteration is the weighted sum of the local model parameters of all clients participating in the current learning, wherein the weighting coefficient corresponding to the local model parameter of the client i is related to

The specific calculation method is that

wherein ,

is the global model parameter after the training of the t-th round is finished, L _t Representing a set of all clients participating in the t-th round of learning,/>

Is about->

Function of->

Is the local model parameter obtained by training the client i in the t-th round.

The beneficial effects are that: according to the method, the possible data difference between different clients in federal learning is fully considered, when local model parameters are polymerized, the contribution of the local model obtained by the different clients in each iteration to the overall training target is calculated based on the Shapley value, and different weighting coefficients are set according to the contribution value, so that adverse effects of data heterogeneity on the federal learning are reduced.

Drawings

Fig. 1 is a flow chart of a federal learning method based on Shapley values according to the present invention.

Detailed Description

A federal learning method based on Shapley values is characterized in that when local model parameters of clients are aggregated in federal learning, a weight coefficient is set based on the Shapley value of each client, and then the local model parameters are weighted and aggregated according to the weight coefficient to obtain global model parameters.

The design of the scheme of the invention is further specifically described with reference to fig. 1 and related formulas.

When federal learning starts, the central server randomly selects clients participating in the next round of iterative training, and issues an initial global model parameter to all the selected clients. The client performs local training on the basis of the global model parameters to obtain a new round of local model parameters.

Assume that at the time of t-th training of federal learning, n clients participate in the present training, and are marked as a set L _t . Wherein client i (i.e.L) _t ) The parameter model obtained in the training of the round is

After the round training is finished, each client uploads the local model parameters obtained through training to a server, and the server constructs a graph according to the local model parameters of the client, wherein the top point of the graph is the client participating in the round learning, the clients are connected to form the edge of the graph, and the weight of the edge is the cosine similarity between the local model parameters of the two clients connected with the edge. Specifically, for the weight e corresponding to the edge formed between vertices i and j _ij Is that

Wherein the molecules represent vectors

Vector->

Dot product between them, +.>

and />

Respectively represent vectors

and />

Is a mold of (a).

The server calculates the shape of each vertex in the constructed graph, specifically, for a certain vertex (i.e., client) i in the graph, the shape (denoted as

) The calculation method of (1) is that

/>

After the shape value of each client vertex is calculated, the server performs weighted summation on the local model parameters of all clients participating in the round of learning to obtain new global model parameters, wherein the specific calculation method is as follows

wherein ,

is the global model parameter after the training of the t th round is finished,>

is about->

Is a function of (2).

And the server transmits the global model parameters obtained by aggregation after each round of iteration to the client selected to participate in the learning in the next round, and the client performs a new round of training on the basis of the global model parameters until the overall training convergence target is reached.

The above description is merely of preferred embodiments of the present invention, and the scope of the present invention is not limited to the above embodiments, but all equivalent modifications or variations according to the present disclosure will be within the scope of the claims.

Claims

1. A federal learning method based on Shapley values is characterized in that when local model parameters of clients are aggregated in federal learning, a weight coefficient is set based on the Shapley value of each client, and then the local model parameters are weighted and aggregated according to the weight coefficient to obtain global model parameters.

2. The shape-based federal learning method according to claim 1, wherein a weighted graph is constructed at the end of each iteration training of federal learning, vertices of the graph are all clients participating in the learning, the clients are connected to each other to form edges of the graph, and weights of the edges are cosine similarities between local model parameters of two clients connected to the edges.

3. The shape based federal learning method according to claim 1, wherein for the weight e corresponding to the edge formed between vertices i and j _ij Is that

Wherein the molecules represent vectors

Vector->

Dot product between them, +.>

and />

Respectively represent vector +.>

And

is a mold of (a).

4. The shape-based federal learning method according to claim 2, wherein the shape of each client is calculated based on the constructed graph, and for a certain vertex (i.e., client) i in the graph, the shape (denoted as

) The calculation method of (1) is that

wherein ,S_i Representing a set of all subsets of clients i containing vertex members, |s| represents the number of elements in set s, n represents the number of all vertices in the graph, j is any vertex in set s other than i, e _ij Is the weight corresponding to the edge connecting vertices i and j.

5. The shape based federal learning method according to claim 1, wherein the global model parameters after each iteration are weighted sums of local model parameters of all clients participating in the present learning, wherein the weighting coefficients corresponding to the local model parameters of client i are the shape

The specific calculation method is that

wherein ,

is the global model parameter after the training of the t-th round is finished, L _t Representing a set of all clients i participating in the t-th round of learning,/>

Is about->

Function of->

Is the local model parameter obtained by training the client i in the t-th round. />