CN116595589B

CN116595589B - Secret sharing mechanism-based distributed support vector machine training method and system

Info

Publication number: CN116595589B
Application number: CN202310869601.5A
Authority: CN
Inventors: 张纪峰; 王继民; 赵延龙; 郭金
Original assignee: University of Science and Technology Beijing USTB; Academy of Mathematics and Systems Science of CAS
Current assignee: University of Science and Technology Beijing USTB; Academy of Mathematics and Systems Science of CAS
Priority date: 2023-07-17
Filing date: 2023-07-17
Publication date: 2023-10-10
Anticipated expiration: 2043-07-17
Also published as: CN116595589A

Abstract

The invention relates to the technical field of support vector machines, in particular to a distributed support vector machine training method and system based on a secret sharing mechanism, wherein the method comprises the following steps: s1, decomposing an optimization problem of a Support Vector Machine (SVM) into N sub-optimization problems by using an Alternating Direction Multiplication Method (ADMM) algorithm; s2, respectively carrying out local solution on the N sub-optimization problems by N participants, wherein the N participants respectively have data sets with data distributed transversely, and the data sets have the same characteristics; s3, training and outputting global classifiers by using ADMM algorithm interaction parameters by the N participants, and protecting the parameters by adopting an FSVM-C privacy protection scheme based on a secret sharing mechanism when the N participants use the ADMM algorithm interaction parameters. By adopting the method and the device, the targets of cooperative training and privacy protection of the completely distributed scene can be realized.

Description

Secret sharing mechanism-based distributed support vector machine training method and system

Technical Field

The invention relates to the technical field of support vector machines, in particular to a distributed support vector machine training method and system based on a secret sharing mechanism.

Background

The Support Vector Machine (SVM) is a widely used supervised learning (supervised learning) method, and is mainly used for constructing a generalized linear classifier for binary classification of data, and has been popular in text classification, handwriting character recognition and bioinformatics at present. In a real scene, the problem of huge training sample set exists, the hardware of a single machine can not support the requirement of directly operating all data, the accuracy of the classification result of an algorithm is difficult to ensure, and in addition, how to ensure the privacy safety in the implementation process of a support vector machine is also a problem to be solved urgently.

Disclosure of Invention

The embodiment of the invention provides a distributed support vector machine training method and system based on a secret sharing mechanism. The technical scheme is as follows:

in one aspect, a method for training a distributed support vector machine based on a secret sharing mechanism is provided, including:

s1, decomposing an optimization problem of a Support Vector Machine (SVM) into N sub-optimization problems by using an Alternating Direction Multiplication Method (ADMM) algorithm;

s2, respectively carrying out local solution on the N sub-optimization problems by N participants, wherein the N participants respectively have data sets with data distributed transversely, and the data sets have the same characteristics;

S3, training and outputting global classifiers by using ADMM algorithm interaction parameters by the N participants, and protecting the parameters by adopting an FSVM-C privacy protection scheme based on a secret sharing mechanism when the N participants use the ADMM algorithm interaction parameters.

Optionally, the feature space of the input samples and labels is given in the SVM classification problemThe solved hyperplane is denoted +.>W is the normal vector of the hyperplane, b is the intercept, and the model of the SVM is:

（1）

where N is the number of samples，x _i Represents the i-th sample, y _i Represents the label corresponding to the ith sample, ζ _i Is a relaxation variable, each sample corresponds to one relaxation variable, but the relaxation variables of points without outliers are all 0, which indicates the degree that the samples do not meet the constraint, fault tolerance is introduced, C is a penalty factor, and w and b are learned and output through a training method;

the Lagrangian function L of the solution model is shown in equation (2):

（2）

wherein alpha and mu are Lagrangian multipliers, alpha _i ≥ 0， µ _i ≥ 0；

The problem of the support vector machine is converted to solve the minimum and maximum problem of L by equation (2), with the goal of solving for the values of w, b and ζ that make L minimum by solving for a that makes L function maximum,the Lagrangian duality is applied here, the optimal solution of the original problem is solved by solving the dual problem, the solving difficulty can be reduced, the dual problem of the original problem is a maximum and minimum problem, w, b and ζ which enable the L function to be minimum are solved first, and then the maximum of alpha is solved for the L function, wherein the dual problem is shown as a formula (3):

（3）

Wherein α= (α) ₁ , α ₂ , . . . , α _N ) Is a Lagrangian multiplier vector;

in the model training stage, the optimal problem is solved by combining a steepest descent method with an outlier penalty method, and the method comprises the following steps:

(1) External circulation

step 1: selecting an initial point alpha ₀ Initial penalty factor M ₁ =1, precision ε ₁ > 0 , k := 1

step 2: to be used forFor an initial point, solving a constraint optimization problem as in equation (4):

（4）

solving F (alpha, M) by entering an inner circulation steepest descent method _k ) Obtaining the minimum pointIs marked as->, wherein ：

（5）

step 3: if it isStopping the calculation to obtain the approximate minimum point +.>Otherwise let M _k+1 =cM _k Let k =k+1, turn step 1;

(2) Internal circulation

step 1: selection ofIs the initial point alpha of internal circulation ₀ Precision epsilon ₂ >0, let t: =1

step 2 determining the search direction for each alpha _i And (3) calculating:

（6）

wherein ：

（7）

finally obtaining the search direction d ^t ：

（8）

step 3: from the slaveStarting from the edge d ^t Step length lambda of direction finding _t

step 4: calculation of

step 5: if it isThe iteration is stopped, if not let t: =t+1, return step 2.

Alternatively, assuming n=2, the S1 specifically includes: the optimization problem of the SVM is decomposed into two sub-optimization problems f (x) and g (z):

（9）

wherein x is E R ⁿ And z.epsilon.R ^M To optimize the variables, a matrix ,/> , />The objective function consists of f (x) and g (z), from which it can be seen that the variables that were first disassembled are treated as different variables, and the constraint is also treated as such, and the iterative process is to alternately optimize the variables;

The augmented lagrangian function of equation (9) is shown below:

（10）

where λ is the dual variable and ρ is the coefficient of the quadratic penalty term;

the specific iteration steps of the ADMM algorithm are as follows:

（11）。

optionally, the step S3 specifically includes:

s31, all the participants reach consensus in advance, wherein the consensus comprises the number N of nodes and a penalty factor C, and each participant initializes v ⁰ and λ⁰ Setting a threshold epsilon _i, wherein v_i = [ω ^T , b _i ] ^T ∈ R ^D+1 D is the feature quantity of the data set, v= [ V ] ^T , v ^T , . . . , v ^T ]Representing v _i Is a collection of (3);

s32, all the participants and the adjacent nodes generate a multiplication triplet table of rho increment sequences and secret sharing multiplication calculation in advance, and simultaneously calculate matrix operation which is not changed in the iterative process in advance;

s33, in the t-th iteration, the participant i first interacts with all neighbor nodes j, e _ij E, computing Σρ using FSCM-C privacy protection scheme ^t [v ^t + v ^t ]Then calculate Lagrangian multiplier；

S34, the party i updates v according to the formula (12) ^t+1 And broadcast to all neighbor nodes j, e _ij ∈ E；

（12）

S35, the party i first interacts with all neighboring nodes j, e _ij E, performing cooperative computing, and performing security computing by adopting FSCM-C privacy protection scheme, and updating lambda ^t+1 ；

S36, calculating a stop condition R ^t+1 If, as shown in formula (13)Stopping iteration, otherwise, returning to the step (3) to continue iteration;

（13）

After iterative training, all the participants obtain global optimal solutions, and then obtain global maximum interval classification functions。

Optionally, the FSCM-C privacy protection scheme specifically includes:

let [ N ]]As a collection of participants, B _i = [X _i , 1 _i ]，1 _i ∈ R ^M i，X _i Representing a matrix of samples, Y _i = diag ([y _i1 , . . . , y _iM ]) Diagonal matrix expressed as label, convenient for subsequent simplified model, xi _i = [ξ _i1 , . . . , ξ _iM i ]A relaxation variable representing the ith-th party, C being a support vector machine penalty factor, and additionally, setting I _D+1 Representing a unit vector with dimension D+1, N participants output a maximum interval linear classification function through a support vector machine model based on consensus, a network with N participants is assumed, a graph G= { N, E } represents a communication mode of the network, E represents a set of communication connection among the participants, and any node i epsilon [ N ]]Interact only with its neighboring nodes, the connection between the two nodes being denoted as e _ij ∈ E ；

Participant(s)Local computation with neighbor node->Pre-generated parameters->Performing the calculation of a multiplication triplet table;

participant(s)For-> and />Performing addition secret segmentation;

participant(s)Send sub-secret to participant->；

Participant(s)For-> and />Performing addition secret segmentation;

participant(s)Send sub-secret to participant->；

Participant(s) Local calculation +.>

Participant(s)Local meterCalculate->；

Both sides reconstruct and output the secret。

In another aspect, a distributed support vector machine training system based on a secret sharing mechanism is provided, the system comprising:

the decomposition module is used for decomposing the optimization problem of the support vector machine SVM into N sub-optimization problems by using an alternating direction multiplication method ADMM algorithm;

the local solving module is used for respectively carrying out local solving on the N sub-optimization problems by N participants, wherein the N participants respectively have data sets with data distributed transversely, and the data sets have the same characteristics;

and the training module is used for training and outputting global classifiers by using the ADMM algorithm interaction parameters by the N participants, and protecting the parameters by adopting an FSVM-C privacy protection scheme based on a secret sharing mechanism when the N participants use the ADMM algorithm interaction parameters.

（1）

where N is the number of samples, x _i Represents the i-th sample, y _i Represents the label corresponding to the ith sample, ζ _i Is a relaxation variable, one for each sample, but the relaxation variables for points without outliers are all 0, indicating that the samples do not satisfyThe degree of constraint, fault tolerance is introduced, C is a penalty factor, and w and b are learned and output through a training method;

the Lagrangian function L of the solution model is shown in equation (2):

（2）

（3）

wherein α= (α) ₁ , α ₂ , . . . , α _N )

Is a Lagrangian multiplier vector;

in the model training stage, the training module is specifically configured to solve an optimization problem by using a steepest descent method in combination with an outlier penalty method, and the steps are as follows:

(1) External circulation

（4）

（5）

(2) Internal circulation

step 2 determining the search direction for each alpha _i And (3) calculating:

（6）

wherein ：

（7）

finally obtaining the search direction d ^t ：

（8）

step 4: calculation of

step 5: if it isThe iteration is stopped, if not let t: =t+1, return step 2.

Alternatively, assuming n=2, the decomposition module is specifically configured to: the optimization problem of the SVM is decomposed into two sub-optimization problems f (x) and g (z):

（9）

wherein x is E R ⁿ and z ∈ R^M To optimize the variables, a matrix ,/> , />The objective function consists of f (x) and g (z), from which it can be seen that the variables that were first disassembled are treated as different variables, and the constraint is also treated as such, and the iterative process is to alternately optimize the variables;

the augmented lagrangian function of equation (9) is shown below:

（10）

the specific iteration steps of the ADMM algorithm are as follows:

（11）。

optionally, the training module is specifically configured to:

all the participants reach consensus in advance, including the number of nodes N and penalty factor C, and each participant initializes v ⁰ and λ⁰ Setting a threshold epsilon _i, wherein v_i = [ω ^T , b _i ] ^T ∈ R ^D+1 D is the feature quantity of the data set, v= [ V ] ^T , v ^T , . . . , v ^T ]Representing v _i Is a collection of (3);

all the participants and the adjacent nodes generate multiplication triple table of rho increment sequence and secret sharing multiplication calculation in advance, and simultaneously calculate matrix operation which is not changed in the iterative process in advance;

in the t-th iteration, party i first interacts with all neighbor nodes j, e _ij E, computing Σρ using FSCM-C privacy protection scheme ^t [v ^t + v ^t ]Then calculate Lagrangian multiplier；

Party i updates v according to equation (12) ^t+1 And broadcast to all neighbor nodes j, e _ij ∈ E；

（12）

Participant i first interacts with all neighbor nodes j, e _ij E, performing cooperative computing, and performing security computing by adopting FSCM-C privacy protection scheme, and updating lambda ^t+1 ；

Calculating a stop condition R ^t+1 If, as shown in formula (13)Stopping iteration, otherwise, returning to the step (3) to continue iteration;

（13）

after iterative training, all the participants obtain global optimal solutions, and then obtain global maximum interval classification functions 。

Optionally, the FSCM-C privacy protection scheme specifically includes:

let [ N ]]As a collection of participants, B _i = [X _i , 1 _i ]，1 _i ∈ R ^M i，X _i Representing a matrix of samples, Y _i = diag ([y _i1 , . . . , y _iM ]) Diagonal matrix expressed as label, convenient for subsequent simplified model, xi _i = [ξ _i1 , . . . , ξ _iM i ]A relaxation variable representing the ith-th party, C being a support vector machine penalty factor, and additionally, setting I _D+1 Representing a unit vector with dimension D+1, N participants output a maximum interval linear classification function through a support vector machine model based on consensus, a network with N participants is assumed, a graph G= { N, E } represents a communication mode of the network, E represents a set of communication connection among the participants, and any node i epsilon [ N ]]Interact only with its neighboring nodes, the connection between the two nodes being denoted as e _ij ∈ E；

Participant(s)Neighbor node local computation +.>Pre-generated parameters->Performing the calculation of a multiplication triplet table;

participant(s)For-> and />Performing addition secret segmentation;

participant(s)Send sub-secret to participant->；

Participant(s)For-> and />Performing addition secret segmentation;

participant(s)Send sub-secret to participant->；

Participant(s)Local calculation +.>；

Both sides reconstruct and output the secret。

In another aspect, an electronic device is provided that includes a processor and a memory having at least one instruction stored therein that is loaded and executed by the processor to implement the above-described secret sharing mechanism based distributed support vector machine training method.

In another aspect, a computer readable storage medium having stored therein at least one instruction that is loaded and executed by a processor to implement the above-described secret sharing mechanism based distributed support vector machine training method is provided.

The technical scheme provided by the embodiment of the invention has the beneficial effects that at least:

the design of the invention constructs the SVM classifier in a distributed and privacy protection mode, the main idea is to combine the ADMM algorithm and the secret sharing technology, and utilize the decomposability of the ADMM algorithm and the property of secret sharing security calculation, and solve the optimization problem by combining the steepest descent method with the outlier punishment method in the model training stage, thereby realizing the aims of cooperative training and privacy protection of the complete distributed scene.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required for the description of the embodiments will be briefly described below, and it is apparent that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.

FIG. 1 is a flowchart of a training method of a distributed support vector machine based on a secret sharing mechanism according to an embodiment of the present invention;

FIG. 2 is a schematic diagram of a lateral distribution of data provided by an embodiment of the present invention;

FIG. 3 is a schematic diagram of a data longitudinal distribution provided by an embodiment of the present invention;

FIG. 4 is a flowchart of another training method of a distributed support vector machine based on a secret sharing mechanism according to an embodiment of the present invention;

FIG. 5 is a block diagram of a distributed support vector machine training system based on a secret sharing mechanism provided by an embodiment of the present invention;

fig. 6 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.

Detailed Description

In order to make the technical problems, technical solutions and advantages to be solved more apparent, the following detailed description will be given with reference to the accompanying drawings and specific embodiments.

The embodiment of the invention provides a distributed support vector machine training method based on a secret sharing mechanism, which can be realized by electronic equipment, wherein the electronic equipment can be a terminal or a server. A flowchart of a distributed support vector machine training method based on a secret sharing mechanism as shown in fig. 1, the process flow of the method may include the following steps:

The traditional multi-data source support vector machine model is to collect the data of each data source together, then train a classification model by a central server, and the related security research work is all developed around the model. Assuming that there are M data samples, each sample containing multiple features, for the ith-th sample, there is x _i ∈ R ^D And tag y _i E { -1, 1}, let ζ represent pineRelaxation variables, which are obtained by solving a convex optimization problem, are shown in the following formula, to obtain a classification model, namely a maximum interval linear discriminant function。

In a real scene, the problems that a training sample set is huge and data is collected and stored in a distributed mode exist, and the communication cost of data transmission is increased and huge calculation load is brought to a central server by using a traditional multi-data-source support vector machine model. Therefore, the embodiment of the invention provides a complete distributed support vector machine model for collaborative training of different data holders realized by adopting an alternating direction multiplier method ADMM algorithm, is more suitable for a scene that data are collected and stored in a distributed mode, and can obtain more accuracy

In addition, the embodiment of the invention also provides a privacy protection scheme of the complete distributed support vector machine model. In the model training stage, the embodiment of the invention solves the optimization problem by combining a steepest descent method with an outlier penalty method.

The classification model and a training method of a distributed support vector machine based on a secret sharing mechanism according to the embodiment of the present invention are described in detail below with reference to fig. 2 to fig. 4, where the training method includes:

（1）

where N is the number of samples, x _i Represents the i-th sample, y _i The label corresponding to the ith sample is represented, xi is a relaxation variable, each sample corresponds to one relaxation variable, but the relaxation variables of points without outliers are all 0, the degree that the samples do not meet the constraint is represented, fault tolerance is introduced, C is a penalty factor, and w and b are learned and output through a training method;

the Lagrangian function L of the solution model is shown in equation (2):

（2）

（3）

(1) External circulation

（4）

（5）

(2) Internal circulation

step 1: selection of Is the initial point alpha of internal circulation ₀ Precision epsilon ₂ >0, let t: =1

step 2 determining the search direction for each alpha _i And (3) calculating:

（6）

wherein ：

（7）

finally obtaining the search direction d ^t ：

（8）

step 4: calculation of

step 5: if it isThe iteration is stopped, if not let t: =t+1, return step 2.

（9）

wherein x is E R ⁿ And z.epsilon.R ^M To optimize the variables, a matrix ,/> , />The objective function consists of f (x) and g (z) parts, from which it can be seen that the variables that were first broken down are seen as different variables, and the constraint is also treated as such, and the iterative process then proceeds with the variablesAlternately optimizing;

the augmented lagrangian function of equation (9) is shown below:

（10）

the specific iteration steps of the ADMM algorithm are as follows:

（11）。

the ADMM algorithm is that the iteration of x and z is changed into independent iteration of x and z on the basis of the original augmentation Lagrangian algorithm, which means that the ADMM algorithm is divided into 2-blocks, and the ADMM algorithm can be popularized to N-blocks after deformation, namely the scenes of N participants.

When training samples of the data set are too large, the samples can also be segmented for collaborative training. Assuming there are N participants in a joint support vector machine (FSVM) scheme, there is M for the ith-th participant _i Data sample, matrix X _i Is a feature matrix, x _ij Representing one sample data, y _ij E { -1, 1} represents x _ij Corresponding to the tag. In addition, the total number of features considered in the scheme of the embodiment of the invention is D.

When referring to a scenario where multiple data owners co-train a model, there are two types of data partitioning conditions, namely, partitioning by sample and by feature.

The first is divided by the number of samples, i.e., distributed laterally, as shown in fig. 2. This means that each party has all the characteristics and part of data, and the method is suitable for scenes in which multiple party data sources have the same sample characteristics, such as collaborative training of multiple e-commerce platforms, and better training effects are obtained by utilizing sharing of shopping data sets of different users with the same characteristic type to perform collaboration.

Thus X is _i ∈ R ^Mi×D , x ^T ∈ R ^D Wherein (i=1, 2,) N and (j=1, 2,) M _i )。

The second is by feature division, i.e. longitudinal distribution. As shown in fig. 3, it can be seen that each participant has the same amount of sample data, but the number or type of characteristics of the samples are not the same, such as different units or companies with different customer groups, such as banks, securities, homeowners, etc., desiring to cooperate to arrive at a global classifier that takes into account more comprehensive factors.

The embodiment of the invention is suitable for data sets which are distributed transversely, namely N participants respectively have the data sets, and the data sets have the same characteristics.

Security and privacy protection are very important propositions to consider for distributed systems, which will have a large amount of data and information to interact. Communication among users on different machines which are not trusted by each other is to ensure that sensitive information is not revealed, ensure confidentiality and prevent information from being stolen by malicious nodes.

There are two types of privacy protection schemes. The first is a method of data perturbation, and controllable noise is added to an intermediate state, so that differential privacy is satisfied, and differential privacy technology is mainly utilized. But the increased noise inevitably compromises the accuracy of the optimization results. The second method is a cryptographic method. But conventional cryptographic techniques cannot be applied directly in a fully distributed scenario without third party assistance. While there are two cryptographic methods that are suitable for fully distributed scenarios: secure multiparty computing technology and homomorphic encryption technology. Homomorphic encryption techniques are divided into full homomorphic encryption, which cannot be used for real-time optimization, and partial homomorphic encryption, which introduces a large amount of computational overhead. The generalized secure multiparty computing technology is that the addition in the ciphertext state is the same as the computation in the plaintext state, i.e. the processed data does not affect the final result that needs to be computed. Now generally to secure multiparty computing, we generally refer to Yao garbled circuits and secret sharing mechanisms. The embodiment of the invention discovers that the data disturbance method has high efficiency but can not provide strong security provable by cryptography, and can damage the accuracy of an optimized result, and the calculation cost and the communication cost of a multi-data source collaborative training scheme realized in a homomorphism are overlarge.

The idea of the secret sharing mechanism is to split one secret and distribute the split sub-secret to other participants, and only a suitable number of participants cooperate to recover the secret. The secret division is carried out by adopting a proper splitting means (the embodiment of the invention adopts addition secret division), after the sub-secrets are divided and distributed, each party can independently carry out local operation, and finally the calculation result obtained by reconstructing the secrets is consistent with the calculation result directly carried out by the undivided secrets, so that the method is suitable for multiparty split input to cooperatively carry out calculation, and the information of input data can not be revealed when the calculation result is obtained. The mathematical definition is as follows:wherein x represents the secret to be split, t represents the recovery threshold, n represents the number of splits, there is a recovery function R, for any m.gtoreq.t, there is +.>. I.e. dividing the secret into n shares, onlyAnd when the participant is greater than or equal to t, the secret can be reconstructed. Secret sharing is classified into boolean sharing and arithmetic sharing according to the type of computation, which correspond to the computation of boolean and arithmetic levels, respectively (boolean sharing is employed in the embodiments of the present invention).

As shown in fig. 4, in the training phase, N participants respectively solve the local sub-optimization problem, and perform interaction parameter solving global classifier by using ADMM algorithm. Further, the embodiment of the invention provides an FSVM-C privacy protection scheme for safely solving the global classifier by combining an ADMM algorithm and Boolean sharing and adding secret sharing.

The embodiment of the invention defines privacy as confidentiality of the intermediate state of the training process and the local support vector machine sub-problem. Because the ADMM needs to participate in parameter interaction among entities in a distributed scene, the embodiment of the invention protects the privacy of the objective function by protecting the interaction parameters.

Optionally, the step S3 specifically includes:

S34, the party i updates v according to the formula (12) ^t+1 And broadcastTo all neighbor nodes j, e _ij ∈ E；

（12）

（13）

Optionally, the FSCM-C privacy protection scheme specifically includes:

let [ N ]]As a collection of participants, B _i = [X _i , 1 _i ]，1 _i ∈ R ^M i，X _i Representing a matrix of samples, Y _i = diag ([y _i1 , . . . , y _iM ]) Diagonal matrix expressed as label, convenient for subsequent simplified model, xi _i = [ξ _i1 , . . . , ξ _iM i ]A relaxation variable representing the ith-th party, C being a support vector machine penalty factor, and additionally, setting I _D+1 Representing a unit vector with dimension D+1, N participants output a maximum interval linear classification function through a support vector machine model based on consensus, a network with N participants is assumed, a graph G= { N, E } represents a communication mode of the network, E represents a set of communication connection among the participants, and any node i epsilon [ N ]]And only its neighbor nodesInteraction is performed and the connection between two nodes is denoted as e _ij ∈ E；

participant(s)For-> and />Performing addition secret segmentation;

participant(s) Send sub-secret to participant->；

Participant(s)For-> and />Performing addition secret segmentation;

participant(s)Send sub-secret to participant->；

Participant(s)Local calculation +.>；

Both sides reconstruct and output the secret。

The Algorithm is shown in Algorithm 1:

as shown in algorithm 1, for the t-th iteration, after the secret sharing technology is introduced by the aid of the respective sequences of the party i and the party j, interaction between nodes is increased, but many calculations requiring interaction, such as calculation of rho increment sequences and multiplication triples, can be pre-calculated, are independent of the iteration process, and do not increase calculation overhead of the training process. In addition, the calculation overhead of the secret sharing technology is far smaller than that of the homomorphic encryption technology. Although secret sharing is relatively inefficient compared to differential privacy, secret sharing techniques are chosen as the basis for the security scheme herein due to the advantages of high security and the high number of pre-calculations that increase the efficiency of the training process. Safely calculate ρ ^t+1 (v ^t+1 -v ^t+1 ) And then, continuing the iterative calculation of the optimization problem solving.

Privacy protection proof of embodiments of the present invention is demonstrated below:

assuming that the solution of the embodiment of the present invention is secure, all participants can only obtain the last global classifier during the whole solution execution process, and cannot obtain any additional information during the training process. The privacy protection of the scheme is proved below, and if an attacker of a real protocol exists, the ideal protocol and the real protocol are indistinguishable to the attacker, namely the information revealed by the real protocol does not exceed the rational protocol, the scheme can achieve the aim of privacy protection.

Privacy preserving evidence is analyzed using a generic combinable model. Assuming that the information obtained by the real protocol participant in the interactive execution scheme with other participants is V iew ^real [π, s]. Assuming a trusted simulator S, the user' S input can be simulated randomly to simulate the execution of a security training scheme, execute an ideal protocol, and use V iew ^ideal [π, s]Representing information acquired to perform the desired protocol. If the size of equation (14) is negligible, i.e., indistinguishable, then the protocol pi can be considered to have no additional data information revealed.

（14）

The training scheme of the FSVM-C privacy protection scheme of the embodiment of the invention relates to N participants. Assuming that there is a semi-honest attacker A, the other participants can be attacked to obtain the original data, here assuming that attacker A attacks N-1 participants P ₁ , P ₂ , . . . , P _N-1 . The definition of security is such that attacker a can only know the data of the broken up participants and the last global classifier and not the information of the other participants.

The simulator S inputs training samples into the training scheme of the FSVM-C privacy protection scheme, and finally solves the global optimal solution, namely the maximum interval linear classification function, cooperatively. Firstly, as the training scheme of the FSVM-C privacy protection scheme does not need to interact with the original data, each participant solves the local optimization sub-problem locally by using a training sample, so that the information of the original data cannot be revealed. The simulator and the real protocol participants do not obtain any information. The aim of cooperatively updating training parameters of the support vector machine is achieved. The final simulator and the real protocol participants both obtain a global classifier g, letting the information obtained at this step be denoted T (g). The information obtained by the simulator and the real protocol participants is shown in the formula:

（15）

I.e. the real protocol and the simulator are indistinguishable. From the information view point of the simulator S and the information view point of the real protocol, the information revealed by the FSVM-C real protocol does not exceed the ideal protocol, so that the privacy protection scheme of the support vector machine with data distributed transversely can achieve the privacy protection target.

The embodiment of the invention provides an FSVM-C privacy protection scheme based on a secret sharing mechanism of a distributed support vector machine under a data transverse distribution scene, and the privacy protection of the original data and training intermediate state is realized in a pre-calculation stage and a training stage by combining an ADMM of a variable penalty matrix with a secret sharing technology. Meanwhile, the scheme safety is analyzed under an honest and curious safety model, and the scheme is proved to be capable of safely completing the aim of the multiparty collaborative training support vector machine.

As shown in fig. 5, an embodiment of the present invention further provides a distributed support vector machine training system based on a secret sharing mechanism, where the system includes:

a decomposition module 510, configured to decompose the optimization problem of the support vector machine SVM into N sub-optimization problems using an alternate direction multiplier ADMM algorithm;

the local solving module 520 is configured to solve the N sub-optimization problems locally by N participants, where the N participants each have a data set with data distributed laterally, and the data sets have the same characteristics;

And the training module 530 is used for training and outputting global classifiers by using ADMM algorithm interaction parameters by the N participants, and protecting the parameters by adopting an FSVM-C privacy protection scheme based on a secret sharing mechanism when the N participants use the ADMM algorithm interaction parameters.

（1）

where N is the number of samples, x _i Represents the i-th sample, y _i Represents the label corresponding to the ith sample, ζ _i Is a relaxation variable, each sample corresponds to one relaxation variable, but the relaxation variables of points without outliers are all 0, which indicates the degree that the samples do not meet the constraint, fault tolerance is introduced, C is a penalty factor, and w and b are learned and output through a training method;

the Lagrangian function L of the solution model is shown in equation (2):

（2）

The problem of the support vector machine is converted to solve the minimum and maximum problem of L by equation (2), with the goal of solving for the values of w, b and ζ that make L minimum by solving for a that makes L function maximum, The Lagrangian duality is applied here, the optimal solution of the original problem is solved by solving the dual problem, the solving difficulty can be reduced, the dual problem of the original problem is a maximum and minimum problem, w, b and ζ which enable the L function to be minimum are solved first, and then the maximum of alpha is solved for the L function, wherein the dual problem is shown as a formula (3):

（3）

(1) External circulation

（4）/>

（5）

(2) Internal circulation

step 2 determining the search direction for each alpha _i And (3) calculating:

（6）

wherein ：

（7）

finally obtaining the search direction d ^t ：

（8）

step 4: calculation of

step 5: if it is The iteration is stopped, if not let t: =t+1, return step 2.

（9）

the augmented lagrangian function of equation (9) is shown below:

（10）

the specific iteration steps of the ADMM algorithm are as follows:

（11）。

optionally, the training module is specifically configured to:

In the t-th iteration, party i first interacts with all neighbor nodes j, e _ij E, computing Σρ using FSCM-C privacy protection scheme ^t [v ^t + v ^t ]Then calculate the Lagrangian multiplicationSon；

（12）

Calculating a stop condition R ^t+1 Stopping iteration if the iteration is stopped as shown in the formula (13), otherwise, returning to the step (3) to continue iteration;

（13）

Optionally, the FSCM-C privacy protection scheme specifically includes:

let [ N ]]As a collection of participants, B _i = [X _i , 1 _i ]，1 _i ∈ R ^M i，X _i Representing a matrix of samples, Y _i = diag ([y _i1 , . . . , y _iM ]) Diagonal matrix expressed as label, convenient for subsequent simplified model, xi _i = [ξ _i1 , . . . , ξ _iM i ]A relaxation variable representing the ith-th party, C being a support vector machine penalty factor, and additionally, setting I _D+1 The unit vector with dimension D+1 is represented, N participants output the maximum interval linear classification function through a support vector machine model based on consensus, a network with N participants is assumed, the communication mode of the network is represented by a graph G= { N, E }, and E table Illustrating a collection of communication connections between participants, arbitrary node i e N]Interact only with its neighboring nodes, the connection between the two nodes being denoted as e _ij ∈ E；

participant(s)For-> and />Performing addition secret segmentation;

participant(s)Send sub-secret to participant->；

Participant(s)For-> and />Performing row addition secret segmentation;

participant(s)Send sub-secret to participant->；

Participant(s)Local calculation +.>；

Both sides reconstruct and output the secret。

The functional structure of the distributed support vector machine training system based on the secret sharing mechanism provided by the embodiment of the invention corresponds to the distributed support vector machine training method based on the secret sharing mechanism provided by the embodiment of the invention, and is not described herein.

Fig. 6 is a schematic structural diagram of an electronic device 600 according to an embodiment of the present invention, where the electronic device 600 may have a relatively large difference due to different configurations or performances, and may include one or more processors (central processing units, CPU) 601 and one or more memories 602, where at least one instruction is stored in the memories 602, and the at least one instruction is loaded and executed by the processors 601 to implement the steps of the above-described distributed support vector machine training method based on a secret sharing mechanism.

In an exemplary embodiment, a computer readable storage medium, such as a memory comprising instructions executable by a processor in a terminal to perform the above-described secret sharing mechanism based distributed support vector machine training method is also provided. For example, the computer readable storage medium may be ROM, random Access Memory (RAM), CD-ROM, magnetic tape, floppy disk, optical data storage device, etc.

It will be understood by those skilled in the art that all or part of the steps for implementing the above embodiments may be implemented by hardware, or may be implemented by a program for instructing relevant hardware, where the program may be stored in a computer readable storage medium, and the storage medium may be a read-only memory, a magnetic disk or an optical disk, etc.

The foregoing description of the preferred embodiments of the invention is not intended to limit the invention to the precise form disclosed, and any such modifications, equivalents, and alternatives falling within the spirit and scope of the invention are intended to be included within the scope of the invention.

Claims

1. A distributed support vector machine training method based on a secret sharing mechanism, the method comprising:

s3, training and outputting global classifiers by using ADMM algorithm interaction parameters by the N participants, and protecting the parameters by adopting an FSVM-C privacy protection scheme based on a secret sharing mechanism when the N participants use the ADMM algorithm interaction parameters;

feature space given input samples and labels in SVM classification problemThe solved hyperplane is denoted +.>W is the normal vector of the hyperplane, b is the intercept, and the model of the SVM is:

（1）

the Lagrangian function L of the solution model is shown in equation (2):

（2）

Wherein alpha and mu are Lagrangian multipliers, alpha _i ≥ 0，µ _i ≥ 0；

（3）

wherein α= (α ₁ , α ₂ , . . . , α _N ) Is a Lagrangian multiplier vector;

(1) External circulation

（4）

（5）

step 3: if it isStopping the calculation to obtain the approximate minimum point +.>Otherwise letM _k+1 =cM _k Let k =k+1, turn step 1;

(2) Internal circulation

step 1: selection of Is the initial point alpha of internal circulation ₀ Precision ofε ₂ >0, let t: =1

step 2 determining the search direction for eachα _i And (3) calculating:

（6）

wherein ：

（7）

finally obtaining the search directiond ^t ：

（8）

step 3: from the slaveStarting from the edged ^t Step length of direction calculationλ _t

step 4: calculation of；

step 5: if it isStopping iteration, if not, letting t be =t+1, and returning to step 2;

assuming n=2, the S1 specifically includes: decomposing the optimization problem of SVM into two sub-optimization problemsf (x) and g (z)：

（9）

wherein ,x ∈ R ⁿ andz ∈ R ^M to optimize the variables, a matrixA ∈R ^s∗n , B ∈R ^s∗M , c ∈R ^s The objective function is composed off (x) and g (z) Two-part construction, from this form, the first-to-break variable is seen asDifferent variables and constraint conditions are treated as such, and the iterative process is to alternately optimize the variables;

the augmented lagrangian function of equation (9) is shown below:

（10）

the specific iteration steps of the ADMM algorithm are as follows:

（11）；

the step S3 specifically comprises the following steps:

s31, all the participants reach consensus in advance, including the number of nodes N and penalty factor C, and each participant initializesv ⁰ Andλ ⁰ setting a threshold valueε _i, wherein v _i = [ω ^T , b _i ] ^T ∈ R ^D+1 D is the characteristic quantity of the data set,V = [v ^T , v ^T , . . . , v ^T ]representation ofv _i Is a collection of (3);

S33, in the t-th iteration, the participant i first interacts with all neighbor nodesj, e _ij ∈ ESigma computation using FSCM-C privacy preserving schemeρ ^t [v ^t + v ^t ]Then calculate Lagrangian multiplier；

S34、Party i updates according to equation (12)v ^t+1 And broadcast to all neighbor nodesj, e _ij ∈ E；

（12）

S35, the party i first and all neighbor nodesj, e _ij ∈ ECollaborative computing is performed, where security computing is also performed using FSCM-C privacy preserving scheme, and then updatedλ ^t+1 ；

（13）

after iterative training, all the participants obtain global optimal solutions, and then obtain global maximum interval classification functions；

The FSCM-C privacy protection scheme specifically comprises the following steps:

let [ N ]]For a collection of parties to be considered,B _i = [X _i , 1 _i ]，1 _i ∈R ^M i，X _i representing a matrix of samples,Y _i = diag ([y _i1 , . . . , y _iM ]) Represented as a diagonal matrix of labels, facilitating subsequent simplified models,ξ _i = [ξ _i1 , . . . , ξ _iM i]a relaxation variable representing the ith-th party, C being a support vector machine penalty factor, additionally, settingI _D+1 Representing a unit vector with dimension D+1, N participants output a maximum interval linear classification function through a support vector machine model based on consensus, a network with N participants is assumed, a graph G= { N, E } represents a communication mode of the network, E represents a set of communication connection among the participants, and any node i epsilon [ N ] ]Interact with only its neighboring nodes, the connection between the two nodes being represented ase _ij ∈ E；

participant(s)For-> and />Performing row addition secret segmentation;

participant(s)Send sub-secret to participant->；

Participant(s)For-> and />Performing addition secret segmentation;

participant(s)Send sub-secret to participant->；

Participant(s)Local calculation +.>；

Both sides reconstruct and output the secret。

2. A distributed support vector machine training system based on a secret sharing mechanism, the system comprising:

the training module is used for training and outputting global classifiers by using ADMM algorithm interaction parameters by the N participants, and protecting the parameters by adopting an FSVM-C privacy protection scheme based on a secret sharing mechanism when the N participants use the ADMM algorithm interaction parameters;

（1）

where N is the number of samples and,x _i the i-th sample is represented by the expression,y _i the label corresponding to the ith sample is represented, xi is a relaxation variable, each sample corresponds to one relaxation variable, but the relaxation variables of points without outliers are all 0, the degree that the samples do not meet the constraint is represented, fault tolerance is introduced, C is a penalty factor, and w and b are learned and output through a training method;

the Lagrangian function L of the solution model is shown in equation (2):

（2）

where alpha and mu are both lagrange multipliers,α _i ≥ 0，µ _i ≥ 0；

the problem of the support vector machine is converted to solve the minimum and maximum problem of L by equation (2), with the goal of solving for the values of w, b and ζ that make L minimum by solving for a that makes L function maximum,where the application isThe Lagrangian duality is achieved by solving the optimal solution of the original problem, the solving difficulty can be reduced, the dual problem of the original problem is the maximum and minimum problem, w, b and ζ which enable the L function to be minimum are solved first, and then the maximum of alpha is solved for the L function, wherein the dual problem is shown in the formula (3):

（3）

(1) External circulation

step 2: alpha is alpha _k−1 For an initial point, solving a constraint optimization problem as in equation (4):

（4）

（5）

step 3: if it isStop the meterCalculating to obtain approximate minimum point->Otherwise letM _k+1 =cM _k Let k =k+1, turn step 1;

(2) Internal circulation

step 1: selection ofIs the initial point alpha of internal circulation ₀ Precision ofε ₂ >0, let t: =1

step 2 determining the search direction for eachα _i And (3) calculating:

（6）

wherein ：

（7）

finally obtaining the search directiond ^t ：

（8）

step 4: calculation ofstep 5: if->Stopping iteration, if not, letting t be =t+1, and returning to step 2;

assuming n=2, the decomposition module is specifically configured to: decomposing the optimization problem of SVM into two sub-optimization problemsf (x) and g (z)：

（9）

wherein ,x ∈ R ⁿ andz ∈ R ^M to optimize the variables, a matrixA ∈ R ^s∗n , B ∈ R ^s∗M , c ∈ R ^s The objective function is composed off (x) and g (z) Two parts are formed, from the form, the variables which are disassembled firstly are regarded as different variables, the constraint condition is treated as the same, and the iterative process is to alternately optimize the variables;

The augmented lagrangian function of equation (9) is shown below:

（10）

the specific iteration steps of the ADMM algorithm are as follows:

（11）；

the training module is specifically configured to:

all the participants reach consensus in advance, including node number N and penalty factor C, each participant is initializedv ⁰ Andλ ⁰ setting a threshold valueε _i, wherein v _i = [ω ^T , b _i ] ^T ∈ R ^D+1 D is the characteristic quantity of the data set,V = [v ^T , v ^T , . . . , v ^T ]watch (watch)Showing thev _i Is a collection of (3);

in the t-th iteration, party i first interacts with all neighbor nodesj, e _ij ∈ ESigma computation using FSCM-C privacy preserving schemeρ ^t [v ^t + v ^t ]Then calculate Lagrangian multiplier；

Party i updates according to equation (12)v ^t+1 And broadcast to all neighbor nodesj, e _ij ∈ E；

（12）

Participant i first interacts with all neighbor nodesj, e _ij ∈ ECollaborative computing is performed, where security computing is also performed using FSCM-C privacy preserving scheme, and then updatedλ ^t+1 ；

（13）

after iterative training, all the participants obtain global optimal solutions, and then obtain global maximum interval classification functions ；

let [ N ]]For a collection of parties to be considered,B _i = [X _i , 1 _i ]，1 _i ∈ R ^M i，X _i representing a matrix of samples,Y _i = diag ([y _i1 , . . . , y _iM ]) Represented as a diagonal matrix of labels, facilitating subsequent simplified models,ξ _i = [ξ _i1 , . . . , ξ _iM i ]a relaxation variable representing the ith-th party, C being a support vector machine penalty factor, additionally, settingI _D+1 Representing a unit vector with dimension D+1, N participants output a maximum interval linear classification function through a support vector machine model based on consensus, a network with N participants is assumed, a graph G= { N, E } represents a communication mode of the network, E represents a set of communication connection among the participants, and any node i epsilon [ N ]]Interact with only its neighboring nodes, the connection between the two nodes being represented ase _ij ∈ E ；

Participant(s)Local computation with neighbor node->Generating parameters in advance to calculate a multiplication triplet table;

participant(s)For-> and />Performing row addition secret segmentation;

participant(s)Send sub-secret to participant->；

Participant(s)For-> and />Performing addition secret segmentation;

participant(s)Send sub-secret to participant->；

Participant(s)Local calculation +.>；

Both sides reconstruct and output the secret。