CN115277696A

CN115277696A - Cross-network federal learning system and method

Info

Publication number: CN115277696A
Application number: CN202210823096.6A
Authority: CN
Inventors: 王济平; 黎刚; 汤克云; 周健雄; 杨劲业; 高俊杰
Original assignee: Jingxin Data Technology Co ltd
Current assignee: Jingxin Data Technology Co ltd
Priority date: 2022-07-13
Filing date: 2022-07-13
Publication date: 2022-11-01
Anticipated expiration: 2042-07-13
Also published as: CN115277696B

Abstract

The invention discloses a cross-network federal learning system and a method, which comprises at least two completely physically isolated networks and an off-line transmission module, wherein: the outer network initiator and the inner network participant together complete a federated learning joint modeling task; the intranet participant is used for providing local data, receiving the invitation of the extranet initiator and participating in the federated learning joint modeling task; the outer network coordinator is used for performing aggregation optimization on intermediate parameters in the process of executing federated learning iterative computation by the outer network initiator and the inner network participant to generate new intermediate parameters; the outer network transmission monitoring module is used for reading the intermediate parameters from the off-line transmission module and then sending the intermediate parameters to the outer network coordinator, and writing the intermediate parameters into the off-line transmission module after receiving the intermediate parameters from the outer network coordinator; the off-line transmission module is used for interactively transmitting the encrypted intermediate parameters between the two networks. The invention can realize the effective transmission of the intermediate data of the isolation network of the federal learning calculation task, thereby completing the cross-network federal learning calculation task.

Description

Cross-network federal learning system and method

Technical Field

The invention relates to a federal learning system, in particular to a cross-network federal learning system and a method.

Background

The definition of federal learning (fed learning) refers to: a mode for a plurality of participants to collaboratively complete a certain machine learning task under the premise of ensuring that respective original private data does not exceed private boundaries defined by a data party. The federated learning mainly comprises three roles of a participant, a central coordinator and an initiator, the participant and the initiator jointly co-establish a machine learning model to perform joint calculation tasks, the coordinator performs intermediate parameter transmission, and the multi-party data fusion analysis application is realized under the condition that data does not exist in the local domain.

In the prior art, federal learning requires that each data resource participant can realize federal learning joint modeling and calculation only under the condition that networks are mutually communicated, and the condition that both parties or multiple parties realize data fusion application in a federal learning calculation mode under the condition that both data resource participants are isolated by the networks cannot be met, for example, the government affair field is divided into a government affair outer network and a government affair inner network, so that the prior mainstream technology cannot establish a federal learning task under the condition of crossing the networks and cannot meet application requirements due to complete physical isolation of the two networks.

Disclosure of Invention

The technical problem to be solved by the invention is to provide a system and a method for realizing effective transmission of the intermediate data of the isolation network of the federal learning calculation task so as to complete the cross-network federal learning calculation task aiming at the defects of the prior art.

In order to solve the technical problems, the invention adopts the following technical scheme.

The utility model provides a cross network federal learning system, it includes at least two complete physically isolated networks and connects the off-line transmission module between two networks, and two networks are defined as extranet and intranet respectively, and every network is including initiator, participant, coordinator and transmission monitoring module, wherein: the outer network initiator is used for completing a federated learning joint modeling task together with the inner network participant; the intranet participant is used for providing local data, receiving the invitation of the extranet initiator and participating in the federated learning joint modeling task; the outer network coordinator is used for performing aggregation optimization on intermediate parameters in the process of executing federated learning iterative computation by the outer network initiator and the inner network participant and generating new intermediate parameters; the outer network transmission monitoring module is used for reading the intermediate parameters from the off-line transmission module and then sending the intermediate parameters to the outer network coordinator, and receiving the intermediate parameters from the outer network coordinator and then writing the intermediate parameters into the off-line transmission module; the off-line transmission module is used for interactively transmitting the encrypted intermediate parameters between the two networks.

Preferably, the offline transmission module is further configured to store intermediate parameters interactively transmitted between the two networks.

A cross-network federal learning method is realized based on a system, the system comprises at least two completely physically isolated networks and an offline transmission module connected between the two networks, the two networks are respectively defined as an extranet and an intranet, each network comprises an initiator, a participant, a coordinator and a transmission monitoring module, and the method comprises the following steps: step S10, initializing the transmission monitoring module; step S20, the external network initiator and the internal network participant complete the creation of a federal learning model together, the external network coordinator performs environment initialization and starts a first batch iterative computation task; step S30, the external network coordinator respectively sends the two-party unilateral gradient after the iterative optimization in the step S20 to an external network initiator and an external network transmission monitoring module, sets a data interval for executing a next round of iterative batch, and starts the next round of iterative batch calculation; step S40: and repeating the step S30 in an iterative manner until the federal learning task is finished.

In the cross-network federated learning system disclosed by the invention, after the initialization of the transmission monitoring module is completed, the external network initiator and the internal network participant together complete the creation of a federated learning model, the external network coordinator performs environment initialization, starts a first batch of iterative computation tasks and respectively sends the two-side gradient after iterative optimization to the external network initiator and the external network transmission monitoring module, and then sets a data interval for executing the next round of iterative batch, starts the next round of iterative batch computation and repeatedly executes the iterative computation until the federated learning task is finished. Compared with the prior art, the method and the device realize that any party data node and the isolation party network federal learning calculation node can create a federal learning task under the isolation network. The invention provides a cross-network federated learning offline transmission module, which realizes offline transmission of intermediate parameters in the iterative process of a federated learning task and well completes the interaction function of the intermediate parameters in the calculation of the cross-network federated learning task.

Drawings

FIG. 1 is a block diagram of the components of the cross-network federated learning system of the present invention;

FIG. 2 is a flow chart of a cross-network federated learning method of the present invention;

FIG. 3 is a flow chart of a first embodiment of the present invention;

FIG. 4 is a flow chart of a second embodiment of the present invention;

FIG. 5 is a first flowchart illustrating a third embodiment of the present invention;

FIG. 6 is a second flowchart illustrating a third embodiment of the present invention.

Detailed Description

The invention is described in more detail below with reference to the figures and examples.

The invention discloses a cross-network federal learning system, please refer to fig. 1, which comprises at least two completely physically isolated networks 1 and an offline transmission module 2 connected between the two networks 1, wherein the two networks 1 are respectively defined as an extranet and an intranet, each network comprises an initiator 10, a participant 11, a coordinator 12 and a transmission monitoring module 13, wherein:

the extranet initiator 10 is used for completing a federated learning joint modeling task together with the intranet participant 11;

the intranet participant 11 is used for providing local data, accepting the invitation of the extranet initiator 10, and participating in the federated learning joint modeling task;

the extranet coordinator 12 is used for performing aggregation optimization on intermediate parameters in the federate learning iterative computation process performed by the extranet initiator 10 and the intranet participant 11, and generating new intermediate parameters;

the external network transmission monitoring module 13 is configured to read the intermediate parameter from the offline transmission module 2, send the intermediate parameter to the external network coordinator 12, receive the intermediate parameter from the external network coordinator 12, and write the intermediate parameter into the offline transmission module 2;

the offline transmission module 2 is configured to interactively transmit the encrypted intermediate parameters between the two networks 1. In addition, the offline transmission module 2 is further configured to store intermediate parameters interactively transmitted between the two networks 1.

In the system, after the initialization of the transmission monitoring module 13 is completed, the external network initiator 10 and the internal network participant 11 together complete the creation of the federal learning model, the external network coordinator 12 performs environment initialization, starts a first-batch iterative calculation task, respectively sends the iteratively optimized two-party single-side gradient to the external network initiator 10 and the external network transmission monitoring module 13, then sets a data interval for executing a next iteration batch, starts a next iteration batch calculation, and repeatedly executes the iterative calculation until the federal learning task is finished. Compared with the prior art, the method and the system have the advantages that under the isolation network, any one party of data nodes can create a federal learning task with the federal learning and calculating nodes of the isolation party network, for example, when the intranet nodes serve as an initiator and the extranet nodes serve as participants, the intranet center side serves as a federal learning and coordinating node, and when the extranet nodes serve as the initiator and the intranet nodes serve as participants, the extranet center side federal learning and coordinating node is adopted. The invention provides a cross-network federated learning offline transmission module, which realizes offline transmission of intermediate parameters in the iterative process of a federated learning task and better completes the interaction function of the intermediate parameters in the calculation of the cross-network federated learning task.

On the basis of the above system, the present invention further relates to a cross-network federal learning method, which is implemented based on a system as shown in fig. 1 and fig. 2, wherein the system includes at least two completely physically isolated networks 1 and an offline transmission module 2 connected between the two networks 1, the two networks 1 are respectively defined as an extranet and an intranet, each network includes an initiator 10, a participant 11, a coordinator 12 and a transmission monitoring module 13, and the method includes the following steps:

step S10, initializing the transmission monitoring module 13;

step S20, the extranet initiator 10 and the intranet participant 11 complete the creation of a federal learning model together, and the extranet coordinator 12 performs environment initialization to start a first batch iterative computation task;

step S30, the external network coordinator 12 respectively sends the two-party unilateral gradient after the iterative optimization in the step S20 to the external network initiator 10 and the external network transmission monitoring module 13, sets a data interval for executing the next round of iterative batch, and starts the next round of iterative batch calculation;

step S40: and repeating the step S30 in an iterative manner until the federal learning task is finished.

In the method, under the completely physically isolated environment of two networks, the federal learning clusters are respectively deployed in the respective network environments, and an initiator, a participant, a central node and a transmission monitoring module are respectively deployed in each network environment. The transmission of intermediate data of the federated learning calculation task isolation network is realized through an offline transmission module, and finally, a cross-network federated learning calculation task is completed, wherein the definition of each module is as follows:

the initiator: the method refers to a data application party of the federated learning calculation task, and jointly completes the federated learning joint modeling task under the cooperation of data provided by participants;

the participation method comprises the following steps: the system comprises a data provider of a federated learning calculation task, an assistance initiator, a data processing module and a data processing module, wherein the assistance initiator provides local data, adds the federated learning task and assists the initiator to jointly complete the federated learning joint modeling task;

a central node: the system refers to a coordinator of a federated learning calculation task, and is used for aggregating and optimizing intermediate parameters in the iterative calculation process of an initiator and a participant to generate new intermediate parameters;

intermediate parameters: the method refers to intermediate factors such as a model gradient, a loss value and the like generated in the iterative calculation process of the federal learning;

the transmission monitoring module means: the intermediate parameters are used for receiving or sending the Federal learning iterative computation process to the middle coordinator, and writing the intermediate parameters into the offline transmission module or reading the intermediate parameters from the offline transmission module.

The offline transmission module refers to: the device is used for storing and transmitting the encrypted intermediate parameters calculated by each federal learning iteration, and the offline transmission module comprises but is not limited to a mobile U disk, a gate, a gatekeeper and the like, and can be used for offline data transmission media in a heterogeneous network.

The method comprises the steps of constructing a transmission monitoring module, monitoring intermediate parameters of a federation learning task iteration process of a coordinator under an initiator network environment, writing the intermediate parameters into an offline transmission module, verifying the legality of the offline transmission module, reading the intermediate parameters of the offline transmission module in a participant network environment, and sending the intermediate parameters to a participant for model updating.

Please refer to the first embodiment to the third embodiment for the specific implementation process of the method of the present invention.

Example one

Referring to fig. 3, this embodiment is a further explanation of the process of initializing the transmission monitoring module 13 in step S10. In this embodiment, the step S10 includes the following steps:

step S101, respectively starting the transmission monitoring modules 13 of the internal network and the external network; specifically, in step S101, the transmission monitoring modules 13 of the internal network and the external network establish a secure transmission channel based on the TLCP cryptographic protocol, respectively join the secure transmission channel into the federated learning cluster network environments of each party, and establish a secure connection with each computing node and the central node of the federated learning cluster network of the party by using the Netty protocol, so as to store and transmit intermediate parameters in the federated learning computing batch iteration process.

Step S102, the transmission monitoring modules 13 of the inner network and the outer network respectively adopt RSA or SM2 asymmetric algorithms, and generate a key pair set S and a public key set X based on the identifications of all participating nodes of the Federal learning cluster network of the party;

further, in the step S102, the node identifier is defined as: u = { h)₁,h₂,h₃......h_iH, where U is a node identification set in the network environment, and h_iDefining as a federal learning node identification, wherein i represents the number of nodes of the federal learning cluster, and respectively generating and generating a public and private key set and a public key set corresponding to each node identification by adopting an RSA or SM2 encryption algorithm based on the node identification, wherein a key pair set S is defined as:

the public key set X is defined as:

wherein h is_iRepresenting Federal learning node identity, p_iRepresenting a Federal learning node identifier h_iK is a public key of_iRepresenting a Federal learning node identifier h_iThe private key of (2).

Step S103, the transmission monitoring modules 13 of the internal network and the external network respectively load the public key set X generated by the network of the other party in the step S102 and mark the public key set X as X₀(ii) a For example, the internal network transmission monitoring module loads the public key set X generated by the external network transmission monitoring module and marks as X₀The purpose is to distinguish from the set X generated in step S102. The outer network transmission monitoring module loads the public key set X generated by the inner network transmission monitoring module and marks the public key set X as X₀The same purpose is to distinguish from the binding X generated in step S102.

Step S104, starting the scanning monitoring process of the off-line transmission module 2, and monitoring the off-line federal learning iteration batch intermediate parameters for reading and writing the off-line transmission module 2 in real time.

Further, in step S104, the offline transmission rule of the offline transmission module 2 includes a combination of an intermediate parameter ciphertext data storage path and an intermediate parameter ciphertext data verification, and both parties define the storage path rule as:

[drive#:][/]fldir/h_i/task/n/；

where fldir represents the device root directory, h_iRepresenting participant node identification, task is a fixed folder name, n represents the turn of task iterative computation, and is sequentially increased from 01 and 02_iThe node public key carries out data decryption and signature verification on the intermediate parameters read from the off-line transmission module, and whether the intermediate parameters are normally decrypted is judged. If the accessed offline transmission module is verified to be tampered, the legality of the transmission equipment is ensured, and the risk of data leakage caused by the fact that an illegal participant accesses and intercepts intermediate parameters is avoided.

Example two

In this embodiment, please refer to fig. 4, where the step S20 includes the following steps:

step S201, the external network coordinator 12 generates an initialized public and private key pair based on the paillier algorithm, and sends a public key PubKey and an internal network participant node identifier h_iRespectively sending the parameters to an external network initiator 10 and an external network transmission monitoring module 13, encrypting intermediate parameters provided for an external network initiator and an internal network participant in an iterative batch, storing a PrivKey by an external network coordinator 12, decrypting the intermediate parameters after receiving the iterative batch, and setting a data interval for the current iterative batch to participate in training;

step S202, the extranet initiator 10 receives the PubKey, sets the local data to participate in the training batch each time, creates a paillier encryption processor, initializes the model and starts the first iterative computation, generates a single-sided gradient flag in the current iteration round:

marking of loss value

Obtaining intermediate parameters, wherein n is the round of task iterative computation, j1 is the affiliated parameter identification of the external network initiator, and the intermediate parameters are subjected to the PubKey pair through a paillier encryption processor

Encrypting to generate ciphertext data CT0_nAnd will be sent to the extranet coordinator 12;

step S203, the off-line transmission module 2 is accessed to the external network federal learning cluster network, and the external network transmission monitoring module 13 receives the PubKey transmitted in the step S201 and the node identification h of the internal network participant 11_iThrough the intranet public key set X received in step S103₀Of (2)_iTo coordinator pubKey, h_iEncrypting to generate an encrypted file priFile, and writing the encrypted file into the offline transmission module 2, wherein the file path is as follows: [ drive #:][/]fldir/h_ithe current task is a first batch iteration task, and the value of n is 01;

s204, the off-line transmission module 2 is disconnected from the outer network and is accessed into the inner network federal learning cluster network, the inner network transmission monitoring module 13 scans the file path of the off-line transmission module 2 and reads the encrypted file priFile generated in the step S203, and the node identifier h is used for collecting S through the key pair generated in the step S102_iPrivate key k of_iDecrypting the encrypted file to obtain the public key PubKey of the coordinator, and sending the obtained PukKey to the intranet participant 11 (node identification h)_i)；

Step S205, the intranet participant 11 receives the public key PubKey decrypted in step S204, sets a training batch in which local data participates in each iteration synchronously, creates a paillier encryption processor, initializes a model, starts a first iterative computation, and marks a single-side gradient generated by a current iteration as:

marking of loss value

Obtaining an intermediate parameter, wherein n is the round of task iterative computation, j2 is the affiliated parameter identification of the intranet participant 11, and the intermediate parameter is subjected to PubKey pair through a paillier encryption processor

Encrypting to generate intermediate parameter ciphertext data CT1_nAnd sends the cipher text data to the intranet transmission monitoring module 13;

in step S206, the intranet transmission monitoring module 13 receives the ciphertext data CT1 in step S205_nBased on the key pair S generated in step S102, the participant node identifier h_iCorresponding private key k_iFor the ciphertext data CT1_nSigning to generate a signature file CTFile_nAnd writing the signature file into the offline transmission module 2, wherein the file path is as follows: [ drive #:][/]fldir/h_i/task/n/CTFile_n；

step S207, after step S203 is completed, the off-line transmission module 2 disconnects from the intranet and accesses the extranet federal learning cluster network, the extranet transmission monitoring module 13 scans the file path of the off-line transmission module 2, reads the signature file CTFile generated in step S206, and passes through the intranet public key set X received in step S103₀Participant h in (1)_iIdentify the corresponding public key P_iChecking the signature of the signature file CTFile, obtaining ciphertext data CT1 generated in the step S205 after the signature is successfully checked, sending the ciphertext data to the coordinator 12, and if the signature is failed to be checked, indicating that the offline transmission module 2 has a risk of being tampered, and terminating the privacy calculation task;

step S208, the external network coordinator 12 receives the intermediate parameter ciphertext data CT1 in step S207 and the intermediate parameter ciphertext data CT0 in step S202, decrypts the intermediate parameters of the two parties through the private key PrivKey generated in step S201, and after decryption, the external network coordinator 12 obtains the unilateral gradient of the external network initiator 10 in the first round of iterative computation process

Loss value

Single edge gradient with intranet participant 11

Loss value

The composed intermediate parameter plaintext data;

step S209, the extranet coordinator 12 optimizes and aggregates the gradients of the two parties obtained in step S208 to obtain a total gradient after the first iteration optimization, and marks the total gradient as: total _ dtn, and segmenting the optimized total gradient to respectively obtain the optimized unilateral gradient of the external network initiator 10, and marking as:

and the unilateral gradient after the optimization of the intranet participant 11 is marked as:

respectively sending the optimized gradients to an external network initiator 10 and an external network transmission monitoring module 13, and simultaneously aggregating the loss values of the two parties obtained in step S208 to obtain a loss value after the first iteration, and marking as: iter _ loss_n。

EXAMPLE III

This embodiment is a further explanation of the step S30, and as shown in fig. 5 and fig. 6, the step S30 includes:

step S301, the external network coordinator optimizes the unilateral gradient

Sending to the external network initiator node, and optimizing the optimized unilateral gradient

Intranet participant node identificationhi is sent to an external network transmission monitoring module, and an external network initiator node receives the unilateral gradient

Updating local model parameters according to the optimized gradient, performing the next iteration batch calculation, and generating a current iteration round unilateral gradient mark:

marking loss value as

As an intermediate parameter, the PubKey is adopted to match the intermediate parameter

Encrypted by a paillier encryption processor to generate ciphertext data CT0_n+1；

Step S302, the offline transmission module is accessed into the external network federal learning cluster network, and the external network transmission monitoring module receives the optimized unilateral gradient transmitted in the step S301

And participant node identification h_iThrough the intranet public key set X received in step S103₀Of (2)_iFor is to

h_iEncrypting to generate an encrypted file marked as wFile_nAnd writing the encrypted file into an offline transmission module, wherein the file path is as follows: [ drive #:][/]fldir/hi/task/n/wFile_n；

step S303, after step S302 is completed, the off-line transmission module disconnects with the outer network and accesses the intranet link learning cluster network, the intranet transmission monitoring module scans the file path of the off-line transmission module and reads the encrypted file wFile generated in step S2_nUsing the node identification h, via the set S of key pairs generated in step S102_iPrivate key k of_iFor encrypted textDecrypting the part to obtain the decrypted single-side gradient

Subjecting the obtained gradient to

Sending the information to an intranet participant;

step S304, the intranet participant receives the single-side gradient decrypted in the step S303

Updating local model parameters according to the received unilateral gradient, performing the next iteration batch calculation, and generating a current iteration round unilateral gradient mark as:

marking of loss value

As an intermediate parameter, the intermediate parameter is encrypted by a paillier encryption processor by adopting PubKey

Encrypting the data by a paillier encryption processor to generate intermediate parameter ciphertext data CT1_n+1And sending the ciphertext data to an intranet transmission monitoring module;

step S305, the intranet transmission monitoring module receives the intermediate parameter ciphertext data CT1 in step S304_n+1Based on the key pair S generated in step S102 and the node ID h of the intranet participant_iCorresponding private key k_iFor the ciphertext data CT1_n+1Signing to generate a signature file CTFile_n+1And writing the signature file into an offline transmission module, wherein the file path is as follows:

[drive#:][/]fldir/hi/task/n/CTFile_n+1；

step S306, after step S305 is completed, the off-line transmission module is disconnected with the intranet, and the off-line transmission module is accessed to the external network federal learning cluster network;

step S307, the external network transmission monitoring module scans the file path of the offline transmission module, and reads the signature file CTFile generated in step S305_n+1The intranet public key set X received in step S103₀Participant identification h in_iCorresponding public key P_iTo signature file CTFile_n+1Checking the signature, and acquiring the ciphertext data CT1 generated in the step S304 after the signature is successfully checked_n+1And sending the ciphertext data to an external network coordinator, and if the signature verification fails, indicating that the offline transmission module has a risk of being tampered, terminating the privacy calculation task;

step S308, the external network coordination party respectively receives the intermediate parameter ciphertext data CT1_n+1And intermediate parameter ciphertext data CT0_n+1Decrypting the intermediate parameters of the two parties through the private key PrivKey generated in the step S201, and obtaining the unilateral gradient of the external network initiator in the first iteration calculation process by the external network coordinator after decryption

Loss value

Single edge gradient with intranet participants

Loss value

The composed intermediate parameter plaintext data;

and the outer network coordinator optimizes and aggregates the obtained gradients of the two parties to obtain a total gradient after the current iteration optimization, and the total gradient is marked as: total _ dt_n+1And segmenting the optimized total gradient to respectively obtain the optimized unilateral gradient of the external network initiator, and marking as:

and the unilateral gradient after the optimization of the intranet participant is marked as:

and respectively sending the optimized gradient to an external network initiator and an external network transmission monitoring module. And simultaneously polymerizing the obtained loss values of the two parties to obtain the loss value after the current round of iteration, and marking the loss value as: iter _ loss_iThe outer network coordinator calculates the loss values of all the iteration batches to obtain a convergence threshold value sigma²The calculation formula is as follows:

n is the iteration round, i is the current round variable, σ²To converge the threshold, by²And judging whether the model converges or not if the preset threshold value is reached or not.

In this embodiment, in a federate learning cluster network under two or more network isolation, a center node under each cluster network may act as a coordinator according to a network environment where a federate learning initiator is located, please refer to fig. 5, when an extranet node acts as an initiator and an intranet acts as a participant, an extranet center node acts as a current task coordinator; similarly, referring to fig. 6, when the intranet node is used as the initiator and the extranet is used as the participant, the intranet central node is used as the current task coordinator. The specific process is realized according to step S30, and the specific calculation flow can be subjected to batch iterative training according to steps S10 to S40.

The above description is only a preferred embodiment of the present invention and should not be taken as limiting the invention, and any modification, equivalent replacement or improvement made within the technical scope of the present invention should be included in the protection scope of the present invention.

Claims

1. The cross-network federal learning system is characterized by comprising at least two completely physically isolated networks and an offline transmission module connected between the two networks, wherein the two networks are respectively defined as an extranet and an intranet, and each network comprises an initiator, a participant, a coordinator and a transmission monitoring module, wherein:

the outer network initiator is used for completing a federated learning joint modeling task together with the inner network participant;

the intranet participant is used for providing local data, accepting the invitation of the extranet initiator and participating in the federated learning joint modeling task;

the outer network coordinator is used for performing aggregation optimization on intermediate parameters in the process of executing federated learning iterative computation by the outer network initiator and the inner network participant and generating new intermediate parameters;

the outer network transmission monitoring module is used for reading the intermediate parameters from the off-line transmission module and then sending the intermediate parameters to the outer network coordinator, and receiving the intermediate parameters from the outer network coordinator and then writing the intermediate parameters into the off-line transmission module;

the off-line transmission module is used for interactively transmitting the encrypted intermediate parameters between the two networks.

2. The cross-network federated learning system of claim 1, wherein the offline transmission module is further configured to store intermediate parameters for inter-transmission between two networks.

3. A cross-network federated learning method is characterized in that the method is realized based on a system, the system comprises at least two completely physically isolated networks and an offline transmission module connected between the two networks, the two networks are respectively defined as an extranet and an intranet, each network comprises an initiator, a participant, a coordinator and a transmission monitoring module, and the method comprises the following steps:

step S10, initializing the transmission monitoring module;

step S20, the external network initiator and the internal network participant complete the creation of a federal learning model together, the external network coordinator performs environment initialization and starts a first batch iterative computation task;

step S30, the external network coordinator respectively sends the two-party unilateral gradient after the iterative optimization in the step S20 to an external network initiator and an external network transmission monitoring module, sets a data interval for executing a next iteration batch, and starts the next iteration batch calculation;

and step S40, repeatedly and iteratively executing the step S30 until the federal learning task is finished.

4. A cross-network federal learning method as claimed in claim 3, wherein said step S10 comprises the following procedures:

step S101, respectively starting transmission monitoring modules of an internal network and an external network;

step S102, adopting RSA or SM2 asymmetric algorithm by transmission monitoring modules of the internal network and the external network respectively, and generating a key pair set S and a public key set X based on each participating node identification of the local federal learning cluster network;

step S103, the transmission monitoring modules of the internal network and the external network respectively load the public key set X generated by the network of the opposite side in the step S102 and mark the public key set X as X₀；

Step S104, starting the scanning monitoring process of the off-line transmission module, and monitoring the off-line federal learning iteration batch intermediate parameters for reading and writing the off-line transmission module in real time.

5. The cross-network federated learning method of claim 4, wherein in the step S101, the transmission monitoring modules of the internal network and the external network establish secure transmission channels based on a TLCP cryptographic protocol, respectively join each party 'S federated learning cluster network environment, and adopt a Netty protocol to establish secure connections with each computing node and a central node of the party' S federated learning cluster network, for storing and transmitting intermediate parameters in the federated learning computing batch iterative process.

6. The cross-network federated learning method of claim 5, wherein in the step S102, a node identity is defined as: u = { h)₁,h₂,h₃......h_iH, where U is a node identification set in the network environment, and h_iDefining as a federal learning node mark, wherein i represents the number of federal learning cluster nodes, and respectively generating and generating a public and private key set and a public key set corresponding to each node mark by adopting an RSA or SM2 encryption algorithm based on the node mark, wherein a key pair set S is defined as:

the public key set X is defined as:

wherein, P_iRepresenting Federal learning node identity h_iOf the public key, k_iRepresenting a Federal learning node identifier h_iThe private key of (1).

7. The cross-network federated learning method of claim 6, wherein in step S104, the offline transmission rule of the offline transmission module includes a middle parameter ciphertext data storage path and a middle parameter ciphertext data verification combination, and both parties define the storage path rule as:

[drive#:][/]fldir/h_i/task/n/；

where fldir denotes the device root directory, h_iRepresenting participant node identification, task is a fixed folder name, n represents the turn of task iterative computation, and is sequentially increased from 01 and 02_iAnd the node public key carries out data decryption and signature verification on the intermediate parameters read from the offline transmission module, and judges whether the decryption is normal or not.

8. The cross-network federal learning method as claimed in claim 7, wherein said step S20 comprises the following procedures:

step S201, the external network coordinator generates an initialized public and private key pair based on the paillier algorithm, and identifies a public key PubKey and an internal network participant node h_iRespectively sending the parameters to an external network initiator and an external network transmission monitoring module, encrypting intermediate parameters of iterative batches provided for the external network initiator and an internal network participant, storing PrivKey by an external network coordinator, and receiving the intermediate parameters of the iterative batchesCarrying out decryption, and simultaneously setting a data interval of the current iteration batch participating in training;

step S202, the external network initiator receives the PubKey, sets local data to participate in training batch in each iteration, creates a paillier encryption processor, initializes a model and starts the first iteration calculation, and generates a unilateral gradient mark of the current iteration round:

marking loss value as

Encrypting to generate ciphertext data CT0_nAnd will send to the coordination party of the extranet;

step S203, the off-line transmission module is accessed into an external network federal learning cluster network, and the external network transmission monitoring module receives the PubKey and the node identification h of the internal network participant transmitted in the step S201_iThrough the intranet public key set X received in step S103₀Public key P in (1)_iPubKey and h for coordinator_iEncrypting to generate an encrypted file priFile, and writing the encrypted file into an offline transmission module, wherein the file path is as follows: [ drive #:][/]fldir/h_ithe current task is a first batch iteration task, and the value of n is 01;

s204, the off-line transmission module is disconnected from the outer network and is accessed into the inner network federal learning cluster network, the inner network transmission monitoring module scans the file path of the off-line transmission module and reads the encrypted file priFile generated in the step S203, and the node identifier h is used for collecting S through the key pair generated in the step S102_iPrivate key k of_iDecrypting the encrypted file to obtain a public key PubKey of a coordinator, and sending the obtained PukKey to an intranet participant；

Step S205, the intranet participant receives the public key PubKey decrypted in step S204, sets a training batch in which local data participates in each iteration synchronously, creates a paillier encryption processor, initializes a model, starts the first iteration calculation, and marks a unilateral gradient generated by the current iteration as:

marking of loss value

Obtaining an intermediate parameter, wherein n is the round of task iterative computation, j2 is the affiliated parameter identification of the intranet participant, and the intermediate parameter is subjected to the PubKey pair through a paillier encryption processor

Encrypting to generate intermediate parameter ciphertext data CT1_nAnd send the cipher text data to the intranet transmission monitoring module;

step S206, the intranet transmission monitoring module receives the ciphertext data CT1 of the step S205_nBased on the key pair S generated in step S102, the participant node identification h_iCorresponding private key k_iFor the ciphertext data CT1_nSigning is carried out to generate a signature file CTFile_nAnd writing the signature file into an offline transmission module, wherein the file path is as follows: [ drive #:][/]fldir/h_i/task/n/CTFile_n。

9. the cross-network federal learning method as claimed in claim 8, wherein said step S20 further comprises:

step S207, after step S203, the off-line transmission module is disconnected from the intranet and accesses the external network Federal learning Cluster network, the external network transmission monitoring module scans the file path of the off-line transmission module, reads the signature file CTFile generated in step S206, and passes through the intranet public key set X received in step S103₀Participant h in (1)_iIdentify the corresponding public key P_iChecking the signature of the signature file CTFile, obtaining ciphertext data CT1 generated in the step S205 after the signature is successfully checked, sending the ciphertext data to a coordinator, and if the signature is failed to be checked, indicating that the offline transmission module has a tampered risk, and terminating a privacy calculation task;

step S208, the external network coordinator respectively receives the intermediate parameter ciphertext data CT1 in step S207 and the intermediate parameter ciphertext data CT0 in step S202, then decrypts the intermediate parameters of the two parties through the private key PrivKey generated in step S201, and after decryption, the external network coordinator obtains the unilateral gradient of the external network initiator in the first round of iterative computation process

Loss value

Single edge gradient with inner net participant

Loss value

The composed intermediate parameter plaintext data;

step S209, the external network coordinator optimizes and aggregates the gradients of the two parties obtained in the step S208 to obtain a total gradient after the first iteration optimization, and the total gradient is marked as: total _ dt_nAnd segmenting the optimized total gradient to respectively obtain the optimized unilateral gradient of the external network initiator, and marking as:

respectively sending the optimized gradients to an external network initiator and an external network transmission monitoring module, and simultaneously aggregating the loss values of the two parties obtained in the step S208 to obtain a first-round overlapping valueThe loss values after generation, marked as: iter _ loss_n。

10. The cross-network federal learning method as claimed in claim 9, wherein said step S30 includes:

step S301, the external network coordinator optimizes the unilateral gradient

And the node identifier hi of the internal network participant is sent to the external network transmission monitoring module, and the node of the external network initiator receives the unilateral gradient

marking loss value as

And participant node identification h_iThe content received in step S103Network public key set X₀Public key P in (1)_iTo pair

step S303, after step S302 is completed, the off-line transmission module disconnects with the outer network and accesses the intranet link learning cluster network, the intranet transmission monitoring module scans the file path of the off-line transmission module and reads the encrypted file wFile generated in step S2_nUsing the node identification h, through the set S of key pairs generated in step S102_iPrivate key k of_iDecrypting the encrypted file to obtain the decrypted unilateral gradient

Subjecting the obtained gradient to

Sending the information to an intranet participant;

Updating local model parameters according to the received unilateral gradient, performing the next iteration batch calculation, and generating a current iteration round unilateral gradient mark:

marking loss value as

[drive#:][/]fldir/hi/task/n/CTFile_n+1；

step S307, the external network transmission monitoring module scans the file path of the offline transmission module, and reads the signature file CTFile generated in step S305_n+1The intranet public key set X received in step S103₀Participant identification h in (1)_iCorresponding public key P_iFor signed file CTFile_n+1Checking the signature, and acquiring the ciphertext data CT1 generated in the step S304 after the signature is successfully checked_n+1And sending the ciphertext data to an extranet coordinator, and if the signature verification fails, indicating that the offline transmission module has a risk of being tampered, terminating the privacy calculation task;

step S308, the external network coordinator respectively receives the intermediate parameter ciphertext data CT1_n+1And intermediate parameter ciphertext data CT0_n+1Decrypting the intermediate parameters of the two parties through the private key PrivKey generated in the step S201, and obtaining the unilateral gradient of the external network initiator in the first iteration calculation process by the external network coordinator after decryption

Loss value

Single edge gradient with inner net participant

Loss value

The composed intermediate parameter plaintext data;

and the outer network coordinator optimizes and aggregates the obtained gradients of the two parties to obtain a total gradient after the current round of iterative optimization, and marks the total gradient as: total _ dt_n+1And segmenting the optimized total gradient to respectively obtain the optimized unilateral gradient of the external network initiator, and marking as:

and respectively sending the optimized gradient to an external network initiator and an external network transmission monitoring module. And simultaneously polymerizing the obtained loss values of the two parties to obtain the loss value after the current round of iteration, and marking the loss value as: iter _ loss_iAnd the outer network coordinator calculates the loss values of all the iteration batches to obtain a convergence threshold value sigma²The calculation formula is as follows:

n is the iteration round, i is the current round variable, σ²To converge the threshold, by²And judging whether the model converges or not if the preset threshold value is reached.