CN112818585B

CN112818585B - Method and device for dividing iterative computation parallel particles of integrated circuit interlayer coupling

Info

Publication number: CN112818585B
Application number: CN202110425202.0A
Authority: CN
Inventors: 唐章宏; 邹军; 王芬; 黄承清; 汲亚飞
Original assignee: Beijing Wisechip Simulation Technology Co Ltd
Current assignee: Beijing Wisechip Simulation Technology Co Ltd
Priority date: 2021-04-20
Filing date: 2021-04-20
Publication date: 2021-07-13
Anticipated expiration: 2041-04-20
Also published as: CN112818585A

Abstract

The invention provides a partitioning method and a partitioning device for iterative computation parallel particles of integrated circuit interlayer coupling, which divide iterative computation into three types of computing units: a basic calculation unit, an integrated circuit layer-layer calculation unit and an electromagnetic field and current distribution calculation unit of each layer; secondly, dividing the iterative computation of the interlayer coupling of the integrated circuit into non-overlapping computation particles according to the three types of computation units; thirdly, based on one complete serial iterative computation, obtaining weighted CPU time and total CPU time of each computation particle, and combining the computation particles into different parallel particles according to the proportion of the weighted CPU time; and finally, classifying the parallel particles, wherein the similar parallel particles are mutually independent, and the corresponding calculation task sequences can be randomly disordered to form new calculation task sequences which are dynamically distributed to different calculation processes. The invention determines a parallel particle division method of efficient parallel computation of the interlayer coupling computation of the integrated circuit, and reduces the simulation time of the integrated circuit.

Description

Method and device for dividing iterative computation parallel particles of integrated circuit interlayer coupling

Technical Field

The invention relates to the technical field of integrated circuit interlayer coupling iterative computation, in particular to a method and a device for dividing iterative computation parallel particles of integrated circuit interlayer coupling.

Background

When the integrated circuit works, high-frequency alternating electromagnetic fields are formed on a multilayer layout of the integrated circuit due to transmission of high-speed signals, and the high-frequency alternating electromagnetic fields form high-frequency radiation sources, so that crosstalk and electromagnetic radiation are formed on other signal layers or other integrated circuits and chips, and normal work of the other signal layers or other integrated circuits and chips is influenced, and therefore, the influence of the spatial electromagnetic radiation among the layers of the integrated circuit needs to be calculated during design. In the traditional method, strict three-dimensional numerical calculation is generally adopted for solving the space electromagnetic radiation of the integrated circuit, but the multilayer super-large-scale integrated circuit has a complex structure and very complex three-dimensional numerical calculation, and the used CPU time and memory are very large. In the large-scale numerical calculation, different calculation examples have different structures, so that the calculation complexity of the different calculation examples is unequal, and for the unequal mass calculation, a high-efficiency parallel calculation method design is needed, the unequal calculation complexity of the different examples is fully considered, and the parallel calculation efficiency is improved as much as possible.

The conventional parallel computing is basically parallel to a single computing example, the parallel is realized in a large number of circulating computing parts, and parallel particles are usually fine, so that a large number of data exchange exists among different processes, and the parallel efficiency is reduced; secondly, due to different calculation schedules of different processes, a large amount of waiting is inevitable when data sharing and synchronization are needed, so that the overall parallel efficiency is low; moreover, since the calculation processes of a considerable part of the calculation processes of a single instance have a sequence and data have dependency, when the single calculation instance is parallel, the calculation of a considerable part cannot be parallelized, which also seriously reduces the overall parallel efficiency.

Disclosure of Invention

Objects of the invention

Based on the above, in order to reduce the influence of the point source described by the dyadic Green function on any point in space and approximate the coupling between each layer of the equivalent multilayer integrated circuit, the original three-dimensional electromagnetic field problem of the integrated circuit is reduced to the calculation complexity of the iterative method of the superposition of the two-dimensional electromagnetic field problem and the Green function. In order to reduce communication among processes to the maximum extent in the iterative computation process of interlayer coupling of a multilayer very large scale integrated circuit, avoid hard disk read-write bottleneck caused by the fact that a memory peak value is larger than an available physical memory during multi-process parallel computation, and perfectly solve the problem of process waiting caused by unequal complexity of different computation examples, thereby greatly improving the parallel computation efficiency, the application discloses the following technical scheme.

(II) technical scheme

As a first embodiment of the present invention, the present invention discloses a partitioning method for iterative computation parallel particles of integrated circuit interlayer coupling, comprising the following steps:

s1, dividing the iterative computation of the interlayer coupling of the multilayer ultra-large scale integrated circuit into three types of computation units: a basic calculation unit, a layer-to-layer calculation unit of an integrated circuit, and an electromagnetic field and current distribution calculation unit of each layer of the integrated circuit;

s2, dividing iterative computation of integrated circuit interlayer coupling into non-overlapping computation particles according to the three types of computation cells, wherein the computation particles are one or more computation cells executing all independent operations of the same type;

s3, obtaining the weighted CPU time and the total CPU time of each calculation particle based on one-time complete serial iterative calculation, and combining the calculation particles into different parallel particles according to the proportion of the weighted CPU time;

and S4, classifying the parallel particles, wherein the similar parallel particles are mutually independent, the corresponding calculation task sequences can be randomly disordered, in the process of executing the parallel particles, the sequences of all calculation tasks executed by the similar parallel particles are randomly disordered to form a new calculation task sequence, and the new calculation task sequence is dynamically distributed to different calculation processes to complete the parallel calculation of the calculation tasks.

Further, the basic computing unit includes two types: the first kind of basic calculation unit is used for calculating the field generated by the current in the simple polygon of the integrated circuit in any simple polygon on other layers of the integrated circuit by utilizing a dyadic Green function, and the second kind of basic calculation unit is used for calculating the grid unit in the single-layer two-dimensional finite element calculation of the integrated circuit; the calculation of the grid unit in the integrated circuit single-layer two-dimensional finite element calculation comprises the following steps: calculating the area of the grid unit, calculating the shape function of the grid unit, calculating the finite element rigidity matrix of the grid unit, and calculating the field intensity and the current density of any point in the grid unit.

Further, the first-class basic computing unit specifically operates as follows:

the method comprises the steps of firstly, calculating an electric field generated by a point current source at a field point, wherein the electric field generated by the point current source at the field point is a special analytical expression formed according to a special layered structure of an integrated circuit, and the current sources of a multilayer integrated circuit are layered, namely the current density distributed on each metal layer of an integrated circuit layout with a complex shape is only equal to that of each metal layerxAndyis related tozIndependently, the current density distribution is onlyx, yAs a function of (c).

And secondly, taking an electric field expression generated by the point current source at the field point as an integrand function of two-dimensional Gaussian integration, and calculating fields generated by the surface current source of the simple-shaped polygon at the same position based on a linear superposition principle of the fields, wherein the method comprises the following steps: the field generated by the current source in the two-dimensional plane S at any point in space can be calculated by the two-dimensional gaussian integral:

；

wherein the content of the first and second substances,

at any point in space for the current source within the two-dimensional plane S (x,y,z) The field that is generated is,

is an arbitrary position within the two-dimensional surface S: (u,v) At any point in space (a)x,y,z) The expression of the field that is generated,

representing a gaussian integration point corresponding to a two-dimensional gaussian integration in the two-dimensional plane S,p,qrespectively representu,vIn the first directionpA first, aqThe number of the Gaussian integration points is equal to the total number of the points,

is the weight factor corresponding to the gaussian integral point;

and thirdly, calculating fields generated by the current on the simple-shaped polygon at different positions of other layers of the integrated circuit, and determining the fields generated by the current on the simple-shaped polygon divided on the layout of other layers of the integrated circuit based on the linear superposition principle of the fields.

Further, the integrated circuit layer-layer computation unit includes: combining the first type of basic calculation units, and calculating fields generated by currents distributed on the complex integrated circuit layout filled by the simple polygons on the complex layouts of other layers of the integrated circuit based on the field linear superposition principle; the electromagnetic field and current distribution calculating unit of each layer of the integrated circuit comprises: and combining the second type of basic computing units, taking the influence of other layers on the source layer to be computed as an additional source item, and computing the electromagnetic field and current distribution of the source layer to be computed by adopting a two-dimensional finite element method.

Further, the calculation formula for calculating the weighted CPU time of the particle is:

in the formula:

is as followsiThe weighted CPU time of each calculated grain,

is as followsiEach calculation particle has a single calculated CPU time,

is as followsiThe number of particle executions was counted.

Further, the calculation formula of the total CPU time in the whole calculation process is:

wherein, in the step (A),Tfor the total CPU time of the entire calculation process,mthe number of calculation particles divided for the entire calculation program,

is as followsiThe weighted CPU time of the particles is calculated.

Further, the weighted CPU time of each calculation particle is sorted in descending order and accumulated in sequence until the accumulated sum exceeds 90% of the total CPU time, and each calculation particle in the accumulated sum is taken as a parallel particle.

The second embodiment discloses a partitioning device for iterative computation parallel particles of integrated circuit interlayer coupling, which comprises a computing unit partitioning module, a computing particle partitioning module, a parallel particle partitioning module and a parallel particle operation module,

the calculation unit division module is used for dividing the iterative calculation of the interlayer coupling of the multilayer ultra-large scale integrated circuit into three types of calculation units: a basic calculation unit, a layer-to-layer calculation unit of an integrated circuit, and an electromagnetic field and current distribution calculation unit of each layer of the integrated circuit;

the calculation particle dividing module is used for dividing the three types of calculation units into calculation particles which are not overlapped with each other;

the parallel particle division module obtains weighted CPU time of each calculation particle and total CPU time of an iteration method of integral multilayer ultra-large scale integrated circuit interlayer coupling based on one-time complete serial iterative calculation, the calculation particles are combined into different parallel particles according to the ratio of the weighted CPU time to the total CPU time, the similar parallel particles are mutually independent, and corresponding calculation task sequences can be randomly disordered;

the parallel particle operation module is used for randomly disordering the sequences of all the calculation tasks executed by the same type of parallel particles in the process of executing the parallel particles to form a new calculation task sequence, and dynamically distributing the new calculation task sequence to different calculation processes to complete the parallel calculation of the calculation tasks.

Further, the first-class basic computing unit specifically operates as follows:

the method comprises the steps of firstly, calculating an electric field generated by a point current source at a field point, wherein the electric field generated by the point current source at the field point is a special analytical expression formed according to a special layered structure of an integrated circuit, and the current sources of a multilayer integrated circuit are layered, namely the current density distributed on each metal layer of an integrated circuit layout with a complex shape is only equal to that of each metal layerxAndyis related tozIndependently, the current density distribution is onlyx, yA function of (a);

；

wherein the content of the first and second substances,

is the weight factor corresponding to the gaussian integral point;

(III) advantageous effects

The invention realizes different particle parallels in the iterative computation of the coupling between the integrated circuit layers, greatly reduces the communication between the processes and the waiting time caused by synchronization, simultaneously, adopts a random dynamic allocation method of the computation tasks, ensures that computation models with unequal complexity are randomly and uniformly distributed on each computation node, and avoids the bottleneck of hard disk read-write caused by virtual memory access due to overhigh peak value memory.

Drawings

The embodiments described below with reference to the drawings are exemplary and intended to be used for explaining and illustrating the present invention and should not be construed as limiting the scope of the present invention.

FIG. 1 is a block diagram of the main steps of a first embodiment of the present invention;

FIG. 2 is a principal flow diagram of a first class of basic computing unit of the present invention;

FIG. 3 is a block diagram of the modules of a second embodiment of the present invention;

fig. 4 is an exploded view of the electric field generated at the field point by the point current source of the present invention.

Detailed Description

In order to make the implementation objects, technical solutions and advantages of the present invention clearer, the technical solutions in the embodiments of the present invention will be described in more detail below with reference to the accompanying drawings in the embodiments of the present invention.

It should be noted that: in the drawings, the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described are some embodiments of the present invention, not all embodiments, and features in embodiments and embodiments in the present application may be combined with each other without conflict. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

In the description of the present invention, it is to be understood that the terms "central," "longitudinal," "lateral," "front," "rear," "left," "right," "vertical," "horizontal," "top," "bottom," "inner," "outer," and the like are used in the orientation or positional relationship indicated in the drawings, which are used for convenience in describing the invention and for simplicity in description, and are not intended to indicate or imply that the referenced device or element must have a particular orientation, be constructed and operated in a particular orientation, and are not to be considered limiting of the scope of the invention.

The following describes in detail a first embodiment of the partitioning method and apparatus for iterative computation of parallel grains for interlayer coupling of an integrated circuit according to the present invention with reference to fig. 1. The partitioning method for iterative computation parallel particles of integrated circuit interlayer coupling provided by the embodiment comprises the following steps:

before parallel computing, the number of processes needs to be determined manually, and one process is taken as a main process.

The calculation particles are defined according to the problem operation characteristics. The problem operation characteristics are different from industry to industry. For example, for large scale integrated circuit electromagnetic field distribution calculation, when a multilayer integrated circuit board with a certain structure and an external circuit thereof are subjected to field-path coupling, the operational characteristics comprise: the influence of the point source on any point in the space is along with the distance relation between the point source and the point, the number of separated medium layers, the electromagnetic field distribution and the current distribution of each layer, and the time point of the electromagnetic field distribution and the current distribution of the influenced layer is updated.

in the formula:

is as followsiThe weighted CPU time of each calculated grain,

is as followsiEach calculation particle has a single calculated CPU time,

is as followsiThe number of particle executions was counted.

is as followsiThe weighted CPU time of the particles is calculated.

Specifically, if the iterative computation of the interlayer coupling of the multilayer very large scale integrated circuit is divided into 3 computing particles of c1, c2 and c3 according to the definition of the computing particles, 3 computing particles can execute the computing task of the whole computing process; if c1 executes 500 operation tasks, c2 executes 200 operation tasks, and c3 executes 5 operation tasks; then 705 operation tasks constitute the whole operation process, which only needs 3 computation particles of c1, c2 and c 3. The whole operation process is executed by 3 computing particles of c1, c2 and c3, and each of c1, c2 and c3 comprises at least 1 independent operation (operation task).

Sorting according to the weighted CPU time obtained by each calculated particle operation, wherein if the c1 weighted CPU time is 0.1s, the c2 weighted CPU time is 100s and the c3 weighted CPU time is 0.2s, the final sorting result is c2> c3> c 1; the weighted CPU times for the 3 calculated particles add sequentially from large to small, i.e., T (c2) + T (c3) + … until the sum of times is greater than 90% of the total CPU time; if T (c2) + T (c3) > 90%, then c2, c3 are each as a parallel particle; if T (c2) > 90% of the total CPU time, then c2 is a parallel particle.

Specifically, the way of randomly scrambling the operation task sequence is as follows:

firstly, the sequence of operation tasks

Correspondingly generating random number sequences

，m＝1,2,3,…,M. Then to the sequence

The sequences are sorted from small to large, and the sorted sequences are

. Finally, generating new non-repeated operation task sequence

，

Is composed of

In that

Of (c) is used.

The key point is to make all the operation tasks in the parallel particles in sequence

Randomly disorganized to generate new non-repetitive operation task sequence

Then distributing operation tasks according to the sequence, namely equivalently distributing the original operation tasks randomly, wherein the random distribution strategy is characterized in that the random distribution scheme can completely disturb the distribution sequence of all the operation tasks, thereby realizing that the sum of the peak value memory occupied by the tasks operated by all the operation nodes simultaneously is formed by the average value of the peak value memory occupied by the process number and all the models (calculation particles) rather than the highest valueAnd (4) determining the value.

And the main process distributes all the operation tasks required to be executed by the parallel particles to all the processes including the main process according to the formed new calculation task sequence, and completes the parallel operation of all the operation tasks executed by the parallel particles.

In addition, if a certain operation task in the parallel particles is distributed to a process, a mark file which is used for indicating that the operation task is already distributed to the operation task is generated; when applying for distributing a certain calculation task, the other process tries to generate a mark file of the calculation task, and automatically applies for distributing the next calculation task by the other process under the condition that the mark file exists.

In the multi-process parallel operation process, the chances of allocating a certain operation task to each process are equal, if no measure is taken, multiple processes may be allocated to the same operation task, and the waste of operation resources is caused, so that some measure must be taken, and all operation tasks are uniquely allocated to a certain process. The simplest and most intuitive measure for achieving this is to assign a task a time stamp, i.e. a task is assigned to a process at the same time as it is marked so that other processes are no longer assigned the task. However, because the variables of each process are generally independent of each other during parallel operation, the operation tasks are asymmetric, the operation states of each process are different, and information distributed by any process through the variable marking task cannot be immediately transmitted to other processes, an external explicit marking method is needed to be adopted so that all processes can obtain the information once the operation tasks are marked. Therefore, if the operation task in the parallel particles is distributed to the process, the mark file of the operation task is immediately generated; when a process applies for distributing a certain operation task, the process will try to generate a mark file of the operation task, if the mark file exists, the operation task is indicated to be distributed, and the process will automatically apply for distributing the next operation task.

The specific implementation steps for realizing the correct allocation of the operation tasks by utilizing the marker files are as follows:

step A1, a process applies for distributioniAn arithmetic task;

step A2, judgmentiSign file of individual operation taskFiIf the current state does not exist, jumping to the step A8, and if the current state does not exist, jumping to the step A3;

step A3, judging the mark fileFiWhether the lock is locked or not, jumping to the step A8 if the lock is locked, and jumping to the step A4 if the lock is not locked;

step A4, locking the logo fileFi；

Step A5, generating a logo fileFi；

Step A6, marking fileFiUnlocking;

step A7, completing the first stepiCalculating the operation tasks;

step A8, judging whether all the operation tasks in the parallel particles are completed or not, if not, determining whether all the operation tasks in the parallel particles are completedi＝i+1 and returning to step a1, if finished, jumping to step a 9;

all the operation tasks required to be executed by the parallel particles are all distributed to all the processes, and the distribution of the parallel particles is finished; it returns to executing all the computational tasks that the other parallel grains need to perform to distribute their respective execution.

Further, the basic calculation unit, the layer-to-layer calculation unit of the integrated circuit, and the electromagnetic field and current distribution calculation units of the layers of the integrated circuit can realize the coupling among the layers of the approximately equivalent multilayer integrated circuit through different combination modes.

Specifically, for example, each iteration method can select the parallelism of layer-layer computing units of the integrated circuit, calculate the influence of different layers on other layers in parallel, and apply two-dimensional finite elements to different layers by using other parallel particles to calculate the electromagnetic field distribution and the current distribution of the layers, so as to update the electromagnetic field distribution and the current distribution of the layers; the electromagnetic field distribution and the current distribution of each layer of the integrated circuit can be calculated in parallel, each parallel particle comprises the influence of other layers on the layer, and the influence is used as a source item of the layer to apply a two-dimensional finite element to the layer to calculate the electromagnetic field distribution and the current distribution of the layer.

Further, as shown in fig. 2, the first-class basic computing unit specifically operates as follows:

first, as shown in fig. 4, calculating an electric field generated by a point current source at a field point, where an electric field expression generated by the point current source at the field point is a special analytical expression formed according to a special structure of an integrated circuit layer, and a specific expression of the analytical expression is as follows: aiming at the frequency domain electromagnetic field of the multilayer integrated circuit layout, the electric field intensity generated by a point source at any layer field point is calculated by adopting a dyadic Green function, and the electric field intensity of nine azimuths of any point at any layer of the multilayer integrated circuit layout can be solved through the following formula to represent the electric field intensity.

The electric field generated by the point current source at the field point is expressed as:

wherein the content of the first and second substances,

iis the unit of an imaginary number,i ²=-1；

representing a Bessel function of order 0;

representing a Bessel function of order 1;

expressed as a function of the Bessel integral coefficient,

；x, y, zthe coordinates of the field points are represented,

,

,

representing source point coordinates; angular frequency

，

Represents a frequency;

indicating that the site is at the second

A layer of a material selected from the group consisting of,

is as follows

At layer boundarieszCoordinates;

,

respectively represent

The number of complex waves in the horizontal and vertical directions of the layer;

respectively represent

A layer horizontal dielectric constant, a vertical dielectric constant;

,

respectively representlHorizontal magnetic conductivity and vertical magnetic conductivity of the layer;

is shown aslThe anisotropy coefficient of the layer;

,

respectively representlIntegral coefficients of complex wave numbers of the horizontal and vertical layers;

respectively representlThe undetermined coefficient of a layer,A _l, B _lthe following linear equation is solved:

T₁is 2n×2nThe complex number matrix of (a) is,

is of length 2nA complex vector of (a);

the following linear equation is solved:

T₂is 2n×2nThe complex number matrix of (a) is,

is of length 2nA complex vector of (a);

the following linear equation is solved:

T₃is 2n×2nThe complex number matrix of (a) is,

is of length 2nA complex vector of (a);

to representxOriented electric dipole in the second placelOf said electric field generated by said field points of the layerxA component;

to representxOriented electric dipole in the second placelOf said electric field generated by said field points of the layeryA component;

to representxOriented electric dipole in the second placelOf said electric field generated by said field points of the layerzA component;

to representyOriented electric dipole in the second placelOf said electric field generated by said field points of the layerxA component;

to representyOriented electric dipole in the second placelOf said electric field generated by said field points of the layeryA component;

to representyOriented electric dipole in the second placelOf said electric field generated by said field points of the layerzA component;

to representzOriented electric dipole in the second placelOf said electric field generated by said field points of the layerxA component;

to representzOriented electric dipole in the second placelOf said electric field generated by said field points of the layeryA component;

to representzOriented electric dipole in the second placelOf said electric field generated by said field points of the layerzAnd (4) components.

The current sources of the multi-layer integrated circuit are distributed in a layered manner, namely the current density distributed on each metal layer of the integrated circuit layout with a complex shape is only equal to that of the current sourcexAndythe axial direction is related tozAxial direction independence, current density distribution of onlyx，yAs a function of (c).

；

wherein the content of the first and second substances,E(x,y,z) At any point in space for the current source in the two-dimensional plane Sx,y,z) The field that is generated is,

is an arbitrary position within the two-dimensional surface S: (u,v) At any point in space (x,y,z) The expression of the dyadic green function of the generated field,

is the weight factor corresponding to the gaussian integral point;

Further, the specific method for calculating the two-dimensional finite element comprises the following steps:

for the direct current electric field model, the three-dimensional model of the multilayer integrated circuit refers to the conductivity in the direct current electric field model

Potential of the electrodeuAll the distributions of (A) and (B) are three-dimensional space coordinatesx,y,z) I.e.:

，

the function of the three-dimensional model satisfies the following equation (1):

in the equation (1),

and boundary condition (2):

in the formula

Is a boundary of the first type and is,nis normal to the boundary of the second type,

represents a potentialuAt the first kind boundary

Value of above, using

It is shown that,

bulk current density for external circuits;

the dimension of an actual PCB or a chip packaged board in the multilayer super large scale integrated circuit is far larger than the thickness of the metal layer, so that the three-dimensional direct current field problem of the multilayer integrated circuit is simplified into a two-dimensional direct current field problem;

the field solving equation set established by the finite element method for the two-dimensional model is an equation set (3):

in the formula (I), theI(u) In order to be a functional function,tis the thickness of the metal layer or layers,

as a grid celleThe electrical conductivity of (a) a (b),

as a grid celleThe potential of (a) is set to be,

as a grid celleThe area of (a) is,

as the density of the surface current, the current density,

representing grid cellseThe edge of (1);

for the alternating electromagnetic field model, the three-dimensional model of the multilayer integrated circuit refers to the dielectric constant in the three-dimensional model of the electromagnetic response characteristic in the frequency domain simulation of the multilayer VLSI

Magnetic permeability of

Electric field intensityEMagnetic field intensityHAll the distributions of (A) and (B) are three-dimensional space coordinatesx,y,z) I.e.:

,

,

，

the function of the three-dimensional model satisfies the following equation:

in the formulaJFor the purpose of the applied current density distribution,

for the angular frequency simulated for the integrated circuit,

indicating the strength of the magnetic fieldHThe degree of rotation of the screw is reduced,

indicates the electric field intensityEThe degree of rotation of the screw is reduced,jis the unit of an imaginary number,j ²=-1；

the board size of the actual PCB or chip package in the multilayer VLSI is far larger than the metal layer spacing, the three-dimensional model of the electromagnetic response characteristics in the frequency domain simulation of the multilayer VLSI is simplified into a two-dimensional model, and the dielectric constant in the model is at the moment

Magnetic permeability of

Electric field intensityEMagnetic field intensityHAll the distributions are two-dimensional plane coordinates (x,y) I.e.:

，

，

，

distribution thereof andzindependent of and potential in the fielduAnd surface current densityJ _sSatisfies the following conditions:

in the formula (I), the compound is shown in the specification,

respectively representx, y, zThe unit vector of the direction is,E _zof electric field strengthzThe direction component of the light beam is,H _xandH _yrespectively of magnetic field strengthxAndythe direction component of the light beam is,his the metal layer spacing;

through the simplification from the three-dimensional model to the two-dimensional model, the two-dimensional finite element functional extreme value formula corresponding to the two-dimensional model is obtained as follows:

in the formula (I), the compound is shown in the specification,

in order to be a functional function,

it is shown that the extreme value is taken for the functional,

as a grid celliThe surface admittance of the first and second electrodes,

is a boundary

The boundary condition of the opening of (a),u _kis a boundary

The distribution of the electric potential on the upper side,

indicating a position to the right of the boundary and infinitely close to the boundary,

indicating a position to the left of the boundary and infinitely close to the boundary,

representing grid cellsiThe area of (a) is,

as a grid celliThe current density of (a) is,

as a grid celliThe surface resistance of the glass substrate is higher than the surface resistance of the glass substrate,

as a grid celliThe potential of (a) is set to be,kis referred to askAnd (4) a boundary.

The method can realize different particle paralleling in the coupled iterative calculation among the layers of the multilayer very large scale integrated circuit, greatly reduces the communication among the processes and the waiting time generated by synchronization, simultaneously, adopts a random dynamic allocation method of the calculation tasks, ensures that the calculation models with unequal complexity are randomly and uniformly distributed on each calculation node, and avoids the bottleneck of hard disk read-write caused by virtual memory access due to overhigh peak value memory.

A second embodiment of the partitioning method and apparatus for iteratively calculating parallel particles for interlayer coupling of an integrated circuit according to the present invention is described in detail with reference to fig. 3. The partitioning apparatus for iterative parallel particle computation in integrated circuit interlayer coupling provided in this embodiment comprises a computing unit partitioning module, a computing particle partitioning module, a parallel particle operation module,

the calculation unit division module is used for dividing the iterative calculation of the interlayer coupling of the multilayer ultra-large scale integrated circuit into three types of calculation units: a basic calculation unit, a layer-to-layer calculation unit of the integrated circuit, and an electromagnetic field and current distribution calculation unit of each layer of the integrated circuit.

The calculation particle dividing module is used for dividing the three types of calculation units into calculation particles which are not overlapped with each other.

The calculation formula for calculating the weighted CPU time of the particles is as follows:

in the formula:

is as followsiThe weighted CPU time of each calculated grain,

is as followsiEach calculation particle has a single calculated CPU time,

is as followsiThe number of particle executions was counted.

The calculation formula of the total CPU time in the whole calculation process is as follows:

is as followsiThe weighted CPU time of the particles is calculated.

And sequencing the weighted CPU time of each calculation particle according to the descending order and sequentially accumulating until the accumulated sum exceeds 90% of the total CPU time, and taking each calculation particle in the accumulated sum as a parallel particle.

The parallel particle operation module is used for randomly disordering the sequences of all operation tasks executed by the same parallel particle to form a new operation task sequence, and distributing all operation tasks executed by the parallel particle to all processes according to the new operation task sequence to complete the parallel operation of the operation tasks.

firstly, the sequence of operation tasks

Correspondingly generating random number sequences

，m＝1,2,3,…,M. Then to the sequence

The sequences are sorted from small to large, and the sorted sequences are

. Finally, generating new non-repeated operation task sequence

，

Is composed of

In that

Of (c) is used.

Randomly disorganized to generate new non-repetitive operation task sequence

And then distributing the operation tasks according to the sequence, namely equivalently distributing the original operation tasks randomly, wherein the random distribution strategy is characterized in that a random distribution scheme can completely disturb the distribution sequence of all the operation tasks, so that the sum of the peak value memory occupied by the tasks operated by all the operation nodes at the same time is determined by the average value of the process number and the peak value memory occupied by all the models (calculation particles) rather than the maximum value.

step A1, a process applies for distributioniAn arithmetic task;

step A4, locking the logo fileFi；

Step A5, generating a logo fileFi；

Step A6, marking fileFiUnlocking;

step A7, completing the first stepiCalculating the operation tasks;

wherein the content of the first and second substances,

iis the unit of an imaginary number,i ²=-1；

representing a Bessel function of order 0;

representing a Bessel function of order 1;

expressed as a function of the Bessel integral coefficient,

；x, y, zthe coordinates of the field points are represented,

,

,

representing source point coordinates; angular frequency

，

Represents a frequency;

indicating that the site is at the second

A layer of a material selected from the group consisting of,

is as follows

At layer boundarieszCoordinates;

,

respectively represent

respectively represent

A layer horizontal dielectric constant, a vertical dielectric constant;

,

is shown aslThe anisotropy coefficient of the layer;

,

T₁is 2n×2nThe complex number matrix of (a) is,

is of length 2nA complex vector of (a);

the following linear equation is solved:

T₂is 2n×2nThe complex number matrix of (a) is,

is of length 2nA complex vector of (a);

the following linear equation is solved:

T₃is 2n×2nThe complex number matrix of (a) is,

is of length 2nA complex vector of (a);

to representyOriented electric dipole in the second placelSaid field point generation of a layerOf said electric fieldzA component;

；

is the twoAny position in the dimension S: (u,v) At any point in space (x,y,z) The expression of the dyadic green function of the generated field,

is the weight factor corresponding to the gaussian integral point;

，

in the equation (1),

and boundary condition (2):

in the formula

represents a potentialuAt the first kind boundary

Value of above, using

It is shown that,

bulk current density for external circuits;

as a grid celleThe electrical conductivity of (a) a (b),

as a grid celleThe potential of (a) is set to be,

as a grid celleThe area of (a) is,

as the density of the surface current, the current density,

representing grid cellseThe edge of (1);

Magnetic permeability of

,

,

，

the function of the three-dimensional model satisfies the following equation:

in the formulaJFor the purpose of the applied current density distribution,

for the angular frequency simulated for the integrated circuit,

Magnetic permeability of

，

，

，

in the formula (I), the compound is shown in the specification,

in the formula (I), the compound is shown in the specification,

in order to be a functional function,

it is shown that the extreme value is taken for the functional,

as a grid celliThe surface admittance of the first and second electrodes,

is a boundary

The boundary condition of the opening of (a),u _kis a boundary

The distribution of the electric potential on the upper side,

representing grid cellsiThe area of (a) is,

as a grid celliThe current density of (a) is,

The device can realize different particle parallels in the coupled iterative computation among the layers of the multilayer very large scale integrated circuit, greatly reduces the communication among the processes and the waiting time generated by synchronization, simultaneously, adopts a random dynamic allocation method of computation tasks, ensures that computation models with unequal complexity are randomly and uniformly distributed on each computation node, and avoids the bottleneck of hard disk read-write caused by virtual memory access due to overhigh peak memory.

The above description is only for the specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the appended claims.

Claims

1. A partitioning method for iterative computation parallel particles of integrated circuit interlayer coupling is characterized by comprising the following steps:

the basic computing unit includes two types: the first kind of basic calculation unit is used for calculating the field generated by the current in the simple polygon of the integrated circuit in any simple polygon on other layers of the integrated circuit by utilizing a dyadic Green function, and the second kind of basic calculation unit is used for calculating the grid unit in the single-layer two-dimensional finite element calculation of the integrated circuit; the calculation of the grid unit in the integrated circuit single-layer two-dimensional finite element calculation comprises the following steps: calculating the area of the grid unit, calculating a shape function of the grid unit, calculating a finite element rigidity matrix of the grid unit, and calculating the field intensity and the current density of any point in the grid unit;

the layer-to-layer computation unit of the integrated circuit comprises: combining the first type of basic calculation units, and calculating fields generated by currents distributed on the complex integrated circuit layout filled by the simple polygons on the complex layouts of other layers of the integrated circuit based on the field linear superposition principle; the electromagnetic field and current distribution calculating unit of each layer of the integrated circuit comprises: combining the second type of basic computing units, taking the influence of other layers on the source layer to be computed as an additional source item, and computing the electromagnetic field and current distribution of the source layer to be computed by adopting a two-dimensional finite element method;

2. The method of partitioning iterative computation parallel particles of integrated circuit inter-layer coupling according to claim 1, wherein said first class of basic computation elements are specifically operated as:

；

wherein the content of the first and second substances,

is the weight factor corresponding to the gaussian integral point;

3. The method of partitioning iterative computation parallel granularity of integrated circuit layer-to-layer coupling according to claim 1, wherein said computing a weighted CPU time of the granularity is represented by the formula:

in the formula:

is as followsiThe weighted CPU time of each calculated grain,

is as followsiEach calculation particle has a single calculated CPU time,

is as followsiA counting particleThe number of executions.

4. The method of partitioning iterative computation of parallel particles of integrated circuit layer-to-layer coupling as recited in claim 3, wherein the total CPU time over the entire computation process is calculated by the formula:

is as followsiThe weighted CPU time of the particles is calculated.

5. The method of partitioning iterative computation parallel grains of integrated circuit layer coupling as recited in claim 4, wherein weighted CPU times for each compute grain are sorted in descending order and accumulated sequentially until an accumulated sum exceeds 90% of said total CPU time, each compute grain in said accumulated sum being treated as a parallel grain.

6. The device for partitioning iterative computation parallel particles coupled among integrated circuit layers is characterized by comprising a computing unit partitioning module, a computing particle partitioning module, a parallel particle partitioning module and a parallel particle operation module,

7. The apparatus for partitioning parallel particles for iterative computation of coupling between layers of an integrated circuit according to claim 6, wherein said first class of basic computation elements are specifically operated as:

first, calculate the electric field generated by the point current source at the field pointThe electric field expression of the point current source generated at the field point is a special analytical expression formed according to the layered special structure of the integrated circuit, and the current sources of the multilayer integrated circuit are distributed in a layered mode, namely the current density distributed on each metal layer of the integrated circuit layout with a complex shape is only equal to that of the integrated circuit layout with a complex shapexAndyis related tozIndependently, the current density distribution is onlyx, yA function of (a);

；

wherein the content of the first and second substances,

is the weight factor corresponding to the gaussian integral point;