JPH03256117A

JPH03256117A - Multiplier

Info

Publication number: JPH03256117A
Application number: JP2053791A
Authority: JP
Inventors: Sukehiro Ootsuka; 大塚　左洋
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1990-03-07
Filing date: 1990-03-07
Publication date: 1991-11-14

Abstract

PURPOSE:To reduce the quantity of hardware and to shorten a delay time by dividing multiplication into several steps by a reduced instruction set computer architecture, and executing the multiplication of each step by the same circuit. CONSTITUTION:A multiplication multiplier part instructing means 5 partitions off a multiplier into plural continuous bit parts in order to divide multiplication into plural steps and specifies which bit part out of plural ones is to be multiplied by a multiplicand. The sum of a product of a multiplicand stored in a multiplicand register 1 and one of plural bit parts of a multiplier stored in a multiplier register 2 and a part not defined as a multiplied result out of the contents of a multiplied result storing register 3 is shifted in accordance with a specification outputted from a multiplication multiplier part instructing means 5 and the shifted result is stored in the register 3. The step is repeatedly executed from the lower bit side of the plural bit parts of the multiplication to reduce quantity of hardware and to shorten the delay time.

Description

【発明の詳細な説明】〔概　　　要〕整数演算器における乗算を命令の機能を縮小したＲＩＳ
Ｃ（リデュースドインストラクションセントコンピュー
タ）アーキテクチャで実行する乗算器に関し、ＲＩＳＣアーキテクチャにより乗算を何ステップかに分
割し、それぞれのステップの乗算を同一の回路で実行す
ることによってハードウェア量を削減し、遅延時間を小
さくすることを目的とし、被乗数レジスタ、乗数レジス
タ、および乗算結果保持レジスタを有する乗算器におい
て、乗算を複数のステップに分割するために乗数を複数
の連続するビット部分に区分し、該複数のビット部分の
うちどの部分と被乗数とを乗算しているかを示す乗算乗
数部分指示手段を設け、前記被乗数レジスタ内の被乗数
と前記乗数レジスタ内の乗数の前記複数のビット部分の
１つとの積と、乗算結果保持レジスタの内容のうち乗算
結果として確定されていない部分との和を該乗算乗数部
分指示手段の指示に従ってシフトした結果を該乗算結果
保持レジスタに格納するステップを前記乗数の複数のビ
ット部分の下位ビット側から繰り返すように構成する。[Detailed Description of the Invention] [Summary] RIS with reduced instruction function for multiplication in an integer arithmetic unit
Regarding multipliers that run on the C (Reduced Instruction Cent Computer) architecture, the RISC architecture divides multiplication into several steps and executes each step of multiplication in the same circuit, reducing the amount of hardware and reducing delays. For the purpose of reducing time, in a multiplier having a multiplicand register, a multiplier register, and a multiplication result holding register, the multiplier is partitioned into a plurality of consecutive bit parts in order to divide the multiplication into a plurality of steps. a multiplier part indicating means for indicating which part of the bit parts of the multiplicand is being multiplied by the multiplicand, the product of the multiplicand in the multiplicand register and one of the plurality of bit parts of the multiplier in the multiplier register; , the step of storing the result of shifting the sum of the contents of the multiplication result holding register with a portion that has not been determined as the multiplication result in accordance with the instructions of the multiplication multiplier part indicating means in the multiplication result holding register; It is configured to repeat from the lower bit side of the part.

〔産業上の利用分野〕[Industrial application field]

本発明は計算機による乗算方式に係り、さらに詳しくは
整数演算器における乗算を命令の機能を縮小したｔＳＣ
（リデュースドインストラクションセントコンピュータ
）アーキテクチャで実行する乗算器に関する。The present invention relates to a multiplication method by a computer, and more specifically, the present invention relates to a multiplication method by a computer, and more particularly, the present invention relates to a multiplication method using an integer arithmetic unit using a tSC with reduced instruction functions.
(reduced instruction cent computer) architecture.

〔従来の技術及び発明が解決しようとする課題〕近年計
算機の性能向上のために演算器の処理速度の高速化とハ
ードウェアの削減が要求されている。従来の乗算器にお
ける最も単純な乗算方式としては、ｎビットの２進数の
乗算を行うに際して被乗数の各ビットと乗数の各ビット
とを乗算し、合計ｎ２個の１ビツト部分積を発生させ、
これらの部分積を桁が合うようにシフトして加算し、積
を得るという方法を用いていた。しかしながら、この方
法では部分積の数が多く、全ての部分積をシフトして加
算するためには膨大なハードウェア量を必要とし、また
結果を得るまでに大きな遅延時間を生ずるという問題点
があった。[Prior Art and Problems to be Solved by the Invention] In recent years, in order to improve the performance of computers, it has been required to increase the processing speed of arithmetic units and reduce the amount of hardware. The simplest multiplication method in a conventional multiplier is to multiply n-bit binary numbers by multiplying each bit of the multiplicand by each bit of the multiplier to generate a total of n2 1-bit partial products.
The method used was to shift these partial products so that the digits matched and add them to obtain the product. However, this method has the problem that there are a large number of partial products, a huge amount of hardware is required to shift and add all the partial products, and there is a large delay time before obtaining the results. Ta.

従ってこのような単純な形式に対して各種の工夫が施さ
れた回路が用いられているが、その１つの代表的な方法
はＢｏｏｔｈのアルゴリズムとＷａｌｌａｃｅのトリー
とを組み合せた乗算回路である。Ｂｏｏ　ｔｈのアルゴ
リズムは偶数ビットの乗数を２ビツトずつの組に分割し
、部分積の個数を半分にする方法であって、各部分積を
作る倍数は０．±１．±２倍となり、シフト回路とイン
バータだけで回路を構成することができる。また乗数の
符号ビットを含んだ形になっているために部分積の加算
の中に符号ビットの処理が含まれており、補正の計算を
必要としないという特徴がある。Therefore, circuits with various improvements are used for such a simple format, and one typical method is a multiplication circuit that combines Booth's algorithm and Wallace's tree. Booth's algorithm divides the even-bit multiplier into groups of 2 bits each, halving the number of partial products, and the multiple used to create each partial product is 0. ±1. ±2 times, and the circuit can be configured with only a shift circuit and an inverter. Furthermore, since the form includes the sign bit of the multiplier, processing of the sign bit is included in the addition of partial products, and there is no need for correction calculation.

またＷａｌｌａｃｅのトリーは、部分積の加算段数を減
らすために連続的に１個ずつ加算せず、加算回路をトリ
ー状に積み上げるものである。このトリーでは全加算回
路を３人力２出力の加算回路と考え、これを組み合せて
複数項の人力を加算して項数を減らしていくという方法
が取られる。ＢｏｏｔｈのアルゴリズムとＷａｌｌａｃ
ｅのトリーとを組み合せた場合には、最終段に桁上伝播
加算回路が使用される。Furthermore, in Wallace's tree, in order to reduce the number of partial product addition stages, adding circuits are piled up in a tree shape instead of adding them one by one successively. In this tree, the full adder circuit is considered to be an adder circuit with three human inputs and two outputs, and a method is used in which the total adder circuit is combined and the human input of multiple terms is added to reduce the number of terms. Booth's algorithm and Wallac
When combining with a tree of e, a carry propagation adder circuit is used in the final stage.

しかしながら、このようにＢｏｏ　ｔｈのアルゴリズム
とＷａｌｌａｃｅのトリーとを組み合わせた乗算回路で
も、乗数と被乗数のビット数が多くなると積を計算する
ためのハードウェア量は膨大となり、また遅延時間が大
きくなるという問題点の解決には有効ではない。However, even in a multiplication circuit that combines Booth's algorithm and Wallace's tree, as the number of bits in the multiplier and multiplicand increases, the amount of hardware required to calculate the product becomes enormous, and the delay time increases. It is not effective in solving problems.

本発明は、ＲＩＳＣアーキテクチャにより乗算を何ステ
ップかに分割し、それぞれのステップの乗算を同一の回
路で実行することによってハードウェア量を削減し、遅
延時間を小さくすることを目的とする。The present invention aims to reduce the amount of hardware and delay time by dividing multiplication into several steps using the RISC architecture and executing the multiplication in each step in the same circuit.

〔課題を解決するための手段〕[Means to solve the problem]

第１図は本発明の原理ブロック図である。同図において
被乗数レジスタ１ば被乗数を、乗数レジスタ２は乗数を
保持し、また乗算結果保持レジスタ３は乗算結果を保持
するものである。乗算器４は例えばＢｏｏ　ｔｈのデコ
ーダとＩＡａ　１１ａｃｅのトリーとを用いた乗算器で
ある。FIG. 1 is a block diagram of the principle of the present invention. In the figure, multiplicand register 1 holds the multiplicand, multiplier register 2 holds the multiplier, and multiplication result holding register 3 holds the multiplication result. The multiplier 4 is a multiplier using, for example, a Booth decoder and an IAa 11ace tree.

乗算乗数部分指示手段５は、乗算を複数のステップに分
割するために、複数の連続するビット部分に分割された
乗数のうちでどのビット部分と被乗数とを乗算している
かを示すものである。例えば乗数と被乗数とがそれぞれ
４バイトである場合には、乗数が１バイトずつに分割さ
れ、被乗数４バイトと乗数のうちの１バイトとの乗算が
計４回、すなわち４ステップ行わ、れるが、乗算乗数部
分指示手段５は乗数のうちのどのバイトと被乗数とが乗
算されているかを示すことになる。The multiplication multiplier part indicating means 5 indicates which bit part of the multiplier divided into a plurality of consecutive bit parts is to be multiplied by the multiplicand in order to divide the multiplication into a plurality of steps. For example, when the multiplier and the multiplicand are each 4 bytes, the multiplier is divided into 1 byte each, and the 4 bytes of the multiplicand and 1 byte of the multiplier are multiplied 4 times in total, that is, in 4 steps. The multiplication part indicating means 5 indicates which byte of the multiplier is multiplied by the multiplicand.

〔作　　　用〕[For production]

本発明においては乗算が複数のステップに分割されて実
行される。例えば４バイト×４バイトの乗算を実行する
ためには、４バイト×１バイトの乗算が４回繰り返され
る。すなわち、例えば被乗数レジスタ１内の被乗数４バ
イトと乗数レジスタ２内の乗数の１バイトとの積と乗算
結果保持レジスタ３の内容のうちで乗算結果として確定
されていない部分とが加算され、乗算乗数部分指示手段
５の指示に従ってそれがシフトされた結果が再び乗算結
果保持レジスタ３に格納されるが、この動作が乗数の下
位ビット側から、例えば乗数の最下位バイトから開始さ
れて乗数の最上位バイトと被乗数との積が取られるまで
繰り返される。乗数の最下位バイトと被乗数との積が取
られるステップでは乗算結果保持レジスタ３の内容はク
リア状態（０）であり、またその積はシフトされること
なく乗算結果保持レジスタ３に格納され、その最下位１
バイトは乗算結果として確定される。次に乗数の下位側
から２バイト目と被乗数との積は乗算結果保持レジスタ
３の内容のうち確定された最下位１バイトを除く部分と
加算され、乗算乗数部分指示手段５の指示に従って８ビ
ツトシフトされて乗算結果保持レジスタ３に格納される
。In the present invention, multiplication is divided into multiple steps and executed. For example, to perform a 4 byte x 4 byte multiplication, the 4 byte x 1 byte multiplication is repeated four times. That is, for example, the product of 4 bytes of the multiplicand in the multiplicand register 1 and 1 byte of the multiplier in the multiplier register 2 and the part of the contents of the multiplication result holding register 3 that is not determined as the multiplication result are added, and the multiplier is obtained. The result of shifting according to the instruction from the partial instruction means 5 is stored again in the multiplication result holding register 3, but this operation starts from the lower bit side of the multiplier, for example, from the least significant byte of the multiplier, and then from the most significant byte of the multiplier. Iterates until the byte is multiplied by the multiplicand. At the step where the lowest byte of the multiplier and the multiplicand are multiplied, the contents of the multiplication result holding register 3 are in a clear state (0), and the product is stored in the multiplication result holding register 3 without being shifted. bottom 1
The bytes are determined as the result of the multiplication. Next, the product of the second byte from the least significant side of the multiplier and the multiplicand is added to the content of the multiplication result holding register 3 excluding the determined lowest 1 byte, and shifted by 8 bits according to the instruction of the multiplication multiplier portion instruction means 5. and stored in the multiplication result holding register 3.

以上のように、本発明においては乗算が複数のステップ
に分割され同一の回路で実行されることになる。なお本
発明では乗算乗数部分指示手段５は特に独立のハードウ
ェアとしては設けられず、乗数レジスタ２の中でデータ
が不要になったビットを用いてその作用が行われる。As described above, in the present invention, multiplication is divided into multiple steps and executed by the same circuit. Incidentally, in the present invention, the multiplication/multiplier portion instruction means 5 is not particularly provided as independent hardware, and its function is performed using bits whose data is no longer needed in the multiplier register 2.

〔実　　施　　例〕〔Example〕

本発明の実施例として、乗数４ハイドと被乗数４バイト
との整数乗算を４つのステップに分割し、各ステップで
は４ハイド×１バイトの乗算を行って４つのステップに
よって最終的な積を求める場合を考える。被乗数４ハイ
ドをａｂｃｄ、乗数４バイトをｅｆｇｈ（ａ−ｈはそれ
ぞれ１バイトのデータとする）とし、第１ステツプでは
被乗数４バイトと乗数の最下位バイトとの積、すなわち
ａｂｃｄＸｈが実行される。この演算によって生成され
た４０ビツト　（３２ビツト×８ビツト）のＳｕｍとＣ
ａｒｒｙとがＣＰＡ　（桁上伝播加算回路）の人力とさ
れ、４０ビツトの部分積が生成されてその結果は部分積
レジスタ（ＰＤ−ｒｅｇ）に格納される。As an embodiment of the present invention, integer multiplication of 4 hides multiplier and 4 bytes multiplicand is divided into 4 steps, and in each step, multiplication of 4 hides x 1 byte is performed to obtain the final product in 4 steps. think of. Assuming that the multiplicand 4 bytes is abcd and the multiplicand 4 bytes is efgh (each of ah is 1 byte of data), the product of the 4 bytes of the multiplicand and the least significant byte of the multiplier, that is, abcdXh, is executed in the first step. The 40-bit (32 bits x 8 bits) Sum and C generated by this operation
40-bit partial product is generated and the result is stored in the partial product register (PD-reg).

そして次のステップに備えて乗数レジスタの内容が１ハ
イド右シフトされる。The contents of the multiplier register are then right-shifted by one hide in preparation for the next step.

第２図は第１ステツプ終了後の乗数レジスタの内容の実
施例である。同図において、最上位１バイトは不要とな
るので、このハイドは第２ステツプ以後のステップで生
成される部分積を部分積レジスタに格納する際に必要な
シフト数の制御に使用される。FIG. 2 is an example of the contents of the multiplier register after the first step. In the figure, since the most significant byte is unnecessary, this hide is used to control the number of shifts required when storing partial products generated in the second and subsequent steps in the partial product register.

第２ステツプでは、第１ステツプと同様に被乗数４バイ
トと乗数のうち下位側から２バイト目の１バイトとの積
（ａｂｃｄＸｇ）が実行される。In the second step, as in the first step, the product (abcdXg) of the 4 bytes of the multiplicand and the 1 byte of the second byte from the lowest order of the multiplier is executed.

そして桁上伝播加算回路の出力が後述するように乗数レ
ジスタの最上位２ビツトを用いて作られる４進カウンタ
の内容に従って１バイト左シフトされ、部分積レジスタ
に格納された後に乗数レジスタの内容が更に１ハイド右
シフトされる。第３ステツプではａｂｃｄｘｆの乗算が
、また第４ステツプではａｂｃｄＸｅの乗算が第２ステ
ツプと同様にして実行され、桁上伝播加算回路の出力が
１バイト左シフトして部分積レジスタに格納され、また
乗数レジスタの内容が１バイト右シフトされる。Then, the output of the carry propagation adder circuit is shifted to the left by 1 byte according to the contents of a quaternary counter created using the most significant two bits of the multiplier register, as described later, and stored in the partial product register, after which the contents of the multiplier register are It is further shifted to the right by one hide. In the third step, multiplication by abcdxf is performed, and in the fourth step, multiplication by abcdXe is performed in the same manner as in the second step, and the output of the carry propagation adder circuit is shifted to the left by one byte and stored in the partial product register. The contents of the multiplier register are shifted right by one byte.

第３図は本発明の乗算器の全体構成ブロック図である。FIG. 3 is a block diagram of the overall configuration of the multiplier of the present invention.

同図において乗算器は乗数レジスタ１１を含むＢｏｏｔ
ｈのデコーダ１０、被乗数レジスタ１２　、Ｂｏｏｔｈ
のアルゴリズムに従って部分積を求めるために必要なイ
ンバータ１３、および１ビツトシフト回路１４　ａ、　
　１４　ｂ、　ＢｏｏｔｈのデコーダＩＯの出力に応じ
て被乗数レジスタ１２、インバータ１３および１ビツト
シフト回路１４ａ、１４ｂのいずれかの出力を選択する
ためのセレクタ１５ａ　〜１５　ｅ　、　Ｗａｌｌａｃ
ｅのトリー１６、桁上伝播加算回路（ＣＰＡ）１７、お
よび部分積レジスタ（ＰＤ−ｒｅｇ）１３から構成され
ている。In the same figure, the multiplier is Boot which includes multiplier register 11.
h decoder 10, multiplicand register 12, Booth
Inverter 13 and 1-bit shift circuit 14a, which are necessary to obtain partial products according to the algorithm of
Selectors 15a to 15e, Wallac for selecting the output of the multiplicand register 12, inverter 13, and 1-bit shift circuits 14a, 14b according to the output of the decoder IO of Booth 14b.
e tree 16, a carry propagation adder (CPA) 17, and a partial product register (PD-reg) 13.

第４図は複数のステップに分割された乗算の各ステップ
における部分積レジスタの生成の実施例である。本発明
においては、例えば４バイト×１０ハイドの乗算ステップが４回繰り返され、最終的に４ハ
イド×４バイトの積が得られるが、１つのステップにお
ける部分積は４０ビツトとなり、その部分積は部分積レ
ジスタ（ＰＤ＝ｒｅｇ）に格納される。１回のステップ
では部分積４０ビツトのうちの下位８ビツトが確定され
る。本発明では４回のステップのうち最初のステップを
ｍｕｌｓｐ　　（マルチプライステッププロローグ）、
第２から第４のステップをｍｕｌｓ　　（マルチプライ
スチップ）と名付ける第４図において、Ｗａｌｌａｃｅのトリーでは前のステ
ップで生成された部分積レジスタの内容（最初のステッ
プｍｕｌｓｐではクリア状態）のうち確定された８ビツ
トを除＜３２ビツトと、Ｂｏｏｔｈのデコーダにおいて
乗数の１バイトと拡張された符号ビットを含む計ｌＯビ
ットが２ビツトずつの組に区切られ、各２ビツトの組と
下位の組の上位ビットの合計３ビツトのビットパターン
で決定される出力に応じたセレクタ１５ａ〜１５ｅの出
力結果、すなわち各３３ビツトのＰａ−Ｐｄと３２ビツ
トのＰｅの６つのデータが図に示すように左シフトされ
て加算される。生成された４０ビツトの部分積は、前の
ステップにおける部分積レジスタへの格納位置から８ビ
ツト左シフトされた位置に格納され、そのうちの下位８
ビツトは確定されることになる。FIG. 4 is an example of generation of partial product registers in each step of multiplication divided into a plurality of steps. In the present invention, for example, the multiplication step of 4 bytes x 10 hides is repeated 4 times to finally obtain a product of 4 hides x 4 bytes, but the partial product in one step is 40 bits, and the partial product is It is stored in the partial product register (PD=reg). In one step, the lower 8 bits of the 40 bits of the partial product are determined. In the present invention, the first step of the four steps is mulsp (multiply step prologue),
In Figure 4, where the second to fourth steps are named muls (multi-priced chip), Wallace's tree determines the contents of the partial product register (cleared in the first step mulsp) generated in the previous step. In the Booth decoder, a total of 10 bits, including the 8 bits except < 32 bits, the 1 byte of the multiplier and the extended sign bit, are divided into groups of 2 bits, each 2-bit group and the upper half of the lower group. The output results of the selectors 15a to 15e according to the output determined by the bit pattern of a total of 3 bits, that is, 6 data of 33 bits Pa-Pd and 32 bits Pe, are shifted to the left as shown in the figure. will be added. The generated 40-bit partial product is stored in a position shifted to the left by 8 bits from the storage position in the partial product register in the previous step, and the lower 8
The bit will be confirmed.

第５図は部分積レジスタの実施例の構成を示す。FIG. 5 shows the structure of an embodiment of the partial product register.

同図において部分積レジスタ（ＰＤ−ｒｅｇ）は、４バ
イト×４バイトの乗算を行って、その結果を格納するた
めに６４ビツト必要であり、下位３２ビツトのＰＤＬ−
ｒｅｇと上位３２ビツトのＰＤＵ−ｒｅｇ）とから構成
される。In the figure, the partial product register (PD-reg) requires 64 bits to perform multiplication of 4 bytes x 4 bytes and store the result, and the PDL-reg of the lower 32 bits is
reg and the upper 32 bits of PDU-reg).

第６図は部分積レジスタへの部分積の格納方法の実施例
である。同図において、最初のステップｍｕｌｓｐでは
部分積はそのままシフトされることなく部分積レジスタ
に格納され、そのうち下位８ビツトが確定される。第２
のステップｍｕｌｓでは部分積４０ビツトが８ビツト左
シフトされて部分積レジスタに格納され、そのうち最下
位８ビツトが確定される。第３のステップｍｕｌｓでも
同様に４０ビツトの部分積がさらに８ビツト左シフトさ
れて格納さ１２れ、そのうち最下位８ビツトが確定される。最後のステ
ップｍｕｌｓでは部分積４０ビツトがさらに８ビツト左
シフトされて部分積レジスタに格納され、その内容は最
終結果の一部となる。FIG. 6 is an example of a method of storing partial products in a partial product register. In the figure, in the first step mulsp, the partial products are stored as they are in the partial product register without being shifted, and the lower 8 bits of them are determined. Second
In step muls, the 40 bits of the partial product are shifted to the left by 8 bits and stored in the partial product register, of which the least significant 8 bits are determined. Similarly, in the third step muls, the 40-bit partial product is further shifted to the left by 8 bits and stored 1 2 , of which the least significant 8 bits are determined. In the final step muls, the 40-bit partial product is further shifted to the left by 8 bits and stored in the partial product register, the contents of which become part of the final result.

第６図に示すように、各ステップにおける４バイト×１
バイトの部分積は、部分積レジスタに格納される際にそ
のステップに応して必要なビット数だけ左シフトされて
格納される。何ビット（バイト）シフトして格納するか
を制御するために、第３図の乗数レジスタ１１　（実際
には第２図に示したように４バイトの容量がある）の最
上位２ビツトを用いて４進カウンタが作成される。本実
施例においてはｍｕｌｓ命令が３回繰り返されるが、例
えば割り込みが発生した場合でもこれによって何回目の
ｍｕｌｓ命令を実行中であったかを知ることができる。As shown in Figure 6, 4 bytes x 1 in each step
When the partial product of a byte is stored in the partial product register, it is shifted to the left by the number of bits required according to the step and then stored. In order to control how many bits (bytes) to shift and store, the two most significant bits of the multiplier register 11 in Figure 3 (actually has a capacity of 4 bytes as shown in Figure 2) are used. A quaternary counter is created. In this embodiment, the muls instruction is repeated three times, but even if an interrupt occurs, for example, it is possible to know how many times the muls instruction is being executed.

第７図は乗数（ＭＰ）レジスタの最上位２ビツトを用い
て作られる４進カウンタの内容の実施例である。同図に
おいてＭＰ　（０）、ＭＰ　（１）は前のステップにお
けるＭＰレジスタの最上位２ビツトの内容を、またｂｉ
ｔｏ、ｂｉｔｌは現在のステップにおけるＭＰレジスタ
の最上位２ビツトの内容を示す。同図において各ステッ
プにおける４進カウンタの内容ｂｉｔｏとｂｉｔｌは次
式によって決定される。FIG. 7 is an example of the contents of a quaternary counter created using the two most significant bits of the multiplier (MP) register. In the same figure, MP (0) and MP (1) are the contents of the most significant 2 bits of the MP register in the previous step, and
to and bitl indicate the contents of the most significant two bits of the MP register at the current step. In the figure, the contents bito and bitl of the quaternary counter at each step are determined by the following equation.

ｂ　ｉ　ｔ　Ｏ＝　（ＭＰ　（０）→−ＭＰ　（１））
　　・丁丁ｂｉｔｌ＝内丁ＴＴＴ−丁丁ここで、ＳＰとしてｍｕｌｓｐ命令の時に制御信号の１
が人力され、ＳＰはｍｕｌｓｐ命令以外の命令を実行し
ているということを意味している。b it O= (MP (0)→-MP (1))
・Ding bitl=Inner TTT−Ding Here, when the SP is the mulsp command, the control signal 1
This means that the SP is executed manually and the SP executes an instruction other than the mulsp instruction.

第８図は各ステップにおける４進カウンタの内容の設定
法の実施例である。同図において最初のステップ、すな
わちｍｕｌｓｐ命令においてはＳＰの値は１であり、ま
たＭＰ　（０）およびＭＰ　（１）は共にクリア状態の
値Ｏを用いるものとしてｂｉｔｏ、ｂｉｔｌＯ値は共に
Ｏとなり、例えばこの命令の最後にこれらの値が乗数レ
ジスタの最上位２ビツトに格納される。第２ステツプ以
降のｍｕｌｓ命令においてはＳＰの値はＯとされ、それ
ぞれ前のステップにおけるｂｉｔｏ、ｂｉｔｌの値と一
部３４致するＭＰ　（０）、ＭＰ　（１）の値を用いて４進カ
ウンクの内容が設定される。FIG. 8 shows an example of a method for setting the contents of a quaternary counter in each step. In the same figure, in the first step, that is, in the mulsp instruction, the value of SP is 1, and the value O in the clear state is used for both MP (0) and MP (1), and the bito and bitlO values are both O, For example, at the end of this instruction, these values are stored in the two most significant bits of the multiplier register. In the muls instruction after the second step, the value of SP is set to O, and the values of MP (0) and MP (1), which partially match the values of bito and bitl in the previous step, are used to calculate The count contents are set.

〔発明の効果〕以上詳細に説明したように、本発明によれば乗算が複数
のステップに分割されて同一の回路で実行されるために
、乗算器のハードウェアが大幅に削減される。また演算
の実行が小さなりロックサイクルで可能となり、遅延時
間を短くすることができる。さらに新たなハードウェア
を増設することなく、部分積レジスタへの部分積の格納
位置を制御することが可能となる。[Effects of the Invention] As described above in detail, according to the present invention, multiplication is divided into a plurality of steps and executed by the same circuit, so that the hardware of the multiplier can be significantly reduced. In addition, calculations can be executed in a small lock cycle, and the delay time can be shortened. Furthermore, it becomes possible to control the storage position of partial products in the partial product register without adding new hardware.

【図面の簡単な説明】[Brief explanation of drawings]

第１図は本発明の原理ブロック図、第２図は第１ステツプ終了後の乗数レジスタの内容の実
施例を示す図、第３図は乗算器の実施例の全体構成を示すブロック図、第４図は各ステップにおける部分積の生成法の実施例を
示す図、第５図は部分積レジスタの構成の実施例を示す図、第６図は部分積の格納方法の実施例を示す図、第７図ば
４進カウンタの内容の実施例を示す図、第８図ば４進カ
ウンクの内容の設定法の実施例を示す図である。１０−　・・Ｂｏｏｔｈのデコーダ、１１・・・乗数（ＭＰ）レジスタ、１２・・・被乗数レジスタ、ｔ３・・・インバータ、１４ａ、１４ｂ・・・１ビツトシフト回路、１５ａ〜１
５ｅ・・・セレクタ、１６・・・Ｗａｌｌａｃｅのトリー１７・・・桁上伝播加算回路（ＣＰＡ）、１８・・・部
分積レジスタ（ＰＤ−ｒｅｇ）。Fig. 1 is a block diagram of the principle of the present invention; Fig. 2 is a diagram showing an embodiment of the contents of the multiplier register after the first step; Fig. 3 is a block diagram showing the overall configuration of the embodiment of the multiplier; FIG. 4 is a diagram showing an example of a partial product generation method in each step, FIG. 5 is a diagram showing an example of the structure of a partial product register, and FIG. 6 is a diagram showing an example of a partial product storage method. FIG. 7 is a diagram showing an example of the contents of a quaternary counter, and FIG. 8 is a diagram showing an example of a method for setting the contents of a quaternary count. 10-... Booth decoder, 11... Multiplier (MP) register, 12... Multiplicand register, t3... Inverter, 14a, 14b... 1-bit shift circuit, 15a-1
5e... Selector, 16... Wallace tree 17... Carry propagation adder circuit (CPA), 18... Partial product register (PD-reg).

Claims

【特許請求の範囲】被乗数レジスタ（１）、乗数レジスタ（２）、および乗
算結果保持レジスタ（３）を有する乗算器（４）におい
て、乗算を複数のステップに分割するために乗数を複数の連
続するビット部分に区分し、該複数のビット部分のうち
どの部分と被乗数とを乗算しているかを示す乗算乗数部
分指示手段（５）を設け、前記被乗数レジスタ（１）内
の被乗数と前記乗数レジスタ（２）内の乗数の前記複数
のビット部分の１つとの積と、乗算結果保持レジスタ（
３）の内容のうち乗算結果として確定されていない部分
との和を該乗算乗数部分指示手段（５）の指示に従って
シフトした結果を該乗算結果保持レジスタ（３）に格納
するステップを前記乗数の複数のビット部分の下位ビッ
ト側から繰り返すことを特徴とする乗算器。[Claims] In a multiplier (4) having a multiplicand register (1), a multiplier register (2), and a multiplication result holding register (3), the multiplier is divided into a plurality of successive steps in order to divide the multiplication into a plurality of steps. a multiplier part indicating means (5) for indicating which part of the plurality of bit parts is multiplied by the multiplicand; The product of the multiplier in (2) with one of the plurality of bit parts and the multiplication result holding register (
The step of storing the result of shifting the sum of the contents of 3) with the part that is not determined as the multiplication result in accordance with the instruction of the multiplication multiplier part instruction means (5) in the multiplication result holding register (3) A multiplier characterized in that iterates from the lower bit side of a plurality of bit parts.