CN116048455B - Insertion type approximate multiplication accumulator - Google Patents

Insertion type approximate multiplication accumulator Download PDF

Info

Publication number
CN116048455B
CN116048455B CN202310207613.1A CN202310207613A CN116048455B CN 116048455 B CN116048455 B CN 116048455B CN 202310207613 A CN202310207613 A CN 202310207613A CN 116048455 B CN116048455 B CN 116048455B
Authority
CN
China
Prior art keywords
partial product
approximate
circuit
bit
accurate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202310207613.1A
Other languages
Chinese (zh)
Other versions
CN116048455A (en
Inventor
刘伟强
尹培培
陈珂
王成华
夏伟杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University of Aeronautics and Astronautics
Original Assignee
Nanjing University of Aeronautics and Astronautics
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University of Aeronautics and Astronautics filed Critical Nanjing University of Aeronautics and Astronautics
Priority to CN202310207613.1A priority Critical patent/CN116048455B/en
Publication of CN116048455A publication Critical patent/CN116048455A/en
Application granted granted Critical
Publication of CN116048455B publication Critical patent/CN116048455B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/38Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
    • G06F7/48Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
    • G06F7/52Multiplying; Dividing
    • G06F7/523Multiplying only
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/38Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
    • G06F7/48Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
    • G06F7/50Adding; Subtracting
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Pure & Applied Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Mathematical Optimization (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

An insertion type approximate multiplication accumulator comprises an accurate partial product generating circuit, an approximate partial product generating circuit, a sign bit expanding circuit, an accurate partial product compressing circuit, an approximate partial product compressing circuit and a travelling wave carry adder circuit. The exact partial product generating circuit multiplies the high of the input operandMkBit division is used for coding, and an accurate partial product is generated; the approximate partial product generating circuit multiplies the low of the input operandk‑pBit division is used for coding, and an approximate partial product is generated; the sign bit expansion circuit expands the sign bit of the highest bit of the accurate partial product according to the bit width of the cumulative term and adds the highest bit of the cumulative termNkBit combining output to the accurate partial product compression circuit to approximate partial product sum low of the cumulative termk‑pBit merging and outputting to an approximate partial accumulating circuit; and finally, generating a multiplication accumulation structure by the travelling wave carry adder circuit. The invention can realize faster multiply-accumulate calculation and lower energy consumption by approximate calculation.

Description

Insertion type approximate multiplication accumulator
Technical Field
The invention relates to the technical field of approximate arithmetic circuit design, in particular to an insertion type approximate multiplication accumulator.
Background
With the advent of the big data age, artificial intelligence, machine learning, data mining, image recognition and the like have been increasingly applied, the complexity of algorithms and the processing capacity of data have been increased continuously, while such cognitive applications are similar to human perception, usually do not need unique or accurate results, and a sufficiently good result that can be accepted by users can meet the application requirements. The approximate calculation is used as a novel calculation paradigm for improving the processing energy efficiency of the integrated circuit, and the low power consumption and the high performance of the system are replaced at the cost of calculation precision, so that the method is an effective method for reducing the calculation complexity of the fault-tolerant application and improving the calculation speed.
The multiplication accumulator is a typical calculation in cognitive application, and the basic operation unit of calculation such as convolution, dot multiplication and the like is the multiplication accumulator. The approximate multiply-accumulator is less studied, most of which are the application of an approximate multiplier or an approximate adder to the multiply-accumulator. Paper An Approximate Multiply-Accumulate Unit with Low Power and Reduced Area published in IEEE Computer Society Annual Symposium on VLSI in 2019 discloses an approximate multiply-accumulate unit based on multiplication and addition combined approximation processing, but the method is applicable to unsigned multiply-accumulate units, such as extended to signed computation, requiring the input to be converted into an unsigned number in advance by adding a sign bit preprocessing unit, but this processing additionally increases system power consumption.
Disclosure of Invention
The present invention addresses the deficiencies in the prior art by providing an insertion type approximate multiply accumulator.
In order to achieve the above purpose, the present invention adopts the following technical scheme:
an inserted approximate multiplication accumulator is characterized in that the approximate multiplication accumulator comprises an accurate partial product generating circuit, an approximate partial product generating circuit, a sign bit expanding circuit, an accurate partial product compressing circuit, an approximate partial product compressing circuit and a traveling wave carry adder circuit;
the exact partial product generating circuit multiplies the high of the input operandM-kBits are encoded according to a group of three bits, and an accurate partial product is generated; wherein the input bit widths of the multiplication input operands are allMBinary representation of the multiplicand and multiplier, respectively, as
Figure SMS_1
and />
Figure SMS_2
mRepresenting the number of bits>
Figure SMS_3
The cumulative term has an input bit width ofN,/>
Figure SMS_4
kIs of approximate length and is a positive integer;
the approximate partial product generating circuit multiplies the low of the input operandk-pBits are encoded according to a group of three bits, and an approximate partial product is generated; wherein,pis a truncated length and is a positive integer;
the sign bit expansion circuit expands the sign bit of the highest bit of the accurate partial product according to the bit width of the cumulative term, and expands the high bit of the accurate partial product and the cumulative term after expansionN-kBit merging and outputting to an accurate partial product compression circuit; the sign bit expansion circuit reduces the approximate partial product and the accumulation termk-pBit merging and outputting to an approximate partial product compression circuit;
the accurate partial product compression circuit will be high in the accurate partial product and the cumulative termN-kBit compression into two rows, and output to the travelling wave carry adder;
the approximate partial product compression circuit reduces the approximate partial product and the sum termk-pBit compression into two rows, and output to the travelling wave carry adder;
generated by traveling wave carry adder circuit for accurate part of the compression circuit and approximate part of the compression circuitN-pThe partial product of two rows of bits is processed to produce a highN-pOutput result of bit multiplication accumulator, lowpThe bit direct output is 0.
In order to optimize the technical scheme, the specific measures adopted further comprise:
further, in the accurate partial product generating circuit, the output expression of the accurate partial product is:
Figure SMS_5
, in the formula ,/>
Figure SMS_6
Represent the firstiLine 1jPrecise partial product of the column,/->
Figure SMS_7
,/>
Figure SMS_8
,/>
Figure SMS_9
Represents the negation operation, + represents the OR operation, +.>
Figure SMS_10
Representing an exclusive or operation.
Further, in the approximate partial product generating circuit, the output expression of the approximate partial product is:
Figure SMS_11
, in the formula ,
Figure SMS_12
represent the firstiLine 1jAn approximate partial product of the columns.
Further, the sign bit expansion circuit expands the sign bit of the most significant bit of the accurate partial product according to the bit width of the cumulative term, and the calculation process comprises:
the exact partial product of each row is calculated according to the cumulative term bit widthNExpanding, the sign bit of the accurate partial product of the 1 st row is
Figure SMS_13
First, thenThe sign bit of the exact partial product of the row is +.>
Figure SMS_14
Accumulating sign bits of all accurate partial products to form an accumulated sum of sign bitss,/>
Figure SMS_15
,/>
Figure SMS_16
Representing the negation operation;
wherein, the expansion sign bit of the accurate partial product of the 1 st row is simplified toM+1、M+2 and M+3 for these 3 columns, corresponding to
Figure SMS_17
、/>
Figure SMS_18
And
Figure SMS_19
the expansion sign bit of the accurate partial product of the 2 nd row is simplified intoM+3 and M+4 for 2 columns, correspondingto +.>
Figure SMS_20
And 1, th->
Figure SMS_21
Line 2M-1 and 2MThe expansion sign bit of the column-accurate partial product is reduced to +.>
Figure SMS_22
And 1, 2M+1 toNThe column expansion sign bits are all corresponding to 1.
Further, the precision partial backlog circuit is comprised of a precision 4-2 compressor and a precision 3-2 compressor.
Further, the approximate partial product compression circuit is comprised of an approximate 4-2 compressor.
The beneficial effects of the invention are as follows: the invention discloses an approximate multiplication accumulator, which inserts accumulation items into the operation process of a multiplier, wherein the approximate multiplication accumulator is at low levelpBit direct output 0, middlek-pBit use approximation partial product generation circuit and approximation partial product compression circuit, highM-kThe bit uses the accurate partial product generating circuit, the high is passed through the sign bit expanding circuitM-kThe bit-accurate partial product is extended toN-kAfter the bits, the exact partial product compression circuit is used again. Compared with the prior approximate multiplication accumulator, the invention has faster calculation speed and lower power consumption and area. The multiplication accumulation circuit adopts a cutting-off and approximation means, thereby effectively reducing the complexity of the circuit. The sign bit expansion of the invention can effectively process signed accumulation items in advance, thereby accelerating the calculation speed.
Drawings
Fig. 1 is a block diagram of the structure of the approximate multiply accumulator of the present invention.
FIG. 2 is a schematic view ofM=8,N=20,p=3,kAn exemplary 8 x 8+20 bit input approximate multiply accumulator array schematic, where,
Figure SMS_25
representing the approximate partial product>
Figure SMS_26
Representing the exact partial product,/->
Figure SMS_29
Representing the sign bit compensation partial product,/->
Figure SMS_24
The term of accumulation is represented by a term of accumulation,
Figure SMS_28
representing truncated cumulative term,/->
Figure SMS_30
Representing the truncated partial product, & lt & gt>
Figure SMS_31
Representing the truncated sign bit compensation partial product, < >>
Figure SMS_23
Representing approximate compressor>
Figure SMS_27
Representing an accurate compressor.
Fig. 3 is a truth table for approximate partial product generation.
Fig. 4 is a gate level circuit diagram of approximate partial product generation.
Fig. 5 is a truth table for an approximation 4-2 compressor.
Fig. 6 is a gate level circuit diagram of an approximate 4-2 compressor.
Description of the embodiments
The invention will now be described in further detail with reference to the accompanying drawings.
Fig. 1 is a block diagram of the structure of an approximate multiply accumulator of the present invention, which includes an exact partial product generation circuit, an approximate partial product generation circuit, a sign bit expansion circuit, an exact partial product compression circuit, an approximate partial product compression circuit, and a traveling wave carry adder circuit.
Multiplicand with approximate multiply accumulatorAMultiplier and multiplierBThe input bit widths are allMBinary is expressed as
Figure SMS_32
And
Figure SMS_33
mrepresenting the number of bits>
Figure SMS_34
The cumulative term has an input bit width ofNMAndNare all positive integers, and ∈ ->
Figure SMS_35
The exact partial product generating circuit is a high that multiplies the input operands (multiplicand and multiplier)M-kBits are encoded in a three-bit set of partitions, generating an exact partial product,kis of approximate length and is a positive integer.
The approximate partial product generating circuit is to multiply the low of the input operandk-pThe bits are encoded in a set of three bits to generate an approximate partial product.
The sign bit expansion circuit expands the sign bit of the highest bit of the accurate partial product according to the bit width of the cumulative term, and expands the high bit of the accurate partial product and the cumulative term after expansionN-kThe bits are combined and output to the accurate partial product compression circuit. The sign bit expansion circuit also approximates partial product and cumulant lowk-pThe bits are combined and output to an approximate partial product compression circuit,pis a truncated length and is a positive integer.
The accurate partial product compression circuit compresses all accurate partial product and high-order accumulation items into two rows by adopting an accurate 4-2 compressor and an accurate 3-2 compressor, and outputs the two rows to the travelling wave carry adder.
The approximate partial product compression circuit compresses all the approximate partial product and low-order accumulation items into two rows by adopting an approximate 4-2 compressor, and outputs the two rows to the travelling wave carry adder.
The traveling wave carry adder generates the accurate part of the accumulation circuit and the approximate part of the accumulation circuitN-pThe partial product of two rows of bits is processed to produce a highN-pOutput result of bit multiplication accumulator, lowpThe bit is directly output as 0, and the two parts form the structure of the final multiplication accumulator.
The output expression of the accurate partial product generation circuit is:
Figure SMS_36
in the formula ,
Figure SMS_37
represent the firstiLine 1jPrecise partial product of the column,/->
Figure SMS_38
,/>
Figure SMS_39
,/>
Figure SMS_40
Represents the negation operation, + represents the OR operation, +.>
Figure SMS_41
Representing an exclusive or operation.
The output expression of the approximate partial product generation circuit is:
Figure SMS_42
, in the formula ,/>
Figure SMS_43
Represent the firstiLine 1jAn approximate partial product of the columns.
Fig. 3 is a truth table for approximate partial product generation, and fig. 4 is a gate-level circuit diagram for approximate partial product generation.
The sign bit expansion circuit expands the sign bit of the highest bit of the accurate partial product according to the bit width of the accumulation term, and the specific calculation process comprises the following steps:
the exact partial product of each row is calculated according to the cumulative term bit widthNExpansion is performed, wherein the sign bit of the first row of accurate partial product is
Figure SMS_44
First, thenThe sign bit of the exact partial product of the row is +.>
Figure SMS_45
Accumulating sign bits of all accurate partial products to form an accumulated sum of sign bitssSimplified->
Figure SMS_46
Wherein, the expansion sign bit of the accurate partial product of the 1 st row is simplified toM+1、M+2、M+3 for these 3 columns, corresponding to
Figure SMS_47
、/>
Figure SMS_48
And
Figure SMS_49
namely, the expansion sign bit of the 1 st row and the 9 th to 11 th columns in the figure 2; the expansion sign bit of the accurate partial product of the 2 nd row is simplified intoM+3、M+4 for 2 columns, correspondingto +.>
Figure SMS_50
And 1 to->
Figure SMS_51
The expansion sign bit simplification method of the accurate partial product of the rows is similar, namely, expansion sign bits of the 11 th and 12 th columns of the 2 nd row, the 13 rd and 14 th columns of the 3 rd row and the 15 th and 16 th columns of the 4 th row in the figure 2; at the same time at 2M+1 toNThe expansion sign bit is 1, namely the expansion sign bit of the 17 th to 20 th columns of the 4 th row in fig. 2.
The accurate partial backlog circuit is composed of an accurate 4-2 compressor and an accurate 3-2 compressor.
The approximate partial backlog circuit is composed of an approximate 4-2 compressor, and is approximate 4The input to the-2 compressor is
Figure SMS_52
、/>
Figure SMS_53
、/>
Figure SMS_54
Figure SMS_55
The 4 inputs are 4 approximate partial product bits of the same weight, and the output isSumAndCout:/>
Figure SMS_56
Figure SMS_57
. Fig. 5 is a truth table for an approximate 4-2 compressor, and fig. 6 is a gate level circuit diagram for an approximate 4-2 compressor.
Fig. 2 shows a partial product process of an approximate multiply-accumulator of an embodiment of the invention. Referring to fig. 2, the two inputs of the multiplicand and multiplier are passed through a partial product generating circuit to generate a partial product array as shown in the upper half of the first stage, where the solid circles represent the exact partial products, which are generated by the exact partial product generating circuit. The open circles represent approximate partial products, and are generated by an approximate partial product generating circuit. The concentric circles are truncated partial products. The diamonds represent the cumulative entries of the inputs that are inserted into the multiply-generated partial array and combined to form a new partial array for the next stage of processing. In the first stage, the dashed box represents an approximately 4-2 compressor, together forming an approximately partial backlog circuit. The solid line boxes represent precision compressors, which together form a precision partial backlog circuit. The accurate partial product compression circuit and the approximate partial product compression circuit compress all partial products into two rows through the second stage and the third stage step by step, then the traveling wave carry adder accumulates the partial products of the two rows, and the fourth stage is entered to generate a final multiplication accumulation result.
The above is only a preferred embodiment of the present invention, and the protection scope of the present invention is not limited to the above examples, and all technical solutions belonging to the concept of the present invention belong to the protection scope of the present invention. It should be noted that modifications and adaptations to the invention without departing from the principles thereof are intended to be within the scope of the invention as set forth in the following claims.

Claims (6)

1. An inserted approximate multiplication accumulator, characterized in that the approximate multiplication accumulator comprises an accurate partial product generating circuit, an approximate partial product generating circuit, a sign bit expanding circuit, an accurate partial product compressing circuit, an approximate partial product compressing circuit and a travelling wave carry adder circuit;
the precise partial product generation circuit multiplies high of the input operandsM-kBits are encoded according to a group of three bits, and an accurate partial product is generated; wherein the input bit widths of the multiplication input operands are allMBinary representation of the multiplicand and multiplier, respectively, as
Figure QLYQS_1
and />
Figure QLYQS_2
mRepresenting the number of bits>
Figure QLYQS_3
The cumulative term has an input bit width ofN,/>
Figure QLYQS_4
kIs of approximate length and is a positive integer;
the approximate partial product generation circuit multiplies the low of the input operandk-pBits are encoded according to a group of three bits, and an approximate partial product is generated; wherein,pis a truncated length and is a positive integer;
the sign bit expansion circuit expands the sign bit of the highest bit of the accurate partial product according to the bit width of the cumulative term, and expands the high bit of the accurate partial product and the cumulative term after expansionN-kBit combining output to accurate partial product compressionIn the circuit; the sign bit expansion circuit approximates partial product and cumulant lowk-pBit merging and outputting to an approximate partial product compression circuit;
the accurate partial product compression circuit will be accurate and accumulate high of the termN-kBit compression into two rows, and output to the travelling wave carry adder;
the approximate partial product compression circuit sums the approximate partial product with low sum termk-pBit compression into two rows, and output to the travelling wave carry adder;
the traveling wave carry adder circuit is used for generating a precise part of the accumulation circuit and an approximate part of the accumulation circuitN-pThe partial product of two rows of bits is processed to produce a highN-pOutput result of bit multiplication accumulator, lowpThe bit direct output is 0.
2. An inserted approximate multiply accumulator as claimed in claim 1, characterized in that: in the precise partial product generating circuit, the output expression of the precise partial product is as follows:
Figure QLYQS_5
, in the formula ,/>
Figure QLYQS_6
Represent the firstiLine 1jPrecise partial product of the column,/->
Figure QLYQS_7
,/>
Figure QLYQS_8
,/>
Figure QLYQS_9
Represents the negation operation, + represents the OR operation, +.>
Figure QLYQS_10
Representing an exclusive or operation.
3. As claimed inAn inserted approximate multiply accumulator as claimed in 1, characterized in that: in the approximate partial product generating circuit, an output expression of the approximate partial product is:
Figure QLYQS_11
, in the formula ,/>
Figure QLYQS_12
Represent the firstiLine 1jAn approximate partial product of the columns.
4. An inserted approximate multiply accumulator as claimed in claim 1, characterized in that: the sign bit expansion circuit expands the sign bit of the highest bit of the accurate partial product according to the bit width of the accumulation term, and the calculation process comprises the following steps:
the exact partial product of each row is calculated according to the cumulative term bit widthNExpanding, the sign bit of the accurate partial product of the 1 st row is
Figure QLYQS_13
First, thenThe sign bit of the exact partial product of the row is +.>
Figure QLYQS_14
Accumulating sign bits of all accurate partial products to form an accumulated sum of sign bitss
Figure QLYQS_15
,/>
Figure QLYQS_16
Representing the negation operation;
wherein, the expansion sign bit of the accurate partial product of the 1 st row is simplified toM+1、M+2 and M+3 for these 3 columns, corresponding to
Figure QLYQS_17
、/>
Figure QLYQS_18
and />
Figure QLYQS_19
The expansion sign bit of the accurate partial product of the 2 nd row is simplified intoM+3 and M+4 for 2 columns, correspondingto +.>
Figure QLYQS_20
And 1, th->
Figure QLYQS_21
Line 2M-1 and 2MThe expansion sign bit of the column-accurate partial product is reduced to +.>
Figure QLYQS_22
And 1, 2M+1 toNThe column expansion sign bits are all corresponding to 1.
5. An inserted approximate multiply accumulator as claimed in claim 1, characterized in that: the accurate partial product compression circuit is composed of an accurate 4-2 compressor and an accurate 3-2 compressor.
6. An inserted approximate multiply accumulator as claimed in claim 1, characterized in that: the approximate partial-compression circuit is comprised of an approximate 4-2 compressor.
CN202310207613.1A 2023-03-07 2023-03-07 Insertion type approximate multiplication accumulator Active CN116048455B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310207613.1A CN116048455B (en) 2023-03-07 2023-03-07 Insertion type approximate multiplication accumulator

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310207613.1A CN116048455B (en) 2023-03-07 2023-03-07 Insertion type approximate multiplication accumulator

Publications (2)

Publication Number Publication Date
CN116048455A CN116048455A (en) 2023-05-02
CN116048455B true CN116048455B (en) 2023-06-02

Family

ID=86127541

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310207613.1A Active CN116048455B (en) 2023-03-07 2023-03-07 Insertion type approximate multiplication accumulator

Country Status (1)

Country Link
CN (1) CN116048455B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001027742A1 (en) * 1999-10-08 2001-04-19 Sony Electronics, Inc. A partitioned multiplier
CN106897046A (en) * 2017-01-24 2017-06-27 青岛朗思信息科技有限公司 A kind of fixed-point multiply-accumulator
CN111488133A (en) * 2020-04-15 2020-08-04 电子科技大学 High-radix approximate Booth coding method and mixed-radix Booth coding approximate multiplier
CN111832719A (en) * 2020-07-28 2020-10-27 电子科技大学 Fixed point quantization convolution neural network accelerator calculation circuit
CN114647399A (en) * 2022-05-19 2022-06-21 南京航空航天大学 Low-energy-consumption high-precision approximate parallel fixed-width multiplication accumulation device
CN115407965A (en) * 2022-11-01 2022-11-29 南京航空航天大学 High-performance approximate divider based on Taylor expansion and error compensation method

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7043517B2 (en) * 2003-03-07 2006-05-09 Faraday Technology Corp. Multiply accumulator for two N bit multipliers and an M bit addend
US7958180B2 (en) * 2007-07-05 2011-06-07 International Business Machines Corporation Multiplier engine

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001027742A1 (en) * 1999-10-08 2001-04-19 Sony Electronics, Inc. A partitioned multiplier
CN106897046A (en) * 2017-01-24 2017-06-27 青岛朗思信息科技有限公司 A kind of fixed-point multiply-accumulator
CN111488133A (en) * 2020-04-15 2020-08-04 电子科技大学 High-radix approximate Booth coding method and mixed-radix Booth coding approximate multiplier
CN111832719A (en) * 2020-07-28 2020-10-27 电子科技大学 Fixed point quantization convolution neural network accelerator calculation circuit
CN114647399A (en) * 2022-05-19 2022-06-21 南京航空航天大学 Low-energy-consumption high-precision approximate parallel fixed-width multiplication accumulation device
CN115407965A (en) * 2022-11-01 2022-11-29 南京航空航天大学 High-performance approximate divider based on Taylor expansion and error compensation method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
54位×54位冗余二进制乘法器的研究与设计;王晓泾;《中国优秀硕士学位论文全文数据库》;I137-33 *
Design and Analysis of Approximate Redundant Binary Multipliers;Weiqiang Liu .etc;《IEEE Transaction on Computers》;第68卷(第6期);804-819 *

Also Published As

Publication number Publication date
CN116048455A (en) 2023-05-02

Similar Documents

Publication Publication Date Title
CN112540743B (en) Reconfigurable processor-oriented signed multiply accumulator and method
US11816448B2 (en) Compressing like-magnitude partial products in multiply accumulation
CN109542393A (en) A kind of approximation 4-2 compressor and approximate multiplier
CN111221499B (en) Approximate multiplier based on approximate 6-2 and 4-2 compressors and calculation method
Nojehdeh et al. Efficient hardware implementation of artificial neural networks using approximate multiply-accumulate blocks
CN114647399A (en) Low-energy-consumption high-precision approximate parallel fixed-width multiplication accumulation device
CN110825346B (en) Low logic complexity unsigned approximation multiplier
CN116205244B (en) Digital signal processing structure
CN116048455B (en) Insertion type approximate multiplication accumulator
Ahish et al. Design of high performance Multiply-Accumulate Computation unit
CN114115803B (en) Approximate floating-point multiplier based on partial product probability analysis
Murugeswari et al. An area efficient and low power multiplier using modified carry save adder for parallel multipliers
Cao et al. Efficient LUT-based FPGA accelerator design for universal quantized CNN inference
Nezhad et al. High-speed multiplier design using multi-operand multipliers
Jayasanthi Implementation of Power Efficient Multiply Accumulate Unit for DSP Applications
Pathak et al. Low power Dadda multiplier using approximate almost full adder and Majority logic based adder compressors
Sharanya et al. A MODIFIED PARTIAL PRODUCT GENERATOR FOR REDUNDANT BINARY MULTIPLIERS
Rathore et al. Implementation and Design of Xilinx based Booth multiplier
Bhongale et al. Review on Recent Advances in VLSI Multiplier
Gangwar et al. A Design Technique for Delay and Power Efficient Dadda-Multiplier
Mulagaleti et al. Low-Power and Area-Efficient M-Term Binary Polynomial Multiplier for Finite Field
JYOTHI et al. Implementation of Modified Booth Algorithm for Power Critical Applications
Wang et al. The Design of Multipliers Based on Radix-4 Booth Coding
Nagaraju et al. Low Power High Speed Multiplier and Accumulator Based on Radix-4 Booth’s Algorithm
Zahid et al. Energy-Efficient Approximate Booth Multipliers for Convolutional Neural Networks

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant