SG11201910024PA - Convolution acceleration and computing processing methods and apparatuses, electronic device, and storage medium - Google Patents
Convolution acceleration and computing processing methods and apparatuses, electronic device, and storage mediumInfo
- Publication number
- SG11201910024PA SG11201910024PA SG11201910024PA SG11201910024PA SG 11201910024P A SG11201910024P A SG 11201910024PA SG 11201910024P A SG11201910024P A SG 11201910024PA SG 11201910024P A SG11201910024P A SG 11201910024PA
- Authority
- SG
- Singapore
- Prior art keywords
- apparatuses
- electronic device
- storage medium
- convolution
- acceleration
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/15—Correlation function computation including computation of convolution operations
- G06F17/153—Multidimensional correlation or convolution
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G06N3/063—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/54—Interprogram communication
- G06F9/544—Buffers; Shared memory; Pipes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Health & Medical Sciences (AREA)
- Computing Systems (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Evolutionary Computation (AREA)
- Computational Linguistics (AREA)
- Artificial Intelligence (AREA)
- Mathematical Analysis (AREA)
- Computational Mathematics (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Neurology (AREA)
- Algebra (AREA)
- Databases & Information Systems (AREA)
- Complex Calculations (AREA)
- Memory System Of A Hierarchy Structure (AREA)
Abstract
CONVOLUTION ACCELERATION AND COMPUTING PROCESSING METHODS AND APPARATUSES, ELECTRONIC DEVICE, AND STORAGE MEDIUM 5 Convolution acceleration and computing processing methods and apparatuses, electronic device, and a storage medium. The acceleration method comprises: reading data be of preset in task be from off-chip 10 memory of an accelerator in sequence by means of a FIFO port, and storing same in at least input region a cache in on-chip of accelerator; response storage the to processed the cache region, input data be from least input region in sequence, and performing convolution calculation on the input window data 15 to be processed by means of convolution kernel data to obtain output window data, and storing the output cache region ; and storing the output window data in the third cache region in the off-chip memory in sequence by means of FIFO The acceleration computing methods and apparatuses, the electronic device, and the storage medium can avoid the cases 20 that on-chip of accelerator bandwidth insufficient the processor resources are limited, improve the efficiency of convolution processing, and are appropriate for hardware platforms such as FPGA and ASIC. 25 FIG. 1
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710300171.XA CN108229645B (en) | 2017-04-28 | 2017-04-28 | Convolution acceleration and calculation processing method and device, electronic equipment and storage medium |
PCT/CN2018/084948 WO2018196863A1 (en) | 2017-04-28 | 2018-04-27 | Convolution acceleration and calculation processing methods and apparatuses, electronic device and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
SG11201910024PA true SG11201910024PA (en) | 2019-11-28 |
Family
ID=62658071
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11201910024P SG11201910024PA (en) | 2017-04-28 | 2018-04-27 | Convolution acceleration and computing processing methods and apparatuses, electronic device, and storage medium |
Country Status (4)
Country | Link |
---|---|
US (1) | US11429852B2 (en) |
CN (1) | CN108229645B (en) |
SG (1) | SG11201910024PA (en) |
WO (1) | WO2018196863A1 (en) |
Families Citing this family (43)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020019174A1 (en) * | 2018-07-24 | 2020-01-30 | 深圳市大疆创新科技有限公司 | Data access method, processor, computer system and movable device |
CN108984426B (en) * | 2018-08-03 | 2021-01-26 | 北京字节跳动网络技术有限公司 | Method and apparatus for processing data |
CN109165723B (en) * | 2018-08-03 | 2021-03-19 | 北京字节跳动网络技术有限公司 | Method and apparatus for processing data |
CN110826707B (en) * | 2018-08-10 | 2023-10-31 | 北京百度网讯科技有限公司 | Acceleration method and hardware accelerator applied to convolutional neural network |
CN109343826B (en) * | 2018-08-14 | 2021-07-13 | 西安交通大学 | Reconfigurable processor operation unit for deep learning |
CN110770763A (en) * | 2018-10-08 | 2020-02-07 | 深圳市大疆创新科技有限公司 | Data storage device, method, processor and removable equipment |
CN111124626A (en) * | 2018-11-01 | 2020-05-08 | 北京灵汐科技有限公司 | Many-core system and data processing method and processing device thereof |
CN111258653B (en) * | 2018-11-30 | 2022-05-24 | 上海寒武纪信息科技有限公司 | Atomic access and storage method, storage medium, computer equipment, device and system |
CN109558329A (en) * | 2018-12-10 | 2019-04-02 | 广东浪潮大数据研究有限公司 | A kind of program detecting method, device, equipment and readable storage medium storing program for executing |
CN109800867B (en) * | 2018-12-17 | 2020-09-29 | 北京理工大学 | Data calling method based on FPGA off-chip memory |
CN109816093B (en) * | 2018-12-17 | 2020-12-04 | 北京理工大学 | Single-path convolution implementation method |
CN109740732B (en) * | 2018-12-27 | 2021-05-11 | 深圳云天励飞技术有限公司 | Neural network processor, convolutional neural network data multiplexing method and related equipment |
CN109948784B (en) * | 2019-01-03 | 2023-04-18 | 重庆邮电大学 | Convolutional neural network accelerator circuit based on rapid filtering algorithm |
CN109919310B (en) * | 2019-01-15 | 2021-05-18 | 中国科学院信息工程研究所 | GPU memory optimization method and system for deep learning training task |
CN110032538B (en) * | 2019-03-06 | 2020-10-02 | 上海熠知电子科技有限公司 | Data reading system and method |
CN111832713B (en) * | 2019-04-19 | 2024-06-18 | 北京灵汐科技有限公司 | Parallel computing method and computing device based on line buffer Linebuffer |
CN111950718B (en) * | 2019-05-16 | 2021-12-07 | 北京知存科技有限公司 | Method for realizing progressive CNN operation by using storage and computation integrated chip |
CN110276444B (en) * | 2019-06-04 | 2021-05-07 | 北京清微智能科技有限公司 | Image processing method and device based on convolutional neural network |
CN110390384B (en) * | 2019-06-25 | 2021-07-06 | 东南大学 | Configurable general convolutional neural network accelerator |
CN112306555A (en) * | 2019-07-30 | 2021-02-02 | 北京百度网讯科技有限公司 | Method, apparatus, device and computer readable storage medium for extracting image data in multiple convolution windows in parallel |
CN110704019B (en) * | 2019-08-26 | 2020-12-22 | 深圳芯英科技有限公司 | Data buffer and data reading method |
CN110673786B (en) * | 2019-09-03 | 2020-11-10 | 浪潮电子信息产业股份有限公司 | Data caching method and device |
CN110766133B (en) * | 2019-09-18 | 2020-12-25 | 开放智能机器(上海)有限公司 | Data processing method, device, equipment and storage medium in embedded equipment |
US11423644B1 (en) * | 2019-09-19 | 2022-08-23 | Ambarella International Lp | Hardware efficient RoI align |
CN112313673A (en) * | 2019-11-15 | 2021-02-02 | 深圳市大疆创新科技有限公司 | Region-of-interest-pooling layer calculation method and device, and neural network system |
CN110991609B (en) * | 2019-11-27 | 2023-12-26 | 天津大学 | Line buffer for data transmission |
US11372644B2 (en) * | 2019-12-09 | 2022-06-28 | Meta Platforms, Inc. | Matrix processing instruction with optional up/down sampling of matrix |
CN113052291B (en) * | 2019-12-27 | 2024-04-16 | 上海商汤智能科技有限公司 | Data processing method and device |
CN111176582A (en) * | 2019-12-31 | 2020-05-19 | 北京百度网讯科技有限公司 | Matrix storage method, matrix access device and electronic equipment |
US20210357748A1 (en) * | 2020-05-14 | 2021-11-18 | Samsung Electronics Co., Ltd. | Hierarchical weight preprocessing for neural network accelerator |
CN111899147B (en) * | 2020-06-16 | 2022-08-09 | 北京大学 | Convolution kernel calculation accelerator and convolution kernel calculation method |
CN112396165B (en) * | 2020-11-30 | 2024-06-11 | 珠海零边界集成电路有限公司 | Computing device and method for convolutional neural network |
US20220223201A1 (en) * | 2021-01-11 | 2022-07-14 | Micron Technology, Inc. | Caching Techniques for Deep Learning Accelerator |
CN112862091B (en) * | 2021-01-26 | 2022-09-27 | 合肥工业大学 | Resource multiplexing type neural network hardware accelerating circuit based on quick convolution |
CN112966729B (en) * | 2021-02-26 | 2023-01-31 | 成都商汤科技有限公司 | Data processing method and device, computer equipment and storage medium |
CN112949845B (en) * | 2021-03-08 | 2022-08-09 | 内蒙古大学 | Deep convolutional neural network accelerator based on FPGA |
CN112686377B (en) | 2021-03-18 | 2021-07-02 | 北京地平线机器人技术研发有限公司 | Method and device for carrying out deconvolution processing on feature data by utilizing convolution hardware |
CN113255898B (en) * | 2021-06-16 | 2022-08-02 | 合肥工业大学 | Convolution neural network hardware accelerator based on Winograd algorithm and calculation method |
CN113516235A (en) * | 2021-07-13 | 2021-10-19 | 南京大学 | Deformable convolution accelerator and deformable convolution acceleration method |
CN113448624B (en) * | 2021-07-15 | 2023-06-27 | 安徽聆思智能科技有限公司 | Data access method, device, system and AI accelerator |
CN113642724B (en) * | 2021-08-11 | 2023-08-01 | 西安微电子技术研究所 | CNN accelerator for high bandwidth storage |
CN114758209B (en) * | 2022-06-14 | 2022-09-02 | 深圳思谋信息科技有限公司 | Convolution result obtaining method and device, computer equipment and storage medium |
CN116861149B (en) * | 2023-09-05 | 2024-01-09 | 之江实验室 | Convolution operation optimization method, device and processor |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8363535B2 (en) * | 2003-04-28 | 2013-01-29 | Marvell International Ltd. | Frequency domain echo and next cancellation |
CN102339386B (en) * | 2011-08-16 | 2013-08-28 | 杭州晟元芯片技术有限公司 | Method for quickening extraction of embedded fingerprint features |
CN105869117B (en) * | 2016-03-28 | 2021-04-02 | 上海交通大学 | GPU acceleration method for deep learning super-resolution technology |
CN105956660A (en) * | 2016-05-16 | 2016-09-21 | 浪潮集团有限公司 | Neural network chip realization method used for real-time image identification |
CN106228240B (en) * | 2016-07-30 | 2020-09-01 | 复旦大学 | Deep convolution neural network implementation method based on FPGA |
US10891538B2 (en) * | 2016-08-11 | 2021-01-12 | Nvidia Corporation | Sparse convolutional neural network accelerator |
CN106547885B (en) * | 2016-10-27 | 2020-04-10 | 桂林电子科技大学 | Text classification system and method |
-
2017
- 2017-04-28 CN CN201710300171.XA patent/CN108229645B/en active Active
-
2018
- 2018-04-27 WO PCT/CN2018/084948 patent/WO2018196863A1/en active Application Filing
- 2018-04-27 SG SG11201910024P patent/SG11201910024PA/en unknown
-
2019
- 2019-10-24 US US16/662,493 patent/US11429852B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
US11429852B2 (en) | 2022-08-30 |
US20200057938A1 (en) | 2020-02-20 |
CN108229645B (en) | 2021-08-06 |
CN108229645A (en) | 2018-06-29 |
WO2018196863A1 (en) | 2018-11-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SG11201910024PA (en) | Convolution acceleration and computing processing methods and apparatuses, electronic device, and storage medium | |
MX2023010407A (en) | Accelerated mathematical engine. | |
MX2023007718A (en) | Encrypted userdata transit and storage. | |
FI3539059T3 (en) | Performing kernel striding in hardware | |
MY190550A (en) | Method and device for processing service request | |
WO2015187771A3 (en) | Apparatuses and methods for performing an exclusive or operation using sensing circuitry | |
MY195952A (en) | Blockchain-Based Data Processing Method and Device | |
KR20180084289A (en) | Compressed neural network system using sparse parameter and design method thereof | |
EP4242924A3 (en) | Low-power ambient computing system with machine learning | |
TWI796286B (en) | A training method and training system for a machine learning system | |
GB2553994A (en) | Modeling personal entities | |
SG11201809890PA (en) | Method of training random forest model, electronic device and storage medium | |
MX2017011793A (en) | Detecting segments of a video program. | |
JP2018512607A5 (en) | ||
EP3979171A4 (en) | Payment processing method, electronic device and computer readable storage medium | |
EP3101598A3 (en) | Augmented neural networks | |
RU2017105059A (en) | IMAGE PROCESSING DEVICE, IMAGE PROCESSING METHOD, RECORDING MEDIA AND PROGRAM | |
EP2784552A3 (en) | Method and device for attenuating random noise in seismic data | |
JP2016515260A5 (en) | ||
EP2979948A3 (en) | Control system and semiconductor device | |
JP2014511526A5 (en) | ||
GB2562897A (en) | Lossless data compression | |
EP4343569A3 (en) | System for data aggregation and analysis of data from a plurality of data sources | |
US20220253668A1 (en) | Data processing method and device, storage medium and electronic device | |
EP3208676A3 (en) | Input/output (i/o) binding with automatic international electromechanical commission (iec) address generation in remote terminal unit (rtu) configuration |