A High-Speed Floating Point Matrix Multiplier Implemented in Reconfigurable Architecture
DOI:
https://doi.org/10.32628/CSEIT2390661Keywords:
Reconfigurable Processor, Matrix Multiplication, Addition, Multiplication, Multiplier-Accumulator CircuitAbstract
Matrix multiplication is a fundamental operation in computational applications across various domains. This paper introduces a novel reconfigurable co-processor that enhances the efficiency of matrix multiplication by concurrently executing addition and multiplication operations upon matrix elements of different sizes. The proposed design aims to reduce computation time and improve efficiency for matrix multiplication equations. Experimental evaluations were conducted on matrices of different sizes to demonstrate the effectiveness of the processor. The results reveal substantial improvements in both time and efficiency when compared to traditional approaches. The reconfigurable transformation processor harnesses parallel processing capabilities, enabling the simultaneous execution of addition and multiplication operations by partitioning input matrices into smaller submatrices and performing parallel computations, thus the processor achieves faster results. Additionally, the design incorporates configurable arithmetic units that dynamically adapt to matrix characteristics, further optimizing performance. The experimental evaluations provide evidence of reduction in computation time and improvement in efficiency. present significant benefits over traditional sequential methods. This makes this co-processor ideally fit for domains that require intensive linear algebra computations such as computer vision, machine learning, and signal processing.
Downloads
References
Advantages of the Virtex-5 FPGA 6-Input LUT Architecture (2007), Xilinx White Paper, https://docs.xilinx.com/v/u/en-US/wp284 accessed 22 October 2023.
Sanaei, A., Heidari, A. A., & Mohammadi, F. (2018). Design of the Motion Estimation Unit on ASIC and FPGA. IJCSI International Journal of Computer Science Issues, 15(1), 209-224.
Chen, C., Xu, J., Yao, W., & Zeng, Z. (2016). Design of a Reconfigurable FFT Processor with an Optimized Memory Hierarchy. Journal of Signal Processing Systems, 84(2), 209-224.
Bagherzadeh, N., & Rissa, T. (2003). Reconfigurable computing: A survey of systems and software. ACM Computing Surveys, 35(1), 1-28.
Jianwen, Luo & Jong, Ching. (2004). Partially reconfigurable matrix multiplication for area and time efficiency on FPGAs. 244- 248. 10.1109/DSD.2004.1333283. DOI: https://doi.org/10.1109/DSD.2004.1333283
Jang, Ju-wook & Choi, Seonil & Prasanna, V.K.K.. (2003). Area and time efficient implementations of matrix multiplication on FPGAs. 93 - 100. 10.1109/FPT.2002.1188669.
Hideharu Amano, “Principles and Structures of FPGA”, Springer Nature, ISBN: 978-981-13-0823-9, September 2018
Fei Wang, “An FPGA Architecture for Two-Dimensional Partial Reconfiguration”, LAP, ISBN: 978-3844328707, June 2011
Atri Sanyal, and Amitabha Sinha. "Trans_Proc: A Reconfigurable Processor to Implement the Linear Transformations." IJSI vol.10, no.1 2022: pp.1-16. DOI: https://doi.org/10.4018/IJSI.303575
Po-Chih Tseng et al, “Reconfigurable discrete cosine transform processor for object-based video signal processing”, in ISCAS '04. Proceedings of the 2004 International Symposium on Circuits and System, 2004.
Po-Chih Tseng, Chao-Tsung Huang, Liang-Gee Chen, “ Reconfigurable Discrete Wavelet Transform Processor for Heterogeneous Reconfigurable Multimedia Systems”, Journal of VLSI signal processing systems for signal, image and video technology, 2005 DOI: https://doi.org/10.1007/s11265-005-6249-z
K. Joe Hass David F. Cox , “ Transform Processing on a Reconfigurable Data Path Processor”, 7th NASA Symposium on VLSI Design 1998
Amitabha Sinha, Mitrava Sarkar, Soumojit Acharyya, Suranjan Chakraborty, “A Novel Reconfigurable Architecture of a DSP Processor for Efficient Mapping of DSP Functions using Field Programmable DSP Arrays”, ACM SIGARCH Computer Architecture News Vol. 41, No. 2, May 2013 DOI: https://doi.org/10.1145/2490302.2490304
P. S. Reddy, S. Mopuri and A. Acharyya, "A Reconfigurable High Speed Architecture Design for Discrete Hilbert Transform," in IEEE Signal Processing Letters, vol. 21, no. 11, pp. 1413-1417, Nov. 2014, doi: 10.1109/LSP.2014.2333745 DOI: https://doi.org/10.1109/LSP.2014.2333745
S. K. Samaddar, A. Sanyal, "A Combined Architecture for FDCT Algorithm," Proc. 2012 Third International Conference on Computer and Communication Technology, Allahabad, 2012, pp. 33-37,doi: 10.1109/ICCCT.2012.16 DOI: https://doi.org/10.1109/ICCCT.2012.16
Atri Sanyal, Swapan Kumar Samaddar, Amitabha Sinha , “ A Generalized Architecture for Linear Transform” , Proc.IEEE International Conference on CNC 2010, Oct 04-05, 2010 , Calicut,Kerala, India , IEEE Computer society , pp.55-60 ,ISBN: 97-0-7695-4209-6
Atri Sanyal, Amitabha Sinha, “A Reconfigurable Architecture to Implement Linear Transforms of Image Processing Applications”, International Conference on Frontiers in Computing and System (COMSYS 2020), Jalpaiguri, West Bengal, India, January 13-15,2020 DOI: https://doi.org/10.1007/978-981-15-7834-2_83
Davide Rossi , Fabio Campi , Simone Spolzino , Stefano Pucillo , Roberto Guerrieri, “A Heterogeneous Digital Signal Processor for Dynamically Reconfigurable Computing”, IEEE Journal of Solid-State Circuits ,Volume: 45, Issue: 8, Aug. 2010 DOI: https://doi.org/10.1109/JSSC.2010.2048149
Bin Zhang, Kuizhi Mei, Nanning Zheng, “Reconfigurable Processor for Binary Image Processing”, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 23, NO. 5, MAY 2013 DOI: https://doi.org/10.1109/TCSVT.2012.2223872