Authors :
Subhadip Ghanta; Sanket Shaw; Abdul Barish Khan; Debjyoti Sarkar; Soumik Das
Volume/Issue :
Volume 11 - 2026, Issue 3 - March
Google Scholar :
https://tinyurl.com/fa6eff2h
Scribd :
https://tinyurl.com/4f6tp945
DOI :
https://doi.org/10.38124/ijisrt/26mar690
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
Multiply-Accumulate units are basic components of digital signal processing systems as well as embedded
systems, where computational throughput as well as power constraints are of great importance. In this paper, a Power
Performance Area trade-off of Vedic Multiplier-based Multiply-Accumulate units with five different adder styles, namely
Ripple-Carry Adder, Carry-Skip Adder, Carry-Look-Ahead Adder, Brent-Kung Adder, and Kogge-Stone Adder, is
presented. In this paper, a novel Multiply-Accumulate architecture based on a hierarchical 16×16 Vedic Multiplier with a
32-bit accumulation path is presented, which has been implemented on a Zynq-7000 FPGA platform based on the Zed
Board platform. The architectures are tested with both 16-bit and 32-bit datapath configurations in terms of power
consumption, propagation delay, Power Delay Product, as well as Energy Delay Product, along with hardware utilization.
The experimental results show that the Kogge-Stone Adder-based Multiply Accumulate architecture has the least
propagation delay of 3.2 ns as well as 5.2 ns for 16-bit as well as 32-bit datapath configurations, respectively, achieving the
least PDP of 304 pJ as well as 728 pJ, respectively, compared to other adder styles. In contrast, the Ripple Carry Adder
consumes the least power of 40 mW as well as 83 mW for 16-bit and 32-bit datapath configurations, respectively, utilizing
the least hardware resources, but has the highest propagation delay of 12.5 ns and 21 ns, respectively. Among all
architectures, the Brent-Kung Adder offers the best trade-off among power, delay, and hardware utilization. The adderbased Multiply-Accumulate architectures are suitable for implementing various high-performance as well as powerefficient applications, including digital signal processing, image processing, and machine learning accelerators.
Keywords :
Multiply–Accumulate (MAC), Vedic Multiplier, Adder, Power–Performance–Area (PPA), Zynq-7000 FPGA.
References :
- Ananthakrishnan, A. Ajit, A. P.V., K. Haridas, N. M. Nambiar and D. S., "FPGA Based Performance Comparison of Different Basic Adder Topologies with Parallel Processing Adder," 2019 3rd International conference on Electronics, Communication and Aerospace Technology (ICECA), Coimbatore, India, 2019, pp. 87-92, doi: 10.1109/ICECA.2019.8821925.
- B. Harish, K. Sivani and M. S. S. Rukmini, "Design and Performance Comparison among Various types of Adder Topologies," 2019 3rd International Conference on Computing Methodologies and Communication (ICCMC), Erode, India, 2019, pp. 725-730, doi: 10.1109/ICCMC.2019.8819779.
- S. Gauhar, A. Sharif and N. Alam, "Comparison of Parallel Prefix Adders Based on FPGA & ASIC Implementations," 2020 IEEE Students Conference on Engineering & Systems (SCES), Prayagraj, India, 2020, pp. 1-6, doi: 10.1109/SCES50439.2020.9236737.
- J. Anirudh, A. R. S. Babu, K. M. Gopi, S. Karishma and S. Musala, "FPGA Based Low Power Approximate Hybrid Parallel Prefix Adders with Less Area," 2025 International Conference on Information, Implementation, and Innovation in Technology (I2ITCON), Pune, India, 2025, pp. 1-6, doi: 10.1109/I2ITCON65200.2025.11210724.
- R. Priya and J. S. Kumar, "Implementation and comparison of effective area efficient architectures for CSLA," 2013 IEEE International Conference ON Emerging Trends in Computing, Communication and Nanotechnology (ICECCN), Tirunelveli, India, 2013, pp. 287-292, doi: 10.1109/ICE-CCN.2013.6528510.
- R. Sahu and A. K. Subudhi, "An area optimized Carry Select Adder," 2015 IEEE Power, Communication and Information Technology Conference (PCITC), Bhubaneswar, India, 2015, pp. 589-594, doi: 10.1109/PCITC.2015.7438066.
- P. Balasubramanian and D. Maskell, "Hardware Efficient Approximate Adder Design," TENCON 2018 - 2018 IEEE Region 10 Conference, Jeju, Korea (South), 2018, pp. 0806-0810, doi: 10.1109/TENCON.2018.8650127.
- K. S and H. P. N, "Analysis of Low Power Approximate 128x128 bit Vedic Multiplier for Existing Real World FPGAs," 2024 5th IEEE Global Conference for Advancement in Technology (GCAT), Bangalore, India, 2024, pp. 1-6, doi: 10.1109/GCAT62922.2024.10924019.
- G. Thakur, H. Sohal and S. Jain, "Design and Analysis of High-Speed Parallel Prefix Adder for Digital Circuit Design Applications," 2020 International Conference on Computational Performance Evaluation (ComPE), Shillong, India, 2020, pp. 095-100, doi: 10.1109/ComPE49325.2020.9200064.
- B. R. S. Khater, M. A. M. Alshewimy and M. T. Faheem Saidahmed, "An analysis technique for improving delay factor of carry select adder using FPGA," 2017 12th International Conference on Computer Engineering and Systems (ICCES), Cairo, Egypt, 2017, pp. 14-18, doi: 10.1109/ICCES.2017.8275269.
- M S. N. Gadakh and A. Khade, "Design and optimization of 16×16 Bit multiplier using Vedic mathematics," 2016 International Conference on Automatic Control and Dynamic Optimization Techniques (ICACDOT), Pune, India, 2016, pp. 460-464, doi: 10.1109/ICACDOT.2016.7877628.
- Shing, L. & Hussin, Razaidi & Kamarudin, A. & Mohyar, Shaiful Nizam & Taking, Sanna & Aziz, Hafiz & Ahmad, Norhawati. (2018). 16×16 fast signed multiplier using Booth and Vedic architecture. AIP Conference Proceedings. 2045. 020085. 10.1063/1.5080898.
- A., S., A., S. Modified vedic multiplier architecture using Nikhilam and Karatsuba algorithms with hybrid adders for enhanced performance. Sci Rep 16, 1772 (2026). https://doi.org/10.1038/s41598-025-30966-7
- J. Kuppili, M. Abhiram and N. A. Manga, "Design of Vedic Mathematics based 16 bit MAC unit for Power and Delay Optimization," 2021 4th Biennial International Conference on Nascent Technologies in Engineering (ICNTE), NaviMumbai, India, 2021, pp. 1-4, doi: 10.1109/ICNTE51185.2021.9487570.
- Pradhan, Manoranjan & Panda, Rutuparna & Sahu, Sushanta. (2009). Speed Comparison of 16x16 Vedic Multipliers. International Journal of Computer Applications. 21. 10.5120/2516-3417.
- Gadakh, Sheetal & Khade, Amitkumar. (2016). Design and optimization of 16×16 Bit multiplier using Vedic mathematics. 460-464. 10.1109/ICACDOT.2016.7877628.
- S. Ravi, M. S. N. V. Mohith, K. Yaswanth Simha, L. Alekhya, and M. Maruthi Sriram, "A Novel High Performance Architecture for MAC Unit Using Vedic Multiplier and Brent-Kung Adder,"International Journal of Research and Scientific Innovation (IJRSI), vol. 12, no. 4, Apr. 2025, doi: 10.51244/IJRSI.2025.12040021.
- Gupta, V., & Kumar, M. (2018). Design of high speed 16x16 bit MAC units using Vedic multiplier. International Journal of Computer Applications, 182(17), 45–48. DOI: 10.5120/ijca2018917895. (ISSN: 0975–8887).
- Nitin Krishna, V. (2020). Performance analysis of MAC unit using Booth, Wallace tree, array and Vedic multipliers. International Journal of Engineering Research & Technology (IJERT), 9(09), 497–504. DOI: 10.17577/ijertv9is090337. (ISSN: 2278-0181; Paper ID: IJERTV9IS090337).
- Paldurai, K., Hariharan, K., Karthikeyan, G.C., & Lakshmanan, K. (2014). Implementation of MAC using area efficient and reduced delay Vedic multiplier targeted at FPGA architectures. In 2014 International Conference on Communication and Network Technologies (ICCNT) (pp. 238–242). IEEE. DOI: 10.1109/ICCNT.2014.7062793.
- Shinde and A. O. Mulani, "A Comprehensive Review on Improving 256x256 Vedic Multiplier Design Using Optimized Adder Architectures," International Journal of Advanced Research in Science, Communication and Technology (IJARSCT), vol. 5, no. 4, pp. 1210–1226, Oct. 2025. DOI: 10.48175/IJARSCT-29467.
- G. Rashmi, P. Sharmila Rani, and S. Nagi Reddy, "Majority Logic Implementation of MAC Unit," Journal of Emerging Technologies and Innovative Research (JETIR), vol. 5, no. 7, pp. 850–856, Jul. 2018. ISSN: 2349-5162. Paper ID: JETIR1807851. DOI: 10.1729/Journal.JETIR1807851.
- S. A. Ahmed and M. Salahuddin, "Design of High Speed Architecture of Parallel MAC Based On Radix-2 MBA," International Journal of Engineering Research and Applications (IJERA), vol. 4, no. 5 (Version 7), pp. 56–61, May 2014. ISSN: 2248-9622. DOI: 10.9790/9622-0405075661.
- K. B. Jagannatha, H. S. Lakshmisagar, and G. R. Bhaskar, "FPGA and ASIC Implementation of 16-Bit Vedic Multiplier Using Urdhva Triyakbhyam Sutra," in Emerging Research in Electronics, Computer Science and Technology, V. Sridhar et al. (eds.), Lecture Notes in Electrical Engineering, vol. 248, pp. 31–38. Springer India, 2014. DOI: 10.1007/978-81-322-1157-0_4
- M. Mulkalapally, J. Manning, P. Gatewood, and T. Nikoubin, "High Speed, Area and Power Efficient 32-bit Vedic Multipliers," in Proceedings of the 7th International Conference on Computing Communication and Networking Technologies (ICCCNT '16), Dallas, TX, USA, Jul. 6–8, 2016, pp. 1–8. DOI: 10.1145/2967878.2967890.
Multiply-Accumulate units are basic components of digital signal processing systems as well as embedded
systems, where computational throughput as well as power constraints are of great importance. In this paper, a Power
Performance Area trade-off of Vedic Multiplier-based Multiply-Accumulate units with five different adder styles, namely
Ripple-Carry Adder, Carry-Skip Adder, Carry-Look-Ahead Adder, Brent-Kung Adder, and Kogge-Stone Adder, is
presented. In this paper, a novel Multiply-Accumulate architecture based on a hierarchical 16×16 Vedic Multiplier with a
32-bit accumulation path is presented, which has been implemented on a Zynq-7000 FPGA platform based on the Zed
Board platform. The architectures are tested with both 16-bit and 32-bit datapath configurations in terms of power
consumption, propagation delay, Power Delay Product, as well as Energy Delay Product, along with hardware utilization.
The experimental results show that the Kogge-Stone Adder-based Multiply Accumulate architecture has the least
propagation delay of 3.2 ns as well as 5.2 ns for 16-bit as well as 32-bit datapath configurations, respectively, achieving the
least PDP of 304 pJ as well as 728 pJ, respectively, compared to other adder styles. In contrast, the Ripple Carry Adder
consumes the least power of 40 mW as well as 83 mW for 16-bit and 32-bit datapath configurations, respectively, utilizing
the least hardware resources, but has the highest propagation delay of 12.5 ns and 21 ns, respectively. Among all
architectures, the Brent-Kung Adder offers the best trade-off among power, delay, and hardware utilization. The adderbased Multiply-Accumulate architectures are suitable for implementing various high-performance as well as powerefficient applications, including digital signal processing, image processing, and machine learning accelerators.
Keywords :
Multiply–Accumulate (MAC), Vedic Multiplier, Adder, Power–Performance–Area (PPA), Zynq-7000 FPGA.