Systemlevel ppa tradeoff analysis of adderbased mac architectures using vedic multipliers on zynq7000 fpga| International Journal of Innovative Science and Research Technology

System-Level PPA Trade-Off Analysis of Adder-Based MAC Architectures Using Vedic Multipliers on Zynq-7000 FPGA

Authors : Subhadip Ghanta; Sanket Shaw; Abdul Barish Khan; Debjyoti Sarkar; Soumik Das

Volume/Issue : Volume 11 - 2026, Issue 3 - March

Google Scholar : https://tinyurl.com/fa6eff2h

Scribd : https://tinyurl.com/4f6tp945

DOI : https://doi.org/10.38124/ijisrt/26mar690

PlumX Metrics

Semantic Scholar

ResearchGate

Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.

Abstract : Multiply-Accumulate units are basic components of digital signal processing systems as well as embedded systems, where computational throughput as well as power constraints are of great importance. In this paper, a Power Performance Area trade-off of Vedic Multiplier-based Multiply-Accumulate units with five different adder styles, namely Ripple-Carry Adder, Carry-Skip Adder, Carry-Look-Ahead Adder, Brent-Kung Adder, and Kogge-Stone Adder, is presented. In this paper, a novel Multiply-Accumulate architecture based on a hierarchical 16×16 Vedic Multiplier with a 32-bit accumulation path is presented, which has been implemented on a Zynq-7000 FPGA platform based on the Zed Board platform. The architectures are tested with both 16-bit and 32-bit datapath configurations in terms of power consumption, propagation delay, Power Delay Product, as well as Energy Delay Product, along with hardware utilization. The experimental results show that the Kogge-Stone Adder-based Multiply Accumulate architecture has the least propagation delay of 3.2 ns as well as 5.2 ns for 16-bit as well as 32-bit datapath configurations, respectively, achieving the least PDP of 304 pJ as well as 728 pJ, respectively, compared to other adder styles. In contrast, the Ripple Carry Adder consumes the least power of 40 mW as well as 83 mW for 16-bit and 32-bit datapath configurations, respectively, utilizing the least hardware resources, but has the highest propagation delay of 12.5 ns and 21 ns, respectively. Among all architectures, the Brent-Kung Adder offers the best trade-off among power, delay, and hardware utilization. The adderbased Multiply-Accumulate architectures are suitable for implementing various high-performance as well as powerefficient applications, including digital signal processing, image processing, and machine learning accelerators.

Keywords : Multiply–Accumulate (MAC), Vedic Multiplier, Adder, Power–Performance–Area (PPA), Zynq-7000 FPGA.

References :

Ananthakrishnan, A. Ajit, A. P.V., K. Haridas, N. M. Nambiar and D. S., "FPGA Based Performance Comparison of Different Basic Adder Topologies with Parallel Processing Adder," 2019 3rd International conference on Electronics, Communication and Aerospace Technology (ICECA), Coimbatore, India, 2019, pp. 87-92, doi: 10.1109/ICECA.2019.8821925.
B. Harish, K. Sivani and M. S. S. Rukmini, "Design and Performance Comparison among Various types of Adder Topologies," 2019 3rd International Conference on Computing Methodologies and Communication (ICCMC), Erode, India, 2019, pp. 725-730, doi: 10.1109/ICCMC.2019.8819779.
S. Gauhar, A. Sharif and N. Alam, "Comparison of Parallel Prefix Adders Based on FPGA & ASIC Implementations," 2020 IEEE Students Conference on Engineering & Systems (SCES), Prayagraj, India, 2020, pp. 1-6, doi: 10.1109/SCES50439.2020.9236737.
J. Anirudh, A. R. S. Babu, K. M. Gopi, S. Karishma and S. Musala, "FPGA Based Low Power Approximate Hybrid Parallel Prefix Adders with Less Area," 2025 International Conference on Information, Implementation, and Innovation in Technology (I2ITCON), Pune, India, 2025, pp. 1-6, doi: 10.1109/I2ITCON65200.2025.11210724.
R. Priya and J. S. Kumar, "Implementation and comparison of effective area efficient architectures for CSLA," 2013 IEEE International Conference ON Emerging Trends in Computing, Communication and Nanotechnology (ICECCN), Tirunelveli, India, 2013, pp. 287-292, doi: 10.1109/ICE-CCN.2013.6528510.
R. Sahu and A. K. Subudhi, "An area optimized Carry Select Adder," 2015 IEEE Power, Communication and Information Technology Conference (PCITC), Bhubaneswar, India, 2015, pp. 589-594, doi: 10.1109/PCITC.2015.7438066.
P. Balasubramanian and D. Maskell, "Hardware Efficient Approximate Adder Design," TENCON 2018 - 2018 IEEE Region 10 Conference, Jeju, Korea (South), 2018, pp. 0806-0810, doi: 10.1109/TENCON.2018.8650127.
K. S and H. P. N, "Analysis of Low Power Approximate 128x128 bit Vedic Multiplier for Existing Real World FPGAs," 2024 5th IEEE Global Conference for Advancement in Technology (GCAT), Bangalore, India, 2024, pp. 1-6, doi: 10.1109/GCAT62922.2024.10924019.
G. Thakur, H. Sohal and S. Jain, "Design and Analysis of High-Speed Parallel Prefix Adder for Digital Circuit Design Applications," 2020 International Conference on Computational Performance Evaluation (ComPE), Shillong, India, 2020, pp. 095-100, doi: 10.1109/ComPE49325.2020.9200064.
B. R. S. Khater, M. A. M. Alshewimy and M. T. Faheem Saidahmed, "An analysis technique for improving delay factor of carry select adder using FPGA," 2017 12th International Conference on Computer Engineering and Systems (ICCES), Cairo, Egypt, 2017, pp. 14-18, doi: 10.1109/ICCES.2017.8275269.
M S. N. Gadakh and A. Khade, "Design and optimization of 16×16 Bit multiplier using Vedic mathematics," 2016 International Conference on Automatic Control and Dynamic Optimization Techniques (ICACDOT), Pune, India, 2016, pp. 460-464, doi: 10.1109/ICACDOT.2016.7877628.
Shing, L. & Hussin, Razaidi & Kamarudin, A. & Mohyar, Shaiful Nizam & Taking, Sanna & Aziz, Hafiz & Ahmad, Norhawati. (2018). 16×16 fast signed multiplier using Booth and Vedic architecture. AIP Conference Proceedings. 2045. 020085. 10.1063/1.5080898.
A., S., A., S. Modified vedic multiplier architecture using Nikhilam and Karatsuba algorithms with hybrid adders for enhanced performance. Sci Rep 16, 1772 (2026). https://doi.org/10.1038/s41598-025-30966-7
J. Kuppili, M. Abhiram and N. A. Manga, "Design of Vedic Mathematics based 16 bit MAC unit for Power and Delay Optimization," 2021 4th Biennial International Conference on Nascent Technologies in Engineering (ICNTE), NaviMumbai, India, 2021, pp. 1-4, doi: 10.1109/ICNTE51185.2021.9487570.
Pradhan, Manoranjan & Panda, Rutuparna & Sahu, Sushanta. (2009). Speed Comparison of 16x16 Vedic Multipliers. International Journal of Computer Applications. 21. 10.5120/2516-3417.
Gadakh, Sheetal & Khade, Amitkumar. (2016). Design and optimization of 16×16 Bit multiplier using Vedic mathematics. 460-464. 10.1109/ICACDOT.2016.7877628.
S. Ravi, M. S. N. V. Mohith, K. Yaswanth Simha, L. Alekhya, and M. Maruthi Sriram, "A Novel High Performance Architecture for MAC Unit Using Vedic Multiplier and Brent-Kung Adder,"International Journal of Research and Scientific Innovation (IJRSI), vol. 12, no. 4, Apr. 2025, doi: 10.51244/IJRSI.2025.12040021.
Gupta, V., & Kumar, M. (2018). Design of high speed 16x16 bit MAC units using Vedic multiplier. International Journal of Computer Applications, 182(17), 45–48. DOI: 10.5120/ijca2018917895. (ISSN: 0975–8887).
Nitin Krishna, V. (2020). Performance analysis of MAC unit using Booth, Wallace tree, array and Vedic multipliers. International Journal of Engineering Research & Technology (IJERT), 9(09), 497–504. DOI: 10.17577/ijertv9is090337. (ISSN: 2278-0181; Paper ID: IJERTV9IS090337).
Paldurai, K., Hariharan, K., Karthikeyan, G.C., & Lakshmanan, K. (2014). Implementation of MAC using area efficient and reduced delay Vedic multiplier targeted at FPGA architectures. In 2014 International Conference on Communication and Network Technologies (ICCNT) (pp. 238–242). IEEE. DOI: 10.1109/ICCNT.2014.7062793.
Shinde and A. O. Mulani, "A Comprehensive Review on Improving 256x256 Vedic Multiplier Design Using Optimized Adder Architectures," International Journal of Advanced Research in Science, Communication and Technology (IJARSCT), vol. 5, no. 4, pp. 1210–1226, Oct. 2025. DOI: 10.48175/IJARSCT-29467.
G. Rashmi, P. Sharmila Rani, and S. Nagi Reddy, "Majority Logic Implementation of MAC Unit," Journal of Emerging Technologies and Innovative Research (JETIR), vol. 5, no. 7, pp. 850–856, Jul. 2018. ISSN: 2349-5162. Paper ID: JETIR1807851. DOI: 10.1729/Journal.JETIR1807851.
S. A. Ahmed and M. Salahuddin, "Design of High Speed Architecture of Parallel MAC Based On Radix-2 MBA," International Journal of Engineering Research and Applications (IJERA), vol. 4, no. 5 (Version 7), pp. 56–61, May 2014. ISSN: 2248-9622. DOI: 10.9790/9622-0405075661.
K. B. Jagannatha, H. S. Lakshmisagar, and G. R. Bhaskar, "FPGA and ASIC Implementation of 16-Bit Vedic Multiplier Using Urdhva Triyakbhyam Sutra," in Emerging Research in Electronics, Computer Science and Technology, V. Sridhar et al. (eds.), Lecture Notes in Electrical Engineering, vol. 248, pp. 31–38. Springer India, 2014. DOI: 10.1007/978-81-322-1157-0_4
M. Mulkalapally, J. Manning, P. Gatewood, and T. Nikoubin, "High Speed, Area and Power Efficient 32-bit Vedic Multipliers," in Proceedings of the 7th International Conference on Computing Communication and Networking Technologies (ICCCNT '16), Dallas, TX, USA, Jul. 6–8, 2016, pp. 1–8. DOI: 10.1145/2967878.2967890.

Multiply-Accumulate units are basic components of digital signal processing systems as well as embedded systems, where computational throughput as well as power constraints are of great importance. In this paper, a Power Performance Area trade-off of Vedic Multiplier-based Multiply-Accumulate units with five different adder styles, namely Ripple-Carry Adder, Carry-Skip Adder, Carry-Look-Ahead Adder, Brent-Kung Adder, and Kogge-Stone Adder, is presented. In this paper, a novel Multiply-Accumulate architecture based on a hierarchical 16×16 Vedic Multiplier with a 32-bit accumulation path is presented, which has been implemented on a Zynq-7000 FPGA platform based on the Zed Board platform. The architectures are tested with both 16-bit and 32-bit datapath configurations in terms of power consumption, propagation delay, Power Delay Product, as well as Energy Delay Product, along with hardware utilization. The experimental results show that the Kogge-Stone Adder-based Multiply Accumulate architecture has the least propagation delay of 3.2 ns as well as 5.2 ns for 16-bit as well as 32-bit datapath configurations, respectively, achieving the least PDP of 304 pJ as well as 728 pJ, respectively, compared to other adder styles. In contrast, the Ripple Carry Adder consumes the least power of 40 mW as well as 83 mW for 16-bit and 32-bit datapath configurations, respectively, utilizing the least hardware resources, but has the highest propagation delay of 12.5 ns and 21 ns, respectively. Among all architectures, the Brent-Kung Adder offers the best trade-off among power, delay, and hardware utilization. The adderbased Multiply-Accumulate architectures are suitable for implementing various high-performance as well as powerefficient applications, including digital signal processing, image processing, and machine learning accelerators.

Keywords : Multiply–Accumulate (MAC), Vedic Multiplier, Adder, Power–Performance–Area (PPA), Zynq-7000 FPGA.

Paper Submission Last Date
31 - May - 2026

SUBMIT YOUR PAPER CALL FOR PAPERS

Video Explanation for Published paper

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.