Optimized FPGA Architecture for Modular Reduction in NTT
| dc.contributor.author | Tosun, Tolun | |
| dc.contributor.author | Kirbiyik, Selim | |
| dc.contributor.author | Kocer, Emre | |
| dc.contributor.author | Alaybeyo?lu, Ersin | |
| dc.date.accessioned | 2026-02-22T11:44:03Z | |
| dc.date.created | 2026 | |
| dc.date.issued | 2026 | |
| dc.department | Bartın Üniversitesi | |
| dc.description | 6th International Workshop on Lightweight Cryptography for Security and Privacy, LightSec 2025 -- 2025-09-01 through 2025-09-02 -- Istanbul -- 347249 | |
| dc.description.abstract | In this paper, we present a comprehensive analysis of various modular multiplication methods for Number Theoretic Transform (NTT) on FPGA. NTT is a critical and time-intensive component of Fully Homomorphic Encryption (FHE) applications while modular multiplication consumes a significant portion of the design resources in an NTT implementation. We study the existing modular reduction approaches from the literature, and implement particular methods on FPGA. Specifically Word-Level Montgomery (WLM) for NTT friendly primes [20] and K2RED [4]. For improvements, we explore the trade-offs between the number of available primes in special forms and hardware cost of the reduction methods. We develop a DSP multiplication-optimized version of WLM, which we call WLM-Mixed. We also introduce a subclass of Proth primes, referred to as Proth-l primes, characterized by a low and fixed signed Hamming Weight. This special class of primes allows us to design multiplication-free shift-add versions of K2RED and naive Montgomery reduction [21], referred to as K2RED-Shift and Montgomery-Shift. We provide in-depth evaluations of these five reduction methods in an NTT architecture on FPGA. Our results indicate that WLM-Mixed is highly resource-efficient, utilizing only 3 DSP multiplications for 64-bit coefficient moduli. On the other hand, K2RED-Shift and Montgomery-Shift offer DSP-free alternatives, which can be beneficial in specific scenarios. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026. | |
| dc.identifier.doi | 10.1007/978-3-032-15541-2_7 | |
| dc.identifier.endpage | 137 | |
| dc.identifier.isbn | 9789819698936 | |
| dc.identifier.isbn | 9789819698042 | |
| dc.identifier.isbn | 9789819698110 | |
| dc.identifier.isbn | 9789819698905 | |
| dc.identifier.isbn | 9783032004949 | |
| dc.identifier.isbn | 9789819512324 | |
| dc.identifier.isbn | 9783032026019 | |
| dc.identifier.isbn | 9783032008909 | |
| dc.identifier.isbn | 9783031915802 | |
| dc.identifier.isbn | 9789819698141 | |
| dc.identifier.issn | 0302-9743 | |
| dc.identifier.scopus | 2-s2.0-105029428829 | |
| dc.identifier.scopusquality | Q3 | |
| dc.identifier.startpage | 117 | |
| dc.identifier.uri | https://doi.org/10.1007/978-3-032-15541-2_7 | |
| dc.identifier.uri | https://hdl.handle.net/11772/26905 | |
| dc.identifier.volume | 16216 LNCS | |
| dc.identifier.wos | WOS:001739145100007 | |
| dc.identifier.wosquality | N/A | |
| dc.indekslendigikaynak | Scopus | |
| dc.indekslendigikaynak | Web of Science | |
| dc.language.iso | en | |
| dc.publisher | Springer Science and Business Media Deutschland GmbH | |
| dc.relation.ispartof | Lecture Notes in Computer Science | |
| dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | |
| dc.rights | info:eu-repo/semantics/closedAccess | |
| dc.snmz | KA_Scopus_20260218 | |
| dc.subject | DSP | |
| dc.subject | FHE | |
| dc.subject | FPGA | |
| dc.subject | K2RED | |
| dc.subject | Modular Reduction | |
| dc.subject | Montgomery | |
| dc.subject | NTT | |
| dc.title | Optimized FPGA Architecture for Modular Reduction in NTT | |
| dc.type | Conference Object | |
| dspace.entity.type | Publication |










