I lack the knowledge of how performance is affected by CPU specifications. I'm running an application to perform modular calculation (DH Key Exchange) on a Windows platform with the following parameters:
Modular: a prime number = 4096 bits
Generator: 2
Exponent: 256 bits
When the application is run on 32-bit Windows 7 with 2.4 GHz processor and 4G RAM, it takes between 3-4 seconds. However, when I run the same application on 64-bit Windows 7 with the same processor speed and 8G RAM, it takes between 1-2 seconds.
I'm trying to understand but I got confused whether the modular calculation speed is affected by the ARM size or CPU support (64-bit vs 32-bit)