When testing NEON instructions on ipad1 and ipad2, I notice allmost no speed up in the code on ipad2, where most functions actually run much faster on ipad2 than on ipad1.
This is for instructions like VMUL, VLD1, VADD and VSUB etc using quad word registers like q0 for things like FFT.
In addition I notice that apples own FFT function vdsp_fft_zrip does not speed up for ipad2 either.
So the question is, does ipad2 NEON execute faster than ipad1 NEON engine for the quad word SIMD type instructions?