Mlucas 17.1 http://hogranch.com/mayer/README.html INFO: testing qfloat routines... CPU Family = ARM Embedded ABI, OS = Linux, 32-bit Version, compiled with Gnu C [or other compatible], Version 6.3.0 20170516. INFO: Using inline-macro form of MUL_LOHI64. INFO: MLUCAS_PATH is set to "" INFO: using 53-bit-significand form of floating-double rounding constant for scalar-mode DNINT emulation. Setting DAT_BITS = 10, PAD_BITS = 2 INFO: testing IMUL routines... INFO: System has 4 available processor cores. INFO: testing FFT radix tables... Set affinity for the following 4 cores: 0.1.2.3. Mlucas selftest running..... /****************************************************************************/ NTHREADS = 4 M20000047: using FFT length 1024K = 1048576 8-byte floats. this gives an average 19.073531150817871 bits per digit Using complex FFT radices 1024 16 32 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M20000047 with FFT length 1048576 = 1024 K Res64: DD61B3E031F1E0BA. AvgMaxErr = 0.237723214. MaxErr = 0.312500000. Program: E17.1 Res mod 2^36 = 837935290 Res mod 2^35 - 1 = 6238131189 Res mod 2^36 - 1 = 41735145962 Clocks = 00:00:45.473 NTHREADS = 4 M20000047: using FFT length 1024K = 1048576 8-byte floats. this gives an average 19.073531150817871 bits per digit Using complex FFT radices 1024 32 16 mers_mod_square: Init threadpool of 4 threads 100 iterations of M20000047 with FFT length 1048576 = 1024 K Res64: DD61B3E031F1E0BA. AvgMaxErr = 0.230580357. MaxErr = 0.281250000. Program: E17.1 Res mod 2^36 = 837935290 Res mod 2^35 - 1 = 6238131189 Res mod 2^36 - 1 = 41735145962 Clocks = 00:00:43.936 NTHREADS = 4 M20000047: using FFT length 1024K = 1048576 8-byte floats. this gives an average 19.073531150817871 bits per digit Using complex FFT radices 256 8 16 16 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M20000047 with FFT length 1048576 = 1024 K Res64: DD61B3E031F1E0BA. AvgMaxErr = 0.240178571. MaxErr = 0.312500000. Program: E17.1 Res mod 2^36 = 837935290 Res mod 2^35 - 1 = 6238131189 Res mod 2^36 - 1 = 41735145962 Clocks = 00:00:33.576 NTHREADS = 4 M20000047: using FFT length 1024K = 1048576 8-byte floats. this gives an average 19.073531150817871 bits per digit Using complex FFT radices 128 16 16 16 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M20000047 with FFT length 1048576 = 1024 K Res64: DD61B3E031F1E0BA. AvgMaxErr = 0.237388393. MaxErr = 0.281250000. Program: E17.1 Res mod 2^36 = 837935290 Res mod 2^35 - 1 = 6238131189 Res mod 2^36 - 1 = 41735145962 Clocks = 00:00:33.082 NTHREADS = 4 M20000047: using FFT length 1024K = 1048576 8-byte floats. this gives an average 19.073531150817871 bits per digit Using complex FFT radices 64 16 16 32 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M20000047 with FFT length 1048576 = 1024 K Res64: DD61B3E031F1E0BA. AvgMaxErr = 0.230580357. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 837935290 Res mod 2^35 - 1 = 6238131189 Res mod 2^36 - 1 = 41735145962 Clocks = 00:00:36.674 NTHREADS = 4 M20000047: using FFT length 1024K = 1048576 8-byte floats. this gives an average 19.073531150817871 bits per digit Using complex FFT radices 64 32 16 16 mers_mod_square: Init threadpool of 4 threads 100 iterations of M20000047 with FFT length 1048576 = 1024 K Res64: DD61B3E031F1E0BA. AvgMaxErr = 0.237276786. MaxErr = 0.281250000. Program: E17.1 Res mod 2^36 = 837935290 Res mod 2^35 - 1 = 6238131189 Res mod 2^36 - 1 = 41735145962 Clocks = 00:00:34.919 NTHREADS = 4 M20000047: using FFT length 1024K = 1048576 8-byte floats. this gives an average 19.073531150817871 bits per digit Using complex FFT radices 64 8 8 8 16 mers_mod_square: Init threadpool of 4 threads 100 iterations of M20000047 with FFT length 1048576 = 1024 K Res64: DD61B3E031F1E0BA. AvgMaxErr = 0.253459821. MaxErr = 0.312500000. Program: E17.1 Res mod 2^36 = 837935290 Res mod 2^35 - 1 = 6238131189 Res mod 2^36 - 1 = 41735145962 Clocks = 00:00:32.885 NTHREADS = 4 M20000047: using FFT length 1024K = 1048576 8-byte floats. this gives an average 19.073531150817871 bits per digit Using complex FFT radices 32 16 32 32 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M20000047 with FFT length 1048576 = 1024 K Res64: DD61B3E031F1E0BA. AvgMaxErr = 0.227790179. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 837935290 Res mod 2^35 - 1 = 6238131189 Res mod 2^36 - 1 = 41735145962 Clocks = 00:00:39.113 NTHREADS = 4 M20000047: using FFT length 1024K = 1048576 8-byte floats. this gives an average 19.073531150817871 bits per digit Using complex FFT radices 32 32 32 16 mers_mod_square: Init threadpool of 4 threads 100 iterations of M20000047 with FFT length 1048576 = 1024 K Res64: DD61B3E031F1E0BA. AvgMaxErr = 0.225892857. MaxErr = 0.281250000. Program: E17.1 Res mod 2^36 = 837935290 Res mod 2^35 - 1 = 6238131189 Res mod 2^36 - 1 = 41735145962 Clocks = 00:00:38.280 NTHREADS = 4 M20000047: using FFT length 1024K = 1048576 8-byte floats. this gives an average 19.073531150817871 bits per digit Using complex FFT radices 32 8 8 16 16 mers_mod_square: Init threadpool of 4 threads 100 iterations of M20000047 with FFT length 1048576 = 1024 K Res64: DD61B3E031F1E0BA. AvgMaxErr = 0.244642857. MaxErr = 0.281250000. Program: E17.1 Res mod 2^36 = 837935290 Res mod 2^35 - 1 = 6238131189 Res mod 2^36 - 1 = 41735145962 Clocks = 00:00:34.925 NTHREADS = 4 M20000047: using FFT length 1024K = 1048576 8-byte floats. this gives an average 19.073531150817871 bits per digit Using complex FFT radices 16 32 32 32 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M20000047 with FFT length 1048576 = 1024 K Res64: DD61B3E031F1E0BA. AvgMaxErr = 0.223883929. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 837935290 Res mod 2^35 - 1 = 6238131189 Res mod 2^36 - 1 = 41735145962 Clocks = 00:00:39.905 NTHREADS = 4 M20000047: using FFT length 1024K = 1048576 8-byte floats. this gives an average 19.073531150817871 bits per digit Using complex FFT radices 16 8 16 16 16 mers_mod_square: Init threadpool of 4 threads 100 iterations of M20000047 with FFT length 1048576 = 1024 K Res64: DD61B3E031F1E0BA. AvgMaxErr = 0.238281250. MaxErr = 0.281250000. Program: E17.1 Res mod 2^36 = 837935290 Res mod 2^35 - 1 = 6238131189 Res mod 2^36 - 1 = 41735145962 Clocks = 00:00:36.320 NTHREADS = 4 M20000047: using FFT length 1024K = 1048576 8-byte floats. this gives an average 19.073531150817871 bits per digit Using complex FFT radices 8 16 16 16 16 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M20000047 with FFT length 1048576 = 1024 K Res64: DD61B3E031F1E0BA. AvgMaxErr = 0.236941964. MaxErr = 0.281250000. Program: E17.1 Res mod 2^36 = 837935290 Res mod 2^35 - 1 = 6238131189 Res mod 2^36 - 1 = 41735145962 Clocks = 00:01:06.301 INFO: 13 of 13 radix-sets at FFT length 1024 K passed - writing cfg-file entry. / **************************************************************************** / NTHREADS = 4 M22442237: using FFT length 1152K = 1179648 8-byte floats. this gives an average 19.024520026312935 bits per digit Using complex FFT radices 288 8 16 16 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M22442237 with FFT length 1179648 = 1152 K Res64: 62C479B03F3E9DD9. AvgMaxErr = 0.206103516. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 1061068249 Res mod 2^35 - 1 = 16186007845 Res mod 2^36 - 1 = 66602070649 Clocks = 00:00:39.549 NTHREADS = 4 M22442237: using FFT length 1152K = 1179648 8-byte floats. this gives an average 19.024520026312935 bits per digit Using complex FFT radices 144 16 16 16 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M22442237 with FFT length 1179648 = 1152 K Res64: 62C479B03F3E9DD9. AvgMaxErr = 0.204610770. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 1061068249 Res mod 2^35 - 1 = 16186007845 Res mod 2^36 - 1 = 66602070649 Clocks = 00:00:38.524 NTHREADS = 4 M22442237: using FFT length 1152K = 1179648 8-byte floats. this gives an average 19.024520026312935 bits per digit Using complex FFT radices 36 16 32 32 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M22442237 with FFT length 1179648 = 1152 K Res64: 62C479B03F3E9DD9. AvgMaxErr = 0.204241071. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 1061068249 Res mod 2^35 - 1 = 16186007845 Res mod 2^36 - 1 = 66602070649 Clocks = 00:00:44.641 NTHREADS = 4 M22442237: using FFT length 1152K = 1179648 8-byte floats. this gives an average 19.024520026312935 bits per digit Using complex FFT radices 36 32 32 16 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. mers_mod_square: Init threadpool of 4 threads 100 iterations of M22442237 with FFT length 1179648 = 1152 K Res64: 62C479B03F3E9DD9. AvgMaxErr = 0.203878348. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 1061068249 Res mod 2^35 - 1 = 16186007845 Res mod 2^36 - 1 = 66602070649 Clocks = 00:00:42.470 NTHREADS = 4 M22442237: using FFT length 1152K = 1179648 8-byte floats. this gives an average 19.024520026312935 bits per digit Using complex FFT radices 18 32 32 32 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. mers_mod_square: Init threadpool of 4 threads 100 iterations of M22442237 with FFT length 1179648 = 1152 K Res64: 62C479B03F3E9DD9. AvgMaxErr = 0.203613281. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 1061068249 Res mod 2^35 - 1 = 16186007845 Res mod 2^36 - 1 = 66602070649 Clocks = 00:01:22.885 NTHREADS = 4 M22442237: using FFT length 1152K = 1179648 8-byte floats. this gives an average 19.024520026312935 bits per digit Using complex FFT radices 18 8 16 16 16 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. mers_mod_square: Init threadpool of 4 threads 100 iterations of M22442237 with FFT length 1179648 = 1152 K Res64: 62C479B03F3E9DD9. AvgMaxErr = 0.206537737. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 1061068249 Res mod 2^35 - 1 = 16186007845 Res mod 2^36 - 1 = 66602070649 Clocks = 00:01:18.259 INFO: 6 of 6 radix-sets at FFT length 1152 K passed - writing cfg-file entry. / **************************************************************************** / NTHREADS = 4 M24878401: using FFT length 1280K = 1310720 8-byte floats. this gives an average 18.980713653564454 bits per digit Using complex FFT radices 160 16 16 16 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M24878401 with FFT length 1310720 = 1280 K Res64: 8A8644FC94CB0A8B. AvgMaxErr = 0.234486607. MaxErr = 0.281250000. Program: E17.1 Res mod 2^36 = 54035942027 Res mod 2^35 - 1 = 31904286697 Res mod 2^36 - 1 = 27818942965 Clocks = 00:00:43.889 NTHREADS = 4 M24878401: using FFT length 1280K = 1310720 8-byte floats. this gives an average 18.980713653564454 bits per digit Using complex FFT radices 40 16 32 32 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M24878401 with FFT length 1310720 = 1280 K Res64: 8A8644FC94CB0A8B. AvgMaxErr = 0.239285714. MaxErr = 0.281250000. Program: E17.1 Res mod 2^36 = 54035942027 Res mod 2^35 - 1 = 31904286697 Res mod 2^36 - 1 = 27818942965 Clocks = 00:00:50.892 NTHREADS = 4 M24878401: using FFT length 1280K = 1310720 8-byte floats. this gives an average 18.980713653564454 bits per digit Using complex FFT radices 40 32 32 16 mers_mod_square: Init threadpool of 4 threads 100 iterations of M24878401 with FFT length 1310720 = 1280 K Res64: 8A8644FC94CB0A8B. AvgMaxErr = 0.241852679. MaxErr = 0.312500000. Program: E17.1 Res mod 2^36 = 54035942027 Res mod 2^35 - 1 = 31904286697 Res mod 2^36 - 1 = 27818942965 Clocks = 00:00:49.985 NTHREADS = 4 M24878401: using FFT length 1280K = 1310720 8-byte floats. this gives an average 18.980713653564454 bits per digit Using complex FFT radices 20 32 32 32 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M24878401 with FFT length 1310720 = 1280 K Res64: 8A8644FC94CB0A8B. AvgMaxErr = 0.226227679. MaxErr = 0.281250000. Program: E17.1 Res mod 2^36 = 54035942027 Res mod 2^35 - 1 = 31904286697 Res mod 2^36 - 1 = 27818942965 Clocks = 00:00:50.927 NTHREADS = 4 M24878401: using FFT length 1280K = 1310720 8-byte floats. this gives an average 18.980713653564454 bits per digit Using complex FFT radices 20 8 16 16 16 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. mers_mod_square: Init threadpool of 4 threads 100 iterations of M24878401 with FFT length 1310720 = 1280 K Res64: 8A8644FC94CB0A8B. AvgMaxErr = 0.231026786. MaxErr = 0.257812500. Program: E17.1 Res mod 2^36 = 54035942027 Res mod 2^35 - 1 = 31904286697 Res mod 2^36 - 1 = 27818942965 Clocks = 00:00:47.044 NTHREADS = 4 M24878401: using FFT length 1280K = 1310720 8-byte floats. this gives an average 18.980713653564454 bits per digit Using complex FFT radices 10 16 16 16 16 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. mers_mod_square: Init threadpool of 4 threads 100 iterations of M24878401 with FFT length 1310720 = 1280 K Res64: 8A8644FC94CB0A8B. AvgMaxErr = 0.232045201. MaxErr = 0.281250000. Program: E17.1 Res mod 2^36 = 54035942027 Res mod 2^35 - 1 = 31904286697 Res mod 2^36 - 1 = 27818942965 Clocks = 00:01:26.540 INFO: 6 of 6 radix-sets at FFT length 1280 K passed - writing cfg-file entry. / **************************************************************************** / NTHREADS = 4 M27309229: using FFT length 1408K = 1441792 8-byte floats. this gives an average 18.941171125932172 bits per digit Using complex FFT radices 176 16 16 16 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M27309229 with FFT length 1441792 = 1408 K Res64: CCE2DF04E61DC922. AvgMaxErr = 0.215952846. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 21040580898 Res mod 2^35 - 1 = 20296295440 Res mod 2^36 - 1 = 23445145000 Clocks = 00:00:47.623 NTHREADS = 4 M27309229: using FFT length 1408K = 1441792 8-byte floats. this gives an average 18.941171125932172 bits per digit Using complex FFT radices 44 16 32 32 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M27309229 with FFT length 1441792 = 1408 K Res64: CCE2DF04E61DC922. AvgMaxErr = 0.210944475. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 21040580898 Res mod 2^35 - 1 = 20296295440 Res mod 2^36 - 1 = 23445145000 Clocks = 00:00:55.964 NTHREADS = 4 M27309229: using FFT length 1408K = 1441792 8-byte floats. this gives an average 18.941171125932172 bits per digit Using complex FFT radices 44 32 32 16 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. mers_mod_square: Init threadpool of 4 threads 100 iterations of M27309229 with FFT length 1441792 = 1408 K Res64: CCE2DF04E61DC922. AvgMaxErr = 0.214753069. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 21040580898 Res mod 2^35 - 1 = 20296295440 Res mod 2^36 - 1 = 23445145000 Clocks = 00:00:54.546 NTHREADS = 4 M27309229: using FFT length 1408K = 1441792 8-byte floats. this gives an average 18.941171125932172 bits per digit Using complex FFT radices 22 32 32 32 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. mers_mod_square: Init threadpool of 4 threads 100 iterations of M27309229 with FFT length 1441792 = 1408 K Res64: CCE2DF04E61DC922. AvgMaxErr = 0.212904576. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 21040580898 Res mod 2^35 - 1 = 20296295440 Res mod 2^36 - 1 = 23445145000 Clocks = 00:01:47.818 NTHREADS = 4 M27309229: using FFT length 1408K = 1441792 8-byte floats. this gives an average 18.941171125932172 bits per digit Using complex FFT radices 22 8 16 16 16 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. mers_mod_square: Init threadpool of 4 threads 100 iterations of M27309229 with FFT length 1441792 = 1408 K Res64: CCE2DF04E61DC922. AvgMaxErr = 0.220452009. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 21040580898 Res mod 2^35 - 1 = 20296295440 Res mod 2^36 - 1 = 23445145000 Clocks = 00:01:39.288 INFO: 5 of 5 radix-sets at FFT length 1408 K passed - writing cfg-file entry. / **************************************************************************** / NTHREADS = 4 M29735137: using FFT length 1536K = 1572864 8-byte floats. this gives an average 18.905090967814129 bits per digit Using complex FFT radices 768 32 32 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M29735137 with FFT length 1572864 = 1536 K Res64: 2D26046FFAAEBC2B. AvgMaxErr = 0.213560268. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 68630264875 Res mod 2^35 - 1 = 2381591353 Res mod 2^36 - 1 = 48693163035 Clocks = 00:01:02.818 NTHREADS = 4 M29735137: using FFT length 1536K = 1572864 8-byte floats. this gives an average 18.905090967814129 bits per digit Using complex FFT radices 192 16 16 16 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M29735137 with FFT length 1572864 = 1536 K Res64: 2D26046FFAAEBC2B. AvgMaxErr = 0.221540179. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 68630264875 Res mod 2^35 - 1 = 2381591353 Res mod 2^36 - 1 = 48693163035 Clocks = 00:00:53.555 NTHREADS = 4 M29735137: using FFT length 1536K = 1572864 8-byte floats. this gives an average 18.905090967814129 bits per digit Using complex FFT radices 48 16 32 32 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M29735137 with FFT length 1572864 = 1536 K Res64: 2D26046FFAAEBC2B. AvgMaxErr = 0.207533482. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 68630264875 Res mod 2^35 - 1 = 2381591353 Res mod 2^36 - 1 = 48693163035 Clocks = 00:01:00.589 NTHREADS = 4 M29735137: using FFT length 1536K = 1572864 8-byte floats. this gives an average 18.905090967814129 bits per digit Using complex FFT radices 48 32 32 16 mers_mod_square: Init threadpool of 4 threads 100 iterations of M29735137 with FFT length 1572864 = 1536 K Res64: 2D26046FFAAEBC2B. AvgMaxErr = 0.207250977. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 68630264875 Res mod 2^35 - 1 = 2381591353 Res mod 2^36 - 1 = 48693163035 Clocks = 00:00:58.561 NTHREADS = 4 M29735137: using FFT length 1536K = 1572864 8-byte floats. this gives an average 18.905090967814129 bits per digit Using complex FFT radices 24 32 32 32 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M29735137 with FFT length 1572864 = 1536 K Res64: 2D26046FFAAEBC2B. AvgMaxErr = 0.210906110. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 68630264875 Res mod 2^35 - 1 = 2381591353 Res mod 2^36 - 1 = 48693163035 Clocks = 00:01:01.047 NTHREADS = 4 M29735137: using FFT length 1536K = 1572864 8-byte floats. this gives an average 18.905090967814129 bits per digit Using complex FFT radices 12 16 16 16 16 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. mers_mod_square: Init threadpool of 4 threads 100 iterations of M29735137 with FFT length 1572864 = 1536 K Res64: 2D26046FFAAEBC2B. AvgMaxErr = 0.209640067. MaxErr = 0.250000000. Program: E17.1 Res mod 2^36 = 68630264875 Res mod 2^35 - 1 = 2381591353 Res mod 2^36 - 1 = 48693163035 Clocks = 00:01:39.647 INFO: 6 of 6 radix-sets at FFT length 1536 K passed - writing cfg-file entry. / **************************************************************************** / NTHREADS = 4 M32156581: using FFT length 1664K = 1703936 8-byte floats. this gives an average 18.871941786545975 bits per digit Using complex FFT radices 208 16 16 16 mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M32156581 with FFT length 1703936 = 1664 K Res64: 77E274E6C29C203E. AvgMaxErr = 0.237590681. MaxErr = 0.312500000. Program: E17.1 Res mod 2^36 = 29034815550 Res mod 2^35 - 1 = 8887595531 Res mod 2^36 - 1 = 5248686295 Clocks = 00:00:54.559 NTHREADS = 4 M32156581: using FFT length 1664K = 1703936 8-byte floats. this gives an average 18.871941786545975 bits per digit Using complex FFT radices 52 16 32 32 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. mers_mod_square: Init threadpool of 4 threads Using 4 threads in carry step 100 iterations of M32156581 with FFT length 1703936 = 1664 K Res64: 77E274E6C29C203E. AvgMaxErr = 0.237234933. MaxErr = 0.312500000. Program: E17.1 Res mod 2^36 = 29034815550 Res mod 2^35 - 1 = 8887595531 Res mod 2^36 - 1 = 5248686295 Clocks = 00:01:06.391 NTHREADS = 4 M32156581: using FFT length 1664K = 1703936 8-byte floats. this gives an average 18.871941786545975 bits per digit Using complex FFT radices 52 32 32 16 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. mers_mod_square: Init threadpool of 4 threads 100 iterations of M32156581 with FFT length 1703936 = 1664 K Res64: 77E274E6C29C203E. AvgMaxErr = 0.243035889. MaxErr = 0.281250000. Program: E17.1 Res mod 2^36 = 29034815550 Res mod 2^35 - 1 = 8887595531 Res mod 2^36 - 1 = 5248686295 Clocks = 00:01:04.974 NTHREADS = 4 M32156581: using FFT length 1664K = 1703936 8-byte floats. this gives an average 18.871941786545975 bits per digit Using complex FFT radices 26 32 32 32 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. mers_mod_square: Init threadpool of 4 threads 100 iterations of M32156581 with FFT length 1703936 = 1664 K Res64: 77E274E6C29C203E. AvgMaxErr = 0.241964286. MaxErr = 0.312500000. Program: E17.1 Res mod 2^36 = 29034815550 Res mod 2^35 - 1 = 8887595531 Res mod 2^36 - 1 = 5248686295 Clocks = 00:02:05.754 NTHREADS = 4 M32156581: using FFT length 1664K = 1703936 8-byte floats. this gives an average 18.871941786545975 bits per digit Using complex FFT radices 26 8 16 16 16 mers_mod_square: radix0/2 not exactly divisible by NTHREADS - This will hurt performance. pthread_create:: Cannot allocate memory pthread_join: : Cannot allocate memory pthread_join: : Cannot allocate memory pthread_join: : Cannot allocate memory ERROR: at line 1473 of file ../src/mers_mod_square.c Assertion failed: threadpool_init failed!