[Sat Apr 29 09:16:19 2017] Compare your results to other computers at http://www.mersenne.org/report_benchmarks Intel(R) Core(TM) i7-5960X CPU @ 3.00GHz CPU speed: 3489.90 MHz, 8 cores CPU features: Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA L1 cache size: 32 KB L2 cache size: 256 KB, L3 cache size: 20 MB L1 cache line size: 64 bytes L2 cache line size: 64 bytes TLBS: 64 Machine topology as determined by hwloc library: Machine#0 (total=29212260KB, Backend=Windows, hwlocVersion=1.11.6, ProcessName=prime95.exe) NUMANode#0 (local=29212260KB, total=29212260KB) Package#0 (CPUVendor=GenuineIntel, CPUFamilyNumber=6, CPUModelNumber=63, CPUModel="Intel(R) Core(TM) i7-5960X CPU @ 3.00GHz", CPUStepping=2) L3 (size=20480KB, linesize=64, ways=20, Inclusive=1) L2 (size=256KB, linesize=64, ways=8, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000001) PU#0 (cpuset: 0x00000001) L2 (size=256KB, linesize=64, ways=8, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000002) PU#1 (cpuset: 0x00000002) L2 (size=256KB, linesize=64, ways=8, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000004) PU#2 (cpuset: 0x00000004) L2 (size=256KB, linesize=64, ways=8, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000008) PU#3 (cpuset: 0x00000008) L2 (size=256KB, linesize=64, ways=8, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000010) PU#4 (cpuset: 0x00000010) L2 (size=256KB, linesize=64, ways=8, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000020) PU#5 (cpuset: 0x00000020) L2 (size=256KB, linesize=64, ways=8, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000040) PU#6 (cpuset: 0x00000040) L2 (size=256KB, linesize=64, ways=8, Inclusive=0) L1d (size=32KB, linesize=64, ways=8, Inclusive=0) Core (cpuset: 0x00000080) PU#7 (cpuset: 0x00000080) Prime95 64-bit version 29.2, RdtscTiming=1 FFTlen=16K, Type=1, Arch=4, Pass1=16384, Pass2=0, clm=0 (8 cpus, 1 worker): 0.06 ms. Throughput: 15616.94 iter/sec. FFTlen=16K, Type=1, Arch=4, Pass1=16384, Pass2=0, clm=0 (8 cpus, 8 workers): 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07 ms. Throughput: 119150.93 iter/sec. FFTlen=16K, Type=3, Arch=4, Pass1=256, Pass2=64, clm=4 (8 cpus, 1 worker): 0.06 ms. Throughput: 16777.31 iter/sec. FFTlen=16K, Type=3, Arch=4, Pass1=256, Pass2=64, clm=4 (8 cpus, 8 workers): 0.05, 0.05, 0.05, 0.05, 0.05, 0.05, 0.06, 0.06 ms. Throughput: 147908.24 iter/sec. FFTlen=16K, Type=3, Arch=4, Pass1=256, Pass2=64, clm=2 (8 cpus, 1 worker): 0.09 ms. Throughput: 11546.62 iter/sec. FFTlen=16K, Type=3, Arch=4, Pass1=256, Pass2=64, clm=2 (8 cpus, 8 workers): 0.05, 0.05, 0.05, 0.05, 0.05, 0.05, 0.05, 0.05 ms. Throughput: 161203.54 iter/sec. FFTlen=16K, Type=3, Arch=4, Pass1=256, Pass2=64, clm=1 (8 cpus, 1 worker): 0.09 ms. Throughput: 10813.00 iter/sec. FFTlen=16K, Type=3, Arch=4, Pass1=256, Pass2=64, clm=1 (8 cpus, 8 workers): 0.05, 0.05, 0.05, 0.05, 0.05, 0.05, 0.05, 0.05 ms. Throughput: 158171.11 iter/sec. FFTlen=18K, Type=1, Arch=4, Pass1=18432, Pass2=0, clm=0 (8 cpus, 1 worker): 0.08 ms. Throughput: 12653.44 iter/sec. FFTlen=18K, Type=1, Arch=4, Pass1=18432, Pass2=0, clm=0 (8 cpus, 8 workers): 0.09, 0.08, 0.08, 0.08, 0.08, 0.09, 0.08, 0.08 ms. Throughput: 95677.95 iter/sec. FFTlen=18K, Type=3, Arch=4, Pass1=384, Pass2=48, clm=4 (8 cpus, 1 worker): 0.07 ms. Throughput: 14004.99 iter/sec. FFTlen=18K, Type=3, Arch=4, Pass1=384, Pass2=48, clm=4 (8 cpus, 8 workers): 0.06, 0.06, 0.06, 0.06, 0.06, 0.06, 0.06, 0.06 ms. Throughput: 130730.75 iter/sec. FFTlen=18K, Type=3, Arch=4, Pass1=384, Pass2=48, clm=2 (8 cpus, 1 worker): 0.10 ms. Throughput: 9993.73 iter/sec. FFTlen=18K, Type=3, Arch=4, Pass1=384, Pass2=48, clm=2 (8 cpus, 8 workers): 0.06, 0.06, 0.06, 0.06, 0.06, 0.06, 0.06, 0.06 ms. Throughput: 135822.61 iter/sec. FFTlen=18K, Type=3, Arch=4, Pass1=384, Pass2=48, clm=1 (8 cpus, 1 worker): 0.09 ms. Throughput: 10599.63 iter/sec. FFTlen=18K, Type=3, Arch=4, Pass1=384, Pass2=48, clm=1 (8 cpus, 8 workers): 0.06, 0.06, 0.06, 0.06, 0.06, 0.06, 0.06, 0.06 ms. Throughput: 134476.54 iter/sec. FFTlen=20K, Type=1, Arch=4, Pass1=20480, Pass2=0, clm=0 (8 cpus, 1 worker): 0.08 ms. Throughput: 11987.81 iter/sec. FFTlen=20K, Type=1, Arch=4, Pass1=20480, Pass2=0, clm=0 (8 cpus, 8 workers): 0.09, 0.09, 0.09, 0.08, 0.09, 0.09, 0.08, 0.08 ms. Throughput: 93642.66 iter/sec. FFTlen=20K, Type=3, Arch=4, Pass1=256, Pass2=80, clm=4 (8 cpus, 1 worker): 0.07 ms. Throughput: 14549.09 iter/sec. FFTlen=20K, Type=3, Arch=4, Pass1=256, Pass2=80, clm=4 (8 cpus, 8 workers): 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07 ms. Throughput: 114581.01 iter/sec. FFTlen=20K, Type=3, Arch=4, Pass1=256, Pass2=80, clm=2 (8 cpus, 1 worker): 0.09 ms. Throughput: 10675.81 iter/sec. FFTlen=20K, Type=3, Arch=4, Pass1=256, Pass2=80, clm=2 (8 cpus, 8 workers): 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07 ms. Throughput: 120648.30 iter/sec. FFTlen=20K, Type=3, Arch=4, Pass1=256, Pass2=80, clm=1 (8 cpus, 1 worker): 0.08 ms. Throughput: 12001.27 iter/sec. FFTlen=20K, Type=3, Arch=4, Pass1=256, Pass2=80, clm=1 (8 cpus, 8 workers): 0.07, 0.07, 0.06, 0.07, 0.07, 0.07, 0.06, 0.06 ms. Throughput: 122030.85 iter/sec. FFTlen=20K, Type=3, Arch=4, Pass1=320, Pass2=64, clm=4 (8 cpus, 1 worker): 0.07 ms. Throughput: 13918.11 iter/sec. FFTlen=20K, Type=3, Arch=4, Pass1=320, Pass2=64, clm=4 (8 cpus, 8 workers): 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07 ms. Throughput: 116880.73 iter/sec. FFTlen=20K, Type=3, Arch=4, Pass1=320, Pass2=64, clm=2 (8 cpus, 1 worker): 0.10 ms. Throughput: 9805.49 iter/sec. FFTlen=20K, Type=3, Arch=4, Pass1=320, Pass2=64, clm=2 (8 cpus, 8 workers): 0.07, 0.06, 0.07, 0.06, 0.07, 0.07, 0.06, 0.06 ms. Throughput: 122781.21 iter/sec. FFTlen=20K, Type=3, Arch=4, Pass1=320, Pass2=64, clm=1 (8 cpus, 1 worker): 0.10 ms. Throughput: 10257.61 iter/sec. FFTlen=20K, Type=3, Arch=4, Pass1=320, Pass2=64, clm=1 (8 cpus, 8 workers): 0.07, 0.07, 0.07, 0.06, 0.07, 0.07, 0.07, 0.06 ms. Throughput: 121780.41 iter/sec. FFTlen=21K, Type=3, Arch=4, Pass1=448, Pass2=48, clm=4 (8 cpus, 1 worker): 0.08 ms. Throughput: 12519.21 iter/sec. FFTlen=21K, Type=3, Arch=4, Pass1=448, Pass2=48, clm=4 (8 cpus, 8 workers): 0.08, 0.07, 0.08, 0.07, 0.08, 0.07, 0.07, 0.08 ms. Throughput: 105802.17 iter/sec. FFTlen=21K, Type=3, Arch=4, Pass1=448, Pass2=48, clm=2 (8 cpus, 1 worker): 0.11 ms. Throughput: 8753.17 iter/sec. FFTlen=21K, Type=3, Arch=4, Pass1=448, Pass2=48, clm=2 (8 cpus, 8 workers): 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07 ms. Throughput: 112177.23 iter/sec. FFTlen=21K, Type=3, Arch=4, Pass1=448, Pass2=48, clm=1 (8 cpus, 1 worker): 0.11 ms. Throughput: 9254.01 iter/sec. FFTlen=21K, Type=3, Arch=4, Pass1=448, Pass2=48, clm=1 (8 cpus, 8 workers): 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07 ms. Throughput: 110013.77 iter/sec. FFTlen=24K, Type=1, Arch=4, Pass1=24576, Pass2=0, clm=0 (8 cpus, 1 worker): 0.11 ms. Throughput: 8910.58 iter/sec. [Sat Apr 29 09:21:24 2017] FFTlen=24K, Type=1, Arch=4, Pass1=24576, Pass2=0, clm=0 (8 cpus, 8 workers): 0.12, 0.12, 0.12, 0.11, 0.12, 0.12, 0.11, 0.11 ms. Throughput: 69112.53 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=128, Pass2=192, clm=4 (8 cpus, 1 worker): 0.06 ms. Throughput: 17918.86 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=128, Pass2=192, clm=4 (8 cpus, 8 workers): 0.08, 0.08, 0.08, 0.08, 0.08, 0.08, 0.08, 0.08 ms. Throughput: 96853.12 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=128, Pass2=192, clm=2 (8 cpus, 1 worker): 0.06 ms. Throughput: 17584.42 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=128, Pass2=192, clm=2 (8 cpus, 8 workers): 0.08, 0.08, 0.08, 0.08, 0.08, 0.08, 0.08, 0.08 ms. Throughput: 98771.50 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=128, Pass2=192, clm=1 (8 cpus, 1 worker): 0.07 ms. Throughput: 15068.98 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=128, Pass2=192, clm=1 (8 cpus, 8 workers): 0.08, 0.08, 0.08, 0.08, 0.09, 0.08, 0.08, 0.08 ms. Throughput: 95624.65 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=384, Pass2=64, clm=4 (8 cpus, 1 worker): 0.09 ms. Throughput: 11567.40 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=384, Pass2=64, clm=4 (8 cpus, 8 workers): 0.08, 0.08, 0.08, 0.08, 0.08, 0.08, 0.08, 0.08 ms. Throughput: 95891.98 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=384, Pass2=64, clm=2 (8 cpus, 1 worker): 0.12 ms. Throughput: 8536.07 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=384, Pass2=64, clm=2 (8 cpus, 8 workers): 0.08, 0.08, 0.08, 0.08, 0.08, 0.08, 0.08, 0.08 ms. Throughput: 101049.21 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=384, Pass2=64, clm=1 (8 cpus, 1 worker): 0.11 ms. Throughput: 9041.27 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=384, Pass2=64, clm=1 (8 cpus, 8 workers): 0.08, 0.08, 0.08, 0.08, 0.08, 0.08, 0.08, 0.08 ms. Throughput: 100567.97 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=512, Pass2=48, clm=4 (8 cpus, 1 worker): 0.09 ms. Throughput: 10642.07 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=512, Pass2=48, clm=4 (8 cpus, 8 workers): 0.09, 0.09, 0.09, 0.09, 0.09, 0.09, 0.09, 0.09 ms. Throughput: 86251.51 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=512, Pass2=48, clm=2 (8 cpus, 1 worker): 0.13 ms. Throughput: 7728.48 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=512, Pass2=48, clm=2 (8 cpus, 8 workers): 0.08, 0.08, 0.08, 0.08, 0.08, 0.08, 0.08, 0.08 ms. Throughput: 99647.01 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=512, Pass2=48, clm=1 (8 cpus, 1 worker): 0.12 ms. Throughput: 8292.12 iter/sec. FFTlen=24K, Type=3, Arch=4, Pass1=512, Pass2=48, clm=1 (8 cpus, 8 workers): 0.08, 0.08, 0.08, 0.08, 0.08, 0.08, 0.08, 0.08 ms. Throughput: 101532.51 iter/sec. FFTlen=25K, Type=3, Arch=4, Pass1=320, Pass2=80, clm=4 (8 cpus, 1 worker): 0.09 ms. Throughput: 10739.39 iter/sec. FFTlen=25K, Type=3, Arch=4, Pass1=320, Pass2=80, clm=4 (8 cpus, 8 workers): 0.09, 0.09, 0.09, 0.09, 0.09, 0.09, 0.09, 0.09 ms. Throughput: 92662.32 iter/sec. FFTlen=25K, Type=3, Arch=4, Pass1=320, Pass2=80, clm=2 (8 cpus, 1 worker): 0.11 ms. Throughput: 9012.02 iter/sec. FFTlen=25K, Type=3, Arch=4, Pass1=320, Pass2=80, clm=2 (8 cpus, 8 workers): 0.08, 0.08, 0.08, 0.08, 0.09, 0.08, 0.08, 0.08 ms. Throughput: 95459.42 iter/sec. FFTlen=25K, Type=3, Arch=4, Pass1=320, Pass2=80, clm=1 (8 cpus, 1 worker): 0.10 ms. Throughput: 10404.97 iter/sec. FFTlen=25K, Type=3, Arch=4, Pass1=320, Pass2=80, clm=1 (8 cpus, 8 workers): 0.09, 0.09, 0.08, 0.08, 0.09, 0.09, 0.08, 0.08 ms. Throughput: 94195.98 iter/sec. FFTlen=28K, Type=3, Arch=4, Pass1=448, Pass2=64, clm=4 (8 cpus, 1 worker): 0.10 ms. Throughput: 10051.31 iter/sec. FFTlen=28K, Type=3, Arch=4, Pass1=448, Pass2=64, clm=4 (8 cpus, 8 workers): 0.10, 0.10, 0.10, 0.10, 0.10, 0.10, 0.10, 0.10 ms. Throughput: 77747.40 iter/sec. FFTlen=28K, Type=3, Arch=4, Pass1=448, Pass2=64, clm=2 (8 cpus, 1 worker): 0.14 ms. Throughput: 7399.35 iter/sec. FFTlen=28K, Type=3, Arch=4, Pass1=448, Pass2=64, clm=2 (8 cpus, 8 workers): 0.10, 0.10, 0.10, 0.10, 0.10, 0.10, 0.10, 0.10 ms. Throughput: 81729.90 iter/sec. FFTlen=28K, Type=3, Arch=4, Pass1=448, Pass2=64, clm=1 (8 cpus, 1 worker): 0.12 ms. Throughput: 8058.07 iter/sec. FFTlen=28K, Type=3, Arch=4, Pass1=448, Pass2=64, clm=1 (8 cpus, 8 workers): 0.10, 0.10, 0.10, 0.10, 0.10, 0.10, 0.10, 0.10 ms. Throughput: 81477.23 iter/sec. FFTlen=30K, Type=3, Arch=4, Pass1=384, Pass2=80, clm=4 (8 cpus, 1 worker): 0.10 ms. Throughput: 10190.44 iter/sec. FFTlen=30K, Type=3, Arch=4, Pass1=384, Pass2=80, clm=4 (8 cpus, 8 workers): 0.11, 0.11, 0.10, 0.10, 0.11, 0.11, 0.10, 0.10 ms. Throughput: 76156.31 iter/sec. FFTlen=30K, Type=3, Arch=4, Pass1=384, Pass2=80, clm=2 (8 cpus, 1 worker): 0.13 ms. Throughput: 7690.50 iter/sec. FFTlen=30K, Type=3, Arch=4, Pass1=384, Pass2=80, clm=2 (8 cpus, 8 workers): 0.10, 0.10, 0.10, 0.10, 0.10, 0.10, 0.10, 0.10 ms. Throughput: 79102.07 iter/sec. FFTlen=30K, Type=3, Arch=4, Pass1=384, Pass2=80, clm=1 (8 cpus, 1 worker): 0.12 ms. Throughput: 8692.96 iter/sec. FFTlen=30K, Type=3, Arch=4, Pass1=384, Pass2=80, clm=1 (8 cpus, 8 workers): 0.10, 0.10, 0.10, 0.10, 0.10, 0.10, 0.10, 0.10 ms. Throughput: 78097.03 iter/sec. FFTlen=30K, Type=3, Arch=4, Pass1=640, Pass2=48, clm=4 (8 cpus, 1 worker): 0.12 ms. Throughput: 8445.70 iter/sec. [Sat Apr 29 09:26:29 2017] FFTlen=30K, Type=3, Arch=4, Pass1=640, Pass2=48, clm=4 (8 cpus, 8 workers): 0.12, 0.12, 0.12, 0.12, 0.12, 0.12, 0.12, 0.12 ms. Throughput: 66358.20 iter/sec. FFTlen=30K, Type=3, Arch=4, Pass1=640, Pass2=48, clm=2 (8 cpus, 1 worker): 0.16 ms. Throughput: 6368.89 iter/sec. FFTlen=30K, Type=3, Arch=4, Pass1=640, Pass2=48, clm=2 (8 cpus, 8 workers): 0.10, 0.10, 0.10, 0.10, 0.10, 0.10, 0.10, 0.10 ms. Throughput: 77575.97 iter/sec. FFTlen=30K, Type=3, Arch=4, Pass1=640, Pass2=48, clm=1 (8 cpus, 1 worker): 0.15 ms. Throughput: 6759.90 iter/sec. FFTlen=30K, Type=3, Arch=4, Pass1=640, Pass2=48, clm=1 (8 cpus, 8 workers): 0.10, 0.10, 0.10, 0.10, 0.10, 0.10, 0.10, 0.10 ms. Throughput: 78654.10 iter/sec. FFTlen=32K, Type=1, Arch=4, Pass1=32768, Pass2=0, clm=0 (8 cpus, 1 worker): 0.17 ms. Throughput: 5794.10 iter/sec. FFTlen=32K, Type=1, Arch=4, Pass1=32768, Pass2=0, clm=0 (8 cpus, 8 workers): 0.18, 0.18, 0.18, 0.18, 0.18, 0.18, 0.19, 0.19 ms. Throughput: 43418.13 iter/sec. FFTlen=32K, Type=3, Arch=4, Pass1=128, Pass2=256, clm=4 (8 cpus, 1 worker): 0.05 ms. Throughput: 19209.54 iter/sec. FFTlen=32K, Type=3, Arch=4, Pass1=128, Pass2=256, clm=4 (8 cpus, 8 workers): 0.11, 0.11, 0.11, 0.11, 0.12, 0.11, 0.11, 0.11 ms. Throughput: 71164.42 iter/sec. FFTlen=32K, Type=3, Arch=4, Pass1=128, Pass2=256, clm=2 (8 cpus, 1 worker): 0.06 ms. Throughput: 18101.19 iter/sec. FFTlen=32K, Type=3, Arch=4, Pass1=128, Pass2=256, clm=2 (8 cpus, 8 workers): 0.11, 0.11, 0.11, 0.10, 0.11, 0.11, 0.11, 0.11 ms. Throughput: 74434.26 iter/sec. FFTlen=32K, Type=3, Arch=4, Pass1=128, Pass2=256, clm=1 (8 cpus, 1 worker): 0.07 ms. Throughput: 15111.34 iter/sec. FFTlen=32K, Type=3, Arch=4, Pass1=128, Pass2=256, clm=1 (8 cpus, 8 workers): 0.11, 0.11, 0.11, 0.11, 0.11, 0.11, 0.11, 0.11 ms. Throughput: 72508.07 iter/sec. FFTlen=32K, Type=3, Arch=4, Pass1=512, Pass2=64, clm=4 (8 cpus, 1 worker): 0.11 ms. Throughput: 8746.07 iter/sec. FFTlen=32K, Type=3, Arch=4, Pass1=512, Pass2=64, clm=4 (8 cpus, 8 workers): 0.13, 0.12, 0.13, 0.12, 0.13, 0.13, 0.12, 0.12 ms. Throughput: 63692.59 iter/sec. FFTlen=32K, Type=3, Arch=4, Pass1=512, Pass2=64, clm=2 (8 cpus, 1 worker): 0.15 ms. Throughput: 6558.05 iter/sec. FFTlen=32K, Type=3, Arch=4, Pass1=512, Pass2=64, clm=2 (8 cpus, 8 workers): 0.11, 0.11, 0.11, 0.11, 0.11, 0.11, 0.11, 0.11 ms. Throughput: 72911.87 iter/sec. FFTlen=32K, Type=3, Arch=4, Pass1=512, Pass2=64, clm=1 (8 cpus, 1 worker): 0.16 ms. Throughput: 6432.46 iter/sec. FFTlen=32K, Type=3, Arch=4, Pass1=512, Pass2=64, clm=1 (8 cpus, 8 workers): 0.11, 0.11, 0.11, 0.11, 0.11, 0.11, 0.11, 0.11 ms. Throughput: 74119.34 iter/sec. FFTlen=35K, Type=3, Arch=4, Pass1=448, Pass2=80, clm=4 (8 cpus, 1 worker): 0.11 ms. Throughput: 8792.30 iter/sec. FFTlen=35K, Type=3, Arch=4, Pass1=448, Pass2=80, clm=4 (8 cpus, 8 workers): 0.13, 0.13, 0.13, 0.13, 0.13, 0.13, 0.13, 0.13 ms. Throughput: 62368.51 iter/sec. FFTlen=35K, Type=3, Arch=4, Pass1=448, Pass2=80, clm=2 (8 cpus, 1 worker): 0.14 ms. Throughput: 7394.57 iter/sec. FFTlen=35K, Type=3, Arch=4, Pass1=448, Pass2=80, clm=2 (8 cpus, 8 workers): 0.12, 0.12, 0.12, 0.12, 0.13, 0.12, 0.12, 0.12 ms. Throughput: 65119.75 iter/sec. FFTlen=35K, Type=3, Arch=4, Pass1=448, Pass2=80, clm=1 (8 cpus, 1 worker): 0.13 ms. Throughput: 7839.70 iter/sec. FFTlen=35K, Type=3, Arch=4, Pass1=448, Pass2=80, clm=1 (8 cpus, 8 workers): 0.13, 0.12, 0.13, 0.12, 0.13, 0.13, 0.12, 0.12 ms. Throughput: 63930.68 iter/sec. FFTlen=36K, Type=3, Arch=4, Pass1=768, Pass2=48, clm=4 (8 cpus, 1 worker): 0.14 ms. Throughput: 7145.30 iter/sec. FFTlen=36K, Type=3, Arch=4, Pass1=768, Pass2=48, clm=4 (8 cpus, 8 workers): 0.15, 0.15, 0.15, 0.15, 0.15, 0.15, 0.15, 0.15 ms. Throughput: 53066.08 iter/sec. FFTlen=36K, Type=3, Arch=4, Pass1=768, Pass2=48, clm=2 (8 cpus, 1 worker): 0.19 ms. Throughput: 5322.32 iter/sec. FFTlen=36K, Type=3, Arch=4, Pass1=768, Pass2=48, clm=2 (8 cpus, 8 workers): 0.13, 0.13, 0.13, 0.12, 0.13, 0.13, 0.13, 0.13 ms. Throughput: 63233.49 iter/sec. FFTlen=36K, Type=3, Arch=4, Pass1=768, Pass2=48, clm=1 (8 cpus, 1 worker): 0.17 ms. Throughput: 5761.07 iter/sec. FFTlen=36K, Type=3, Arch=4, Pass1=768, Pass2=48, clm=1 (8 cpus, 8 workers): 0.12, 0.12, 0.12, 0.12, 0.12, 0.12, 0.12, 0.12 ms. Throughput: 65443.07 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=128, Pass2=320, clm=4 (8 cpus, 1 worker): 0.06 ms. Throughput: 18071.61 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=128, Pass2=320, clm=4 (8 cpus, 8 workers): 0.15, 0.15, 0.14, 0.14, 0.14, 0.14, 0.14, 0.14 ms. Throughput: 55938.85 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=128, Pass2=320, clm=2 (8 cpus, 1 worker): 0.06 ms. Throughput: 17182.30 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=128, Pass2=320, clm=2 (8 cpus, 8 workers): 0.14, 0.14, 0.14, 0.14, 0.15, 0.14, 0.14, 0.14 ms. Throughput: 56480.71 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=128, Pass2=320, clm=1 (8 cpus, 1 worker): 0.07 ms. Throughput: 14283.42 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=128, Pass2=320, clm=1 (8 cpus, 8 workers): 0.15, 0.15, 0.14, 0.14, 0.15, 0.15, 0.14, 0.14 ms. Throughput: 55102.59 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=512, Pass2=80, clm=4 (8 cpus, 1 worker): 0.13 ms. Throughput: 7674.08 iter/sec. [Sat Apr 29 09:31:34 2017] FFTlen=40K, Type=3, Arch=4, Pass1=512, Pass2=80, clm=4 (8 cpus, 8 workers): 0.16, 0.16, 0.16, 0.16, 0.16, 0.16, 0.16, 0.16 ms. Throughput: 50375.94 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=512, Pass2=80, clm=2 (8 cpus, 1 worker): 0.15 ms. Throughput: 6612.55 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=512, Pass2=80, clm=2 (8 cpus, 8 workers): 0.14, 0.14, 0.14, 0.14, 0.14, 0.14, 0.14, 0.14 ms. Throughput: 57873.18 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=512, Pass2=80, clm=1 (8 cpus, 1 worker): 0.14 ms. Throughput: 7098.35 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=512, Pass2=80, clm=1 (8 cpus, 8 workers): 0.14, 0.14, 0.14, 0.13, 0.14, 0.14, 0.13, 0.13 ms. Throughput: 58983.01 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=640, Pass2=64, clm=4 (8 cpus, 1 worker): 0.14 ms. Throughput: 7174.16 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=640, Pass2=64, clm=4 (8 cpus, 8 workers): 0.16, 0.16, 0.16, 0.16, 0.17, 0.16, 0.16, 0.16 ms. Throughput: 49339.59 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=640, Pass2=64, clm=2 (8 cpus, 1 worker): 0.18 ms. Throughput: 5514.25 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=640, Pass2=64, clm=2 (8 cpus, 8 workers): 0.14, 0.14, 0.14, 0.14, 0.15, 0.15, 0.14, 0.14 ms. Throughput: 55810.60 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=640, Pass2=64, clm=1 (8 cpus, 1 worker): 0.17 ms. Throughput: 5833.54 iter/sec. FFTlen=40K, Type=3, Arch=4, Pass1=640, Pass2=64, clm=1 (8 cpus, 8 workers): 0.14, 0.14, 0.14, 0.14, 0.14, 0.14, 0.14, 0.14 ms. Throughput: 57272.28 iter/sec. FFTlen=48K, Type=3, Arch=4, Pass1=256, Pass2=192, clm=4 (8 cpus, 1 worker): 0.10 ms. Throughput: 9902.35 iter/sec. FFTlen=48K, Type=3, Arch=4, Pass1=256, Pass2=192, clm=4 (8 cpus, 8 workers): 0.18, 0.17, 0.17, 0.17, 0.18, 0.17, 0.17, 0.17 ms. Throughput: 45960.50 iter/sec. FFTlen=48K, Type=3, Arch=4, Pass1=256, Pass2=192, clm=2 (8 cpus, 1 worker): 0.08 ms. Throughput: 11766.39 iter/sec. FFTlen=48K, Type=3, Arch=4, Pass1=256, Pass2=192, clm=2 (8 cpus, 8 workers): 0.16, 0.16, 0.16, 0.16, 0.16, 0.16, 0.16, 0.16 ms. Throughput: 49102.31 iter/sec. FFTlen=48K, Type=3, Arch=4, Pass1=256, Pass2=192, clm=1 (8 cpus, 1 worker): 0.09 ms. Throughput: 11269.54 iter/sec. FFTlen=48K, Type=3, Arch=4, Pass1=256, Pass2=192, clm=1 (8 cpus, 8 workers): 0.17, 0.17, 0.17, 0.16, 0.17, 0.17, 0.16, 0.16 ms. Throughput: 48078.84 iter/sec. FFTlen=48K, Type=3, Arch=4, Pass1=768, Pass2=64, clm=4 (8 cpus, 1 worker): 0.17 ms. Throughput: 6049.23 iter/sec. FFTlen=48K, Type=3, Arch=4, Pass1=768, Pass2=64, clm=4 (8 cpus, 8 workers): 0.21, 0.21, 0.20, 0.20, 0.20, 0.20, 0.19, 0.20 ms. Throughput: 39759.05 iter/sec. FFTlen=48K, Type=3, Arch=4, Pass1=768, Pass2=64, clm=2 (8 cpus, 1 worker): 0.24 ms. Throughput: 4190.70 iter/sec. FFTlen=48K, Type=3, Arch=4, Pass1=768, Pass2=64, clm=2 (8 cpus, 8 workers): 0.18, 0.17, 0.17, 0.17, 0.18, 0.17, 0.17, 0.17 ms. Throughput: 46110.68 iter/sec. FFTlen=48K, Type=3, Arch=4, Pass1=768, Pass2=64, clm=1 (8 cpus, 1 worker): 0.20 ms. Throughput: 4932.23 iter/sec. FFTlen=48K, Type=3, Arch=4, Pass1=768, Pass2=64, clm=1 (8 cpus, 8 workers): 0.17, 0.17, 0.17, 0.17, 0.17, 0.17, 0.17, 0.17 ms. Throughput: 47507.81 iter/sec. FFTlen=50K, Type=3, Arch=4, Pass1=640, Pass2=80, clm=4 (8 cpus, 1 worker): 0.17 ms. Throughput: 5867.07 iter/sec. FFTlen=50K, Type=3, Arch=4, Pass1=640, Pass2=80, clm=4 (8 cpus, 8 workers): 0.21, 0.20, 0.20, 0.21, 0.21, 0.21, 0.20, 0.20 ms. Throughput: 39174.01 iter/sec. FFTlen=50K, Type=3, Arch=4, Pass1=640, Pass2=80, clm=2 (8 cpus, 1 worker): 0.19 ms. Throughput: 5397.14 iter/sec. FFTlen=50K, Type=3, Arch=4, Pass1=640, Pass2=80, clm=2 (8 cpus, 8 workers): 0.18, 0.18, 0.18, 0.17, 0.18, 0.18, 0.17, 0.17 ms. Throughput: 45395.97 iter/sec. FFTlen=50K, Type=3, Arch=4, Pass1=640, Pass2=80, clm=1 (8 cpus, 1 worker): 0.17 ms. Throughput: 5786.21 iter/sec. FFTlen=50K, Type=3, Arch=4, Pass1=640, Pass2=80, clm=1 (8 cpus, 8 workers): 0.17, 0.17, 0.17, 0.17, 0.18, 0.17, 0.17, 0.17 ms. Throughput: 46033.09 iter/sec. FFTlen=60K, Type=3, Arch=4, Pass1=320, Pass2=192, clm=4 (8 cpus, 1 worker): 0.11 ms. Throughput: 8750.83 iter/sec. FFTlen=60K, Type=3, Arch=4, Pass1=320, Pass2=192, clm=4 (8 cpus, 8 workers): 0.22, 0.22, 0.22, 0.21, 0.22, 0.22, 0.21, 0.21 ms. Throughput: 37057.13 iter/sec. FFTlen=60K, Type=3, Arch=4, Pass1=320, Pass2=192, clm=2 (8 cpus, 1 worker): 0.10 ms. Throughput: 9866.91 iter/sec. FFTlen=60K, Type=3, Arch=4, Pass1=320, Pass2=192, clm=2 (8 cpus, 8 workers): 0.21, 0.21, 0.21, 0.21, 0.21, 0.21, 0.21, 0.21 ms. Throughput: 37909.19 iter/sec. FFTlen=60K, Type=3, Arch=4, Pass1=320, Pass2=192, clm=1 (8 cpus, 1 worker): 0.10 ms. Throughput: 9607.16 iter/sec. FFTlen=60K, Type=3, Arch=4, Pass1=320, Pass2=192, clm=1 (8 cpus, 8 workers): 0.22, 0.22, 0.21, 0.21, 0.22, 0.21, 0.21, 0.21 ms. Throughput: 37297.84 iter/sec. FFTlen=60K, Type=3, Arch=4, Pass1=768, Pass2=80, clm=4 (8 cpus, 1 worker): 0.20 ms. Throughput: 5098.43 iter/sec. FFTlen=60K, Type=3, Arch=4, Pass1=768, Pass2=80, clm=4 (8 cpus, 8 workers): 0.25, 0.25, 0.25, 0.25, 0.25, 0.25, 0.25, 0.25 ms. Throughput: 31869.94 iter/sec. FFTlen=60K, Type=3, Arch=4, Pass1=768, Pass2=80, clm=2 (8 cpus, 1 worker): 0.22 ms. Throughput: 4520.73 iter/sec. [Sat Apr 29 09:36:39 2017] FFTlen=60K, Type=3, Arch=4, Pass1=768, Pass2=80, clm=2 (8 cpus, 8 workers): 0.22, 0.21, 0.21, 0.21, 0.22, 0.22, 0.21, 0.21 ms. Throughput: 37454.80 iter/sec. FFTlen=60K, Type=3, Arch=4, Pass1=768, Pass2=80, clm=1 (8 cpus, 1 worker): 0.21 ms. Throughput: 4854.32 iter/sec. FFTlen=60K, Type=3, Arch=4, Pass1=768, Pass2=80, clm=1 (8 cpus, 8 workers): 0.21, 0.21, 0.21, 0.21, 0.21, 0.21, 0.21, 0.21 ms. Throughput: 38065.58 iter/sec. FFTlen=64K, Type=3, Arch=4, Pass1=256, Pass2=256, clm=4 (8 cpus, 1 worker): 0.08 ms. Throughput: 12037.08 iter/sec. FFTlen=64K, Type=3, Arch=4, Pass1=256, Pass2=256, clm=4 (8 cpus, 8 workers): 0.23, 0.23, 0.23, 0.23, 0.23, 0.23, 0.23, 0.23 ms. Throughput: 34942.60 iter/sec. FFTlen=64K, Type=3, Arch=4, Pass1=256, Pass2=256, clm=2 (8 cpus, 1 worker): 0.08 ms. Throughput: 11965.29 iter/sec. FFTlen=64K, Type=3, Arch=4, Pass1=256, Pass2=256, clm=2 (8 cpus, 8 workers): 0.23, 0.22, 0.22, 0.22, 0.23, 0.23, 0.22, 0.22 ms. Throughput: 35909.21 iter/sec. FFTlen=64K, Type=3, Arch=4, Pass1=256, Pass2=256, clm=1 (8 cpus, 1 worker): 0.08 ms. Throughput: 12031.77 iter/sec. FFTlen=64K, Type=3, Arch=4, Pass1=256, Pass2=256, clm=1 (8 cpus, 8 workers): 0.22, 0.22, 0.22, 0.22, 0.23, 0.22, 0.22, 0.22 ms. Throughput: 36268.78 iter/sec. FFTlen=72K, Type=3, Arch=4, Pass1=384, Pass2=192, clm=4 (8 cpus, 1 worker): 0.13 ms. Throughput: 7479.04 iter/sec. FFTlen=72K, Type=3, Arch=4, Pass1=384, Pass2=192, clm=4 (8 cpus, 8 workers): 0.27, 0.26, 0.26, 0.26, 0.26, 0.26, 0.26, 0.26 ms. Throughput: 30501.95 iter/sec. FFTlen=72K, Type=3, Arch=4, Pass1=384, Pass2=192, clm=2 (8 cpus, 1 worker): 0.13 ms. Throughput: 7981.82 iter/sec. FFTlen=72K, Type=3, Arch=4, Pass1=384, Pass2=192, clm=2 (8 cpus, 8 workers): 0.26, 0.26, 0.25, 0.25, 0.26, 0.26, 0.25, 0.25 ms. Throughput: 31325.20 iter/sec. FFTlen=72K, Type=3, Arch=4, Pass1=384, Pass2=192, clm=1 (8 cpus, 1 worker): 0.12 ms. Throughput: 8325.18 iter/sec. FFTlen=72K, Type=3, Arch=4, Pass1=384, Pass2=192, clm=1 (8 cpus, 8 workers): 0.26, 0.26, 0.26, 0.26, 0.27, 0.26, 0.26, 0.26 ms. Throughput: 30721.12 iter/sec. FFTlen=80K, Type=3, Arch=4, Pass1=256, Pass2=320, clm=4 (8 cpus, 1 worker): 0.09 ms. Throughput: 10553.58 iter/sec. FFTlen=80K, Type=3, Arch=4, Pass1=256, Pass2=320, clm=4 (8 cpus, 8 workers): 0.30, 0.30, 0.30, 0.29, 0.31, 0.30, 0.30, 0.30 ms. Throughput: 26777.70 iter/sec. FFTlen=80K, Type=3, Arch=4, Pass1=256, Pass2=320, clm=2 (8 cpus, 1 worker): 0.09 ms. Throughput: 11492.23 iter/sec. FFTlen=80K, Type=3, Arch=4, Pass1=256, Pass2=320, clm=2 (8 cpus, 8 workers): 0.28, 0.29, 0.29, 0.28, 0.29, 0.28, 0.28, 0.28 ms. Throughput: 28073.99 iter/sec. FFTlen=80K, Type=3, Arch=4, Pass1=256, Pass2=320, clm=1 (8 cpus, 1 worker): 0.09 ms. Throughput: 10922.52 iter/sec. FFTlen=80K, Type=3, Arch=4, Pass1=256, Pass2=320, clm=1 (8 cpus, 8 workers): 0.29, 0.29, 0.29, 0.29, 0.30, 0.29, 0.29, 0.29 ms. Throughput: 27612.95 iter/sec. FFTlen=80K, Type=3, Arch=4, Pass1=320, Pass2=256, clm=4 (8 cpus, 1 worker): 0.10 ms. Throughput: 10270.62 iter/sec. FFTlen=80K, Type=3, Arch=4, Pass1=320, Pass2=256, clm=4 (8 cpus, 8 workers): 0.29, 0.29, 0.29, 0.28, 0.29, 0.29, 0.28, 0.28 ms. Throughput: 27906.68 iter/sec. FFTlen=80K, Type=3, Arch=4, Pass1=320, Pass2=256, clm=2 (8 cpus, 1 worker): 0.10 ms. Throughput: 10480.70 iter/sec. FFTlen=80K, Type=3, Arch=4, Pass1=320, Pass2=256, clm=2 (8 cpus, 8 workers): 0.28, 0.28, 0.28, 0.28, 0.28, 0.28, 0.28, 0.28 ms. Throughput: 28521.41 iter/sec. FFTlen=80K, Type=3, Arch=4, Pass1=320, Pass2=256, clm=1 (8 cpus, 1 worker): 0.10 ms. Throughput: 10324.58 iter/sec. FFTlen=80K, Type=3, Arch=4, Pass1=320, Pass2=256, clm=1 (8 cpus, 8 workers): 0.29, 0.29, 0.28, 0.28, 0.29, 0.29, 0.28, 0.28 ms. Throughput: 28116.47 iter/sec. FFTlen=84K, Type=3, Arch=4, Pass1=448, Pass2=192, clm=4 (8 cpus, 1 worker): 0.16 ms. Throughput: 6450.80 iter/sec. FFTlen=84K, Type=3, Arch=4, Pass1=448, Pass2=192, clm=4 (8 cpus, 8 workers): 0.32, 0.32, 0.32, 0.31, 0.32, 0.32, 0.31, 0.31 ms. Throughput: 25203.49 iter/sec. FFTlen=84K, Type=3, Arch=4, Pass1=448, Pass2=192, clm=2 (8 cpus, 1 worker): 0.14 ms. Throughput: 7200.45 iter/sec. FFTlen=84K, Type=3, Arch=4, Pass1=448, Pass2=192, clm=2 (8 cpus, 8 workers): 0.31, 0.31, 0.31, 0.31, 0.32, 0.32, 0.31, 0.31 ms. Throughput: 25661.52 iter/sec. FFTlen=84K, Type=3, Arch=4, Pass1=448, Pass2=192, clm=1 (8 cpus, 1 worker): 0.14 ms. Throughput: 7236.81 iter/sec. FFTlen=84K, Type=3, Arch=4, Pass1=448, Pass2=192, clm=1 (8 cpus, 8 workers): 0.32, 0.32, 0.32, 0.32, 0.32, 0.32, 0.32, 0.31 ms. Throughput: 25151.70 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=128, Pass2=768, clm=4 (8 cpus, 1 worker): 0.09 ms. Throughput: 10594.86 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=128, Pass2=768, clm=4 (8 cpus, 8 workers): 0.37, 0.36, 0.36, 0.36, 0.37, 0.36, 0.35, 0.36 ms. Throughput: 22231.19 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=128, Pass2=768, clm=2 (8 cpus, 1 worker): 0.10 ms. Throughput: 10484.26 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=128, Pass2=768, clm=2 (8 cpus, 8 workers): 0.35, 0.35, 0.35, 0.35, 0.36, 0.35, 0.35, 0.34 ms. Throughput: 22915.92 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=128, Pass2=768, clm=1 (8 cpus, 1 worker): 0.12 ms. Throughput: 8688.00 iter/sec. [Sat Apr 29 09:41:45 2017] FFTlen=96K, Type=3, Arch=4, Pass1=128, Pass2=768, clm=1 (8 cpus, 8 workers): 0.36, 0.35, 0.35, 0.35, 0.36, 0.35, 0.35, 0.35 ms. Throughput: 22534.29 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=384, Pass2=256, clm=4 (8 cpus, 1 worker): 0.12 ms. Throughput: 8613.73 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=384, Pass2=256, clm=4 (8 cpus, 8 workers): 0.35, 0.35, 0.35, 0.35, 0.35, 0.35, 0.34, 0.34 ms. Throughput: 23105.37 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=384, Pass2=256, clm=2 (8 cpus, 1 worker): 0.13 ms. Throughput: 7562.17 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=384, Pass2=256, clm=2 (8 cpus, 8 workers): 0.34, 0.34, 0.34, 0.34, 0.34, 0.34, 0.34, 0.34 ms. Throughput: 23574.02 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=384, Pass2=256, clm=1 (8 cpus, 1 worker): 0.12 ms. Throughput: 8635.44 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=384, Pass2=256, clm=1 (8 cpus, 8 workers): 0.35, 0.34, 0.34, 0.34, 0.35, 0.35, 0.34, 0.34 ms. Throughput: 23346.16 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=512, Pass2=192, clm=4 (8 cpus, 1 worker): 0.18 ms. Throughput: 5597.68 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=512, Pass2=192, clm=4 (8 cpus, 8 workers): 0.39, 0.38, 0.38, 0.38, 0.39, 0.38, 0.38, 0.38 ms. Throughput: 20976.95 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=512, Pass2=192, clm=2 (8 cpus, 1 worker): 0.15 ms. Throughput: 6477.31 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=512, Pass2=192, clm=2 (8 cpus, 8 workers): 0.35, 0.35, 0.35, 0.34, 0.36, 0.35, 0.35, 0.35 ms. Throughput: 22923.62 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=512, Pass2=192, clm=1 (8 cpus, 1 worker): 0.17 ms. Throughput: 5969.45 iter/sec. FFTlen=96K, Type=3, Arch=4, Pass1=512, Pass2=192, clm=1 (8 cpus, 8 workers): 0.35, 0.35, 0.35, 0.34, 0.36, 0.35, 0.35, 0.34 ms. Throughput: 22993.94 iter/sec. FFTlen=100K, Type=3, Arch=4, Pass1=320, Pass2=320, clm=4 (8 cpus, 1 worker): 0.11 ms. Throughput: 8787.22 iter/sec. FFTlen=100K, Type=3, Arch=4, Pass1=320, Pass2=320, clm=4 (8 cpus, 8 workers): 0.38, 0.37, 0.37, 0.37, 0.38, 0.37, 0.37, 0.37 ms. Throughput: 21546.54 iter/sec. FFTlen=100K, Type=3, Arch=4, Pass1=320, Pass2=320, clm=2 (8 cpus, 1 worker): 0.10 ms. Throughput: 9648.39 iter/sec. FFTlen=100K, Type=3, Arch=4, Pass1=320, Pass2=320, clm=2 (8 cpus, 8 workers): 0.37, 0.36, 0.36, 0.36, 0.37, 0.37, 0.36, 0.36 ms. Throughput: 21833.20 iter/sec. FFTlen=100K, Type=3, Arch=4, Pass1=320, Pass2=320, clm=1 (8 cpus, 1 worker): 0.12 ms. Throughput: 8691.36 iter/sec. FFTlen=100K, Type=3, Arch=4, Pass1=320, Pass2=320, clm=1 (8 cpus, 8 workers): 0.37, 0.37, 0.37, 0.37, 0.38, 0.37, 0.37, 0.37 ms. Throughput: 21500.94 iter/sec. FFTlen=112K, Type=3, Arch=4, Pass1=448, Pass2=256, clm=4 (8 cpus, 1 worker): 0.13 ms. Throughput: 7576.17 iter/sec. FFTlen=112K, Type=3, Arch=4, Pass1=448, Pass2=256, clm=4 (8 cpus, 8 workers): 0.42, 0.42, 0.41, 0.41, 0.42, 0.42, 0.42, 0.41 ms. Throughput: 19205.70 iter/sec. FFTlen=112K, Type=3, Arch=4, Pass1=448, Pass2=256, clm=2 (8 cpus, 1 worker): 0.13 ms. Throughput: 7827.40 iter/sec. FFTlen=112K, Type=3, Arch=4, Pass1=448, Pass2=256, clm=2 (8 cpus, 8 workers): 0.41, 0.41, 0.41, 0.41, 0.42, 0.41, 0.41, 0.41 ms. Throughput: 19502.12 iter/sec. FFTlen=112K, Type=3, Arch=4, Pass1=448, Pass2=256, clm=1 (8 cpus, 1 worker): 0.13 ms. Throughput: 7967.81 iter/sec. FFTlen=112K, Type=3, Arch=4, Pass1=448, Pass2=256, clm=1 (8 cpus, 8 workers): 0.42, 0.42, 0.42, 0.42, 0.43, 0.42, 0.41, 0.41 ms. Throughput: 19047.20 iter/sec. FFTlen=120K, Type=3, Arch=4, Pass1=384, Pass2=320, clm=4 (8 cpus, 1 worker): 0.14 ms. Throughput: 7137.44 iter/sec. FFTlen=120K, Type=3, Arch=4, Pass1=384, Pass2=320, clm=4 (8 cpus, 8 workers): 0.45, 0.45, 0.45, 0.44, 0.46, 0.45, 0.45, 0.45 ms. Throughput: 17722.91 iter/sec. FFTlen=120K, Type=3, Arch=4, Pass1=384, Pass2=320, clm=2 (8 cpus, 1 worker): 0.12 ms. Throughput: 8223.84 iter/sec. FFTlen=120K, Type=3, Arch=4, Pass1=384, Pass2=320, clm=2 (8 cpus, 8 workers): 0.45, 0.44, 0.44, 0.44, 0.45, 0.45, 0.44, 0.44 ms. Throughput: 17987.28 iter/sec. FFTlen=120K, Type=3, Arch=4, Pass1=384, Pass2=320, clm=1 (8 cpus, 1 worker): 0.12 ms. Throughput: 8008.84 iter/sec. FFTlen=120K, Type=3, Arch=4, Pass1=384, Pass2=320, clm=1 (8 cpus, 8 workers): 0.46, 0.45, 0.45, 0.45, 0.46, 0.46, 0.45, 0.45 ms. Throughput: 17696.51 iter/sec. FFTlen=120K, Type=3, Arch=4, Pass1=640, Pass2=192, clm=4 (8 cpus, 1 worker): 0.22 ms. Throughput: 4598.19 iter/sec. FFTlen=120K, Type=3, Arch=4, Pass1=640, Pass2=192, clm=4 (8 cpus, 8 workers): 0.49, 0.49, 0.48, 0.48, 0.49, 0.49, 0.48, 0.48 ms. Throughput: 16469.78 iter/sec. FFTlen=120K, Type=3, Arch=4, Pass1=640, Pass2=192, clm=2 (8 cpus, 1 worker): 0.19 ms. Throughput: 5227.65 iter/sec. FFTlen=120K, Type=3, Arch=4, Pass1=640, Pass2=192, clm=2 (8 cpus, 8 workers): 0.45, 0.44, 0.44, 0.44, 0.46, 0.45, 0.44, 0.44 ms. Throughput: 17910.68 iter/sec. FFTlen=120K, Type=3, Arch=4, Pass1=640, Pass2=192, clm=1 (8 cpus, 1 worker): 0.19 ms. Throughput: 5284.78 iter/sec. FFTlen=120K, Type=3, Arch=4, Pass1=640, Pass2=192, clm=1 (8 cpus, 8 workers): 0.45, 0.45, 0.45, 0.44, 0.46, 0.45, 0.44, 0.45 ms. Throughput: 17822.57 iter/sec. FFTlen=128K, Type=3, Arch=4, Pass1=128, Pass2=1024, clm=4 (8 cpus, 1 worker): 0.11 ms. Throughput: 8761.40 iter/sec. [Sat Apr 29 09:46:51 2017] FFTlen=128K, Type=3, Arch=4, Pass1=128, Pass2=1024, clm=4 (8 cpus, 8 workers): 0.49, 0.49, 0.48, 0.48, 0.50, 0.49, 0.48, 0.48 ms. Throughput: 16473.02 iter/sec. FFTlen=128K, Type=3, Arch=4, Pass1=128, Pass2=1024, clm=2 (8 cpus, 1 worker): 0.12 ms. Throughput: 8554.29 iter/sec. FFTlen=128K, Type=3, Arch=4, Pass1=128, Pass2=1024, clm=2 (8 cpus, 8 workers): 0.47, 0.47, 0.47, 0.47, 0.48, 0.48, 0.46, 0.47 ms. Throughput: 17000.49 iter/sec. FFTlen=128K, Type=3, Arch=4, Pass1=128, Pass2=1024, clm=1 (8 cpus, 1 worker): 0.14 ms. Throughput: 7313.34 iter/sec. FFTlen=128K, Type=3, Arch=4, Pass1=128, Pass2=1024, clm=1 (8 cpus, 8 workers): 0.49, 0.48, 0.49, 0.47, 0.50, 0.49, 0.48, 0.48 ms. Throughput: 16565.83 iter/sec. FFTlen=128K, Type=3, Arch=4, Pass1=512, Pass2=256, clm=4 (8 cpus, 1 worker): 0.16 ms. Throughput: 6148.59 iter/sec. FFTlen=128K, Type=3, Arch=4, Pass1=512, Pass2=256, clm=4 (8 cpus, 8 workers): 0.51, 0.51, 0.50, 0.50, 0.51, 0.51, 0.50, 0.51 ms. Throughput: 15772.98 iter/sec. FFTlen=128K, Type=3, Arch=4, Pass1=512, Pass2=256, clm=2 (8 cpus, 1 worker): 0.16 ms. Throughput: 6346.73 iter/sec. FFTlen=128K, Type=3, Arch=4, Pass1=512, Pass2=256, clm=2 (8 cpus, 8 workers): 0.46, 0.46, 0.46, 0.45, 0.46, 0.46, 0.45, 0.45 ms. Throughput: 17467.44 iter/sec. FFTlen=128K, Type=3, Arch=4, Pass1=512, Pass2=256, clm=1 (8 cpus, 1 worker): 0.15 ms. Throughput: 6637.54 iter/sec. FFTlen=128K, Type=3, Arch=4, Pass1=512, Pass2=256, clm=1 (8 cpus, 8 workers): 0.47, 0.46, 0.46, 0.46, 0.47, 0.46, 0.46, 0.46 ms. Throughput: 17367.12 iter/sec. FFTlen=140K, Type=3, Arch=4, Pass1=448, Pass2=320, clm=4 (8 cpus, 1 worker): 0.15 ms. Throughput: 6528.65 iter/sec. FFTlen=140K, Type=3, Arch=4, Pass1=448, Pass2=320, clm=4 (8 cpus, 8 workers): 0.55, 0.55, 0.54, 0.54, 0.56, 0.55, 0.54, 0.54 ms. Throughput: 14643.62 iter/sec. FFTlen=140K, Type=3, Arch=4, Pass1=448, Pass2=320, clm=2 (8 cpus, 1 worker): 0.14 ms. Throughput: 7080.44 iter/sec. FFTlen=140K, Type=3, Arch=4, Pass1=448, Pass2=320, clm=2 (8 cpus, 8 workers): 0.54, 0.54, 0.54, 0.53, 0.55, 0.54, 0.53, 0.53 ms. Throughput: 14820.58 iter/sec. FFTlen=140K, Type=3, Arch=4, Pass1=448, Pass2=320, clm=1 (8 cpus, 1 worker): 0.15 ms. Throughput: 6806.99 iter/sec. FFTlen=140K, Type=3, Arch=4, Pass1=448, Pass2=320, clm=1 (8 cpus, 8 workers): 0.56, 0.56, 0.55, 0.55, 0.56, 0.55, 0.55, 0.55 ms. Throughput: 14494.08 iter/sec. FFTlen=144K, Type=3, Arch=4, Pass1=768, Pass2=192, clm=4 (8 cpus, 1 worker): 0.26 ms. Throughput: 3867.97 iter/sec. FFTlen=144K, Type=3, Arch=4, Pass1=768, Pass2=192, clm=4 (8 cpus, 8 workers): 0.60, 0.60, 0.59, 0.59, 0.61, 0.60, 0.59, 0.59 ms. Throughput: 13422.26 iter/sec. FFTlen=144K, Type=3, Arch=4, Pass1=768, Pass2=192, clm=2 (8 cpus, 1 worker): 0.24 ms. Throughput: 4110.00 iter/sec. FFTlen=144K, Type=3, Arch=4, Pass1=768, Pass2=192, clm=2 (8 cpus, 8 workers): 0.55, 0.54, 0.54, 0.54, 0.55, 0.55, 0.54, 0.54 ms. Throughput: 14741.37 iter/sec. FFTlen=144K, Type=3, Arch=4, Pass1=768, Pass2=192, clm=1 (8 cpus, 1 worker): 0.22 ms. Throughput: 4488.76 iter/sec. FFTlen=144K, Type=3, Arch=4, Pass1=768, Pass2=192, clm=1 (8 cpus, 8 workers): 0.54, 0.54, 0.54, 0.53, 0.55, 0.54, 0.53, 0.53 ms. Throughput: 14849.39 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=128, Pass2=1280, clm=4 (8 cpus, 1 worker): 0.13 ms. Throughput: 7545.16 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=128, Pass2=1280, clm=4 (8 cpus, 8 workers): 0.62, 0.61, 0.62, 0.61, 0.63, 0.63, 0.61, 0.60 ms. Throughput: 12980.46 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=128, Pass2=1280, clm=2 (8 cpus, 1 worker): 0.14 ms. Throughput: 7330.95 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=128, Pass2=1280, clm=2 (8 cpus, 8 workers): 0.61, 0.60, 0.59, 0.59, 0.62, 0.61, 0.59, 0.60 ms. Throughput: 13309.88 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=128, Pass2=1280, clm=1 (8 cpus, 1 worker): 0.19 ms. Throughput: 5152.54 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=128, Pass2=1280, clm=1 (8 cpus, 8 workers): 0.62, 0.61, 0.61, 0.60, 0.86, 0.67, 0.75, 0.76 ms. Throughput: 11872.79 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=512, Pass2=320, clm=4 (8 cpus, 1 worker): 0.22 ms. Throughput: 4466.55 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=512, Pass2=320, clm=4 (8 cpus, 8 workers): 0.66, 0.65, 0.65, 0.64, 0.67, 0.69, 0.66, 0.65 ms. Throughput: 12138.02 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=512, Pass2=320, clm=2 (8 cpus, 1 worker): 0.16 ms. Throughput: 6307.37 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=512, Pass2=320, clm=2 (8 cpus, 8 workers): 0.62, 0.61, 0.60, 0.60, 0.62, 0.61, 0.60, 0.60 ms. Throughput: 13194.32 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=512, Pass2=320, clm=1 (8 cpus, 1 worker): 0.16 ms. Throughput: 6174.58 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=512, Pass2=320, clm=1 (8 cpus, 8 workers): 0.61, 0.61, 0.60, 0.60, 0.61, 0.61, 0.59, 0.60 ms. Throughput: 13234.43 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=640, Pass2=256, clm=4 (8 cpus, 1 worker): 0.19 ms. Throughput: 5230.11 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=640, Pass2=256, clm=4 (8 cpus, 8 workers): 0.66, 0.65, 0.65, 0.64, 0.66, 0.65, 0.64, 0.64 ms. Throughput: 12330.17 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=640, Pass2=256, clm=2 (8 cpus, 1 worker): 0.17 ms. Throughput: 5946.65 iter/sec. [Sat Apr 29 09:51:58 2017] FFTlen=160K, Type=3, Arch=4, Pass1=640, Pass2=256, clm=2 (8 cpus, 8 workers): 0.60, 0.59, 0.59, 0.58, 0.60, 0.59, 0.58, 0.58 ms. Throughput: 13621.46 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=640, Pass2=256, clm=1 (8 cpus, 1 worker): 0.22 ms. Throughput: 4611.67 iter/sec. FFTlen=160K, Type=3, Arch=4, Pass1=640, Pass2=256, clm=1 (8 cpus, 8 workers): 0.59, 0.60, 0.59, 0.58, 0.60, 0.59, 0.58, 0.58 ms. Throughput: 13550.45 iter/sec. FFTlen=168K, Type=3, Arch=4, Pass1=896, Pass2=192, clm=4 (8 cpus, 1 worker): 0.30 ms. Throughput: 3342.41 iter/sec. FFTlen=168K, Type=3, Arch=4, Pass1=896, Pass2=192, clm=4 (8 cpus, 8 workers): 0.72, 0.72, 0.71, 0.71, 0.73, 0.72, 0.71, 0.71 ms. Throughput: 11166.78 iter/sec. FFTlen=168K, Type=3, Arch=4, Pass1=896, Pass2=192, clm=2 (8 cpus, 1 worker): 0.26 ms. Throughput: 3840.34 iter/sec. FFTlen=168K, Type=3, Arch=4, Pass1=896, Pass2=192, clm=2 (8 cpus, 8 workers): 0.66, 0.65, 0.65, 0.64, 0.66, 0.66, 0.64, 0.65 ms. Throughput: 12263.74 iter/sec. FFTlen=168K, Type=3, Arch=4, Pass1=896, Pass2=192, clm=1 (8 cpus, 1 worker): 0.26 ms. Throughput: 3855.49 iter/sec. FFTlen=168K, Type=3, Arch=4, Pass1=896, Pass2=192, clm=1 (8 cpus, 8 workers): 0.65, 0.65, 0.65, 0.65, 0.66, 0.65, 0.64, 0.64 ms. Throughput: 12315.33 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=128, Pass2=1536, clm=4 (8 cpus, 1 worker): 0.15 ms. Throughput: 6540.36 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=128, Pass2=1536, clm=4 (8 cpus, 8 workers): 0.74, 0.74, 0.73, 0.72, 0.75, 0.74, 0.73, 0.73 ms. Throughput: 10913.77 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=128, Pass2=1536, clm=2 (8 cpus, 1 worker): 0.16 ms. Throughput: 6299.61 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=128, Pass2=1536, clm=2 (8 cpus, 8 workers): 0.72, 0.72, 0.71, 0.70, 0.74, 0.72, 0.71, 0.70 ms. Throughput: 11188.03 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=128, Pass2=1536, clm=1 (8 cpus, 1 worker): 0.19 ms. Throughput: 5379.62 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=128, Pass2=1536, clm=1 (8 cpus, 8 workers): 0.73, 0.73, 0.72, 0.72, 0.74, 0.73, 0.73, 0.72 ms. Throughput: 10999.37 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=256, Pass2=768, clm=4 (8 cpus, 1 worker): 0.17 ms. Throughput: 5730.65 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=256, Pass2=768, clm=4 (8 cpus, 8 workers): 0.75, 0.74, 0.74, 0.73, 0.76, 0.75, 0.73, 0.73 ms. Throughput: 10800.08 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=256, Pass2=768, clm=2 (8 cpus, 1 worker): 0.16 ms. Throughput: 6323.20 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=256, Pass2=768, clm=2 (8 cpus, 8 workers): 0.73, 0.72, 0.72, 0.71, 0.74, 0.73, 0.71, 0.71 ms. Throughput: 11074.65 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=256, Pass2=768, clm=1 (8 cpus, 1 worker): 0.17 ms. Throughput: 6059.11 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=256, Pass2=768, clm=1 (8 cpus, 8 workers): 0.72, 0.72, 0.71, 0.71, 0.73, 0.72, 0.71, 0.71 ms. Throughput: 11185.04 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=1024, Pass2=192, clm=4 (8 cpus, 1 worker): 0.34 ms. Throughput: 2900.51 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=1024, Pass2=192, clm=4 (8 cpus, 8 workers): 0.83, 0.83, 0.82, 0.81, 0.84, 0.84, 0.83, 0.82 ms. Throughput: 9642.20 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=1024, Pass2=192, clm=2 (8 cpus, 1 worker): 0.29 ms. Throughput: 3405.59 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=1024, Pass2=192, clm=2 (8 cpus, 8 workers): 0.76, 0.76, 0.76, 0.75, 0.77, 0.76, 0.75, 0.75 ms. Throughput: 10559.70 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=1024, Pass2=192, clm=1 (8 cpus, 1 worker): 0.29 ms. Throughput: 3421.21 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=1024, Pass2=192, clm=1 (8 cpus, 8 workers): 0.75, 0.75, 0.75, 0.74, 0.76, 0.76, 0.74, 0.74 ms. Throughput: 10699.56 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=768, Pass2=256, clm=4 (8 cpus, 1 worker): 0.22 ms. Throughput: 4478.86 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=768, Pass2=256, clm=4 (8 cpus, 8 workers): 0.80, 0.79, 0.82, 0.80, 0.81, 0.80, 0.79, 0.78 ms. Throughput: 10018.47 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=768, Pass2=256, clm=2 (8 cpus, 1 worker): 0.21 ms. Throughput: 4848.14 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=768, Pass2=256, clm=2 (8 cpus, 8 workers): 0.72, 0.71, 0.71, 0.70, 0.72, 0.72, 0.70, 0.70 ms. Throughput: 11252.49 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=768, Pass2=256, clm=1 (8 cpus, 1 worker): 0.22 ms. Throughput: 4622.42 iter/sec. FFTlen=192K, Type=3, Arch=4, Pass1=768, Pass2=256, clm=1 (8 cpus, 8 workers): 0.73, 0.72, 0.71, 0.70, 0.73, 0.72, 0.71, 0.71 ms. Throughput: 11190.17 iter/sec. FFTlen=200K, Type=3, Arch=4, Pass1=640, Pass2=320, clm=4 (8 cpus, 1 worker): 0.21 ms. Throughput: 4663.42 iter/sec. FFTlen=200K, Type=3, Arch=4, Pass1=640, Pass2=320, clm=4 (8 cpus, 8 workers): 0.84, 0.86, 0.83, 0.84, 0.84, 0.84, 0.83, 0.82 ms. Throughput: 9554.86 iter/sec. FFTlen=200K, Type=3, Arch=4, Pass1=640, Pass2=320, clm=2 (8 cpus, 1 worker): 0.19 ms. Throughput: 5249.28 iter/sec. FFTlen=200K, Type=3, Arch=4, Pass1=640, Pass2=320, clm=2 (8 cpus, 8 workers): 0.79, 0.78, 0.78, 0.77, 0.80, 0.79, 0.77, 0.77 ms. Throughput: 10246.04 iter/sec. FFTlen=200K, Type=3, Arch=4, Pass1=640, Pass2=320, clm=1 (8 cpus, 1 worker): 0.19 ms. Throughput: 5269.29 iter/sec. [Sat Apr 29 09:57:05 2017] FFTlen=200K, Type=3, Arch=4, Pass1=640, Pass2=320, clm=1 (8 cpus, 8 workers): 0.79, 0.78, 0.78, 0.77, 0.81, 0.79, 0.77, 0.77 ms. Throughput: 10224.43 iter/sec. FFTlen=224K, Type=3, Arch=4, Pass1=896, Pass2=256, clm=4 (8 cpus, 1 worker): 0.27 ms. Throughput: 3767.32 iter/sec. FFTlen=224K, Type=3, Arch=4, Pass1=896, Pass2=256, clm=4 (8 cpus, 8 workers): 0.98, 0.97, 0.96, 0.96, 0.99, 0.98, 0.95, 0.96 ms. Throughput: 8249.12 iter/sec. FFTlen=224K, Type=3, Arch=4, Pass1=896, Pass2=256, clm=2 (8 cpus, 1 worker): 0.24 ms. Throughput: 4101.31 iter/sec. FFTlen=224K, Type=3, Arch=4, Pass1=896, Pass2=256, clm=2 (8 cpus, 8 workers): 0.86, 0.86, 0.86, 0.85, 0.87, 0.87, 0.85, 0.85 ms. Throughput: 9309.13 iter/sec. FFTlen=224K, Type=3, Arch=4, Pass1=896, Pass2=256, clm=1 (8 cpus, 1 worker): 0.25 ms. Throughput: 4079.99 iter/sec. FFTlen=224K, Type=3, Arch=4, Pass1=896, Pass2=256, clm=1 (8 cpus, 8 workers): 0.87, 0.87, 0.87, 0.86, 0.88, 0.88, 0.85, 0.85 ms. Throughput: 9248.01 iter/sec. FFTlen=240K, Type=3, Arch=4, Pass1=320, Pass2=768, clm=4 (8 cpus, 1 worker): 0.19 ms. Throughput: 5317.58 iter/sec. FFTlen=240K, Type=3, Arch=4, Pass1=320, Pass2=768, clm=4 (8 cpus, 8 workers): 0.93, 0.93, 0.92, 0.91, 0.95, 0.93, 0.91, 0.91 ms. Throughput: 8656.41 iter/sec. FFTlen=240K, Type=3, Arch=4, Pass1=320, Pass2=768, clm=2 (8 cpus, 1 worker): 0.19 ms. Throughput: 5184.58 iter/sec. FFTlen=240K, Type=3, Arch=4, Pass1=320, Pass2=768, clm=2 (8 cpus, 8 workers): 0.92, 0.91, 0.91, 0.90, 0.93, 0.92, 0.91, 0.90 ms. Throughput: 8759.62 iter/sec. FFTlen=240K, Type=3, Arch=4, Pass1=320, Pass2=768, clm=1 (8 cpus, 1 worker): 0.19 ms. Throughput: 5264.49 iter/sec. FFTlen=240K, Type=3, Arch=4, Pass1=320, Pass2=768, clm=1 (8 cpus, 8 workers): 0.94, 0.93, 0.93, 0.91, 0.95, 0.94, 0.92, 0.92 ms. Throughput: 8611.60 iter/sec. FFTlen=240K, Type=3, Arch=4, Pass1=768, Pass2=320, clm=4 (8 cpus, 1 worker): 0.25 ms. Throughput: 3998.70 iter/sec. FFTlen=240K, Type=3, Arch=4, Pass1=768, Pass2=320, clm=4 (8 cpus, 8 workers): 1.03, 1.04, 1.02, 1.03, 1.05, 1.03, 1.01, 1.01 ms. Throughput: 7788.82 iter/sec. FFTlen=240K, Type=3, Arch=4, Pass1=768, Pass2=320, clm=2 (8 cpus, 1 worker): 0.22 ms. Throughput: 4506.52 iter/sec. FFTlen=240K, Type=3, Arch=4, Pass1=768, Pass2=320, clm=2 (8 cpus, 8 workers): 0.95, 0.95, 0.95, 0.94, 0.96, 0.96, 0.94, 0.94 ms. Throughput: 8435.09 iter/sec. FFTlen=240K, Type=3, Arch=4, Pass1=768, Pass2=320, clm=1 (8 cpus, 1 worker): 0.23 ms. Throughput: 4413.80 iter/sec. FFTlen=240K, Type=3, Arch=4, Pass1=768, Pass2=320, clm=1 (8 cpus, 8 workers): 0.96, 0.96, 0.94, 0.93, 0.97, 0.96, 0.93, 0.94 ms. Throughput: 8431.13 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=128, Pass2=2048, clm=4 (8 cpus, 1 worker): 0.18 ms. Throughput: 5569.75 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=128, Pass2=2048, clm=4 (8 cpus, 8 workers): 1.00, 1.00, 0.99, 0.99, 1.03, 1.01, 0.98, 0.98 ms. Throughput: 8010.38 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=128, Pass2=2048, clm=2 (8 cpus, 1 worker): 0.19 ms. Throughput: 5340.77 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=128, Pass2=2048, clm=2 (8 cpus, 8 workers): 0.97, 0.96, 0.95, 0.96, 0.98, 0.97, 0.95, 0.95 ms. Throughput: 8314.85 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=128, Pass2=2048, clm=1 (8 cpus, 1 worker): 0.25 ms. Throughput: 4046.11 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=128, Pass2=2048, clm=1 (8 cpus, 8 workers): 1.02, 0.99, 0.98, 0.97, 1.04, 1.00, 0.98, 0.97 ms. Throughput: 8058.27 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=256, Pass2=1024, clm=4 (8 cpus, 1 worker): 0.19 ms. Throughput: 5401.65 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=256, Pass2=1024, clm=4 (8 cpus, 8 workers): 1.02, 1.02, 1.01, 1.00, 1.04, 1.03, 1.00, 1.00 ms. Throughput: 7886.97 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=256, Pass2=1024, clm=2 (8 cpus, 1 worker): 0.19 ms. Throughput: 5291.89 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=256, Pass2=1024, clm=2 (8 cpus, 8 workers): 0.99, 0.98, 0.98, 0.97, 1.00, 0.99, 0.97, 0.97 ms. Throughput: 8145.77 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=256, Pass2=1024, clm=1 (8 cpus, 1 worker): 0.19 ms. Throughput: 5147.00 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=256, Pass2=1024, clm=1 (8 cpus, 8 workers): 0.98, 0.97, 0.97, 0.96, 1.00, 0.98, 0.96, 0.95 ms. Throughput: 8226.22 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=1024, Pass2=256, clm=4 (8 cpus, 1 worker): 0.30 ms. Throughput: 3311.44 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=1024, Pass2=256, clm=4 (8 cpus, 8 workers): 1.15, 1.13, 1.15, 1.13, 1.17, 1.16, 1.15, 1.14 ms. Throughput: 6969.61 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=1024, Pass2=256, clm=2 (8 cpus, 1 worker): 0.28 ms. Throughput: 3548.81 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=1024, Pass2=256, clm=2 (8 cpus, 8 workers): 1.01, 1.01, 0.99, 0.99, 1.02, 1.00, 0.99, 0.99 ms. Throughput: 7990.17 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=1024, Pass2=256, clm=1 (8 cpus, 1 worker): 0.29 ms. Throughput: 3503.47 iter/sec. FFTlen=256K, Type=3, Arch=4, Pass1=1024, Pass2=256, clm=1 (8 cpus, 8 workers): 1.00, 0.99, 0.99, 0.98, 1.02, 1.00, 0.97, 0.97 ms. Throughput: 8077.57 iter/sec. FFTlen=280K, Type=3, Arch=4, Pass1=896, Pass2=320, clm=4 (8 cpus, 1 worker): 0.29 ms. Throughput: 3416.64 iter/sec. [Sat Apr 29 10:02:13 2017] FFTlen=280K, Type=3, Arch=4, Pass1=896, Pass2=320, clm=4 (8 cpus, 8 workers): 1.27, 1.25, 1.25, 1.25, 1.27, 1.26, 1.23, 1.24 ms. Throughput: 6385.88 iter/sec. FFTlen=280K, Type=3, Arch=4, Pass1=896, Pass2=320, clm=2 (8 cpus, 1 worker): 0.26 ms. Throughput: 3871.21 iter/sec. FFTlen=280K, Type=3, Arch=4, Pass1=896, Pass2=320, clm=2 (8 cpus, 8 workers): 1.16, 1.16, 1.15, 1.14, 1.17, 1.16, 1.15, 1.14 ms. Throughput: 6926.44 iter/sec. FFTlen=280K, Type=3, Arch=4, Pass1=896, Pass2=320, clm=1 (8 cpus, 1 worker): 0.26 ms. Throughput: 3810.07 iter/sec. FFTlen=280K, Type=3, Arch=4, Pass1=896, Pass2=320, clm=1 (8 cpus, 8 workers): 1.16, 1.16, 1.14, 1.13, 1.19, 1.16, 1.15, 1.13 ms. Throughput: 6940.03 iter/sec. FFTlen=288K, Type=3, Arch=4, Pass1=128, Pass2=2304, clm=4 (8 cpus, 1 worker): 0.21 ms. Throughput: 4832.73 iter/sec. FFTlen=288K, Type=3, Arch=4, Pass1=128, Pass2=2304, clm=4 (8 cpus, 8 workers): 1.25, 1.24, 1.22, 1.21, 1.25, 1.25, 1.21, 1.21 ms. Throughput: 6507.47 iter/sec. FFTlen=288K, Type=3, Arch=4, Pass1=128, Pass2=2304, clm=2 (8 cpus, 1 worker): 0.21 ms. Throughput: 4704.78 iter/sec. FFTlen=288K, Type=3, Arch=4, Pass1=128, Pass2=2304, clm=2 (8 cpus, 8 workers): 1.20, 1.19, 1.19, 1.17, 1.21, 1.20, 1.18, 1.17 ms. Throughput: 6738.14 iter/sec. FFTlen=288K, Type=3, Arch=4, Pass1=128, Pass2=2304, clm=1 (8 cpus, 1 worker): 0.29 ms. Throughput: 3419.87 iter/sec. FFTlen=288K, Type=3, Arch=4, Pass1=128, Pass2=2304, clm=1 (8 cpus, 8 workers): 1.21, 1.21, 1.20, 1.19, 1.23, 1.22, 1.18, 1.20 ms. Throughput: 6643.24 iter/sec. FFTlen=288K, Type=3, Arch=4, Pass1=384, Pass2=768, clm=4 (8 cpus, 1 worker): 0.22 ms. Throughput: 4592.61 iter/sec. FFTlen=288K, Type=3, Arch=4, Pass1=384, Pass2=768, clm=4 (8 cpus, 8 workers): 1.16, 1.16, 1.14, 1.12, 1.18, 1.18, 1.13, 1.11 ms. Throughput: 6982.18 iter/sec. FFTlen=288K, Type=3, Arch=4, Pass1=384, Pass2=768, clm=2 (8 cpus, 1 worker): 0.22 ms. Throughput: 4627.96 iter/sec. FFTlen=288K, Type=3, Arch=4, Pass1=384, Pass2=768, clm=2 (8 cpus, 8 workers): 1.17, 1.21, 1.10, 1.10, 1.13, 1.12, 1.10, 1.09 ms. Throughput: 7101.65 iter/sec. FFTlen=288K, Type=3, Arch=4, Pass1=384, Pass2=768, clm=1 (8 cpus, 1 worker): 0.22 ms. Throughput: 4599.18 iter/sec. FFTlen=288K, Type=3, Arch=4, Pass1=384, Pass2=768, clm=1 (8 cpus, 8 workers): 1.13, 1.13, 1.12, 1.11, 1.15, 1.14, 1.11, 1.11 ms. Throughput: 7104.98 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=128, Pass2=2560, clm=4 (8 cpus, 1 worker): 0.22 ms. Throughput: 4604.30 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=128, Pass2=2560, clm=4 (8 cpus, 8 workers): 1.35, 1.34, 1.32, 1.32, 1.36, 1.36, 1.31, 1.31 ms. Throughput: 5998.36 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=128, Pass2=2560, clm=2 (8 cpus, 1 worker): 0.22 ms. Throughput: 4479.91 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=128, Pass2=2560, clm=2 (8 cpus, 8 workers): 1.30, 1.29, 1.28, 1.27, 1.31, 1.31, 1.27, 1.27 ms. Throughput: 6222.31 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=128, Pass2=2560, clm=1 (8 cpus, 1 worker): 0.31 ms. Throughput: 3174.82 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=128, Pass2=2560, clm=1 (8 cpus, 8 workers): 1.32, 1.32, 1.30, 1.29, 1.34, 1.33, 1.29, 1.29 ms. Throughput: 6107.43 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=256, Pass2=1280, clm=4 (8 cpus, 1 worker): 0.22 ms. Throughput: 4517.53 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=256, Pass2=1280, clm=4 (8 cpus, 8 workers): 1.33, 1.32, 1.31, 1.29, 1.34, 1.33, 1.29, 1.29 ms. Throughput: 6096.90 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=256, Pass2=1280, clm=2 (8 cpus, 1 worker): 0.22 ms. Throughput: 4511.89 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=256, Pass2=1280, clm=2 (8 cpus, 8 workers): 1.27, 1.26, 1.25, 1.24, 1.28, 1.27, 1.25, 1.24 ms. Throughput: 6368.39 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=256, Pass2=1280, clm=1 (8 cpus, 1 worker): 0.23 ms. Throughput: 4382.49 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=256, Pass2=1280, clm=1 (8 cpus, 8 workers): 1.26, 1.26, 1.24, 1.22, 1.27, 1.27, 1.22, 1.22 ms. Throughput: 6429.50 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=320, Pass2=1024, clm=4 (8 cpus, 1 worker): 0.22 ms. Throughput: 4606.18 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=320, Pass2=1024, clm=4 (8 cpus, 8 workers): 1.30, 1.28, 1.29, 1.26, 1.33, 1.28, 1.26, 1.26 ms. Throughput: 6241.43 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=320, Pass2=1024, clm=2 (8 cpus, 1 worker): 0.23 ms. Throughput: 4441.81 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=320, Pass2=1024, clm=2 (8 cpus, 8 workers): 1.27, 1.26, 1.25, 1.24, 1.28, 1.26, 1.24, 1.24 ms. Throughput: 6377.38 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=320, Pass2=1024, clm=1 (8 cpus, 1 worker): 0.23 ms. Throughput: 4351.83 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=320, Pass2=1024, clm=1 (8 cpus, 8 workers): 1.29, 1.28, 1.27, 1.25, 1.31, 1.29, 1.26, 1.25 ms. Throughput: 6281.38 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=1024, Pass2=320, clm=4 (8 cpus, 1 worker): 0.34 ms. Throughput: 2972.14 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=1024, Pass2=320, clm=4 (8 cpus, 8 workers): 1.48, 1.47, 1.47, 1.45, 1.50, 1.50, 1.45, 1.45 ms. Throughput: 5431.89 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=1024, Pass2=320, clm=2 (8 cpus, 1 worker): 0.29 ms. Throughput: 3397.59 iter/sec. [Sat Apr 29 10:07:21 2017] FFTlen=320K, Type=3, Arch=4, Pass1=1024, Pass2=320, clm=2 (8 cpus, 8 workers): 1.36, 1.35, 1.35, 1.33, 1.38, 1.36, 1.33, 1.33 ms. Throughput: 5922.43 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=1024, Pass2=320, clm=1 (8 cpus, 1 worker): 0.29 ms. Throughput: 3419.11 iter/sec. FFTlen=320K, Type=3, Arch=4, Pass1=1024, Pass2=320, clm=1 (8 cpus, 8 workers): 1.34, 1.33, 1.32, 1.31, 1.36, 1.35, 1.31, 1.30 ms. Throughput: 6027.67 iter/sec. FFTlen=336K, Type=3, Arch=4, Pass1=448, Pass2=768, clm=4 (8 cpus, 1 worker): 0.25 ms. Throughput: 4011.91 iter/sec. FFTlen=336K, Type=3, Arch=4, Pass1=448, Pass2=768, clm=4 (8 cpus, 8 workers): 1.41, 1.40, 1.40, 1.38, 1.44, 1.42, 1.38, 1.40 ms. Throughput: 5702.87 iter/sec. FFTlen=336K, Type=3, Arch=4, Pass1=448, Pass2=768, clm=2 (8 cpus, 1 worker): 0.25 ms. Throughput: 4059.67 iter/sec. FFTlen=336K, Type=3, Arch=4, Pass1=448, Pass2=768, clm=2 (8 cpus, 8 workers): 1.37, 1.36, 1.35, 1.34, 1.38, 1.37, 1.34, 1.34 ms. Throughput: 5904.60 iter/sec. FFTlen=336K, Type=3, Arch=4, Pass1=448, Pass2=768, clm=1 (8 cpus, 1 worker): 0.25 ms. Throughput: 3976.30 iter/sec. FFTlen=336K, Type=3, Arch=4, Pass1=448, Pass2=768, clm=1 (8 cpus, 8 workers): 1.39, 1.39, 1.37, 1.35, 1.41, 1.40, 1.36, 1.36 ms. Throughput: 5807.54 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=128, Pass2=3072, clm=4 (8 cpus, 1 worker): 0.25 ms. Throughput: 3987.31 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=128, Pass2=3072, clm=4 (8 cpus, 8 workers): 1.75, 1.74, 1.72, 1.70, 1.76, 1.75, 1.70, 1.70 ms. Throughput: 4630.88 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=128, Pass2=3072, clm=2 (8 cpus, 1 worker): 0.26 ms. Throughput: 3860.53 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=128, Pass2=3072, clm=2 (8 cpus, 8 workers): 1.71, 1.70, 1.68, 1.67, 1.74, 1.71, 1.67, 1.67 ms. Throughput: 4722.80 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=128, Pass2=3072, clm=1 (8 cpus, 1 worker): 0.37 ms. Throughput: 2688.46 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=128, Pass2=3072, clm=1 (8 cpus, 8 workers): 1.73, 1.72, 1.70, 1.68, 1.76, 1.73, 1.69, 1.68 ms. Throughput: 4673.95 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=256, Pass2=1536, clm=4 (8 cpus, 1 worker): 0.26 ms. Throughput: 3808.50 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=256, Pass2=1536, clm=4 (8 cpus, 8 workers): 1.62, 1.61, 1.60, 1.60, 1.64, 1.63, 1.59, 1.61 ms. Throughput: 4961.50 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=256, Pass2=1536, clm=2 (8 cpus, 1 worker): 0.27 ms. Throughput: 3650.52 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=256, Pass2=1536, clm=2 (8 cpus, 8 workers): 1.60, 1.58, 1.56, 1.54, 1.60, 1.59, 1.54, 1.54 ms. Throughput: 5100.94 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=256, Pass2=1536, clm=1 (8 cpus, 1 worker): 0.28 ms. Throughput: 3613.58 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=256, Pass2=1536, clm=1 (8 cpus, 8 workers): 1.57, 1.55, 1.52, 1.52, 1.57, 1.56, 1.52, 1.51 ms. Throughput: 5198.45 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=384, Pass2=1024, clm=4 (8 cpus, 1 worker): 0.26 ms. Throughput: 3863.90 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=384, Pass2=1024, clm=4 (8 cpus, 8 workers): 1.64, 1.60, 1.61, 1.60, 1.63, 1.61, 1.58, 1.58 ms. Throughput: 4973.63 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=384, Pass2=1024, clm=2 (8 cpus, 1 worker): 0.26 ms. Throughput: 3815.77 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=384, Pass2=1024, clm=2 (8 cpus, 8 workers): 1.60, 1.58, 1.56, 1.55, 1.61, 1.60, 1.55, 1.55 ms. Throughput: 5079.06 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=384, Pass2=1024, clm=1 (8 cpus, 1 worker): 0.26 ms. Throughput: 3788.01 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=384, Pass2=1024, clm=1 (8 cpus, 8 workers): 1.60, 1.58, 1.56, 1.55, 1.61, 1.59, 1.55, 1.55 ms. Throughput: 5081.00 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=512, Pass2=768, clm=4 (8 cpus, 1 worker): 0.30 ms. Throughput: 3350.79 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=512, Pass2=768, clm=4 (8 cpus, 8 workers): 1.76, 1.75, 1.73, 1.72, 1.77, 1.77, 1.72, 1.72 ms. Throughput: 4591.31 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=512, Pass2=768, clm=2 (8 cpus, 1 worker): 0.29 ms. Throughput: 3470.30 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=512, Pass2=768, clm=2 (8 cpus, 8 workers): 1.61, 1.60, 1.58, 1.56, 1.64, 1.61, 1.56, 1.55 ms. Throughput: 5034.87 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=512, Pass2=768, clm=1 (8 cpus, 1 worker): 0.29 ms. Throughput: 3507.00 iter/sec. FFTlen=384K, Type=3, Arch=4, Pass1=512, Pass2=768, clm=1 (8 cpus, 8 workers): 1.61, 1.58, 1.57, 1.55, 1.62, 1.60, 1.56, 1.55 ms. Throughput: 5061.51 iter/sec. FFTlen=400K, Type=3, Arch=4, Pass1=320, Pass2=1280, clm=4 (8 cpus, 1 worker): 0.27 ms. Throughput: 3687.28 iter/sec. FFTlen=400K, Type=3, Arch=4, Pass1=320, Pass2=1280, clm=4 (8 cpus, 8 workers): 1.73, 1.72, 1.70, 1.69, 1.74, 1.73, 1.69, 1.69 ms. Throughput: 4675.24 iter/sec. FFTlen=400K, Type=3, Arch=4, Pass1=320, Pass2=1280, clm=2 (8 cpus, 1 worker): 0.27 ms. Throughput: 3708.94 iter/sec. FFTlen=400K, Type=3, Arch=4, Pass1=320, Pass2=1280, clm=2 (8 cpus, 8 workers): 1.71, 1.69, 1.66, 1.65, 1.72, 1.70, 1.65, 1.65 ms. Throughput: 4765.86 iter/sec. [Sat Apr 29 10:12:23 2017] FFTlen=400K, Type=3, Arch=4, Pass1=320, Pass2=1280, clm=1 (8 cpus, 1 worker): 0.27 ms. Throughput: 3703.92 iter/sec. FFTlen=400K, Type=3, Arch=4, Pass1=320, Pass2=1280, clm=1 (8 cpus, 8 workers): 1.69, 1.67, 1.66, 1.64, 1.70, 1.68, 1.65, 1.64 ms. Throughput: 4803.35 iter/sec. FFTlen=448K, Type=3, Arch=4, Pass1=448, Pass2=1024, clm=4 (8 cpus, 1 worker): 0.30 ms. Throughput: 3342.15 iter/sec. FFTlen=448K, Type=3, Arch=4, Pass1=448, Pass2=1024, clm=4 (8 cpus, 8 workers): 2.05, 2.03, 2.01, 1.99, 2.06, 2.03, 1.99, 2.01 ms. Throughput: 3956.55 iter/sec. FFTlen=448K, Type=3, Arch=4, Pass1=448, Pass2=1024, clm=2 (8 cpus, 1 worker): 0.31 ms. Throughput: 3276.67 iter/sec. FFTlen=448K, Type=3, Arch=4, Pass1=448, Pass2=1024, clm=2 (8 cpus, 8 workers): 2.00, 1.98, 1.95, 1.94, 2.01, 1.99, 1.94, 1.94 ms. Throughput: 4065.47 iter/sec. FFTlen=448K, Type=3, Arch=4, Pass1=448, Pass2=1024, clm=1 (8 cpus, 1 worker): 0.31 ms. Throughput: 3183.84 iter/sec. FFTlen=448K, Type=3, Arch=4, Pass1=448, Pass2=1024, clm=1 (8 cpus, 8 workers): 2.01, 1.98, 1.96, 1.95, 2.01, 1.99, 1.95, 1.94 ms. Throughput: 4056.11 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=128, Pass2=3840, clm=4 (8 cpus, 1 worker): 0.32 ms. Throughput: 3149.45 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=128, Pass2=3840, clm=4 (8 cpus, 8 workers): 2.41, 2.38, 2.34, 2.33, 2.41, 2.39, 2.34, 2.33 ms. Throughput: 3382.65 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=128, Pass2=3840, clm=2 (8 cpus, 1 worker): 0.33 ms. Throughput: 3002.55 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=128, Pass2=3840, clm=2 (8 cpus, 8 workers): 2.39, 2.37, 2.33, 2.31, 2.42, 2.38, 2.31, 2.31 ms. Throughput: 3400.85 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=128, Pass2=3840, clm=1 (8 cpus, 1 worker): 0.52 ms. Throughput: 1935.05 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=128, Pass2=3840, clm=1 (8 cpus, 8 workers): 2.37, 2.34, 2.33, 2.31, 2.39, 2.36, 2.30, 2.31 ms. Throughput: 3420.84 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=320, Pass2=1536, clm=4 (8 cpus, 1 worker): 0.32 ms. Throughput: 3130.74 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=320, Pass2=1536, clm=4 (8 cpus, 8 workers): 2.21, 2.19, 2.15, 2.13, 2.24, 2.20, 2.13, 2.13 ms. Throughput: 3683.03 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=320, Pass2=1536, clm=2 (8 cpus, 1 worker): 0.32 ms. Throughput: 3120.51 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=320, Pass2=1536, clm=2 (8 cpus, 8 workers): 2.16, 2.14, 2.12, 2.10, 2.18, 2.16, 2.10, 2.10 ms. Throughput: 3753.04 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=320, Pass2=1536, clm=1 (8 cpus, 1 worker): 0.34 ms. Throughput: 2945.07 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=320, Pass2=1536, clm=1 (8 cpus, 8 workers): 2.15, 2.12, 2.11, 2.08, 2.16, 2.14, 2.09, 2.09 ms. Throughput: 3778.72 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=384, Pass2=1280, clm=4 (8 cpus, 1 worker): 0.31 ms. Throughput: 3183.66 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=384, Pass2=1280, clm=4 (8 cpus, 8 workers): 2.21, 2.19, 2.15, 2.13, 2.21, 2.21, 2.13, 2.13 ms. Throughput: 3685.30 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=384, Pass2=1280, clm=2 (8 cpus, 1 worker): 0.31 ms. Throughput: 3196.39 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=384, Pass2=1280, clm=2 (8 cpus, 8 workers): 2.14, 2.12, 2.10, 2.09, 2.15, 2.13, 2.10, 2.10 ms. Throughput: 3783.99 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=384, Pass2=1280, clm=1 (8 cpus, 1 worker): 0.32 ms. Throughput: 3135.45 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=384, Pass2=1280, clm=1 (8 cpus, 8 workers): 2.17, 2.14, 2.12, 2.09, 2.18, 2.15, 2.10, 2.09 ms. Throughput: 3757.85 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=640, Pass2=768, clm=4 (8 cpus, 1 worker): 0.37 ms. Throughput: 2702.21 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=640, Pass2=768, clm=4 (8 cpus, 8 workers): 2.40, 2.36, 2.33, 2.31, 2.41, 2.38, 2.31, 2.31 ms. Throughput: 3403.96 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=640, Pass2=768, clm=2 (8 cpus, 1 worker): 0.36 ms. Throughput: 2753.10 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=640, Pass2=768, clm=2 (8 cpus, 8 workers): 2.18, 2.16, 2.14, 2.12, 2.21, 2.18, 2.13, 2.12 ms. Throughput: 3711.12 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=640, Pass2=768, clm=1 (8 cpus, 1 worker): 0.36 ms. Throughput: 2750.67 iter/sec. FFTlen=480K, Type=3, Arch=4, Pass1=640, Pass2=768, clm=1 (8 cpus, 8 workers): 2.16, 2.15, 2.11, 2.09, 2.20, 2.16, 2.10, 2.10 ms. Throughput: 3748.67 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=128, Pass2=4096, clm=4 (8 cpus, 1 worker): 0.33 ms. Throughput: 3066.37 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=128, Pass2=4096, clm=4 (8 cpus, 8 workers): 2.55, 2.55, 2.50, 2.48, 2.57, 2.55, 2.48, 2.48 ms. Throughput: 3175.80 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=128, Pass2=4096, clm=2 (8 cpus, 1 worker): 0.33 ms. Throughput: 3019.17 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=128, Pass2=4096, clm=2 (8 cpus, 8 workers): 2.55, 2.53, 2.51, 2.49, 2.57, 2.57, 2.49, 2.49 ms. Throughput: 3168.37 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=128, Pass2=4096, clm=1 (8 cpus, 1 worker): 0.52 ms. Throughput: 1939.95 iter/sec. [Sat Apr 29 10:17:26 2017] FFTlen=512K, Type=3, Arch=4, Pass1=128, Pass2=4096, clm=1 (8 cpus, 8 workers): 2.57, 2.55, 2.52, 2.49, 2.60, 2.57, 2.50, 2.49 ms. Throughput: 3154.59 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=256, Pass2=2048, clm=4 (8 cpus, 1 worker): 0.32 ms. Throughput: 3087.35 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=256, Pass2=2048, clm=4 (8 cpus, 8 workers): 2.39, 2.36, 2.33, 2.32, 2.39, 2.37, 2.32, 2.31 ms. Throughput: 3406.68 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=256, Pass2=2048, clm=2 (8 cpus, 1 worker): 0.32 ms. Throughput: 3156.25 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=256, Pass2=2048, clm=2 (8 cpus, 8 workers): 2.33, 2.30, 2.28, 2.25, 2.35, 2.33, 2.25, 2.25 ms. Throughput: 3490.60 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=256, Pass2=2048, clm=1 (8 cpus, 1 worker): 0.33 ms. Throughput: 3042.37 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=256, Pass2=2048, clm=1 (8 cpus, 8 workers): 2.32, 2.31, 2.29, 2.25, 2.35, 2.32, 2.26, 2.25 ms. Throughput: 3490.34 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=512, Pass2=1024, clm=4 (8 cpus, 1 worker): 0.37 ms. Throughput: 2719.70 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=512, Pass2=1024, clm=4 (8 cpus, 8 workers): 2.56, 2.53, 2.50, 2.48, 2.55, 2.54, 2.49, 2.48 ms. Throughput: 3179.38 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=512, Pass2=1024, clm=2 (8 cpus, 1 worker): 0.35 ms. Throughput: 2822.09 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=512, Pass2=1024, clm=2 (8 cpus, 8 workers): 2.43, 2.41, 2.37, 2.34, 2.44, 2.41, 2.35, 2.34 ms. Throughput: 3354.57 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=512, Pass2=1024, clm=1 (8 cpus, 1 worker): 0.36 ms. Throughput: 2808.86 iter/sec. FFTlen=512K, Type=3, Arch=4, Pass1=512, Pass2=1024, clm=1 (8 cpus, 8 workers): 2.40, 2.36, 2.33, 2.31, 2.40, 2.37, 2.31, 2.31 ms. Throughput: 3404.99 iter/sec. FFTlen=560K, Type=3, Arch=4, Pass1=448, Pass2=1280, clm=4 (8 cpus, 1 worker): 0.37 ms. Throughput: 2712.04 iter/sec. FFTlen=560K, Type=3, Arch=4, Pass1=448, Pass2=1280, clm=4 (8 cpus, 8 workers): 2.75, 2.69, 2.66, 2.63, 2.74, 2.69, 2.64, 2.64 ms. Throughput: 2985.06 iter/sec. FFTlen=560K, Type=3, Arch=4, Pass1=448, Pass2=1280, clm=2 (8 cpus, 1 worker): 0.36 ms. Throughput: 2751.08 iter/sec. FFTlen=560K, Type=3, Arch=4, Pass1=448, Pass2=1280, clm=2 (8 cpus, 8 workers): 2.69, 2.65, 2.61, 2.57, 2.71, 2.65, 2.59, 2.58 ms. Throughput: 3042.07 iter/sec. FFTlen=560K, Type=3, Arch=4, Pass1=448, Pass2=1280, clm=1 (8 cpus, 1 worker): 0.38 ms. Throughput: 2665.93 iter/sec. FFTlen=560K, Type=3, Arch=4, Pass1=448, Pass2=1280, clm=1 (8 cpus, 8 workers): 2.66, 2.63, 2.61, 2.59, 2.68, 2.64, 2.59, 2.59 ms. Throughput: 3050.36 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=128, Pass2=4608, clm=4 (8 cpus, 1 worker): 0.37 ms. Throughput: 2689.82 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=128, Pass2=4608, clm=4 (8 cpus, 8 workers): 2.97, 2.93, 2.90, 2.87, 2.98, 2.97, 2.87, 2.84 ms. Throughput: 2745.38 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=128, Pass2=4608, clm=2 (8 cpus, 1 worker): 0.37 ms. Throughput: 2702.01 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=128, Pass2=4608, clm=2 (8 cpus, 8 workers): 2.95, 2.94, 2.91, 2.88, 2.97, 2.95, 2.88, 2.88 ms. Throughput: 2740.69 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=128, Pass2=4608, clm=1 (8 cpus, 1 worker): 0.56 ms. Throughput: 1795.46 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=128, Pass2=4608, clm=1 (8 cpus, 8 workers): 2.95, 2.90, 2.86, 2.84, 2.95, 2.94, 2.84, 2.84 ms. Throughput: 2768.72 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=256, Pass2=2304, clm=4 (8 cpus, 1 worker): 0.38 ms. Throughput: 2642.58 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=256, Pass2=2304, clm=4 (8 cpus, 8 workers): 2.91, 2.89, 2.84, 2.81, 2.92, 2.87, 2.81, 2.80 ms. Throughput: 2802.33 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=256, Pass2=2304, clm=2 (8 cpus, 1 worker): 0.37 ms. Throughput: 2704.02 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=256, Pass2=2304, clm=2 (8 cpus, 8 workers): 2.81, 2.78, 2.77, 2.75, 2.82, 2.80, 2.75, 2.74 ms. Throughput: 2880.48 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=256, Pass2=2304, clm=1 (8 cpus, 1 worker): 0.38 ms. Throughput: 2647.22 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=256, Pass2=2304, clm=1 (8 cpus, 8 workers): 2.81, 2.80, 2.76, 2.73, 2.84, 2.81, 2.73, 2.73 ms. Throughput: 2880.19 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=384, Pass2=1536, clm=4 (8 cpus, 1 worker): 0.37 ms. Throughput: 2717.86 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=384, Pass2=1536, clm=4 (8 cpus, 8 workers): 2.76, 2.72, 2.70, 2.66, 2.78, 2.75, 2.67, 2.66 ms. Throughput: 2950.70 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=384, Pass2=1536, clm=2 (8 cpus, 1 worker): 0.39 ms. Throughput: 2588.40 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=384, Pass2=1536, clm=2 (8 cpus, 8 workers): 2.73, 2.67, 2.65, 2.62, 2.73, 2.70, 2.64, 2.62 ms. Throughput: 2996.33 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=384, Pass2=1536, clm=1 (8 cpus, 1 worker): 0.39 ms. Throughput: 2542.75 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=384, Pass2=1536, clm=1 (8 cpus, 8 workers): 2.73, 2.67, 2.64, 2.61, 2.72, 2.70, 2.61, 2.61 ms. Throughput: 3008.14 iter/sec. [Sat Apr 29 10:22:30 2017] FFTlen=576K, Type=3, Arch=4, Pass1=768, Pass2=768, clm=4 (8 cpus, 1 worker): 0.46 ms. Throughput: 2153.94 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=768, Pass2=768, clm=4 (8 cpus, 8 workers): 3.10, 3.04, 3.02, 2.99, 3.09, 3.06, 3.00, 2.99 ms. Throughput: 2635.11 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=768, Pass2=768, clm=2 (8 cpus, 1 worker): 0.45 ms. Throughput: 2235.60 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=768, Pass2=768, clm=2 (8 cpus, 8 workers): 2.85, 2.83, 2.78, 2.76, 2.89, 2.85, 2.75, 2.78 ms. Throughput: 2847.35 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=768, Pass2=768, clm=1 (8 cpus, 1 worker): 0.46 ms. Throughput: 2188.34 iter/sec. FFTlen=576K, Type=3, Arch=4, Pass1=768, Pass2=768, clm=1 (8 cpus, 8 workers): 2.89, 2.85, 2.80, 2.77, 2.92, 2.84, 2.77, 2.78 ms. Throughput: 2831.18 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=128, Pass2=5120, clm=4 (8 cpus, 1 worker): 0.41 ms. Throughput: 2453.56 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=128, Pass2=5120, clm=4 (8 cpus, 8 workers): 3.32, 3.28, 3.27, 3.23, 3.31, 3.29, 3.24, 3.23 ms. Throughput: 2444.74 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=128, Pass2=5120, clm=2 (8 cpus, 1 worker): 0.41 ms. Throughput: 2467.99 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=128, Pass2=5120, clm=2 (8 cpus, 8 workers): 3.31, 3.30, 3.26, 3.21, 3.34, 3.31, 3.22, 3.22 ms. Throughput: 2446.17 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=128, Pass2=5120, clm=1 (8 cpus, 1 worker): 0.64 ms. Throughput: 1552.45 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=128, Pass2=5120, clm=1 (8 cpus, 8 workers): 3.28, 3.28, 3.24, 3.22, 3.31, 3.19, 3.24, 3.22 ms. Throughput: 2463.17 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=256, Pass2=2560, clm=4 (8 cpus, 1 worker): 0.44 ms. Throughput: 2258.95 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=256, Pass2=2560, clm=4 (8 cpus, 8 workers): 3.18, 3.15, 3.11, 3.07, 3.21, 3.16, 3.08, 3.07 ms. Throughput: 2557.77 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=256, Pass2=2560, clm=2 (8 cpus, 1 worker): 0.40 ms. Throughput: 2522.68 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=256, Pass2=2560, clm=2 (8 cpus, 8 workers): 3.08, 3.04, 3.02, 2.99, 3.10, 3.06, 2.99, 2.99 ms. Throughput: 2637.56 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=256, Pass2=2560, clm=1 (8 cpus, 1 worker): 0.41 ms. Throughput: 2462.71 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=256, Pass2=2560, clm=1 (8 cpus, 8 workers): 3.07, 3.02, 2.99, 2.95, 3.11, 3.05, 2.97, 2.95 ms. Throughput: 2654.72 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=320, Pass2=2048, clm=4 (8 cpus, 1 worker): 0.39 ms. Throughput: 2582.07 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=320, Pass2=2048, clm=4 (8 cpus, 8 workers): 3.12, 3.06, 3.01, 2.97, 3.13, 3.08, 2.98, 2.97 ms. Throughput: 2632.00 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=320, Pass2=2048, clm=2 (8 cpus, 1 worker): 0.39 ms. Throughput: 2563.68 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=320, Pass2=2048, clm=2 (8 cpus, 8 workers): 3.06, 3.02, 2.98, 2.95, 3.07, 3.04, 2.99, 2.95 ms. Throughput: 2660.26 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=320, Pass2=2048, clm=1 (8 cpus, 1 worker): 0.40 ms. Throughput: 2530.10 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=320, Pass2=2048, clm=1 (8 cpus, 8 workers): 3.07, 3.02, 2.97, 2.94, 3.08, 3.04, 2.96, 2.94 ms. Throughput: 2665.83 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=512, Pass2=1280, clm=4 (8 cpus, 1 worker): 0.44 ms. Throughput: 2247.55 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=512, Pass2=1280, clm=4 (8 cpus, 8 workers): 3.32, 3.31, 3.22, 3.20, 3.30, 3.27, 3.21, 3.19 ms. Throughput: 2460.61 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=512, Pass2=1280, clm=2 (8 cpus, 1 worker): 0.42 ms. Throughput: 2374.27 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=512, Pass2=1280, clm=2 (8 cpus, 8 workers): 3.20, 3.12, 3.08, 3.03, 3.17, 3.12, 3.05, 3.04 ms. Throughput: 2579.70 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=512, Pass2=1280, clm=1 (8 cpus, 1 worker): 0.41 ms. Throughput: 2411.02 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=512, Pass2=1280, clm=1 (8 cpus, 8 workers): 3.13, 3.13, 3.06, 3.03, 3.18, 3.12, 3.02, 3.03 ms. Throughput: 2590.77 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=640, Pass2=1024, clm=4 (8 cpus, 1 worker): 0.45 ms. Throughput: 2204.09 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=640, Pass2=1024, clm=4 (8 cpus, 8 workers): 3.33, 3.30, 3.25, 3.23, 3.38, 3.29, 3.23, 3.23 ms. Throughput: 2440.48 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=640, Pass2=1024, clm=2 (8 cpus, 1 worker): 0.44 ms. Throughput: 2264.48 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=640, Pass2=1024, clm=2 (8 cpus, 8 workers): 3.19, 3.12, 3.07, 3.03, 3.21, 3.13, 3.04, 3.03 ms. Throughput: 2578.42 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=640, Pass2=1024, clm=1 (8 cpus, 1 worker): 0.44 ms. Throughput: 2262.62 iter/sec. FFTlen=640K, Type=3, Arch=4, Pass1=640, Pass2=1024, clm=1 (8 cpus, 8 workers): 3.16, 3.10, 3.05, 3.02, 3.14, 3.11, 3.04, 3.03 ms. Throughput: 2596.69 iter/sec. FFTlen=672K, Type=3, Arch=4, Pass1=448, Pass2=1536, clm=4 (8 cpus, 1 worker): 0.44 ms. Throughput: 2290.38 iter/sec. [Sat Apr 29 10:27:35 2017] FFTlen=672K, Type=3, Arch=4, Pass1=448, Pass2=1536, clm=4 (8 cpus, 8 workers): 3.38, 3.29, 3.24, 3.22, 3.38, 3.32, 3.22, 3.22 ms. Throughput: 2436.40 iter/sec. FFTlen=672K, Type=3, Arch=4, Pass1=448, Pass2=1536, clm=2 (8 cpus, 1 worker): 0.45 ms. Throughput: 2229.82 iter/sec. FFTlen=672K, Type=3, Arch=4, Pass1=448, Pass2=1536, clm=2 (8 cpus, 8 workers): 3.29, 3.27, 3.21, 3.17, 3.34, 3.30, 3.18, 3.19 ms. Throughput: 2466.79 iter/sec. FFTlen=672K, Type=3, Arch=4, Pass1=448, Pass2=1536, clm=1 (8 cpus, 1 worker): 0.47 ms. Throughput: 2121.02 iter/sec. FFTlen=672K, Type=3, Arch=4, Pass1=448, Pass2=1536, clm=1 (8 cpus, 8 workers): 3.29, 3.23, 3.19, 3.16, 3.27, 3.27, 3.18, 3.16 ms. Throughput: 2487.18 iter/sec. FFTlen=672K, Type=3, Arch=4, Pass1=896, Pass2=768, clm=4 (8 cpus, 1 worker): 0.55 ms. Throughput: 1825.09 iter/sec. FFTlen=672K, Type=3, Arch=4, Pass1=896, Pass2=768, clm=4 (8 cpus, 8 workers): 3.85, 3.73, 3.68, 3.66, 3.84, 3.76, 3.64, 3.64 ms. Throughput: 2148.34 iter/sec. FFTlen=672K, Type=3, Arch=4, Pass1=896, Pass2=768, clm=2 (8 cpus, 1 worker): 0.52 ms. Throughput: 1913.35 iter/sec. FFTlen=672K, Type=3, Arch=4, Pass1=896, Pass2=768, clm=2 (8 cpus, 8 workers): 3.50, 3.45, 3.43, 3.35, 3.47, 3.51, 3.37, 3.39 ms. Throughput: 2331.07 iter/sec. FFTlen=672K, Type=3, Arch=4, Pass1=896, Pass2=768, clm=1 (8 cpus, 1 worker): 0.56 ms. Throughput: 1790.51 iter/sec. FFTlen=672K, Type=3, Arch=4, Pass1=896, Pass2=768, clm=1 (8 cpus, 8 workers): 3.63, 3.57, 3.54, 3.51, 3.67, 3.63, 3.52, 3.50 ms. Throughput: 2241.38 iter/sec. FFTlen=720K, Type=3, Arch=4, Pass1=320, Pass2=2304, clm=4 (8 cpus, 1 worker): 0.47 ms. Throughput: 2114.47 iter/sec. FFTlen=720K, Type=3, Arch=4, Pass1=320, Pass2=2304, clm=4 (8 cpus, 8 workers): 3.71, 3.63, 3.60, 3.53, 3.70, 3.68, 3.56, 3.55 ms. Throughput: 2209.77 iter/sec. FFTlen=720K, Type=3, Arch=4, Pass1=320, Pass2=2304, clm=2 (8 cpus, 1 worker): 0.46 ms. Throughput: 2189.81 iter/sec. FFTlen=720K, Type=3, Arch=4, Pass1=320, Pass2=2304, clm=2 (8 cpus, 8 workers): 3.65, 3.62, 3.60, 3.56, 3.67, 3.63, 3.57, 3.56 ms. Throughput: 2218.04 iter/sec. FFTlen=720K, Type=3, Arch=4, Pass1=320, Pass2=2304, clm=1 (8 cpus, 1 worker): 0.47 ms. Throughput: 2141.38 iter/sec. FFTlen=720K, Type=3, Arch=4, Pass1=320, Pass2=2304, clm=1 (8 cpus, 8 workers): 3.66, 3.60, 3.56, 3.51, 3.69, 3.62, 3.54, 3.54 ms. Throughput: 2229.27 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=128, Pass2=6144, clm=4 (8 cpus, 1 worker): 0.48 ms. Throughput: 2088.69 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=128, Pass2=6144, clm=4 (8 cpus, 8 workers): 4.00, 3.92, 3.89, 3.91, 4.02, 3.94, 3.96, 4.00 ms. Throughput: 2021.96 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=128, Pass2=6144, clm=2 (8 cpus, 1 worker): 0.48 ms. Throughput: 2088.79 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=128, Pass2=6144, clm=2 (8 cpus, 8 workers): 4.05, 3.98, 3.90, 3.88, 4.07, 4.01, 3.98, 3.97 ms. Throughput: 2010.14 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=128, Pass2=6144, clm=1 (8 cpus, 1 worker): 0.73 ms. Throughput: 1367.74 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=128, Pass2=6144, clm=1 (8 cpus, 8 workers): 3.99, 3.97, 3.91, 3.91, 4.03, 3.95, 3.92, 3.92 ms. Throughput: 2025.66 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=256, Pass2=3072, clm=4 (8 cpus, 1 worker): 0.48 ms. Throughput: 2093.46 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=256, Pass2=3072, clm=4 (8 cpus, 8 workers): 3.97, 3.93, 3.87, 3.82, 4.02, 3.92, 3.83, 3.83 ms. Throughput: 2052.60 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=256, Pass2=3072, clm=2 (8 cpus, 1 worker): 0.48 ms. Throughput: 2089.08 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=256, Pass2=3072, clm=2 (8 cpus, 8 workers): 3.90, 3.83, 3.77, 3.75, 3.91, 3.86, 3.75, 3.74 ms. Throughput: 2097.88 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=256, Pass2=3072, clm=1 (8 cpus, 1 worker): 0.50 ms. Throughput: 2013.04 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=256, Pass2=3072, clm=1 (8 cpus, 8 workers): 3.86, 3.84, 3.77, 3.72, 3.92, 3.85, 3.72, 3.72 ms. Throughput: 2104.97 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=384, Pass2=2048, clm=4 (8 cpus, 1 worker): 0.45 ms. Throughput: 2203.76 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=384, Pass2=2048, clm=4 (8 cpus, 8 workers): 3.79, 3.76, 3.69, 3.65, 3.84, 3.77, 3.65, 3.64 ms. Throughput: 2149.18 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=384, Pass2=2048, clm=2 (8 cpus, 1 worker): 0.46 ms. Throughput: 2172.56 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=384, Pass2=2048, clm=2 (8 cpus, 8 workers): 3.77, 3.69, 3.64, 3.62, 3.74, 3.70, 3.63, 3.62 ms. Throughput: 2175.69 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=384, Pass2=2048, clm=1 (8 cpus, 1 worker): 0.46 ms. Throughput: 2151.92 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=384, Pass2=2048, clm=1 (8 cpus, 8 workers): 3.79, 3.70, 3.65, 3.61, 3.80, 3.74, 3.61, 3.60 ms. Throughput: 2170.47 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=512, Pass2=1536, clm=4 (8 cpus, 1 worker): 0.54 ms. Throughput: 1862.05 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=512, Pass2=1536, clm=4 (8 cpus, 8 workers): 4.02, 3.96, 3.92, 3.88, 4.02, 4.00, 3.90, 3.88 ms. Throughput: 2026.67 iter/sec. [Sat Apr 29 10:32:41 2017] FFTlen=768K, Type=3, Arch=4, Pass1=512, Pass2=1536, clm=2 (8 cpus, 1 worker): 0.49 ms. Throughput: 2032.22 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=512, Pass2=1536, clm=2 (8 cpus, 8 workers): 3.90, 3.83, 3.75, 3.72, 3.92, 3.85, 3.72, 3.72 ms. Throughput: 2105.58 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=512, Pass2=1536, clm=1 (8 cpus, 1 worker): 0.49 ms. Throughput: 2049.42 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=512, Pass2=1536, clm=1 (8 cpus, 8 workers): 3.85, 3.79, 3.74, 3.69, 3.86, 3.81, 3.71, 3.70 ms. Throughput: 2124.17 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=768, Pass2=1024, clm=4 (8 cpus, 1 worker): 0.54 ms. Throughput: 1840.80 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=768, Pass2=1024, clm=4 (8 cpus, 8 workers): 4.17, 4.09, 4.04, 4.00, 4.25, 4.12, 4.00, 3.99 ms. Throughput: 1961.19 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=768, Pass2=1024, clm=2 (8 cpus, 1 worker): 0.52 ms. Throughput: 1937.65 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=768, Pass2=1024, clm=2 (8 cpus, 8 workers): 3.93, 3.82, 3.77, 3.73, 3.95, 3.85, 3.73, 3.73 ms. Throughput: 2099.20 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=768, Pass2=1024, clm=1 (8 cpus, 1 worker): 0.55 ms. Throughput: 1828.75 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=768, Pass2=1024, clm=1 (8 cpus, 8 workers): 3.83, 3.79, 3.75, 3.71, 3.87, 3.80, 3.72, 3.72 ms. Throughput: 2119.63 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=1024, Pass2=768, clm=4 (8 cpus, 1 worker): 0.70 ms. Throughput: 1426.74 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=1024, Pass2=768, clm=4 (8 cpus, 8 workers): 4.52, 4.44, 4.39, 4.33, 4.58, 4.51, 4.34, 4.33 ms. Throughput: 1806.98 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=1024, Pass2=768, clm=2 (8 cpus, 1 worker): 0.61 ms. Throughput: 1634.99 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=1024, Pass2=768, clm=2 (8 cpus, 8 workers): 4.18, 4.06, 4.02, 3.99, 4.28, 4.07, 4.00, 3.99 ms. Throughput: 1964.45 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=1024, Pass2=768, clm=1 (8 cpus, 1 worker): 0.66 ms. Throughput: 1523.41 iter/sec. FFTlen=768K, Type=3, Arch=4, Pass1=1024, Pass2=768, clm=1 (8 cpus, 8 workers): 4.28, 4.22, 4.20, 4.14, 4.36, 4.28, 4.14, 4.10 ms. Throughput: 1898.09 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=128, Pass2=6400, clm=4 (8 cpus, 1 worker): 0.51 ms. Throughput: 1953.52 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=128, Pass2=6400, clm=4 (8 cpus, 8 workers): 4.34, 4.31, 4.27, 4.25, 4.34, 4.33, 4.26, 4.24 ms. Throughput: 1863.90 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=128, Pass2=6400, clm=2 (8 cpus, 1 worker): 0.52 ms. Throughput: 1927.57 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=128, Pass2=6400, clm=2 (8 cpus, 8 workers): 4.36, 4.29, 4.26, 4.22, 4.40, 4.36, 4.22, 4.19 ms. Throughput: 1866.53 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=128, Pass2=6400, clm=1 (8 cpus, 1 worker): 0.77 ms. Throughput: 1293.71 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=128, Pass2=6400, clm=1 (8 cpus, 8 workers): 4.30, 4.28, 4.23, 4.22, 4.31, 4.30, 4.21, 4.22 ms. Throughput: 1879.22 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=320, Pass2=2560, clm=4 (8 cpus, 1 worker): 0.49 ms. Throughput: 2033.04 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=320, Pass2=2560, clm=4 (8 cpus, 8 workers): 4.03, 3.96, 3.90, 3.90, 4.05, 4.03, 3.88, 3.87 ms. Throughput: 2025.14 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=320, Pass2=2560, clm=2 (8 cpus, 1 worker): 0.48 ms. Throughput: 2081.59 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=320, Pass2=2560, clm=2 (8 cpus, 8 workers): 4.01, 3.94, 3.89, 3.86, 4.00, 3.98, 3.87, 3.85 ms. Throughput: 2038.06 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=320, Pass2=2560, clm=1 (8 cpus, 1 worker): 0.50 ms. Throughput: 1989.46 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=320, Pass2=2560, clm=1 (8 cpus, 8 workers): 3.97, 3.92, 3.88, 3.86, 4.01, 3.95, 3.87, 3.86 ms. Throughput: 2044.05 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=640, Pass2=1280, clm=4 (8 cpus, 1 worker): 0.55 ms. Throughput: 1823.80 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=640, Pass2=1280, clm=4 (8 cpus, 8 workers): 4.34, 4.21, 4.19, 4.14, 4.33, 4.24, 4.15, 4.14 ms. Throughput: 1897.53 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=640, Pass2=1280, clm=2 (8 cpus, 1 worker): 0.52 ms. Throughput: 1908.91 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=640, Pass2=1280, clm=2 (8 cpus, 8 workers): 4.03, 3.94, 3.92, 3.89, 3.99, 3.98, 3.90, 3.88 ms. Throughput: 2029.16 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=640, Pass2=1280, clm=1 (8 cpus, 1 worker): 0.53 ms. Throughput: 1887.21 iter/sec. FFTlen=800K, Type=3, Arch=4, Pass1=640, Pass2=1280, clm=1 (8 cpus, 8 workers): 4.03, 3.94, 3.88, 3.84, 4.06, 3.96, 3.83, 3.83 ms. Throughput: 2040.61 iter/sec. FFTlen=864K, Type=3, Arch=4, Pass1=384, Pass2=2304, clm=4 (8 cpus, 1 worker): 0.55 ms. Throughput: 1829.28 iter/sec. FFTlen=864K, Type=3, Arch=4, Pass1=384, Pass2=2304, clm=4 (8 cpus, 8 workers): 4.42, 4.39, 4.37, 4.30, 4.46, 4.44, 4.37, 4.36 ms. Throughput: 1823.33 iter/sec. FFTlen=864K, Type=3, Arch=4, Pass1=384, Pass2=2304, clm=2 (8 cpus, 1 worker): 0.55 ms. Throughput: 1830.86 iter/sec. [Sat Apr 29 10:37:48 2017] FFTlen=864K, Type=3, Arch=4, Pass1=384, Pass2=2304, clm=2 (8 cpus, 8 workers): 4.46, 4.37, 4.32, 4.27, 4.53, 4.41, 4.29, 4.27 ms. Throughput: 1832.82 iter/sec. FFTlen=864K, Type=3, Arch=4, Pass1=384, Pass2=2304, clm=1 (8 cpus, 1 worker): 0.54 ms. Throughput: 1835.92 iter/sec. FFTlen=864K, Type=3, Arch=4, Pass1=384, Pass2=2304, clm=1 (8 cpus, 8 workers): 4.45, 4.38, 4.33, 4.28, 4.47, 4.38, 4.30, 4.28 ms. Throughput: 1835.59 iter/sec. FFTlen=896K, Type=3, Arch=4, Pass1=448, Pass2=2048, clm=4 (8 cpus, 1 worker): 0.54 ms. Throughput: 1861.24 iter/sec. FFTlen=896K, Type=3, Arch=4, Pass1=448, Pass2=2048, clm=4 (8 cpus, 8 workers): 4.58, 4.45, 4.40, 4.37, 4.57, 4.56, 4.39, 4.36 ms. Throughput: 1794.38 iter/sec. FFTlen=896K, Type=3, Arch=4, Pass1=448, Pass2=2048, clm=2 (8 cpus, 1 worker): 0.54 ms. Throughput: 1855.95 iter/sec. FFTlen=896K, Type=3, Arch=4, Pass1=448, Pass2=2048, clm=2 (8 cpus, 8 workers): 4.54, 4.43, 4.37, 4.33, 4.55, 4.50, 4.33, 4.31 ms. Throughput: 1810.32 iter/sec. FFTlen=896K, Type=3, Arch=4, Pass1=448, Pass2=2048, clm=1 (8 cpus, 1 worker): 0.55 ms. Throughput: 1806.52 iter/sec. FFTlen=896K, Type=3, Arch=4, Pass1=448, Pass2=2048, clm=1 (8 cpus, 8 workers): 4.50, 4.38, 4.37, 4.33, 4.52, 4.44, 4.34, 4.33 ms. Throughput: 1818.88 iter/sec. FFTlen=896K, Type=3, Arch=4, Pass1=896, Pass2=1024, clm=4 (8 cpus, 1 worker): 0.64 ms. Throughput: 1569.92 iter/sec. FFTlen=896K, Type=3, Arch=4, Pass1=896, Pass2=1024, clm=4 (8 cpus, 8 workers): 5.06, 4.88, 4.81, 4.77, 5.01, 4.94, 4.79, 4.76 ms. Throughput: 1640.99 iter/sec. FFTlen=896K, Type=3, Arch=4, Pass1=896, Pass2=1024, clm=2 (8 cpus, 1 worker): 0.60 ms. Throughput: 1654.19 iter/sec. FFTlen=896K, Type=3, Arch=4, Pass1=896, Pass2=1024, clm=2 (8 cpus, 8 workers): 4.67, 4.61, 4.51, 4.47, 4.65, 4.57, 4.48, 4.49 ms. Throughput: 1756.31 iter/sec. FFTlen=896K, Type=3, Arch=4, Pass1=896, Pass2=1024, clm=1 (8 cpus, 1 worker): 0.63 ms. Throughput: 1593.70 iter/sec. FFTlen=896K, Type=3, Arch=4, Pass1=896, Pass2=1024, clm=1 (8 cpus, 8 workers): 4.63, 4.54, 4.48, 4.45, 4.69, 4.55, 4.45, 4.43 ms. Throughput: 1767.81 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=128, Pass2=7680, clm=4 (8 cpus, 1 worker): 0.60 ms. Throughput: 1661.87 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=128, Pass2=7680, clm=4 (8 cpus, 8 workers): 5.18, 5.16, 5.08, 5.07, 5.23, 5.15, 5.08, 5.06 ms. Throughput: 1560.67 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=128, Pass2=7680, clm=2 (8 cpus, 1 worker): 0.60 ms. Throughput: 1653.96 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=128, Pass2=7680, clm=2 (8 cpus, 8 workers): 5.21, 5.20, 5.12, 5.09, 5.38, 5.22, 5.07, 5.07 ms. Throughput: 1547.85 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=128, Pass2=7680, clm=1 (8 cpus, 1 worker): 0.92 ms. Throughput: 1082.41 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=128, Pass2=7680, clm=1 (8 cpus, 8 workers): 5.19, 5.15, 5.08, 5.04, 5.22, 5.08, 5.05, 5.05 ms. Throughput: 1566.30 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=256, Pass2=3840, clm=4 (8 cpus, 1 worker): 0.60 ms. Throughput: 1662.43 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=256, Pass2=3840, clm=4 (8 cpus, 8 workers): 5.12, 5.09, 4.96, 4.94, 5.12, 5.04, 4.95, 4.94 ms. Throughput: 1593.94 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=256, Pass2=3840, clm=2 (8 cpus, 1 worker): 0.59 ms. Throughput: 1699.54 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=256, Pass2=3840, clm=2 (8 cpus, 8 workers): 4.94, 4.91, 4.88, 4.85, 4.99, 4.97, 4.87, 4.84 ms. Throughput: 1630.73 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=256, Pass2=3840, clm=1 (8 cpus, 1 worker): 0.60 ms. Throughput: 1678.59 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=256, Pass2=3840, clm=1 (8 cpus, 8 workers): 4.96, 4.96, 4.90, 4.85, 5.07, 4.97, 4.85, 4.84 ms. Throughput: 1624.58 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=320, Pass2=3072, clm=4 (8 cpus, 1 worker): 0.58 ms. Throughput: 1725.82 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=320, Pass2=3072, clm=4 (8 cpus, 8 workers): 4.90, 4.88, 4.83, 4.79, 5.01, 4.90, 4.84, 4.79 ms. Throughput: 1643.70 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=320, Pass2=3072, clm=2 (8 cpus, 1 worker): 0.59 ms. Throughput: 1695.13 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=320, Pass2=3072, clm=2 (8 cpus, 8 workers): 4.97, 4.93, 4.84, 4.79, 5.00, 4.92, 4.83, 4.80 ms. Throughput: 1638.42 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=320, Pass2=3072, clm=1 (8 cpus, 1 worker): 0.62 ms. Throughput: 1620.03 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=320, Pass2=3072, clm=1 (8 cpus, 8 workers): 4.88, 4.86, 4.81, 4.78, 4.92, 4.88, 4.80, 4.79 ms. Throughput: 1652.55 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=384, Pass2=2560, clm=4 (8 cpus, 1 worker): 0.58 ms. Throughput: 1713.73 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=384, Pass2=2560, clm=4 (8 cpus, 8 workers): 4.91, 4.83, 4.75, 4.75, 4.92, 4.86, 4.70, 4.67 ms. Throughput: 1668.18 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=384, Pass2=2560, clm=2 (8 cpus, 1 worker): 0.57 ms. Throughput: 1741.61 iter/sec. [Sat Apr 29 10:42:49 2017] FFTlen=960K, Type=3, Arch=4, Pass1=384, Pass2=2560, clm=2 (8 cpus, 8 workers): 4.77, 4.75, 4.69, 4.67, 4.84, 4.77, 4.69, 4.64 ms. Throughput: 1692.26 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=384, Pass2=2560, clm=1 (8 cpus, 1 worker): 0.59 ms. Throughput: 1685.80 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=384, Pass2=2560, clm=1 (8 cpus, 8 workers): 4.81, 4.80, 4.74, 4.67, 4.90, 4.87, 4.69, 4.67 ms. Throughput: 1678.57 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=640, Pass2=1536, clm=4 (8 cpus, 1 worker): 0.66 ms. Throughput: 1509.99 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=640, Pass2=1536, clm=4 (8 cpus, 8 workers): 5.18, 5.11, 5.05, 4.99, 5.30, 5.17, 4.99, 4.99 ms. Throughput: 1569.59 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=640, Pass2=1536, clm=2 (8 cpus, 1 worker): 0.61 ms. Throughput: 1638.21 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=640, Pass2=1536, clm=2 (8 cpus, 8 workers): 4.87, 4.85, 4.77, 4.74, 4.97, 4.89, 4.76, 4.74 ms. Throughput: 1659.41 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=640, Pass2=1536, clm=1 (8 cpus, 1 worker): 0.61 ms. Throughput: 1649.08 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=640, Pass2=1536, clm=1 (8 cpus, 8 workers): 4.87, 4.83, 4.74, 4.69, 4.92, 4.92, 4.69, 4.69 ms. Throughput: 1669.00 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=768, Pass2=1280, clm=4 (8 cpus, 1 worker): 0.67 ms. Throughput: 1497.34 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=768, Pass2=1280, clm=4 (8 cpus, 8 workers): 5.30, 5.31, 5.47, 5.24, 5.25, 5.17, 5.02, 5.02 ms. Throughput: 1532.51 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=768, Pass2=1280, clm=2 (8 cpus, 1 worker): 0.62 ms. Throughput: 1621.10 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=768, Pass2=1280, clm=2 (8 cpus, 8 workers): 4.89, 4.85, 4.75, 4.74, 4.93, 4.89, 4.71, 4.71 ms. Throughput: 1663.68 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=768, Pass2=1280, clm=1 (8 cpus, 1 worker): 0.62 ms. Throughput: 1616.30 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=768, Pass2=1280, clm=1 (8 cpus, 8 workers): 4.85, 4.79, 4.72, 4.71, 4.87, 4.84, 4.71, 4.70 ms. Throughput: 1676.05 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=1280, Pass2=768, clm=4 (8 cpus, 1 worker): 0.77 ms. Throughput: 1290.97 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=1280, Pass2=768, clm=4 (8 cpus, 8 workers): 5.60, 5.52, 5.43, 5.42, 5.64, 5.69, 5.39, 5.61 ms. Throughput: 1445.09 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=1280, Pass2=768, clm=2 (8 cpus, 1 worker): 0.72 ms. Throughput: 1385.17 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=1280, Pass2=768, clm=2 (8 cpus, 8 workers): 5.22, 5.19, 5.15, 5.08, 5.45, 5.36, 5.11, 5.16 ms. Throughput: 1534.85 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=1280, Pass2=768, clm=1 (8 cpus, 1 worker): 0.80 ms. Throughput: 1253.57 iter/sec. FFTlen=960K, Type=3, Arch=4, Pass1=1280, Pass2=768, clm=1 (8 cpus, 8 workers): 5.46, 5.46, 5.38, 5.43, 5.54, 5.51, 5.35, 5.37 ms. Throughput: 1471.25 iter/sec. FFTlen=1008K, Type=3, Arch=4, Pass1=448, Pass2=2304, clm=4 (8 cpus, 1 worker): 0.65 ms. Throughput: 1549.13 iter/sec. FFTlen=1008K, Type=3, Arch=4, Pass1=448, Pass2=2304, clm=4 (8 cpus, 8 workers): 5.33, 5.32, 5.23, 5.17, 5.43, 5.34, 5.19, 5.15 ms. Throughput: 1519.10 iter/sec. FFTlen=1008K, Type=3, Arch=4, Pass1=448, Pass2=2304, clm=2 (8 cpus, 1 worker): 0.64 ms. Throughput: 1566.24 iter/sec. FFTlen=1008K, Type=3, Arch=4, Pass1=448, Pass2=2304, clm=2 (8 cpus, 8 workers): 5.24, 5.24, 5.18, 5.11, 5.32, 5.29, 5.15, 5.08 ms. Throughput: 1538.54 iter/sec. FFTlen=1008K, Type=3, Arch=4, Pass1=448, Pass2=2304, clm=1 (8 cpus, 1 worker): 0.64 ms. Throughput: 1554.28 iter/sec. FFTlen=1008K, Type=3, Arch=4, Pass1=448, Pass2=2304, clm=1 (8 cpus, 8 workers): 5.30, 5.25, 5.17, 5.10, 5.37, 5.25, 5.13, 5.10 ms. Throughput: 1535.82 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=128, Pass2=8192, clm=4 (8 cpus, 1 worker): 0.64 ms. Throughput: 1570.69 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=128, Pass2=8192, clm=4 (8 cpus, 8 workers): 5.63, 5.60, 5.49, 5.45, 5.65, 5.53, 5.49, 5.44 ms. Throughput: 1445.86 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=128, Pass2=8192, clm=2 (8 cpus, 1 worker): 0.63 ms. Throughput: 1586.04 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=128, Pass2=8192, clm=2 (8 cpus, 8 workers): 5.58, 5.50, 5.51, 5.48, 5.59, 5.56, 5.46, 5.41 ms. Throughput: 1452.08 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=128, Pass2=8192, clm=1 (8 cpus, 1 worker): 0.99 ms. Throughput: 1011.52 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=128, Pass2=8192, clm=1 (8 cpus, 8 workers): 5.64, 5.61, 5.55, 5.50, 5.69, 5.65, 5.50, 5.50 ms. Throughput: 1434.09 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=256, Pass2=4096, clm=4 (8 cpus, 1 worker): 0.62 ms. Throughput: 1610.93 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=256, Pass2=4096, clm=4 (8 cpus, 8 workers): 5.31, 5.26, 5.27, 5.25, 5.39, 5.27, 5.28, 5.22 ms. Throughput: 1514.46 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=256, Pass2=4096, clm=2 (8 cpus, 1 worker): 0.61 ms. Throughput: 1644.23 iter/sec. [Sat Apr 29 10:47:50 2017] FFTlen=1024K, Type=3, Arch=4, Pass1=256, Pass2=4096, clm=2 (8 cpus, 8 workers): 5.28, 5.24, 5.16, 5.15, 5.37, 5.30, 5.12, 5.11 ms. Throughput: 1533.94 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=256, Pass2=4096, clm=1 (8 cpus, 1 worker): 0.62 ms. Throughput: 1618.45 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=256, Pass2=4096, clm=1 (8 cpus, 8 workers): 5.33, 5.27, 5.14, 5.10, 5.30, 5.22, 5.10, 5.09 ms. Throughput: 1540.47 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=512, Pass2=2048, clm=4 (8 cpus, 1 worker): 0.67 ms. Throughput: 1503.69 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=512, Pass2=2048, clm=4 (8 cpus, 8 workers): 5.53, 5.39, 5.29, 5.22, 5.50, 5.38, 5.26, 5.23 ms. Throughput: 1496.26 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=512, Pass2=2048, clm=2 (8 cpus, 1 worker): 0.62 ms. Throughput: 1606.11 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=512, Pass2=2048, clm=2 (8 cpus, 8 workers): 5.26, 5.14, 5.15, 5.04, 5.21, 5.18, 5.05, 5.07 ms. Throughput: 1557.39 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=512, Pass2=2048, clm=1 (8 cpus, 1 worker): 0.64 ms. Throughput: 1554.00 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=512, Pass2=2048, clm=1 (8 cpus, 8 workers): 5.25, 5.17, 5.09, 5.05, 5.31, 5.23, 5.06, 4.99 ms. Throughput: 1556.30 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=1024, Pass2=1024, clm=4 (8 cpus, 1 worker): 0.74 ms. Throughput: 1352.83 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=1024, Pass2=1024, clm=4 (8 cpus, 8 workers): 5.88, 5.68, 5.65, 5.59, 5.77, 5.71, 5.58, 5.95 ms. Throughput: 1397.43 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=1024, Pass2=1024, clm=2 (8 cpus, 1 worker): 0.70 ms. Throughput: 1420.32 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=1024, Pass2=1024, clm=2 (8 cpus, 8 workers): 5.44, 5.37, 5.26, 5.21, 5.54, 5.41, 5.24, 5.22 ms. Throughput: 1499.85 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=1024, Pass2=1024, clm=1 (8 cpus, 1 worker): 0.71 ms. Throughput: 1412.75 iter/sec. FFTlen=1024K, Type=3, Arch=4, Pass1=1024, Pass2=1024, clm=1 (8 cpus, 8 workers): 5.36, 5.25, 5.16, 5.16, 5.39, 5.29, 5.10, 5.09 ms. Throughput: 1532.21 iter/sec. FFTlen=1120K, Type=3, Arch=4, Pass1=448, Pass2=2560, clm=4 (8 cpus, 1 worker): 0.69 ms. Throughput: 1441.55 iter/sec. FFTlen=1120K, Type=3, Arch=4, Pass1=448, Pass2=2560, clm=4 (8 cpus, 8 workers): 5.82, 5.73, 5.66, 5.60, 5.85, 5.78, 5.64, 5.58 ms. Throughput: 1402.13 iter/sec. FFTlen=1120K, Type=3, Arch=4, Pass1=448, Pass2=2560, clm=2 (8 cpus, 1 worker): 0.68 ms. Throughput: 1473.99 iter/sec. FFTlen=1120K, Type=3, Arch=4, Pass1=448, Pass2=2560, clm=2 (8 cpus, 8 workers): 5.73, 5.66, 5.58, 5.54, 5.93, 5.71, 5.57, 5.53 ms. Throughput: 1415.32 iter/sec. FFTlen=1120K, Type=3, Arch=4, Pass1=448, Pass2=2560, clm=1 (8 cpus, 1 worker): 0.72 ms. Throughput: 1386.67 iter/sec. FFTlen=1120K, Type=3, Arch=4, Pass1=448, Pass2=2560, clm=1 (8 cpus, 8 workers): 5.74, 5.68, 5.63, 5.60, 5.81, 5.72, 5.60, 5.60 ms. Throughput: 1411.31 iter/sec. FFTlen=1120K, Type=3, Arch=4, Pass1=896, Pass2=1280, clm=4 (8 cpus, 1 worker): 0.78 ms. Throughput: 1274.87 iter/sec. FFTlen=1120K, Type=3, Arch=4, Pass1=896, Pass2=1280, clm=4 (8 cpus, 8 workers): 6.25, 6.18, 6.09, 6.06, 6.32, 6.27, 6.18, 6.17 ms. Throughput: 1292.60 iter/sec. FFTlen=1120K, Type=3, Arch=4, Pass1=896, Pass2=1280, clm=2 (8 cpus, 1 worker): 0.73 ms. Throughput: 1372.08 iter/sec. FFTlen=1120K, Type=3, Arch=4, Pass1=896, Pass2=1280, clm=2 (8 cpus, 8 workers): 5.80, 5.75, 5.68, 5.68, 5.84, 5.82, 5.67, 5.66 ms. Throughput: 1395.10 iter/sec. FFTlen=1120K, Type=3, Arch=4, Pass1=896, Pass2=1280, clm=1 (8 cpus, 1 worker): 0.81 ms. Throughput: 1229.11 iter/sec. FFTlen=1120K, Type=3, Arch=4, Pass1=896, Pass2=1280, clm=1 (8 cpus, 8 workers): 5.71, 5.63, 5.51, 5.51, 7.15, 6.23, 6.05, 5.57 ms. Throughput: 1360.30 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=128, Pass2=9216, clm=4 (8 cpus, 1 worker): 1.81 ms. Throughput: 551.14 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=128, Pass2=9216, clm=4 (8 cpus, 8 workers): 6.84, 6.56, 6.63, 8.09, 6.58, 6.52, 6.23, 6.25 ms. Throughput: 1199.17 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=128, Pass2=9216, clm=2 (8 cpus, 1 worker): 0.76 ms. Throughput: 1313.94 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=128, Pass2=9216, clm=2 (8 cpus, 8 workers): 6.56, 6.42, 6.48, 6.33, 6.62, 6.46, 6.34, 6.41 ms. Throughput: 1240.00 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=128, Pass2=9216, clm=1 (8 cpus, 1 worker): 1.13 ms. Throughput: 883.65 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=128, Pass2=9216, clm=1 (8 cpus, 8 workers): 6.61, 6.68, 6.48, 6.45, 6.59, 6.59, 6.43, 6.43 ms. Throughput: 1225.17 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=256, Pass2=4608, clm=4 (8 cpus, 1 worker): 0.71 ms. Throughput: 1414.26 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=256, Pass2=4608, clm=4 (8 cpus, 8 workers): 6.13, 6.06, 6.00, 5.97, 6.18, 6.08, 5.98, 5.96 ms. Throughput: 1323.63 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=256, Pass2=4608, clm=2 (8 cpus, 1 worker): 0.69 ms. Throughput: 1449.91 iter/sec. [Sat Apr 29 10:52:52 2017] FFTlen=1152K, Type=3, Arch=4, Pass1=256, Pass2=4608, clm=2 (8 cpus, 8 workers): 6.04, 5.95, 5.89, 5.84, 6.08, 6.03, 5.85, 5.81 ms. Throughput: 1347.98 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=256, Pass2=4608, clm=1 (8 cpus, 1 worker): 0.70 ms. Throughput: 1426.88 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=256, Pass2=4608, clm=1 (8 cpus, 8 workers): 5.99, 5.94, 5.91, 5.87, 6.03, 5.97, 5.86, 5.85 ms. Throughput: 1349.31 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=384, Pass2=3072, clm=4 (8 cpus, 1 worker): 0.69 ms. Throughput: 1450.90 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=384, Pass2=3072, clm=4 (8 cpus, 8 workers): 6.00, 5.98, 5.86, 5.81, 6.05, 6.00, 5.82, 5.76 ms. Throughput: 1354.49 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=384, Pass2=3072, clm=2 (8 cpus, 1 worker): 0.70 ms. Throughput: 1422.97 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=384, Pass2=3072, clm=2 (8 cpus, 8 workers): 5.92, 5.88, 5.81, 5.78, 6.01, 5.94, 5.77, 5.77 ms. Throughput: 1365.58 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=384, Pass2=3072, clm=1 (8 cpus, 1 worker): 0.72 ms. Throughput: 1391.70 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=384, Pass2=3072, clm=1 (8 cpus, 8 workers): 5.95, 5.90, 5.83, 5.75, 6.04, 5.96, 5.76, 5.73 ms. Throughput: 1364.40 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=512, Pass2=2304, clm=4 (8 cpus, 1 worker): 0.85 ms. Throughput: 1176.82 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=512, Pass2=2304, clm=4 (8 cpus, 8 workers): 6.38, 6.29, 6.26, 6.20, 6.36, 6.30, 6.14, 6.19 ms. Throughput: 1277.09 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=512, Pass2=2304, clm=2 (8 cpus, 1 worker): 0.71 ms. Throughput: 1409.24 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=512, Pass2=2304, clm=2 (8 cpus, 8 workers): 6.13, 6.08, 5.97, 5.92, 6.21, 6.06, 5.96, 5.93 ms. Throughput: 1327.03 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=512, Pass2=2304, clm=1 (8 cpus, 1 worker): 0.72 ms. Throughput: 1380.30 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=512, Pass2=2304, clm=1 (8 cpus, 8 workers): 6.17, 6.05, 5.99, 5.93, 6.14, 6.07, 5.96, 5.90 ms. Throughput: 1327.81 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=768, Pass2=1536, clm=4 (8 cpus, 1 worker): 0.80 ms. Throughput: 1248.53 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=768, Pass2=1536, clm=4 (8 cpus, 8 workers): 6.27, 6.21, 6.16, 6.05, 6.32, 6.26, 6.12, 6.16 ms. Throughput: 1291.88 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=768, Pass2=1536, clm=2 (8 cpus, 1 worker): 0.73 ms. Throughput: 1370.92 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=768, Pass2=1536, clm=2 (8 cpus, 8 workers): 5.90, 5.84, 5.75, 5.73, 6.00, 5.89, 5.72, 5.80 ms. Throughput: 1372.71 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=768, Pass2=1536, clm=1 (8 cpus, 1 worker): 0.73 ms. Throughput: 1370.14 iter/sec. FFTlen=1152K, Type=3, Arch=4, Pass1=768, Pass2=1536, clm=1 (8 cpus, 8 workers): 5.96, 5.88, 5.75, 5.70, 6.02, 5.87, 5.72, 5.69 ms. Throughput: 1374.00 iter/sec. FFTlen=1200K, Type=3, Arch=4, Pass1=320, Pass2=3840, clm=4 (8 cpus, 1 worker): 0.75 ms. Throughput: 1340.46 iter/sec. FFTlen=1200K, Type=3, Arch=4, Pass1=320, Pass2=3840, clm=4 (8 cpus, 8 workers): 6.32, 6.26, 6.19, 6.13, 6.37, 6.28, 6.16, 6.16 ms. Throughput: 1283.55 iter/sec. FFTlen=1200K, Type=3, Arch=4, Pass1=320, Pass2=3840, clm=2 (8 cpus, 1 worker): 0.73 ms. Throughput: 1379.03 iter/sec. FFTlen=1200K, Type=3, Arch=4, Pass1=320, Pass2=3840, clm=2 (8 cpus, 8 workers): 6.34, 6.29, 6.23, 6.15, 6.42, 6.34, 6.16, 6.15 ms. Throughput: 1278.03 iter/sec. FFTlen=1200K, Type=3, Arch=4, Pass1=320, Pass2=3840, clm=1 (8 cpus, 1 worker): 0.76 ms. Throughput: 1320.86 iter/sec. FFTlen=1200K, Type=3, Arch=4, Pass1=320, Pass2=3840, clm=1 (8 cpus, 8 workers): 6.32, 6.32, 6.22, 6.17, 6.39, 6.33, 6.17, 6.16 ms. Throughput: 1278.68 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=128, Pass2=10240, clm=4 (8 cpus, 1 worker): 0.86 ms. Throughput: 1162.62 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=128, Pass2=10240, clm=4 (8 cpus, 8 workers): 7.22, 7.16, 7.16, 7.03, 7.22, 7.06, 7.03, 7.09 ms. Throughput: 1123.51 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=128, Pass2=10240, clm=2 (8 cpus, 1 worker): 0.82 ms. Throughput: 1213.77 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=128, Pass2=10240, clm=2 (8 cpus, 8 workers): 7.27, 7.13, 6.99, 6.95, 7.32, 7.16, 6.96, 6.93 ms. Throughput: 1129.00 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=128, Pass2=10240, clm=1 (8 cpus, 1 worker): 1.23 ms. Throughput: 812.21 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=128, Pass2=10240, clm=1 (8 cpus, 8 workers): 7.22, 7.16, 7.04, 6.99, 7.30, 7.18, 7.01, 7.01 ms. Throughput: 1124.79 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=256, Pass2=5120, clm=4 (8 cpus, 1 worker): 0.77 ms. Throughput: 1296.82 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=256, Pass2=5120, clm=4 (8 cpus, 8 workers): 6.87, 6.82, 6.77, 6.67, 6.98, 6.81, 6.70, 6.68 ms. Throughput: 1179.31 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=256, Pass2=5120, clm=2 (8 cpus, 1 worker): 0.77 ms. Throughput: 1292.37 iter/sec. [Sat Apr 29 10:57:57 2017] FFTlen=1280K, Type=3, Arch=4, Pass1=256, Pass2=5120, clm=2 (8 cpus, 8 workers): 6.74, 6.68, 6.61, 6.52, 6.81, 6.70, 6.52, 6.52 ms. Throughput: 1205.41 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=256, Pass2=5120, clm=1 (8 cpus, 1 worker): 0.77 ms. Throughput: 1294.52 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=256, Pass2=5120, clm=1 (8 cpus, 8 workers): 6.61, 6.61, 6.54, 6.45, 6.61, 6.59, 6.47, 6.44 ms. Throughput: 1223.23 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=320, Pass2=4096, clm=4 (8 cpus, 1 worker): 0.79 ms. Throughput: 1265.27 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=320, Pass2=4096, clm=4 (8 cpus, 8 workers): 6.71, 6.67, 6.54, 6.47, 6.88, 6.80, 6.45, 6.44 ms. Throughput: 1209.35 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=320, Pass2=4096, clm=2 (8 cpus, 1 worker): 0.76 ms. Throughput: 1318.12 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=320, Pass2=4096, clm=2 (8 cpus, 8 workers): 6.75, 6.72, 6.51, 6.49, 6.81, 6.60, 6.52, 6.55 ms. Throughput: 1209.08 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=320, Pass2=4096, clm=1 (8 cpus, 1 worker): 0.79 ms. Throughput: 1272.98 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=320, Pass2=4096, clm=1 (8 cpus, 8 workers): 6.69, 6.66, 6.54, 6.44, 6.74, 6.69, 6.50, 6.48 ms. Throughput: 1213.59 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=512, Pass2=2560, clm=4 (8 cpus, 1 worker): 0.85 ms. Throughput: 1173.63 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=512, Pass2=2560, clm=4 (8 cpus, 8 workers): 6.90, 6.80, 6.72, 6.70, 6.93, 6.89, 6.71, 6.70 ms. Throughput: 1177.88 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=512, Pass2=2560, clm=2 (8 cpus, 1 worker): 0.76 ms. Throughput: 1308.25 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=512, Pass2=2560, clm=2 (8 cpus, 8 workers): 6.65, 6.61, 6.46, 6.44, 6.73, 6.65, 6.53, 6.48 ms. Throughput: 1217.88 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=512, Pass2=2560, clm=1 (8 cpus, 1 worker): 0.78 ms. Throughput: 1283.99 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=512, Pass2=2560, clm=1 (8 cpus, 8 workers): 6.66, 6.58, 6.49, 6.49, 6.63, 6.54, 6.46, 6.42 ms. Throughput: 1224.47 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=640, Pass2=2048, clm=4 (8 cpus, 1 worker): 0.84 ms. Throughput: 1195.89 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=640, Pass2=2048, clm=4 (8 cpus, 8 workers): 6.96, 6.88, 6.77, 6.70, 7.02, 6.93, 6.68, 6.69 ms. Throughput: 1171.87 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=640, Pass2=2048, clm=2 (8 cpus, 1 worker): 0.79 ms. Throughput: 1259.74 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=640, Pass2=2048, clm=2 (8 cpus, 8 workers): 6.66, 6.54, 6.46, 6.42, 6.73, 6.58, 6.43, 6.45 ms. Throughput: 1224.95 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=640, Pass2=2048, clm=1 (8 cpus, 1 worker): 0.79 ms. Throughput: 1266.93 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=640, Pass2=2048, clm=1 (8 cpus, 8 workers): 6.62, 6.56, 6.41, 6.36, 6.73, 6.63, 6.36, 6.45 ms. Throughput: 1228.18 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=1024, Pass2=1280, clm=4 (8 cpus, 1 worker): 0.92 ms. Throughput: 1081.98 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=1024, Pass2=1280, clm=4 (8 cpus, 8 workers): 7.28, 7.22, 7.23, 7.05, 7.34, 7.24, 7.04, 7.08 ms. Throughput: 1113.66 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=1024, Pass2=1280, clm=2 (8 cpus, 1 worker): 0.84 ms. Throughput: 1196.65 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=1024, Pass2=1280, clm=2 (8 cpus, 8 workers): 6.77, 6.67, 6.65, 6.49, 6.80, 6.65, 6.54, 6.55 ms. Throughput: 1205.45 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=1024, Pass2=1280, clm=1 (8 cpus, 1 worker): 0.84 ms. Throughput: 1190.69 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=1024, Pass2=1280, clm=1 (8 cpus, 8 workers): 6.59, 6.53, 6.46, 6.45, 6.69, 6.56, 6.42, 6.40 ms. Throughput: 1229.13 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=1280, Pass2=1024, clm=4 (8 cpus, 1 worker): 0.90 ms. Throughput: 1115.20 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=1280, Pass2=1024, clm=4 (8 cpus, 8 workers): 7.29, 7.14, 7.05, 6.94, 7.24, 7.21, 7.34, 7.17 ms. Throughput: 1115.90 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=1280, Pass2=1024, clm=2 (8 cpus, 1 worker): 0.85 ms. Throughput: 1171.69 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=1280, Pass2=1024, clm=2 (8 cpus, 8 workers): 6.76, 6.78, 6.72, 6.59, 6.87, 6.70, 6.68, 6.61 ms. Throughput: 1191.98 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=1280, Pass2=1024, clm=1 (8 cpus, 1 worker): 0.84 ms. Throughput: 1184.93 iter/sec. FFTlen=1280K, Type=3, Arch=4, Pass1=1280, Pass2=1024, clm=1 (8 cpus, 8 workers): 6.73, 6.66, 6.51, 6.51, 6.88, 6.68, 6.57, 6.52 ms. Throughput: 1206.78 iter/sec. FFTlen=1344K, Type=3, Arch=4, Pass1=448, Pass2=3072, clm=4 (8 cpus, 1 worker): 0.82 ms. Throughput: 1215.47 iter/sec. FFTlen=1344K, Type=3, Arch=4, Pass1=448, Pass2=3072, clm=4 (8 cpus, 8 workers): 7.09, 7.05, 6.95, 6.92, 7.19, 7.06, 6.91, 6.90 ms. Throughput: 1141.66 iter/sec. FFTlen=1344K, Type=3, Arch=4, Pass1=448, Pass2=3072, clm=2 (8 cpus, 1 worker): 0.84 ms. Throughput: 1193.09 iter/sec. [Sat Apr 29 11:03:00 2017] FFTlen=1344K, Type=3, Arch=4, Pass1=448, Pass2=3072, clm=2 (8 cpus, 8 workers): 7.10, 7.02, 6.92, 6.90, 7.16, 7.08, 6.85, 6.83 ms. Throughput: 1145.99 iter/sec. FFTlen=1344K, Type=3, Arch=4, Pass1=448, Pass2=3072, clm=1 (8 cpus, 1 worker): 0.87 ms. Throughput: 1144.46 iter/sec. FFTlen=1344K, Type=3, Arch=4, Pass1=448, Pass2=3072, clm=1 (8 cpus, 8 workers): 7.07, 7.01, 6.94, 6.87, 7.09, 7.11, 6.90, 6.86 ms. Throughput: 1146.16 iter/sec. FFTlen=1344K, Type=3, Arch=4, Pass1=896, Pass2=1536, clm=4 (8 cpus, 1 worker): 0.94 ms. Throughput: 1066.76 iter/sec. FFTlen=1344K, Type=3, Arch=4, Pass1=896, Pass2=1536, clm=4 (8 cpus, 8 workers): 7.55, 7.49, 7.27, 7.19, 7.60, 7.39, 7.30, 7.24 ms. Throughput: 1084.62 iter/sec. FFTlen=1344K, Type=3, Arch=4, Pass1=896, Pass2=1536, clm=2 (8 cpus, 1 worker): 0.87 ms. Throughput: 1150.02 iter/sec. FFTlen=1344K, Type=3, Arch=4, Pass1=896, Pass2=1536, clm=2 (8 cpus, 8 workers): 7.21, 7.04, 6.93, 6.83, 7.29, 7.01, 6.83, 6.81 ms. Throughput: 1144.42 iter/sec. FFTlen=1344K, Type=3, Arch=4, Pass1=896, Pass2=1536, clm=1 (8 cpus, 1 worker): 0.86 ms. Throughput: 1168.24 iter/sec. FFTlen=1344K, Type=3, Arch=4, Pass1=896, Pass2=1536, clm=1 (8 cpus, 8 workers): 6.98, 7.01, 6.85, 6.80, 7.10, 7.00, 6.79, 6.82 ms. Throughput: 1156.70 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=320, Pass2=4608, clm=4 (8 cpus, 1 worker): 0.95 ms. Throughput: 1050.40 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=320, Pass2=4608, clm=4 (8 cpus, 8 workers): 7.67, 7.65, 7.47, 7.45, 7.72, 7.71, 7.39, 7.43 ms. Throughput: 1058.47 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=320, Pass2=4608, clm=2 (8 cpus, 1 worker): 0.86 ms. Throughput: 1156.93 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=320, Pass2=4608, clm=2 (8 cpus, 8 workers): 7.58, 7.53, 7.46, 7.57, 7.71, 7.61, 7.58, 7.43 ms. Throughput: 1058.34 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=320, Pass2=4608, clm=1 (8 cpus, 1 worker): 0.90 ms. Throughput: 1117.28 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=320, Pass2=4608, clm=1 (8 cpus, 8 workers): 7.65, 7.62, 7.51, 7.43, 7.77, 7.56, 7.42, 7.42 ms. Throughput: 1060.19 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=384, Pass2=3840, clm=4 (8 cpus, 1 worker): 0.89 ms. Throughput: 1119.84 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=384, Pass2=3840, clm=4 (8 cpus, 8 workers): 7.62, 7.54, 7.47, 7.44, 7.66, 7.60, 7.44, 7.42 ms. Throughput: 1063.58 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=384, Pass2=3840, clm=2 (8 cpus, 1 worker): 0.87 ms. Throughput: 1148.73 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=384, Pass2=3840, clm=2 (8 cpus, 8 workers): 7.61, 7.56, 7.47, 7.39, 7.70, 7.63, 7.40, 7.39 ms. Throughput: 1064.30 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=384, Pass2=3840, clm=1 (8 cpus, 1 worker): 0.90 ms. Throughput: 1112.67 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=384, Pass2=3840, clm=1 (8 cpus, 8 workers): 7.60, 7.56, 7.46, 7.41, 7.68, 7.61, 7.40, 7.35 ms. Throughput: 1065.76 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=640, Pass2=2304, clm=4 (8 cpus, 1 worker): 1.00 ms. Throughput: 1003.35 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=640, Pass2=2304, clm=4 (8 cpus, 8 workers): 8.14, 8.06, 7.88, 7.83, 8.19, 8.13, 7.87, 7.81 ms. Throughput: 1001.85 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=640, Pass2=2304, clm=2 (8 cpus, 1 worker): 0.91 ms. Throughput: 1102.81 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=640, Pass2=2304, clm=2 (8 cpus, 8 workers): 7.76, 7.63, 7.61, 7.48, 7.84, 7.71, 7.53, 7.50 ms. Throughput: 1048.57 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=640, Pass2=2304, clm=1 (8 cpus, 1 worker): 0.90 ms. Throughput: 1112.88 iter/sec. FFTlen=1440K, Type=3, Arch=4, Pass1=640, Pass2=2304, clm=1 (8 cpus, 8 workers): 7.68, 7.62, 7.57, 7.50, 7.72, 7.64, 7.53, 7.56 ms. Throughput: 1052.34 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=128, Pass2=12288, clm=4 (8 cpus, 1 worker): 1.09 ms. Throughput: 917.36 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=128, Pass2=12288, clm=4 (8 cpus, 8 workers): 9.12, 9.08, 8.89, 8.83, 9.23, 9.05, 8.80, 8.84 ms. Throughput: 891.03 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=128, Pass2=12288, clm=2 (8 cpus, 1 worker): 1.07 ms. Throughput: 936.42 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=128, Pass2=12288, clm=2 (8 cpus, 8 workers): 9.06, 8.91, 8.80, 8.91, 9.20, 8.99, 8.76, 8.75 ms. Throughput: 896.75 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=128, Pass2=12288, clm=1 (8 cpus, 1 worker): 1.53 ms. Throughput: 653.31 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=128, Pass2=12288, clm=1 (8 cpus, 8 workers): 9.08, 9.02, 8.92, 8.86, 9.14, 9.08, 9.05, 8.75 ms. Throughput: 890.42 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=256, Pass2=6144, clm=4 (8 cpus, 1 worker): 0.92 ms. Throughput: 1086.76 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=256, Pass2=6144, clm=4 (8 cpus, 8 workers): 8.28, 8.21, 8.08, 8.02, 8.43, 8.14, 8.01, 8.00 ms. Throughput: 982.37 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=256, Pass2=6144, clm=2 (8 cpus, 1 worker): 0.91 ms. Throughput: 1103.49 iter/sec. [Sat Apr 29 11:08:07 2017] FFTlen=1536K, Type=3, Arch=4, Pass1=256, Pass2=6144, clm=2 (8 cpus, 8 workers): 8.13, 8.04, 7.91, 7.82, 8.19, 8.02, 7.91, 7.84 ms. Throughput: 1002.26 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=256, Pass2=6144, clm=1 (8 cpus, 1 worker): 0.91 ms. Throughput: 1095.94 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=256, Pass2=6144, clm=1 (8 cpus, 8 workers): 8.22, 8.17, 7.95, 7.72, 8.28, 8.08, 8.08, 7.71 ms. Throughput: 997.51 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=384, Pass2=4096, clm=4 (8 cpus, 1 worker): 0.91 ms. Throughput: 1097.07 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=384, Pass2=4096, clm=4 (8 cpus, 8 workers): 8.07, 8.00, 7.93, 7.85, 8.17, 8.08, 7.84, 7.82 ms. Throughput: 1003.88 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=384, Pass2=4096, clm=2 (8 cpus, 1 worker): 0.91 ms. Throughput: 1096.82 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=384, Pass2=4096, clm=2 (8 cpus, 8 workers): 7.97, 7.93, 7.83, 7.77, 8.16, 7.99, 7.95, 7.79 ms. Throughput: 1010.01 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=384, Pass2=4096, clm=1 (8 cpus, 1 worker): 0.93 ms. Throughput: 1080.42 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=384, Pass2=4096, clm=1 (8 cpus, 8 workers): 8.07, 7.99, 7.92, 7.82, 8.12, 8.06, 7.82, 7.81 ms. Throughput: 1006.42 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=512, Pass2=3072, clm=4 (8 cpus, 1 worker): 1.01 ms. Throughput: 993.70 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=512, Pass2=3072, clm=4 (8 cpus, 8 workers): 8.56, 8.40, 8.29, 8.26, 8.62, 8.38, 8.22, 8.17 ms. Throughput: 956.88 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=512, Pass2=3072, clm=2 (8 cpus, 1 worker): 0.92 ms. Throughput: 1084.79 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=512, Pass2=3072, clm=2 (8 cpus, 8 workers): 8.15, 8.04, 7.99, 7.91, 8.21, 8.13, 7.93, 7.91 ms. Throughput: 995.87 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=512, Pass2=3072, clm=1 (8 cpus, 1 worker): 0.92 ms. Throughput: 1083.25 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=512, Pass2=3072, clm=1 (8 cpus, 8 workers): 8.20, 8.11, 8.06, 7.96, 8.21, 8.19, 7.97, 7.94 ms. Throughput: 990.58 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=768, Pass2=2048, clm=4 (8 cpus, 1 worker): 1.01 ms. Throughput: 994.97 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=768, Pass2=2048, clm=4 (8 cpus, 8 workers): 8.37, 8.28, 8.11, 8.10, 8.44, 8.31, 8.19, 8.18 ms. Throughput: 970.31 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=768, Pass2=2048, clm=2 (8 cpus, 1 worker): 0.94 ms. Throughput: 1061.33 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=768, Pass2=2048, clm=2 (8 cpus, 8 workers): 8.01, 7.98, 7.85, 7.75, 8.14, 8.02, 7.76, 7.73 ms. Throughput: 1012.57 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=768, Pass2=2048, clm=1 (8 cpus, 1 worker): 0.95 ms. Throughput: 1048.68 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=768, Pass2=2048, clm=1 (8 cpus, 8 workers): 7.98, 7.86, 7.75, 7.74, 8.13, 7.84, 7.82, 7.69 ms. Throughput: 1019.28 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=1024, Pass2=1536, clm=4 (8 cpus, 1 worker): 1.10 ms. Throughput: 910.38 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=1024, Pass2=1536, clm=4 (8 cpus, 8 workers): 8.69, 8.68, 8.45, 8.52, 8.85, 8.57, 8.39, 8.68 ms. Throughput: 930.01 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=1024, Pass2=1536, clm=2 (8 cpus, 1 worker): 1.00 ms. Throughput: 997.57 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=1024, Pass2=1536, clm=2 (8 cpus, 8 workers): 8.16, 8.05, 7.96, 7.88, 8.21, 8.09, 7.97, 7.86 ms. Throughput: 997.60 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=1024, Pass2=1536, clm=1 (8 cpus, 1 worker): 0.97 ms. Throughput: 1027.72 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=1024, Pass2=1536, clm=1 (8 cpus, 8 workers): 8.12, 7.92, 7.86, 7.82, 8.26, 8.06, 7.75, 7.79 ms. Throughput: 1007.26 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=1536, Pass2=1024, clm=4 (8 cpus, 1 worker): 1.10 ms. Throughput: 904.99 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=1536, Pass2=1024, clm=4 (8 cpus, 8 workers): 9.02, 8.80, 8.76, 8.71, 9.07, 8.86, 8.70, 8.72 ms. Throughput: 906.22 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=1536, Pass2=1024, clm=2 (8 cpus, 1 worker): 1.02 ms. Throughput: 978.18 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=1536, Pass2=1024, clm=2 (8 cpus, 8 workers): 8.44, 8.28, 8.20, 8.26, 8.54, 8.29, 8.33, 8.42 ms. Throughput: 958.79 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=1536, Pass2=1024, clm=1 (8 cpus, 1 worker): 1.03 ms. Throughput: 971.18 iter/sec. FFTlen=1536K, Type=3, Arch=4, Pass1=1536, Pass2=1024, clm=1 (8 cpus, 8 workers): 8.20, 8.23, 8.01, 7.95, 8.26, 8.24, 7.94, 8.01 ms. Throughput: 987.28 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=128, Pass2=12800, clm=4 (8 cpus, 1 worker): 1.06 ms. Throughput: 944.69 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=128, Pass2=12800, clm=4 (8 cpus, 8 workers): 9.14, 9.04, 8.83, 8.87, 9.16, 9.06, 8.91, 8.87 ms. Throughput: 890.38 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=128, Pass2=12800, clm=2 (8 cpus, 1 worker): 1.05 ms. Throughput: 948.03 iter/sec. [Sat Apr 29 11:13:13 2017] FFTlen=1600K, Type=3, Arch=4, Pass1=128, Pass2=12800, clm=2 (8 cpus, 8 workers): 9.09, 8.99, 8.83, 8.81, 9.13, 9.00, 8.82, 8.79 ms. Throughput: 895.58 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=128, Pass2=12800, clm=1 (8 cpus, 1 worker): 1.55 ms. Throughput: 645.02 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=128, Pass2=12800, clm=1 (8 cpus, 8 workers): 9.12, 9.09, 8.92, 8.91, 9.30, 9.11, 8.82, 8.84 ms. Throughput: 887.92 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=256, Pass2=6400, clm=4 (8 cpus, 1 worker): 1.01 ms. Throughput: 994.58 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=256, Pass2=6400, clm=4 (8 cpus, 8 workers): 8.80, 8.76, 8.64, 8.62, 8.87, 8.81, 8.60, 8.59 ms. Throughput: 918.44 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=256, Pass2=6400, clm=2 (8 cpus, 1 worker): 0.98 ms. Throughput: 1022.71 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=256, Pass2=6400, clm=2 (8 cpus, 8 workers): 8.66, 8.58, 8.48, 8.40, 8.75, 8.65, 8.43, 8.36 ms. Throughput: 937.09 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=256, Pass2=6400, clm=1 (8 cpus, 1 worker): 1.07 ms. Throughput: 930.58 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=256, Pass2=6400, clm=1 (8 cpus, 8 workers): 8.63, 8.60, 8.49, 8.41, 8.72, 8.66, 8.45, 8.40 ms. Throughput: 936.46 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=320, Pass2=5120, clm=4 (8 cpus, 1 worker): 0.95 ms. Throughput: 1049.68 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=320, Pass2=5120, clm=4 (8 cpus, 8 workers): 8.62, 8.44, 8.32, 8.30, 8.57, 8.44, 8.34, 8.30 ms. Throughput: 950.70 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=320, Pass2=5120, clm=2 (8 cpus, 1 worker): 0.96 ms. Throughput: 1047.05 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=320, Pass2=5120, clm=2 (8 cpus, 8 workers): 8.49, 8.48, 8.34, 8.24, 8.61, 8.48, 8.25, 8.24 ms. Throughput: 953.63 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=320, Pass2=5120, clm=1 (8 cpus, 1 worker): 1.00 ms. Throughput: 1002.91 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=320, Pass2=5120, clm=1 (8 cpus, 8 workers): 8.45, 8.44, 8.32, 8.29, 8.48, 8.43, 8.33, 8.31 ms. Throughput: 954.45 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=640, Pass2=2560, clm=4 (8 cpus, 1 worker): 1.07 ms. Throughput: 937.94 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=640, Pass2=2560, clm=4 (8 cpus, 8 workers): 8.80, 8.72, 8.61, 8.55, 8.87, 8.79, 8.55, 8.50 ms. Throughput: 922.67 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=640, Pass2=2560, clm=2 (8 cpus, 1 worker): 0.97 ms. Throughput: 1031.81 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=640, Pass2=2560, clm=2 (8 cpus, 8 workers): 8.58, 8.37, 8.23, 8.13, 8.49, 8.40, 8.21, 8.12 ms. Throughput: 962.33 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=640, Pass2=2560, clm=1 (8 cpus, 1 worker): 0.97 ms. Throughput: 1035.27 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=640, Pass2=2560, clm=1 (8 cpus, 8 workers): 8.36, 8.30, 8.20, 8.18, 8.38, 8.36, 8.20, 8.16 ms. Throughput: 967.69 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=1280, Pass2=1280, clm=4 (8 cpus, 1 worker): 1.13 ms. Throughput: 887.01 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=1280, Pass2=1280, clm=4 (8 cpus, 8 workers): 9.14, 9.08, 8.89, 8.82, 9.29, 9.04, 8.87, 8.93 ms. Throughput: 888.34 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=1280, Pass2=1280, clm=2 (8 cpus, 1 worker): 1.04 ms. Throughput: 961.22 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=1280, Pass2=1280, clm=2 (8 cpus, 8 workers): 8.78, 8.49, 8.36, 8.38, 8.58, 8.50, 8.34, 8.31 ms. Throughput: 945.13 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=1280, Pass2=1280, clm=1 (8 cpus, 1 worker): 1.03 ms. Throughput: 972.34 iter/sec. FFTlen=1600K, Type=3, Arch=4, Pass1=1280, Pass2=1280, clm=1 (8 cpus, 8 workers): 8.48, 8.36, 8.24, 8.20, 8.51, 8.42, 8.22, 8.09 ms. Throughput: 962.24 iter/sec. FFTlen=1680K, Type=3, Arch=4, Pass1=448, Pass2=3840, clm=4 (8 cpus, 1 worker): 1.05 ms. Throughput: 950.82 iter/sec. FFTlen=1680K, Type=3, Arch=4, Pass1=448, Pass2=3840, clm=4 (8 cpus, 8 workers): 9.06, 8.96, 8.91, 8.84, 9.18, 9.06, 8.86, 8.92 ms. Throughput: 891.50 iter/sec. FFTlen=1680K, Type=3, Arch=4, Pass1=448, Pass2=3840, clm=2 (8 cpus, 1 worker): 1.03 ms. Throughput: 970.89 iter/sec. FFTlen=1680K, Type=3, Arch=4, Pass1=448, Pass2=3840, clm=2 (8 cpus, 8 workers): 9.01, 8.95, 8.87, 8.78, 9.10, 9.03, 8.79, 8.75 ms. Throughput: 897.97 iter/sec. FFTlen=1680K, Type=3, Arch=4, Pass1=448, Pass2=3840, clm=1 (8 cpus, 1 worker): 1.07 ms. Throughput: 935.38 iter/sec. FFTlen=1680K, Type=3, Arch=4, Pass1=448, Pass2=3840, clm=1 (8 cpus, 8 workers): 9.07, 8.97, 8.87, 8.77, 9.13, 9.05, 8.78, 8.77 ms. Throughput: 896.38 iter/sec. FFTlen=1728K, Type=3, Arch=4, Pass1=384, Pass2=4608, clm=4 (8 cpus, 1 worker): 1.06 ms. Throughput: 947.32 iter/sec. FFTlen=1728K, Type=3, Arch=4, Pass1=384, Pass2=4608, clm=4 (8 cpus, 8 workers): 9.06, 9.41, 9.31, 9.15, 9.13, 9.05, 8.83, 9.15 ms. Throughput: 875.78 iter/sec. FFTlen=1728K, Type=3, Arch=4, Pass1=384, Pass2=4608, clm=2 (8 cpus, 1 worker): 1.04 ms. Throughput: 963.82 iter/sec. [Sat Apr 29 11:18:21 2017] FFTlen=1728K, Type=3, Arch=4, Pass1=384, Pass2=4608, clm=2 (8 cpus, 8 workers): 9.18, 9.08, 8.98, 8.90, 9.32, 9.24, 8.88, 9.03 ms. Throughput: 881.71 iter/sec. FFTlen=1728K, Type=3, Arch=4, Pass1=384, Pass2=4608, clm=1 (8 cpus, 1 worker): 1.06 ms. Throughput: 939.14 iter/sec. FFTlen=1728K, Type=3, Arch=4, Pass1=384, Pass2=4608, clm=1 (8 cpus, 8 workers): 9.17, 9.10, 9.01, 8.93, 9.18, 9.18, 8.92, 8.94 ms. Throughput: 883.84 iter/sec. FFTlen=1728K, Type=3, Arch=4, Pass1=768, Pass2=2304, clm=4 (8 cpus, 1 worker): 1.22 ms. Throughput: 816.93 iter/sec. FFTlen=1728K, Type=3, Arch=4, Pass1=768, Pass2=2304, clm=4 (8 cpus, 8 workers): 9.84, 9.80, 9.66, 9.55, 9.94, 9.83, 9.60, 9.55 ms. Throughput: 823.16 iter/sec. FFTlen=1728K, Type=3, Arch=4, Pass1=768, Pass2=2304, clm=2 (8 cpus, 1 worker): 1.09 ms. Throughput: 919.43 iter/sec. FFTlen=1728K, Type=3, Arch=4, Pass1=768, Pass2=2304, clm=2 (8 cpus, 8 workers): 9.41, 9.31, 9.14, 9.09, 9.39, 9.39, 9.08, 9.08 ms. Throughput: 866.28 iter/sec. FFTlen=1728K, Type=3, Arch=4, Pass1=768, Pass2=2304, clm=1 (8 cpus, 1 worker): 1.08 ms. Throughput: 927.69 iter/sec. FFTlen=1728K, Type=3, Arch=4, Pass1=768, Pass2=2304, clm=1 (8 cpus, 8 workers): 9.22, 9.20, 9.08, 9.07, 9.33, 9.16, 9.08, 9.05 ms. Throughput: 874.53 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=448, Pass2=4096, clm=4 (8 cpus, 1 worker): 1.08 ms. Throughput: 922.86 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=448, Pass2=4096, clm=4 (8 cpus, 8 workers): 9.64, 9.51, 9.37, 9.32, 9.74, 9.62, 9.32, 9.21 ms. Throughput: 845.38 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=448, Pass2=4096, clm=2 (8 cpus, 1 worker): 1.09 ms. Throughput: 918.67 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=448, Pass2=4096, clm=2 (8 cpus, 8 workers): 9.65, 9.47, 9.32, 9.20, 9.91, 9.53, 9.42, 9.24 ms. Throughput: 845.60 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=448, Pass2=4096, clm=1 (8 cpus, 1 worker): 1.13 ms. Throughput: 888.20 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=448, Pass2=4096, clm=1 (8 cpus, 8 workers): 9.71, 9.49, 9.33, 9.26, 9.74, 9.42, 9.29, 9.27 ms. Throughput: 847.92 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=896, Pass2=2048, clm=4 (8 cpus, 1 worker): 1.20 ms. Throughput: 834.11 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=896, Pass2=2048, clm=4 (8 cpus, 8 workers): 10.09, 9.99, 9.75, 9.70, 10.13, 9.89, 9.77, 9.81 ms. Throughput: 809.00 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=896, Pass2=2048, clm=2 (8 cpus, 1 worker): 1.12 ms. Throughput: 890.89 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=896, Pass2=2048, clm=2 (8 cpus, 8 workers): 9.44, 9.40, 9.32, 9.29, 9.54, 9.36, 9.27, 9.26 ms. Throughput: 854.74 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=896, Pass2=2048, clm=1 (8 cpus, 1 worker): 1.11 ms. Throughput: 897.72 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=896, Pass2=2048, clm=1 (8 cpus, 8 workers): 9.44, 9.38, 9.30, 9.19, 9.46, 9.49, 9.26, 9.18 ms. Throughput: 856.84 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=1792, Pass2=1024, clm=4 (8 cpus, 1 worker): 1.37 ms. Throughput: 732.02 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=1792, Pass2=1024, clm=4 (8 cpus, 8 workers): 10.78, 10.71, 10.47, 10.47, 11.07, 10.88, 10.44, 10.52 ms. Throughput: 750.32 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=1792, Pass2=1024, clm=2 (8 cpus, 1 worker): 1.27 ms. Throughput: 785.76 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=1792, Pass2=1024, clm=2 (8 cpus, 8 workers): 10.34, 10.13, 9.93, 9.93, 10.34, 10.21, 9.87, 9.94 ms. Throughput: 793.49 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=1792, Pass2=1024, clm=1 (8 cpus, 1 worker): 1.26 ms. Throughput: 794.30 iter/sec. FFTlen=1792K, Type=3, Arch=4, Pass1=1792, Pass2=1024, clm=1 (8 cpus, 8 workers): 9.89, 9.80, 9.73, 9.68, 9.92, 9.90, 9.60, 9.68 ms. Throughput: 818.46 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=128, Pass2=15360, clm=4 (8 cpus, 1 worker): 1.37 ms. Throughput: 727.66 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=128, Pass2=15360, clm=4 (8 cpus, 8 workers): 11.41, 11.28, 11.15, 11.11, 11.54, 11.19, 11.09, 11.12 ms. Throughput: 712.13 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=128, Pass2=15360, clm=2 (8 cpus, 1 worker): 1.37 ms. Throughput: 732.31 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=128, Pass2=15360, clm=2 (8 cpus, 8 workers): 11.27, 11.14, 11.09, 11.03, 11.33, 11.17, 11.13, 11.05 ms. Throughput: 717.39 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=128, Pass2=15360, clm=1 (8 cpus, 1 worker): 1.92 ms. Throughput: 522.14 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=128, Pass2=15360, clm=1 (8 cpus, 8 workers): 11.34, 11.25, 11.12, 11.10, 11.42, 11.17, 11.04, 11.11 ms. Throughput: 714.74 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=256, Pass2=7680, clm=4 (8 cpus, 1 worker): 1.18 ms. Throughput: 850.11 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=256, Pass2=7680, clm=4 (8 cpus, 8 workers): 10.50, 10.45, 10.34, 10.19, 10.73, 10.47, 10.28, 10.20 ms. Throughput: 769.65 iter/sec. [Sat Apr 29 11:23:23 2017] FFTlen=1920K, Type=3, Arch=4, Pass1=256, Pass2=7680, clm=2 (8 cpus, 1 worker): 1.17 ms. Throughput: 857.43 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=256, Pass2=7680, clm=2 (8 cpus, 8 workers): 10.29, 10.25, 10.08, 9.97, 10.47, 10.30, 9.98, 9.92 ms. Throughput: 787.82 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=256, Pass2=7680, clm=1 (8 cpus, 1 worker): 1.18 ms. Throughput: 849.43 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=256, Pass2=7680, clm=1 (8 cpus, 8 workers): 10.28, 10.19, 10.08, 10.03, 10.32, 10.30, 10.00, 10.00 ms. Throughput: 788.46 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=320, Pass2=6144, clm=4 (8 cpus, 1 worker): 1.13 ms. Throughput: 885.04 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=320, Pass2=6144, clm=4 (8 cpus, 8 workers): 10.27, 10.18, 10.10, 10.00, 10.48, 10.25, 9.91, 9.95 ms. Throughput: 788.98 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=320, Pass2=6144, clm=2 (8 cpus, 1 worker): 1.23 ms. Throughput: 814.23 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=320, Pass2=6144, clm=2 (8 cpus, 8 workers): 10.30, 10.20, 10.08, 9.98, 10.38, 10.27, 9.99, 9.88 ms. Throughput: 789.51 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=320, Pass2=6144, clm=1 (8 cpus, 1 worker): 1.16 ms. Throughput: 858.49 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=320, Pass2=6144, clm=1 (8 cpus, 8 workers): 10.16, 10.14, 10.04, 9.97, 10.29, 10.18, 10.07, 10.14 ms. Throughput: 790.47 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=384, Pass2=5120, clm=4 (8 cpus, 1 worker): 1.15 ms. Throughput: 872.19 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=384, Pass2=5120, clm=4 (8 cpus, 8 workers): 10.27, 10.23, 10.08, 9.97, 10.45, 10.30, 10.03, 9.96 ms. Throughput: 787.52 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=384, Pass2=5120, clm=2 (8 cpus, 1 worker): 1.15 ms. Throughput: 872.05 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=384, Pass2=5120, clm=2 (8 cpus, 8 workers): 10.17, 10.11, 10.00, 9.90, 10.33, 10.16, 9.92, 9.88 ms. Throughput: 795.60 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=384, Pass2=5120, clm=1 (8 cpus, 1 worker): 1.17 ms. Throughput: 853.58 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=384, Pass2=5120, clm=1 (8 cpus, 8 workers): 10.22, 10.14, 10.02, 9.94, 10.28, 10.07, 10.00, 9.94 ms. Throughput: 794.09 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=512, Pass2=3840, clm=4 (8 cpus, 1 worker): 1.29 ms. Throughput: 774.01 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=512, Pass2=3840, clm=4 (8 cpus, 8 workers): 10.86, 10.78, 10.58, 10.44, 11.00, 10.83, 10.48, 10.45 ms. Throughput: 749.53 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=512, Pass2=3840, clm=2 (8 cpus, 1 worker): 1.19 ms. Throughput: 843.80 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=512, Pass2=3840, clm=2 (8 cpus, 8 workers): 10.48, 10.37, 10.24, 10.15, 10.54, 10.48, 10.15, 10.13 ms. Throughput: 775.44 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=512, Pass2=3840, clm=1 (8 cpus, 1 worker): 1.19 ms. Throughput: 840.69 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=512, Pass2=3840, clm=1 (8 cpus, 8 workers): 10.36, 10.33, 10.22, 10.14, 10.41, 10.34, 10.18, 10.16 ms. Throughput: 779.15 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=640, Pass2=3072, clm=4 (8 cpus, 1 worker): 1.28 ms. Throughput: 780.58 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=640, Pass2=3072, clm=4 (8 cpus, 8 workers): 10.81, 10.64, 10.50, 10.43, 10.83, 10.85, 10.50, 10.41 ms. Throughput: 753.40 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=640, Pass2=3072, clm=2 (8 cpus, 1 worker): 1.18 ms. Throughput: 844.81 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=640, Pass2=3072, clm=2 (8 cpus, 8 workers): 10.28, 10.27, 10.07, 10.04, 10.46, 10.23, 10.05, 9.97 ms. Throughput: 786.64 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=640, Pass2=3072, clm=1 (8 cpus, 1 worker): 1.17 ms. Throughput: 857.07 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=640, Pass2=3072, clm=1 (8 cpus, 8 workers): 10.31, 10.23, 10.06, 10.01, 10.31, 10.22, 10.11, 10.05 ms. Throughput: 787.31 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=768, Pass2=2560, clm=4 (8 cpus, 1 worker): 1.31 ms. Throughput: 762.17 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=768, Pass2=2560, clm=4 (8 cpus, 8 workers): 10.75, 10.57, 10.42, 10.39, 10.79, 10.56, 10.37, 10.54 ms. Throughput: 758.53 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=768, Pass2=2560, clm=2 (8 cpus, 1 worker): 1.17 ms. Throughput: 856.12 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=768, Pass2=2560, clm=2 (8 cpus, 8 workers): 10.14, 10.04, 9.94, 9.86, 10.27, 10.09, 9.97, 9.84 ms. Throughput: 798.67 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=768, Pass2=2560, clm=1 (8 cpus, 1 worker): 1.15 ms. Throughput: 867.97 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=768, Pass2=2560, clm=1 (8 cpus, 8 workers): 10.18, 10.08, 9.93, 9.84, 10.29, 10.07, 9.83, 9.81 ms. Throughput: 799.85 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=1280, Pass2=1536, clm=4 (8 cpus, 1 worker): 1.36 ms. Throughput: 733.63 iter/sec. [Sat Apr 29 11:28:25 2017] FFTlen=1920K, Type=3, Arch=4, Pass1=1280, Pass2=1536, clm=4 (8 cpus, 8 workers): 11.17, 10.87, 10.72, 10.61, 11.11, 11.01, 10.61, 10.53 ms. Throughput: 739.28 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=1280, Pass2=1536, clm=2 (8 cpus, 1 worker): 1.32 ms. Throughput: 758.98 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=1280, Pass2=1536, clm=2 (8 cpus, 8 workers): 10.41, 10.21, 10.18, 10.22, 10.41, 10.26, 10.02, 10.01 ms. Throughput: 783.40 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=1280, Pass2=1536, clm=1 (8 cpus, 1 worker): 1.31 ms. Throughput: 760.96 iter/sec. FFTlen=1920K, Type=3, Arch=4, Pass1=1280, Pass2=1536, clm=1 (8 cpus, 8 workers): 10.23, 10.04, 9.93, 9.89, 10.35, 10.25, 9.95, 9.76 ms. Throughput: 796.23 iter/sec. FFTlen=2000K, Type=3, Arch=4, Pass1=320, Pass2=6400, clm=4 (8 cpus, 1 worker): 1.26 ms. Throughput: 796.34 iter/sec. FFTlen=2000K, Type=3, Arch=4, Pass1=320, Pass2=6400, clm=4 (8 cpus, 8 workers): 11.02, 10.96, 10.74, 10.69, 11.15, 11.02, 10.73, 10.67 ms. Throughput: 736.03 iter/sec. FFTlen=2000K, Type=3, Arch=4, Pass1=320, Pass2=6400, clm=2 (8 cpus, 1 worker): 1.24 ms. Throughput: 808.67 iter/sec. FFTlen=2000K, Type=3, Arch=4, Pass1=320, Pass2=6400, clm=2 (8 cpus, 8 workers): 10.92, 10.80, 10.75, 10.73, 10.99, 10.92, 10.73, 10.69 ms. Throughput: 739.69 iter/sec. FFTlen=2000K, Type=3, Arch=4, Pass1=320, Pass2=6400, clm=1 (8 cpus, 1 worker): 1.26 ms. Throughput: 792.05 iter/sec. FFTlen=2000K, Type=3, Arch=4, Pass1=320, Pass2=6400, clm=1 (8 cpus, 8 workers): 11.01, 10.90, 10.75, 10.64, 11.19, 11.05, 10.69, 10.53 ms. Throughput: 738.01 iter/sec. FFTlen=2016K, Type=3, Arch=4, Pass1=448, Pass2=4608, clm=4 (8 cpus, 1 worker): 1.27 ms. Throughput: 788.69 iter/sec. FFTlen=2016K, Type=3, Arch=4, Pass1=448, Pass2=4608, clm=4 (8 cpus, 8 workers): 10.98, 10.89, 10.69, 10.65, 10.94, 10.99, 10.68, 10.65 ms. Throughput: 740.33 iter/sec. FFTlen=2016K, Type=3, Arch=4, Pass1=448, Pass2=4608, clm=2 (8 cpus, 1 worker): 1.23 ms. Throughput: 810.85 iter/sec. FFTlen=2016K, Type=3, Arch=4, Pass1=448, Pass2=4608, clm=2 (8 cpus, 8 workers): 10.79, 10.72, 10.62, 10.59, 10.90, 10.76, 10.55, 10.39 ms. Throughput: 750.29 iter/sec. FFTlen=2016K, Type=3, Arch=4, Pass1=448, Pass2=4608, clm=1 (8 cpus, 1 worker): 1.26 ms. Throughput: 793.13 iter/sec. FFTlen=2016K, Type=3, Arch=4, Pass1=448, Pass2=4608, clm=1 (8 cpus, 8 workers): 10.88, 10.80, 10.70, 10.62, 10.90, 10.87, 10.63, 10.57 ms. Throughput: 744.65 iter/sec. FFTlen=2016K, Type=3, Arch=4, Pass1=896, Pass2=2304, clm=4 (8 cpus, 1 worker): 1.46 ms. Throughput: 685.56 iter/sec. FFTlen=2016K, Type=3, Arch=4, Pass1=896, Pass2=2304, clm=4 (8 cpus, 8 workers): 11.83, 11.66, 11.54, 11.46, 11.91, 11.65, 11.46, 11.43 ms. Throughput: 688.69 iter/sec. FFTlen=2016K, Type=3, Arch=4, Pass1=896, Pass2=2304, clm=2 (8 cpus, 1 worker): 1.30 ms. Throughput: 767.67 iter/sec. FFTlen=2016K, Type=3, Arch=4, Pass1=896, Pass2=2304, clm=2 (8 cpus, 8 workers): 11.15, 10.92, 10.88, 10.81, 11.14, 10.92, 10.98, 10.94 ms. Throughput: 729.53 iter/sec. FFTlen=2016K, Type=3, Arch=4, Pass1=896, Pass2=2304, clm=1 (8 cpus, 1 worker): 1.28 ms. Throughput: 780.32 iter/sec. FFTlen=2016K, Type=3, Arch=4, Pass1=896, Pass2=2304, clm=1 (8 cpus, 8 workers): 11.01, 10.92, 10.83, 10.76, 11.07, 11.00, 10.77, 10.71 ms. Throughput: 734.98 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=128, Pass2=16384, clm=4 (8 cpus, 1 worker): 1.52 ms. Throughput: 657.84 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=128, Pass2=16384, clm=4 (8 cpus, 8 workers): 12.55, 12.46, 12.24, 12.22, 12.63, 12.47, 12.26, 12.19 ms. Throughput: 646.43 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=128, Pass2=16384, clm=2 (8 cpus, 1 worker): 1.51 ms. Throughput: 660.55 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=128, Pass2=16384, clm=2 (8 cpus, 8 workers): 12.43, 12.31, 12.24, 12.09, 12.67, 12.40, 12.24, 12.02 ms. Throughput: 650.52 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=128, Pass2=16384, clm=1 (8 cpus, 1 worker): 2.07 ms. Throughput: 484.00 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=128, Pass2=16384, clm=1 (8 cpus, 8 workers): 12.49, 12.41, 12.30, 12.17, 12.71, 12.57, 12.00, 12.18 ms. Throughput: 647.83 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=256, Pass2=8192, clm=4 (8 cpus, 1 worker): 1.28 ms. Throughput: 782.59 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=256, Pass2=8192, clm=4 (8 cpus, 8 workers): 11.35, 11.26, 11.17, 11.12, 11.41, 11.35, 11.04, 11.06 ms. Throughput: 713.09 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=256, Pass2=8192, clm=2 (8 cpus, 1 worker): 1.25 ms. Throughput: 797.51 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=256, Pass2=8192, clm=2 (8 cpus, 8 workers): 11.12, 11.05, 10.90, 10.80, 11.60, 11.10, 10.97, 10.82 ms. Throughput: 724.59 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=256, Pass2=8192, clm=1 (8 cpus, 1 worker): 1.28 ms. Throughput: 778.67 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=256, Pass2=8192, clm=1 (8 cpus, 8 workers): 11.19, 11.19, 10.98, 10.98, 11.29, 11.20, 10.90, 10.84 ms. Throughput: 722.68 iter/sec. [Sat Apr 29 11:33:31 2017] FFTlen=2048K, Type=3, Arch=4, Pass1=512, Pass2=4096, clm=4 (8 cpus, 1 worker): 1.34 ms. Throughput: 743.71 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=512, Pass2=4096, clm=4 (8 cpus, 8 workers): 11.38, 11.27, 11.16, 11.11, 11.47, 11.37, 11.11, 11.10 ms. Throughput: 711.42 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=512, Pass2=4096, clm=2 (8 cpus, 1 worker): 1.25 ms. Throughput: 800.58 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=512, Pass2=4096, clm=2 (8 cpus, 8 workers): 10.92, 10.85, 10.93, 10.89, 10.93, 10.83, 10.71, 10.58 ms. Throughput: 738.71 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=512, Pass2=4096, clm=1 (8 cpus, 1 worker): 1.23 ms. Throughput: 813.65 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=512, Pass2=4096, clm=1 (8 cpus, 8 workers): 11.03, 11.00, 10.78, 10.59, 11.06, 10.96, 10.76, 10.67 ms. Throughput: 737.16 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=1024, Pass2=2048, clm=4 (8 cpus, 1 worker): 1.42 ms. Throughput: 703.22 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=1024, Pass2=2048, clm=4 (8 cpus, 8 workers): 11.67, 11.62, 11.34, 11.28, 11.70, 11.52, 11.32, 11.42 ms. Throughput: 696.77 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=1024, Pass2=2048, clm=2 (8 cpus, 1 worker): 1.31 ms. Throughput: 764.35 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=1024, Pass2=2048, clm=2 (8 cpus, 8 workers): 11.00, 10.88, 10.78, 10.60, 11.04, 10.86, 10.70, 10.74 ms. Throughput: 739.02 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=1024, Pass2=2048, clm=1 (8 cpus, 1 worker): 1.29 ms. Throughput: 774.77 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=1024, Pass2=2048, clm=1 (8 cpus, 8 workers): 10.86, 10.80, 10.64, 10.52, 10.97, 10.79, 10.57, 10.49 ms. Throughput: 747.48 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=2048, Pass2=1024, clm=4 (8 cpus, 1 worker): 1.66 ms. Throughput: 600.94 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=2048, Pass2=1024, clm=4 (8 cpus, 8 workers): 12.85, 12.59, 12.57, 12.35, 12.96, 12.72, 12.36, 12.49 ms. Throughput: 634.56 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=2048, Pass2=1024, clm=2 (8 cpus, 1 worker): 1.57 ms. Throughput: 638.33 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=2048, Pass2=1024, clm=2 (8 cpus, 8 workers): 12.21, 12.23, 12.08, 11.86, 12.17, 12.27, 11.88, 11.93 ms. Throughput: 662.49 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=2048, Pass2=1024, clm=1 (8 cpus, 1 worker): 1.53 ms. Throughput: 653.11 iter/sec. FFTlen=2048K, Type=3, Arch=4, Pass1=2048, Pass2=1024, clm=1 (8 cpus, 8 workers): 11.83, 11.82, 11.71, 11.69, 11.90, 11.76, 11.52, 11.54 ms. Throughput: 682.66 iter/sec. FFTlen=2240K, Type=3, Arch=4, Pass1=448, Pass2=5120, clm=4 (8 cpus, 1 worker): 1.41 ms. Throughput: 710.99 iter/sec. FFTlen=2240K, Type=3, Arch=4, Pass1=448, Pass2=5120, clm=4 (8 cpus, 8 workers): 12.17, 12.10, 11.95, 11.90, 12.34, 12.11, 12.11, 12.05 ms. Throughput: 661.67 iter/sec. FFTlen=2240K, Type=3, Arch=4, Pass1=448, Pass2=5120, clm=2 (8 cpus, 1 worker): 1.36 ms. Throughput: 732.84 iter/sec. FFTlen=2240K, Type=3, Arch=4, Pass1=448, Pass2=5120, clm=2 (8 cpus, 8 workers): 11.95, 11.86, 11.81, 11.72, 12.10, 11.90, 11.71, 11.70 ms. Throughput: 675.51 iter/sec. FFTlen=2240K, Type=3, Arch=4, Pass1=448, Pass2=5120, clm=1 (8 cpus, 1 worker): 1.41 ms. Throughput: 708.96 iter/sec. FFTlen=2240K, Type=3, Arch=4, Pass1=448, Pass2=5120, clm=1 (8 cpus, 8 workers): 12.18, 12.07, 11.91, 11.83, 12.37, 12.22, 11.75, 11.75 ms. Throughput: 666.33 iter/sec. FFTlen=2240K, Type=3, Arch=4, Pass1=896, Pass2=2560, clm=4 (8 cpus, 1 worker): 1.60 ms. Throughput: 626.24 iter/sec. FFTlen=2240K, Type=3, Arch=4, Pass1=896, Pass2=2560, clm=4 (8 cpus, 8 workers): 12.82, 12.76, 12.47, 12.34, 12.98, 12.71, 12.42, 12.35 ms. Throughput: 634.92 iter/sec. FFTlen=2240K, Type=3, Arch=4, Pass1=896, Pass2=2560, clm=2 (8 cpus, 1 worker): 1.40 ms. Throughput: 716.31 iter/sec. FFTlen=2240K, Type=3, Arch=4, Pass1=896, Pass2=2560, clm=2 (8 cpus, 8 workers): 11.97, 11.96, 11.83, 11.72, 12.06, 11.87, 11.80, 11.70 ms. Throughput: 674.43 iter/sec. FFTlen=2240K, Type=3, Arch=4, Pass1=896, Pass2=2560, clm=1 (8 cpus, 1 worker): 1.38 ms. Throughput: 723.67 iter/sec. FFTlen=2240K, Type=3, Arch=4, Pass1=896, Pass2=2560, clm=1 (8 cpus, 8 workers): 11.99, 11.90, 11.76, 11.62, 12.13, 12.00, 11.62, 11.50 ms. Throughput: 677.43 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=256, Pass2=9216, clm=4 (8 cpus, 1 worker): 1.53 ms. Throughput: 654.73 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=256, Pass2=9216, clm=4 (8 cpus, 8 workers): 13.23, 13.19, 12.93, 12.84, 13.24, 13.25, 12.88, 12.84 ms. Throughput: 613.11 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=256, Pass2=9216, clm=2 (8 cpus, 1 worker): 1.48 ms. Throughput: 677.52 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=256, Pass2=9216, clm=2 (8 cpus, 8 workers): 12.95, 12.84, 12.67, 12.60, 13.01, 12.91, 12.65, 12.55 ms. Throughput: 626.46 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=256, Pass2=9216, clm=1 (8 cpus, 1 worker): 1.50 ms. Throughput: 665.59 iter/sec. [Sat Apr 29 11:38:35 2017] FFTlen=2304K, Type=3, Arch=4, Pass1=256, Pass2=9216, clm=1 (8 cpus, 8 workers): 12.85, 13.21, 12.96, 12.69, 12.91, 12.76, 12.74, 12.63 ms. Throughput: 623.01 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=384, Pass2=6144, clm=4 (8 cpus, 1 worker): 1.38 ms. Throughput: 723.36 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=384, Pass2=6144, clm=4 (8 cpus, 8 workers): 12.41, 12.29, 12.10, 11.96, 12.25, 12.20, 12.06, 11.95 ms. Throughput: 658.45 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=384, Pass2=6144, clm=2 (8 cpus, 1 worker): 1.36 ms. Throughput: 736.53 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=384, Pass2=6144, clm=2 (8 cpus, 8 workers): 12.32, 12.24, 12.01, 11.91, 12.30, 12.29, 11.98, 11.91 ms. Throughput: 660.28 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=384, Pass2=6144, clm=1 (8 cpus, 1 worker): 1.40 ms. Throughput: 712.81 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=384, Pass2=6144, clm=1 (8 cpus, 8 workers): 12.39, 12.29, 12.06, 11.96, 12.46, 12.29, 12.05, 11.81 ms. Throughput: 657.89 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=512, Pass2=4608, clm=4 (8 cpus, 1 worker): 1.57 ms. Throughput: 637.47 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=512, Pass2=4608, clm=4 (8 cpus, 8 workers): 13.25, 13.12, 12.81, 12.59, 12.96, 12.74, 12.57, 12.65 ms. Throughput: 623.46 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=512, Pass2=4608, clm=2 (8 cpus, 1 worker): 1.42 ms. Throughput: 705.40 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=512, Pass2=4608, clm=2 (8 cpus, 8 workers): 12.55, 12.48, 12.30, 12.20, 12.44, 12.47, 12.18, 12.14 ms. Throughput: 648.09 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=512, Pass2=4608, clm=1 (8 cpus, 1 worker): 1.43 ms. Throughput: 697.77 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=512, Pass2=4608, clm=1 (8 cpus, 8 workers): 12.62, 12.44, 12.38, 12.21, 12.67, 12.59, 12.24, 12.18 ms. Throughput: 644.44 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=768, Pass2=3072, clm=4 (8 cpus, 1 worker): 1.61 ms. Throughput: 619.41 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=768, Pass2=3072, clm=4 (8 cpus, 8 workers): 13.13, 12.86, 12.73, 12.68, 13.20, 13.18, 12.66, 12.62 ms. Throughput: 621.22 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=768, Pass2=3072, clm=2 (8 cpus, 1 worker): 1.41 ms. Throughput: 709.53 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=768, Pass2=3072, clm=2 (8 cpus, 8 workers): 12.39, 12.24, 12.13, 12.10, 12.45, 12.33, 12.11, 12.10 ms. Throughput: 654.17 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=768, Pass2=3072, clm=1 (8 cpus, 1 worker): 1.40 ms. Throughput: 713.66 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=768, Pass2=3072, clm=1 (8 cpus, 8 workers): 12.50, 12.38, 12.23, 12.03, 12.55, 12.36, 12.06, 12.05 ms. Throughput: 652.12 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=1024, Pass2=2304, clm=4 (8 cpus, 1 worker): 1.72 ms. Throughput: 580.45 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=1024, Pass2=2304, clm=4 (8 cpus, 8 workers): 13.72, 13.71, 13.46, 13.33, 14.14, 13.67, 13.38, 13.30 ms. Throughput: 588.98 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=1024, Pass2=2304, clm=2 (8 cpus, 1 worker): 1.51 ms. Throughput: 661.85 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=1024, Pass2=2304, clm=2 (8 cpus, 8 workers): 12.83, 12.69, 12.60, 12.45, 12.75, 12.68, 12.44, 12.35 ms. Throughput: 635.03 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=1024, Pass2=2304, clm=1 (8 cpus, 1 worker): 1.47 ms. Throughput: 682.58 iter/sec. FFTlen=2304K, Type=3, Arch=4, Pass1=1024, Pass2=2304, clm=1 (8 cpus, 8 workers): 12.65, 12.55, 12.47, 12.30, 12.83, 12.63, 12.38, 12.36 ms. Throughput: 639.00 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=320, Pass2=7680, clm=4 (8 cpus, 1 worker): 1.50 ms. Throughput: 668.71 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=320, Pass2=7680, clm=4 (8 cpus, 8 workers): 13.05, 12.89, 12.74, 12.63, 13.22, 13.01, 12.69, 12.64 ms. Throughput: 622.21 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=320, Pass2=7680, clm=2 (8 cpus, 1 worker): 1.47 ms. Throughput: 679.85 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=320, Pass2=7680, clm=2 (8 cpus, 8 workers): 13.03, 12.93, 12.84, 12.71, 13.21, 13.00, 12.68, 12.68 ms. Throughput: 621.03 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=320, Pass2=7680, clm=1 (8 cpus, 1 worker): 1.51 ms. Throughput: 662.59 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=320, Pass2=7680, clm=1 (8 cpus, 8 workers): 13.02, 12.79, 12.80, 13.03, 12.84, 13.14, 12.63, 12.98 ms. Throughput: 620.07 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=384, Pass2=6400, clm=4 (8 cpus, 1 worker): 1.53 ms. Throughput: 654.81 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=384, Pass2=6400, clm=4 (8 cpus, 8 workers): 13.21, 13.20, 12.95, 12.81, 13.31, 13.23, 12.90, 12.86 ms. Throughput: 612.64 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=384, Pass2=6400, clm=2 (8 cpus, 1 worker): 1.50 ms. Throughput: 666.83 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=384, Pass2=6400, clm=2 (8 cpus, 8 workers): 13.15, 13.08, 12.89, 12.78, 13.32, 13.13, 12.81, 12.71 ms. Throughput: 616.33 iter/sec. [Sat Apr 29 11:43:42 2017] FFTlen=2400K, Type=3, Arch=4, Pass1=384, Pass2=6400, clm=1 (8 cpus, 1 worker): 1.52 ms. Throughput: 656.38 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=384, Pass2=6400, clm=1 (8 cpus, 8 workers): 13.24, 13.14, 12.94, 12.80, 13.27, 13.14, 12.88, 12.82 ms. Throughput: 614.10 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=640, Pass2=3840, clm=4 (8 cpus, 1 worker): 1.68 ms. Throughput: 596.60 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=640, Pass2=3840, clm=4 (8 cpus, 8 workers): 13.71, 13.64, 13.54, 13.43, 13.80, 13.74, 13.55, 13.39 ms. Throughput: 588.27 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=640, Pass2=3840, clm=2 (8 cpus, 1 worker): 1.62 ms. Throughput: 617.15 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=640, Pass2=3840, clm=2 (8 cpus, 8 workers): 13.11, 13.02, 12.93, 12.75, 13.21, 13.03, 12.74, 12.71 ms. Throughput: 618.49 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=640, Pass2=3840, clm=1 (8 cpus, 1 worker): 1.52 ms. Throughput: 659.38 iter/sec. FFTlen=2400K, Type=3, Arch=4, Pass1=640, Pass2=3840, clm=1 (8 cpus, 8 workers): 13.21, 13.07, 12.95, 12.87, 13.32, 13.17, 12.86, 12.80 ms. Throughput: 614.10 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=128, Pass2=20480, clm=4 (8 cpus, 1 worker): 2.04 ms. Throughput: 489.66 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=128, Pass2=20480, clm=4 (8 cpus, 8 workers): 16.01, 15.86, 15.65, 15.53, 16.01, 15.97, 15.59, 15.54 ms. Throughput: 507.32 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=128, Pass2=20480, clm=2 (8 cpus, 1 worker): 2.03 ms. Throughput: 493.04 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=128, Pass2=20480, clm=2 (8 cpus, 8 workers): 15.91, 15.78, 15.54, 15.45, 16.04, 15.82, 15.25, 15.52 ms. Throughput: 510.88 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=128, Pass2=20480, clm=1 (8 cpus, 1 worker): 3.06 ms. Throughput: 327.27 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=128, Pass2=20480, clm=1 (8 cpus, 8 workers): 16.05, 15.92, 15.73, 15.60, 16.06, 15.93, 15.62, 15.64 ms. Throughput: 505.84 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=256, Pass2=10240, clm=4 (8 cpus, 1 worker): 1.70 ms. Throughput: 588.87 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=256, Pass2=10240, clm=4 (8 cpus, 8 workers): 14.49, 14.31, 14.17, 14.03, 14.68, 14.37, 14.03, 14.09 ms. Throughput: 560.71 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=256, Pass2=10240, clm=2 (8 cpus, 1 worker): 1.63 ms. Throughput: 611.98 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=256, Pass2=10240, clm=2 (8 cpus, 8 workers): 14.17, 14.02, 13.84, 13.78, 14.28, 14.09, 13.82, 13.75 ms. Throughput: 572.79 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=256, Pass2=10240, clm=1 (8 cpus, 1 worker): 1.67 ms. Throughput: 598.75 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=256, Pass2=10240, clm=1 (8 cpus, 8 workers): 14.17, 14.00, 13.92, 13.80, 14.31, 14.11, 13.80, 13.78 ms. Throughput: 572.10 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=320, Pass2=8192, clm=4 (8 cpus, 1 worker): 1.62 ms. Throughput: 618.32 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=320, Pass2=8192, clm=4 (8 cpus, 8 workers): 14.12, 13.94, 13.76, 13.80, 14.24, 13.93, 13.77, 13.75 ms. Throughput: 575.12 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=320, Pass2=8192, clm=2 (8 cpus, 1 worker): 1.57 ms. Throughput: 637.20 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=320, Pass2=8192, clm=2 (8 cpus, 8 workers): 14.04, 13.98, 13.70, 14.10, 14.02, 13.93, 13.68, 14.14 ms. Throughput: 573.59 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=320, Pass2=8192, clm=1 (8 cpus, 1 worker): 1.64 ms. Throughput: 609.35 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=320, Pass2=8192, clm=1 (8 cpus, 8 workers): 14.06, 13.94, 13.87, 13.74, 14.29, 13.88, 13.80, 13.69 ms. Throughput: 575.28 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=512, Pass2=5120, clm=4 (8 cpus, 1 worker): 1.75 ms. Throughput: 571.19 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=512, Pass2=5120, clm=4 (8 cpus, 8 workers): 14.57, 14.41, 14.23, 14.08, 14.71, 14.53, 14.16, 14.09 ms. Throughput: 557.74 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=512, Pass2=5120, clm=2 (8 cpus, 1 worker): 1.60 ms. Throughput: 626.13 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=512, Pass2=5120, clm=2 (8 cpus, 8 workers): 14.02, 13.90, 13.70, 13.58, 14.19, 13.95, 13.65, 13.44 ms. Throughput: 579.71 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=512, Pass2=5120, clm=1 (8 cpus, 1 worker): 1.59 ms. Throughput: 628.85 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=512, Pass2=5120, clm=1 (8 cpus, 8 workers): 13.96, 13.84, 13.71, 13.62, 14.27, 14.07, 13.46, 13.60 ms. Throughput: 579.24 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=640, Pass2=4096, clm=4 (8 cpus, 1 worker): 1.75 ms. Throughput: 570.76 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=640, Pass2=4096, clm=4 (8 cpus, 8 workers): 14.41, 14.34, 14.23, 13.90, 14.89, 14.63, 14.01, 14.15 ms. Throughput: 558.94 iter/sec. [Sat Apr 29 11:48:45 2017] FFTlen=2560K, Type=3, Arch=4, Pass1=640, Pass2=4096, clm=2 (8 cpus, 1 worker): 1.60 ms. Throughput: 626.73 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=640, Pass2=4096, clm=2 (8 cpus, 8 workers): 13.91, 13.68, 13.73, 13.36, 14.05, 13.72, 13.39, 13.56 ms. Throughput: 585.12 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=640, Pass2=4096, clm=1 (8 cpus, 1 worker): 1.58 ms. Throughput: 632.06 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=640, Pass2=4096, clm=1 (8 cpus, 8 workers): 13.94, 13.84, 13.59, 13.48, 13.99, 13.90, 13.49, 13.46 ms. Throughput: 583.57 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=1024, Pass2=2560, clm=4 (8 cpus, 1 worker): 1.88 ms. Throughput: 532.62 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=1024, Pass2=2560, clm=4 (8 cpus, 8 workers): 14.92, 14.78, 14.52, 14.35, 15.04, 14.77, 14.42, 14.43 ms. Throughput: 546.14 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=1024, Pass2=2560, clm=2 (8 cpus, 1 worker): 1.63 ms. Throughput: 612.07 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=1024, Pass2=2560, clm=2 (8 cpus, 8 workers): 13.91, 13.65, 13.42, 13.67, 13.85, 13.69, 13.42, 13.50 ms. Throughput: 586.63 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=1024, Pass2=2560, clm=1 (8 cpus, 1 worker): 1.60 ms. Throughput: 624.60 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=1024, Pass2=2560, clm=1 (8 cpus, 8 workers): 13.75, 13.66, 13.47, 13.35, 13.86, 13.72, 13.37, 13.35 ms. Throughput: 589.75 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=1280, Pass2=2048, clm=4 (8 cpus, 1 worker): 1.79 ms. Throughput: 558.42 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=1280, Pass2=2048, clm=4 (8 cpus, 8 workers): 14.61, 14.43, 14.17, 14.09, 14.84, 14.56, 14.09, 14.14 ms. Throughput: 557.09 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=1280, Pass2=2048, clm=2 (8 cpus, 1 worker): 2.24 ms. Throughput: 447.11 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=1280, Pass2=2048, clm=2 (8 cpus, 8 workers): 13.50, 13.27, 13.16, 13.01, 16.67, 15.69, 16.11, 18.74 ms. Throughput: 541.49 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=1280, Pass2=2048, clm=1 (8 cpus, 1 worker): 1.63 ms. Throughput: 612.37 iter/sec. FFTlen=2560K, Type=3, Arch=4, Pass1=1280, Pass2=2048, clm=1 (8 cpus, 8 workers): 13.50, 13.34, 13.32, 13.25, 13.61, 13.43, 13.10, 13.29 ms. Throughput: 599.09 iter/sec. FFTlen=2688K, Type=3, Arch=4, Pass1=448, Pass2=6144, clm=4 (8 cpus, 1 worker): 1.68 ms. Throughput: 596.37 iter/sec. FFTlen=2688K, Type=3, Arch=4, Pass1=448, Pass2=6144, clm=4 (8 cpus, 8 workers): 14.67, 14.45, 14.27, 14.15, 14.51, 14.27, 14.19, 14.25 ms. Throughput: 557.74 iter/sec. FFTlen=2688K, Type=3, Arch=4, Pass1=448, Pass2=6144, clm=2 (8 cpus, 1 worker): 1.63 ms. Throughput: 613.66 iter/sec. FFTlen=2688K, Type=3, Arch=4, Pass1=448, Pass2=6144, clm=2 (8 cpus, 8 workers): 14.47, 14.41, 14.11, 14.03, 14.82, 14.48, 14.29, 14.02 ms. Throughput: 558.54 iter/sec. FFTlen=2688K, Type=3, Arch=4, Pass1=448, Pass2=6144, clm=1 (8 cpus, 1 worker): 1.69 ms. Throughput: 593.20 iter/sec. FFTlen=2688K, Type=3, Arch=4, Pass1=448, Pass2=6144, clm=1 (8 cpus, 8 workers): 14.60, 14.44, 14.23, 14.15, 14.54, 14.49, 14.13, 14.14 ms. Throughput: 557.94 iter/sec. FFTlen=2688K, Type=3, Arch=4, Pass1=896, Pass2=3072, clm=4 (8 cpus, 1 worker): 1.94 ms. Throughput: 514.63 iter/sec. FFTlen=2688K, Type=3, Arch=4, Pass1=896, Pass2=3072, clm=4 (8 cpus, 8 workers): 15.69, 15.53, 15.37, 15.10, 15.63, 15.47, 15.24, 15.17 ms. Throughput: 519.53 iter/sec. FFTlen=2688K, Type=3, Arch=4, Pass1=896, Pass2=3072, clm=2 (8 cpus, 1 worker): 1.75 ms. Throughput: 571.84 iter/sec. FFTlen=2688K, Type=3, Arch=4, Pass1=896, Pass2=3072, clm=2 (8 cpus, 8 workers): 14.68, 14.56, 14.26, 14.31, 14.81, 14.44, 14.40, 14.31 ms. Throughput: 552.92 iter/sec. FFTlen=2688K, Type=3, Arch=4, Pass1=896, Pass2=3072, clm=1 (8 cpus, 1 worker): 1.70 ms. Throughput: 589.36 iter/sec. FFTlen=2688K, Type=3, Arch=4, Pass1=896, Pass2=3072, clm=1 (8 cpus, 8 workers): 14.74, 14.71, 14.44, 14.32, 14.77, 14.56, 14.33, 14.31 ms. Throughput: 551.00 iter/sec. FFTlen=2800K, Type=3, Arch=4, Pass1=448, Pass2=6400, clm=4 (8 cpus, 1 worker): 1.88 ms. Throughput: 531.27 iter/sec. FFTlen=2800K, Type=3, Arch=4, Pass1=448, Pass2=6400, clm=4 (8 cpus, 8 workers): 15.68, 15.61, 15.32, 15.25, 15.82, 15.97, 15.63, 15.23 ms. Throughput: 514.18 iter/sec. FFTlen=2800K, Type=3, Arch=4, Pass1=448, Pass2=6400, clm=2 (8 cpus, 1 worker): 1.81 ms. Throughput: 551.41 iter/sec. FFTlen=2800K, Type=3, Arch=4, Pass1=448, Pass2=6400, clm=2 (8 cpus, 8 workers): 15.51, 15.37, 15.28, 15.06, 15.79, 15.63, 15.09, 15.04 ms. Throughput: 521.46 iter/sec. FFTlen=2800K, Type=3, Arch=4, Pass1=448, Pass2=6400, clm=1 (8 cpus, 1 worker): 1.88 ms. Throughput: 533.25 iter/sec. FFTlen=2800K, Type=3, Arch=4, Pass1=448, Pass2=6400, clm=1 (8 cpus, 8 workers): 15.57, 15.54, 15.31, 15.17, 15.74, 15.64, 15.17, 15.17 ms. Throughput: 519.18 iter/sec. [Sat Apr 29 11:53:46 2017] FFTlen=2880K, Type=3, Arch=4, Pass1=320, Pass2=9216, clm=4 (8 cpus, 1 worker): 1.96 ms. Throughput: 509.59 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=320, Pass2=9216, clm=4 (8 cpus, 8 workers): 16.50, 16.21, 16.20, 15.89, 16.72, 16.20, 15.97, 15.97 ms. Throughput: 493.73 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=320, Pass2=9216, clm=2 (8 cpus, 1 worker): 1.93 ms. Throughput: 517.68 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=320, Pass2=9216, clm=2 (8 cpus, 8 workers): 16.21, 16.16, 15.99, 15.96, 16.55, 16.11, 15.92, 15.98 ms. Throughput: 496.66 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=320, Pass2=9216, clm=1 (8 cpus, 1 worker): 2.12 ms. Throughput: 471.44 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=320, Pass2=9216, clm=1 (8 cpus, 8 workers): 16.45, 16.12, 16.06, 15.83, 16.81, 16.40, 15.91, 15.98 ms. Throughput: 494.17 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=384, Pass2=7680, clm=4 (8 cpus, 1 worker): 1.83 ms. Throughput: 545.29 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=384, Pass2=7680, clm=4 (8 cpus, 8 workers): 15.74, 15.45, 15.39, 15.61, 15.59, 15.45, 15.32, 15.75 ms. Throughput: 514.95 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=384, Pass2=7680, clm=2 (8 cpus, 1 worker): 1.79 ms. Throughput: 557.34 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=384, Pass2=7680, clm=2 (8 cpus, 8 workers): 15.62, 15.44, 15.25, 15.13, 15.66, 15.50, 15.15, 15.11 ms. Throughput: 521.02 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=384, Pass2=7680, clm=1 (8 cpus, 1 worker): 1.81 ms. Throughput: 552.45 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=384, Pass2=7680, clm=1 (8 cpus, 8 workers): 15.67, 15.60, 15.31, 15.20, 15.84, 15.53, 15.23, 15.19 ms. Throughput: 518.13 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=640, Pass2=4608, clm=4 (8 cpus, 1 worker): 2.05 ms. Throughput: 487.96 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=640, Pass2=4608, clm=4 (8 cpus, 8 workers): 16.55, 16.46, 16.17, 16.05, 16.63, 16.52, 16.07, 15.96 ms. Throughput: 490.86 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=640, Pass2=4608, clm=2 (8 cpus, 1 worker): 1.85 ms. Throughput: 539.20 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=640, Pass2=4608, clm=2 (8 cpus, 8 workers): 15.86, 15.68, 15.40, 15.35, 15.83, 15.79, 15.34, 15.33 ms. Throughput: 513.84 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=640, Pass2=4608, clm=1 (8 cpus, 1 worker): 1.84 ms. Throughput: 543.23 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=640, Pass2=4608, clm=1 (8 cpus, 8 workers): 15.77, 15.66, 15.43, 15.31, 15.79, 15.90, 15.45, 15.30 ms. Throughput: 513.64 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=768, Pass2=3840, clm=4 (8 cpus, 1 worker): 2.08 ms. Throughput: 480.11 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=768, Pass2=3840, clm=4 (8 cpus, 8 workers): 16.69, 16.56, 16.23, 16.31, 16.62, 16.51, 16.18, 16.24 ms. Throughput: 487.32 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=768, Pass2=3840, clm=2 (8 cpus, 1 worker): 1.86 ms. Throughput: 537.60 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=768, Pass2=3840, clm=2 (8 cpus, 8 workers): 15.75, 15.65, 15.55, 15.41, 15.97, 15.81, 15.48, 15.45 ms. Throughput: 511.78 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=768, Pass2=3840, clm=1 (8 cpus, 1 worker): 1.82 ms. Throughput: 549.45 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=768, Pass2=3840, clm=1 (8 cpus, 8 workers): 15.86, 15.74, 15.55, 15.40, 16.03, 15.90, 15.39, 15.39 ms. Throughput: 511.02 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=1280, Pass2=2304, clm=4 (8 cpus, 1 worker): 2.20 ms. Throughput: 455.22 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=1280, Pass2=2304, clm=4 (8 cpus, 8 workers): 17.21, 17.06, 16.89, 16.69, 17.53, 17.15, 16.72, 16.78 ms. Throughput: 470.58 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=1280, Pass2=2304, clm=2 (8 cpus, 1 worker): 2.00 ms. Throughput: 500.69 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=1280, Pass2=2304, clm=2 (8 cpus, 8 workers): 16.35, 16.11, 15.87, 15.77, 16.39, 16.50, 15.75, 15.84 ms. Throughput: 497.86 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=1280, Pass2=2304, clm=1 (8 cpus, 1 worker): 1.90 ms. Throughput: 525.44 iter/sec. FFTlen=2880K, Type=3, Arch=4, Pass1=1280, Pass2=2304, clm=1 (8 cpus, 8 workers): 15.87, 15.68, 15.54, 15.44, 15.82, 15.72, 15.56, 15.41 ms. Throughput: 511.85 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=256, Pass2=12288, clm=4 (8 cpus, 1 worker): 2.25 ms. Throughput: 444.37 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=256, Pass2=12288, clm=4 (8 cpus, 8 workers): 18.11, 17.95, 17.78, 17.79, 18.32, 18.03, 17.69, 17.78 ms. Throughput: 446.17 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=256, Pass2=12288, clm=2 (8 cpus, 1 worker): 2.27 ms. Throughput: 440.61 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=256, Pass2=12288, clm=2 (8 cpus, 8 workers): 17.85, 17.50, 17.53, 17.34, 17.88, 17.71, 17.30, 17.60 ms. Throughput: 454.89 iter/sec. [Sat Apr 29 11:58:50 2017] FFTlen=3072K, Type=3, Arch=4, Pass1=256, Pass2=12288, clm=1 (8 cpus, 1 worker): 2.19 ms. Throughput: 457.19 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=256, Pass2=12288, clm=1 (8 cpus, 8 workers): 17.77, 17.76, 17.51, 17.65, 18.07, 17.67, 17.36, 17.39 ms. Throughput: 453.38 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=384, Pass2=8192, clm=4 (8 cpus, 1 worker): 2.00 ms. Throughput: 500.19 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=384, Pass2=8192, clm=4 (8 cpus, 8 workers): 17.12, 16.88, 16.39, 16.87, 16.81, 16.65, 16.74, 16.24 ms. Throughput: 478.83 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=384, Pass2=8192, clm=2 (8 cpus, 1 worker): 1.94 ms. Throughput: 515.44 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=384, Pass2=8192, clm=2 (8 cpus, 8 workers): 16.78, 16.54, 16.48, 16.32, 16.84, 16.58, 16.45, 16.30 ms. Throughput: 483.86 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=384, Pass2=8192, clm=1 (8 cpus, 1 worker): 1.97 ms. Throughput: 507.49 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=384, Pass2=8192, clm=1 (8 cpus, 8 workers): 16.79, 16.61, 16.46, 16.42, 16.93, 16.69, 16.47, 16.36 ms. Throughput: 482.23 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=512, Pass2=6144, clm=4 (8 cpus, 1 worker): 2.12 ms. Throughput: 471.02 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=512, Pass2=6144, clm=4 (8 cpus, 8 workers): 17.42, 17.27, 17.03, 16.88, 17.49, 17.43, 16.95, 16.94 ms. Throughput: 465.85 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=512, Pass2=6144, clm=2 (8 cpus, 1 worker): 1.96 ms. Throughput: 510.85 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=512, Pass2=6144, clm=2 (8 cpus, 8 workers): 16.85, 16.70, 16.38, 16.29, 17.02, 16.53, 16.28, 16.25 ms. Throughput: 483.87 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=512, Pass2=6144, clm=1 (8 cpus, 1 worker): 1.94 ms. Throughput: 516.64 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=512, Pass2=6144, clm=1 (8 cpus, 8 workers): 16.83, 16.77, 16.51, 16.32, 16.85, 16.90, 16.28, 16.32 ms. Throughput: 482.13 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=768, Pass2=4096, clm=4 (8 cpus, 1 worker): 2.22 ms. Throughput: 450.52 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=768, Pass2=4096, clm=4 (8 cpus, 8 workers): 17.62, 17.42, 17.20, 17.08, 17.76, 17.51, 17.15, 17.07 ms. Throughput: 461.16 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=768, Pass2=4096, clm=2 (8 cpus, 1 worker): 1.98 ms. Throughput: 503.93 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=768, Pass2=4096, clm=2 (8 cpus, 8 workers): 16.82, 16.67, 16.41, 16.29, 16.95, 16.65, 16.34, 16.30 ms. Throughput: 483.36 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=768, Pass2=4096, clm=1 (8 cpus, 1 worker): 1.94 ms. Throughput: 515.92 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=768, Pass2=4096, clm=1 (8 cpus, 8 workers): 16.61, 16.45, 16.33, 16.23, 16.77, 16.41, 16.24, 16.23 ms. Throughput: 487.57 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=1024, Pass2=3072, clm=4 (8 cpus, 1 worker): 2.30 ms. Throughput: 434.62 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=1024, Pass2=3072, clm=4 (8 cpus, 8 workers): 18.08, 17.96, 17.80, 17.53, 18.16, 17.97, 17.67, 17.63 ms. Throughput: 448.23 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=1024, Pass2=3072, clm=2 (8 cpus, 1 worker): 2.03 ms. Throughput: 493.15 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=1024, Pass2=3072, clm=2 (8 cpus, 8 workers): 17.10, 17.01, 16.67, 16.60, 17.02, 16.97, 16.55, 16.47 ms. Throughput: 476.37 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=1024, Pass2=3072, clm=1 (8 cpus, 1 worker): 1.99 ms. Throughput: 502.77 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=1024, Pass2=3072, clm=1 (8 cpus, 8 workers): 17.09, 16.85, 16.53, 16.35, 17.11, 16.88, 16.40, 16.38 ms. Throughput: 479.17 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=1536, Pass2=2048, clm=4 (8 cpus, 1 worker): 2.31 ms. Throughput: 432.09 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=1536, Pass2=2048, clm=4 (8 cpus, 8 workers): 17.92, 17.73, 17.43, 17.27, 18.13, 18.36, 17.22, 17.40 ms. Throughput: 452.60 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=1536, Pass2=2048, clm=2 (8 cpus, 1 worker): 2.08 ms. Throughput: 481.06 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=1536, Pass2=2048, clm=2 (8 cpus, 8 workers): 16.76, 16.66, 16.49, 16.38, 17.10, 17.52, 16.32, 16.95 ms. Throughput: 477.29 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=1536, Pass2=2048, clm=1 (8 cpus, 1 worker): 1.96 ms. Throughput: 508.96 iter/sec. FFTlen=3072K, Type=3, Arch=4, Pass1=1536, Pass2=2048, clm=1 (8 cpus, 8 workers): 16.33, 16.22, 16.18, 16.09, 16.64, 16.30, 16.01, 16.22 ms. Throughput: 492.45 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=128, Pass2=25600, clm=4 (8 cpus, 1 worker): 2.59 ms. Throughput: 386.61 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=128, Pass2=25600, clm=4 (8 cpus, 8 workers): 20.07, 19.77, 19.16, 19.00, 20.40, 19.85, 19.21, 18.95 ms. Throughput: 409.47 iter/sec. [Sat Apr 29 12:03:57 2017] FFTlen=3200K, Type=3, Arch=4, Pass1=128, Pass2=25600, clm=2 (8 cpus, 1 worker): 2.58 ms. Throughput: 388.05 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=128, Pass2=25600, clm=2 (8 cpus, 8 workers): 19.52, 19.38, 19.14, 18.99, 19.68, 19.46, 19.00, 19.05 ms. Throughput: 415.04 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=128, Pass2=25600, clm=1 (8 cpus, 1 worker): 3.40 ms. Throughput: 294.21 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=128, Pass2=25600, clm=1 (8 cpus, 8 workers): 20.04, 19.42, 19.47, 21.16, 19.67, 19.09, 19.02, 19.09 ms. Throughput: 408.20 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=256, Pass2=12800, clm=4 (8 cpus, 1 worker): 2.23 ms. Throughput: 448.31 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=256, Pass2=12800, clm=4 (8 cpus, 8 workers): 18.10, 17.98, 17.82, 17.69, 18.29, 18.06, 17.71, 17.73 ms. Throughput: 446.46 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=256, Pass2=12800, clm=2 (8 cpus, 1 worker): 2.18 ms. Throughput: 459.67 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=256, Pass2=12800, clm=2 (8 cpus, 8 workers): 17.77, 17.59, 17.46, 17.31, 17.84, 17.76, 17.33, 17.35 ms. Throughput: 455.89 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=256, Pass2=12800, clm=1 (8 cpus, 1 worker): 2.20 ms. Throughput: 454.33 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=256, Pass2=12800, clm=1 (8 cpus, 8 workers): 17.87, 17.71, 17.49, 17.43, 17.96, 17.76, 17.41, 17.44 ms. Throughput: 453.76 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=320, Pass2=10240, clm=4 (8 cpus, 1 worker): 2.19 ms. Throughput: 456.76 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=320, Pass2=10240, clm=4 (8 cpus, 8 workers): 17.96, 17.69, 17.65, 17.48, 18.11, 17.73, 17.45, 17.50 ms. Throughput: 452.14 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=320, Pass2=10240, clm=2 (8 cpus, 1 worker): 2.13 ms. Throughput: 470.05 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=320, Pass2=10240, clm=2 (8 cpus, 8 workers): 17.82, 17.64, 17.53, 17.39, 17.95, 17.72, 17.41, 17.39 ms. Throughput: 454.43 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=320, Pass2=10240, clm=1 (8 cpus, 1 worker): 2.14 ms. Throughput: 466.51 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=320, Pass2=10240, clm=1 (8 cpus, 8 workers): 17.85, 17.71, 17.53, 17.45, 18.00, 17.59, 17.39, 17.44 ms. Throughput: 454.06 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=512, Pass2=6400, clm=4 (8 cpus, 1 worker): 2.33 ms. Throughput: 428.78 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=512, Pass2=6400, clm=4 (8 cpus, 8 workers): 18.51, 18.46, 18.24, 18.14, 18.63, 18.50, 18.20, 18.13 ms. Throughput: 436.01 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=512, Pass2=6400, clm=2 (8 cpus, 1 worker): 2.13 ms. Throughput: 469.13 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=512, Pass2=6400, clm=2 (8 cpus, 8 workers): 17.84, 17.69, 17.59, 17.45, 17.92, 17.84, 17.50, 17.43 ms. Throughput: 453.14 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=512, Pass2=6400, clm=1 (8 cpus, 1 worker): 2.11 ms. Throughput: 473.92 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=512, Pass2=6400, clm=1 (8 cpus, 8 workers): 18.02, 17.82, 17.67, 17.49, 18.14, 17.72, 17.54, 17.49 ms. Throughput: 451.18 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=640, Pass2=5120, clm=4 (8 cpus, 1 worker): 2.28 ms. Throughput: 439.20 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=640, Pass2=5120, clm=4 (8 cpus, 8 workers): 18.58, 18.34, 18.07, 17.92, 18.67, 18.29, 17.96, 18.06 ms. Throughput: 438.76 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=640, Pass2=5120, clm=2 (8 cpus, 1 worker): 2.09 ms. Throughput: 479.45 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=640, Pass2=5120, clm=2 (8 cpus, 8 workers): 17.74, 17.54, 17.31, 17.16, 17.77, 17.63, 17.23, 17.13 ms. Throughput: 458.85 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=640, Pass2=5120, clm=1 (8 cpus, 1 worker): 2.07 ms. Throughput: 483.28 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=640, Pass2=5120, clm=1 (8 cpus, 8 workers): 17.64, 17.53, 17.31, 17.18, 17.87, 17.45, 17.25, 17.12 ms. Throughput: 459.33 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=1280, Pass2=2560, clm=4 (8 cpus, 1 worker): 2.41 ms. Throughput: 415.77 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=1280, Pass2=2560, clm=4 (8 cpus, 8 workers): 18.54, 18.36, 18.04, 17.94, 18.75, 19.11, 17.97, 17.95 ms. Throughput: 436.60 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=1280, Pass2=2560, clm=2 (8 cpus, 1 worker): 2.20 ms. Throughput: 455.28 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=1280, Pass2=2560, clm=2 (8 cpus, 8 workers): 17.42, 17.26, 17.11, 17.10, 17.79, 17.42, 17.05, 16.98 ms. Throughput: 463.49 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=1280, Pass2=2560, clm=1 (8 cpus, 1 worker): 2.10 ms. Throughput: 476.26 iter/sec. FFTlen=3200K, Type=3, Arch=4, Pass1=1280, Pass2=2560, clm=1 (8 cpus, 8 workers): 17.23, 17.14, 16.94, 16.90, 17.18, 17.22, 16.96, 17.36 ms. Throughput: 467.46 iter/sec. [Sat Apr 29 12:09:06 2017] FFTlen=3360K, Type=3, Arch=4, Pass1=448, Pass2=7680, clm=4 (8 cpus, 1 worker): 2.26 ms. Throughput: 441.70 iter/sec. FFTlen=3360K, Type=3, Arch=4, Pass1=448, Pass2=7680, clm=4 (8 cpus, 8 workers): 18.36, 18.23, 18.09, 18.42, 18.50, 18.29, 18.00, 18.63 ms. Throughput: 436.87 iter/sec. FFTlen=3360K, Type=3, Arch=4, Pass1=448, Pass2=7680, clm=2 (8 cpus, 1 worker): 2.20 ms. Throughput: 454.89 iter/sec. FFTlen=3360K, Type=3, Arch=4, Pass1=448, Pass2=7680, clm=2 (8 cpus, 8 workers): 18.25, 18.65, 18.10, 17.88, 18.53, 18.52, 17.91, 17.85 ms. Throughput: 439.40 iter/sec. FFTlen=3360K, Type=3, Arch=4, Pass1=448, Pass2=7680, clm=1 (8 cpus, 1 worker): 2.23 ms. Throughput: 448.76 iter/sec. FFTlen=3360K, Type=3, Arch=4, Pass1=448, Pass2=7680, clm=1 (8 cpus, 8 workers): 18.41, 18.36, 18.28, 17.95, 18.45, 18.26, 17.95, 17.81 ms. Throughput: 440.03 iter/sec. FFTlen=3360K, Type=3, Arch=4, Pass1=896, Pass2=3840, clm=4 (8 cpus, 1 worker): 2.56 ms. Throughput: 389.92 iter/sec. FFTlen=3360K, Type=3, Arch=4, Pass1=896, Pass2=3840, clm=4 (8 cpus, 8 workers): 19.93, 19.75, 19.65, 19.50, 20.08, 19.88, 19.43, 19.47 ms. Throughput: 405.89 iter/sec. FFTlen=3360K, Type=3, Arch=4, Pass1=896, Pass2=3840, clm=2 (8 cpus, 1 worker): 2.28 ms. Throughput: 438.28 iter/sec. FFTlen=3360K, Type=3, Arch=4, Pass1=896, Pass2=3840, clm=2 (8 cpus, 8 workers): 18.84, 18.69, 18.52, 18.34, 19.27, 18.75, 18.40, 18.32 ms. Throughput: 429.24 iter/sec. FFTlen=3360K, Type=3, Arch=4, Pass1=896, Pass2=3840, clm=1 (8 cpus, 1 worker): 2.25 ms. Throughput: 444.30 iter/sec. FFTlen=3360K, Type=3, Arch=4, Pass1=896, Pass2=3840, clm=1 (8 cpus, 8 workers): 18.72, 18.62, 18.40, 18.15, 18.94, 18.66, 18.28, 18.22 ms. Throughput: 432.57 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=384, Pass2=9216, clm=4 (8 cpus, 1 worker): 2.43 ms. Throughput: 411.26 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=384, Pass2=9216, clm=4 (8 cpus, 8 workers): 19.80, 19.54, 19.38, 19.40, 19.95, 19.36, 19.01, 19.58 ms. Throughput: 410.29 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=384, Pass2=9216, clm=2 (8 cpus, 1 worker): 2.36 ms. Throughput: 423.36 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=384, Pass2=9216, clm=2 (8 cpus, 8 workers): 19.50, 19.44, 19.10, 19.00, 19.68, 19.31, 19.99, 19.40 ms. Throughput: 411.86 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=384, Pass2=9216, clm=1 (8 cpus, 1 worker): 2.38 ms. Throughput: 419.86 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=384, Pass2=9216, clm=1 (8 cpus, 8 workers): 19.67, 19.48, 19.12, 19.07, 19.84, 19.33, 19.14, 19.08 ms. Throughput: 413.71 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=768, Pass2=4608, clm=4 (8 cpus, 1 worker): 2.58 ms. Throughput: 387.59 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=768, Pass2=4608, clm=4 (8 cpus, 8 workers): 20.14, 19.82, 19.75, 19.49, 20.24, 19.99, 19.46, 19.44 ms. Throughput: 404.29 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=768, Pass2=4608, clm=2 (8 cpus, 1 worker): 2.33 ms. Throughput: 429.64 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=768, Pass2=4608, clm=2 (8 cpus, 8 workers): 19.12, 18.89, 18.66, 18.46, 19.08, 19.04, 18.44, 18.46 ms. Throughput: 426.34 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=768, Pass2=4608, clm=1 (8 cpus, 1 worker): 2.28 ms. Throughput: 438.59 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=768, Pass2=4608, clm=1 (8 cpus, 8 workers): 18.96, 18.85, 18.53, 18.48, 19.37, 18.70, 18.28, 18.37 ms. Throughput: 428.13 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=1536, Pass2=2304, clm=4 (8 cpus, 1 worker): 2.80 ms. Throughput: 357.28 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=1536, Pass2=2304, clm=4 (8 cpus, 8 workers): 21.30, 21.07, 20.59, 20.52, 22.14, 21.63, 20.40, 20.54 ms. Throughput: 380.83 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=1536, Pass2=2304, clm=2 (8 cpus, 1 worker): 2.52 ms. Throughput: 396.57 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=1536, Pass2=2304, clm=2 (8 cpus, 8 workers): 20.02, 19.55, 19.34, 19.46, 19.89, 19.70, 19.35, 19.25 ms. Throughput: 408.87 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=1536, Pass2=2304, clm=1 (8 cpus, 1 worker): 2.37 ms. Throughput: 422.45 iter/sec. FFTlen=3456K, Type=3, Arch=4, Pass1=1536, Pass2=2304, clm=1 (8 cpus, 8 workers): 19.47, 18.99, 18.94, 18.66, 19.41, 19.20, 18.89, 18.76 ms. Throughput: 420.25 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=448, Pass2=8192, clm=4 (8 cpus, 1 worker): 2.48 ms. Throughput: 403.03 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=448, Pass2=8192, clm=4 (8 cpus, 8 workers): 19.92, 21.47, 19.57, 19.50, 20.16, 19.84, 19.91, 19.87 ms. Throughput: 399.72 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=448, Pass2=8192, clm=2 (8 cpus, 1 worker): 2.41 ms. Throughput: 414.46 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=448, Pass2=8192, clm=2 (8 cpus, 8 workers): 19.98, 20.29, 19.94, 19.68, 19.76, 19.72, 19.30, 19.85 ms. Throughput: 403.82 iter/sec. [Sat Apr 29 12:14:15 2017] FFTlen=3584K, Type=3, Arch=4, Pass1=448, Pass2=8192, clm=1 (8 cpus, 1 worker): 2.43 ms. Throughput: 410.73 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=448, Pass2=8192, clm=1 (8 cpus, 8 workers): 21.25, 19.99, 19.94, 20.92, 19.87, 19.63, 19.27, 19.30 ms. Throughput: 400.01 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=896, Pass2=4096, clm=4 (8 cpus, 1 worker): 2.67 ms. Throughput: 374.88 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=896, Pass2=4096, clm=4 (8 cpus, 8 workers): 20.92, 20.88, 20.45, 20.39, 21.00, 20.71, 20.60, 20.36 ms. Throughput: 387.18 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=896, Pass2=4096, clm=2 (8 cpus, 1 worker): 2.40 ms. Throughput: 417.03 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=896, Pass2=4096, clm=2 (8 cpus, 8 workers): 19.79, 19.52, 19.40, 19.25, 19.70, 19.81, 19.94, 19.30 ms. Throughput: 408.42 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=896, Pass2=4096, clm=1 (8 cpus, 1 worker): 2.36 ms. Throughput: 423.34 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=896, Pass2=4096, clm=1 (8 cpus, 8 workers): 19.68, 19.56, 19.38, 19.18, 20.01, 19.75, 19.20, 19.31 ms. Throughput: 410.16 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=1792, Pass2=2048, clm=4 (8 cpus, 1 worker): 2.88 ms. Throughput: 346.82 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=1792, Pass2=2048, clm=4 (8 cpus, 8 workers): 21.83, 21.56, 21.19, 21.16, 22.14, 21.74, 21.08, 21.18 ms. Throughput: 372.49 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=1792, Pass2=2048, clm=2 (8 cpus, 1 worker): 2.60 ms. Throughput: 384.98 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=1792, Pass2=2048, clm=2 (8 cpus, 8 workers): 20.53, 20.28, 19.86, 19.76, 20.57, 20.48, 19.95, 19.76 ms. Throughput: 397.13 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=1792, Pass2=2048, clm=1 (8 cpus, 1 worker): 2.46 ms. Throughput: 406.54 iter/sec. FFTlen=3584K, Type=3, Arch=4, Pass1=1792, Pass2=2048, clm=1 (8 cpus, 8 workers): 19.72, 19.59, 19.32, 19.26, 19.82, 19.99, 19.21, 19.19 ms. Throughput: 410.09 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=256, Pass2=15360, clm=4 (8 cpus, 1 worker): 2.93 ms. Throughput: 341.79 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=256, Pass2=15360, clm=4 (8 cpus, 8 workers): 22.82, 22.66, 22.36, 22.23, 22.99, 22.60, 22.19, 22.20 ms. Throughput: 355.52 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=256, Pass2=15360, clm=2 (8 cpus, 1 worker): 2.81 ms. Throughput: 355.26 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=256, Pass2=15360, clm=2 (8 cpus, 8 workers): 22.29, 22.08, 21.95, 21.90, 22.39, 22.22, 21.76, 21.85 ms. Throughput: 362.75 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=256, Pass2=15360, clm=1 (8 cpus, 1 worker): 2.88 ms. Throughput: 347.04 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=256, Pass2=15360, clm=1 (8 cpus, 8 workers): 22.21, 22.07, 21.88, 21.59, 22.56, 22.10, 21.82, 21.82 ms. Throughput: 363.62 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=320, Pass2=12288, clm=4 (8 cpus, 1 worker): 2.89 ms. Throughput: 345.74 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=320, Pass2=12288, clm=4 (8 cpus, 8 workers): 22.64, 22.46, 22.09, 21.96, 22.84, 22.69, 22.05, 21.97 ms. Throughput: 358.21 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=320, Pass2=12288, clm=2 (8 cpus, 1 worker): 2.85 ms. Throughput: 351.45 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=320, Pass2=12288, clm=2 (8 cpus, 8 workers): 22.59, 22.43, 22.09, 21.95, 22.82, 22.58, 21.90, 21.93 ms. Throughput: 359.04 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=320, Pass2=12288, clm=1 (8 cpus, 1 worker): 2.88 ms. Throughput: 347.30 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=320, Pass2=12288, clm=1 (8 cpus, 8 workers): 22.67, 22.43, 22.17, 21.96, 22.84, 22.89, 21.90, 21.80 ms. Throughput: 358.34 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=384, Pass2=10240, clm=4 (8 cpus, 1 worker): 2.72 ms. Throughput: 367.49 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=384, Pass2=10240, clm=4 (8 cpus, 8 workers): 21.56, 21.44, 21.11, 21.06, 21.89, 21.47, 21.00, 20.99 ms. Throughput: 375.42 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=384, Pass2=10240, clm=2 (8 cpus, 1 worker): 2.66 ms. Throughput: 376.12 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=384, Pass2=10240, clm=2 (8 cpus, 8 workers): 21.54, 21.25, 20.99, 20.83, 21.72, 21.37, 20.84, 20.81 ms. Throughput: 378.00 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=384, Pass2=10240, clm=1 (8 cpus, 1 worker): 2.74 ms. Throughput: 365.58 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=384, Pass2=10240, clm=1 (8 cpus, 8 workers): 21.60, 21.35, 21.05, 20.93, 21.89, 21.32, 21.00, 20.92 ms. Throughput: 376.45 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=512, Pass2=7680, clm=4 (8 cpus, 1 worker): 2.82 ms. Throughput: 354.46 iter/sec. [Sat Apr 29 12:19:19 2017] FFTlen=3840K, Type=3, Arch=4, Pass1=512, Pass2=7680, clm=4 (8 cpus, 8 workers): 21.93, 21.78, 21.49, 21.38, 22.54, 22.07, 21.37, 21.43 ms. Throughput: 367.96 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=512, Pass2=7680, clm=2 (8 cpus, 1 worker): 2.63 ms. Throughput: 380.65 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=512, Pass2=7680, clm=2 (8 cpus, 8 workers): 21.06, 20.82, 20.73, 20.54, 21.20, 20.93, 20.68, 20.56 ms. Throughput: 384.36 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=512, Pass2=7680, clm=1 (8 cpus, 1 worker): 2.60 ms. Throughput: 385.12 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=512, Pass2=7680, clm=1 (8 cpus, 8 workers): 21.38, 21.10, 20.98, 20.66, 21.15, 21.09, 20.69, 20.71 ms. Throughput: 381.55 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=640, Pass2=6144, clm=4 (8 cpus, 1 worker): 2.76 ms. Throughput: 362.65 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=640, Pass2=6144, clm=4 (8 cpus, 8 workers): 22.13, 21.98, 21.27, 21.82, 21.82, 21.51, 21.34, 21.78 ms. Throughput: 368.64 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=640, Pass2=6144, clm=2 (8 cpus, 1 worker): 2.55 ms. Throughput: 392.29 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=640, Pass2=6144, clm=2 (8 cpus, 8 workers): 20.69, 20.82, 20.72, 20.58, 20.93, 20.77, 20.65, 20.44 ms. Throughput: 386.48 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=640, Pass2=6144, clm=1 (8 cpus, 1 worker): 2.51 ms. Throughput: 397.98 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=640, Pass2=6144, clm=1 (8 cpus, 8 workers): 20.83, 21.22, 20.93, 20.65, 20.95, 20.62, 20.64, 20.39 ms. Throughput: 385.03 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=768, Pass2=5120, clm=4 (8 cpus, 1 worker): 2.88 ms. Throughput: 347.53 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=768, Pass2=5120, clm=4 (8 cpus, 8 workers): 22.32, 22.15, 21.90, 21.77, 22.50, 22.63, 21.90, 21.84 ms. Throughput: 361.63 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=768, Pass2=5120, clm=2 (8 cpus, 1 worker): 2.58 ms. Throughput: 387.03 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=768, Pass2=5120, clm=2 (8 cpus, 8 workers): 21.11, 21.09, 20.77, 20.59, 21.26, 20.94, 20.75, 20.84 ms. Throughput: 382.46 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=768, Pass2=5120, clm=1 (8 cpus, 1 worker): 2.56 ms. Throughput: 390.00 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=768, Pass2=5120, clm=1 (8 cpus, 8 workers): 21.03, 20.86, 21.14, 20.78, 21.30, 21.30, 20.46, 20.49 ms. Throughput: 382.54 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=1024, Pass2=3840, clm=4 (8 cpus, 1 worker): 3.04 ms. Throughput: 329.24 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=1024, Pass2=3840, clm=4 (8 cpus, 8 workers): 23.11, 22.90, 22.60, 22.58, 23.77, 22.93, 22.50, 22.31 ms. Throughput: 350.45 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=1024, Pass2=3840, clm=2 (8 cpus, 1 worker): 2.69 ms. Throughput: 372.05 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=1024, Pass2=3840, clm=2 (8 cpus, 8 workers): 21.65, 21.41, 21.28, 21.06, 21.85, 21.43, 21.10, 21.03 ms. Throughput: 374.72 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=1024, Pass2=3840, clm=1 (8 cpus, 1 worker): 2.59 ms. Throughput: 385.44 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=1024, Pass2=3840, clm=1 (8 cpus, 8 workers): 21.46, 21.18, 20.97, 21.03, 21.48, 21.34, 21.02, 20.94 ms. Throughput: 377.74 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=1280, Pass2=3072, clm=4 (8 cpus, 1 worker): 2.92 ms. Throughput: 342.72 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=1280, Pass2=3072, clm=4 (8 cpus, 8 workers): 22.50, 22.27, 22.16, 22.19, 22.86, 22.48, 21.68, 22.05 ms. Throughput: 359.24 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=1280, Pass2=3072, clm=2 (8 cpus, 1 worker): 2.69 ms. Throughput: 372.24 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=1280, Pass2=3072, clm=2 (8 cpus, 8 workers): 21.53, 21.33, 21.29, 21.09, 21.66, 21.42, 21.13, 21.04 ms. Throughput: 375.46 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=1280, Pass2=3072, clm=1 (8 cpus, 1 worker): 2.57 ms. Throughput: 388.79 iter/sec. FFTlen=3840K, Type=3, Arch=4, Pass1=1280, Pass2=3072, clm=1 (8 cpus, 8 workers): 21.22, 20.93, 20.84, 20.70, 21.24, 20.95, 20.73, 20.73 ms. Throughput: 382.48 iter/sec. FFTlen=4000K, Type=3, Arch=4, Pass1=320, Pass2=12800, clm=4 (8 cpus, 1 worker): 2.91 ms. Throughput: 343.86 iter/sec. FFTlen=4000K, Type=3, Arch=4, Pass1=320, Pass2=12800, clm=4 (8 cpus, 8 workers): 22.65, 22.22, 22.27, 22.10, 22.84, 22.48, 21.91, 22.05 ms. Throughput: 358.59 iter/sec. FFTlen=4000K, Type=3, Arch=4, Pass1=320, Pass2=12800, clm=2 (8 cpus, 1 worker): 2.86 ms. Throughput: 349.35 iter/sec. FFTlen=4000K, Type=3, Arch=4, Pass1=320, Pass2=12800, clm=2 (8 cpus, 8 workers): 22.61, 22.44, 22.06, 21.92, 22.85, 22.63, 22.06, 21.97 ms. Throughput: 358.57 iter/sec. [Sat Apr 29 12:24:23 2017] FFTlen=4000K, Type=3, Arch=4, Pass1=320, Pass2=12800, clm=1 (8 cpus, 1 worker): 3.13 ms. Throughput: 319.56 iter/sec. FFTlen=4000K, Type=3, Arch=4, Pass1=320, Pass2=12800, clm=1 (8 cpus, 8 workers): 22.60, 22.41, 22.24, 21.99, 22.99, 22.37, 21.91, 21.90 ms. Throughput: 358.81 iter/sec. FFTlen=4000K, Type=3, Arch=4, Pass1=640, Pass2=6400, clm=4 (8 cpus, 1 worker): 3.04 ms. Throughput: 329.29 iter/sec. FFTlen=4000K, Type=3, Arch=4, Pass1=640, Pass2=6400, clm=4 (8 cpus, 8 workers): 23.78, 23.58, 23.26, 23.17, 24.06, 23.72, 23.05, 22.98 ms. Throughput: 341.23 iter/sec. FFTlen=4000K, Type=3, Arch=4, Pass1=640, Pass2=6400, clm=2 (8 cpus, 1 worker): 2.81 ms. Throughput: 355.31 iter/sec. FFTlen=4000K, Type=3, Arch=4, Pass1=640, Pass2=6400, clm=2 (8 cpus, 8 workers): 22.67, 22.56, 22.35, 21.95, 22.98, 22.72, 21.99, 21.91 ms. Throughput: 357.38 iter/sec. FFTlen=4000K, Type=3, Arch=4, Pass1=640, Pass2=6400, clm=1 (8 cpus, 1 worker): 2.78 ms. Throughput: 359.65 iter/sec. FFTlen=4000K, Type=3, Arch=4, Pass1=640, Pass2=6400, clm=1 (8 cpus, 8 workers): 22.86, 22.59, 22.29, 22.14, 23.23, 22.90, 22.25, 22.11 ms. Throughput: 354.92 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=448, Pass2=9216, clm=4 (8 cpus, 1 worker): 2.94 ms. Throughput: 340.59 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=448, Pass2=9216, clm=4 (8 cpus, 8 workers): 23.51, 23.24, 23.02, 22.84, 23.75, 23.11, 22.80, 22.84 ms. Throughput: 345.80 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=448, Pass2=9216, clm=2 (8 cpus, 1 worker): 2.86 ms. Throughput: 349.79 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=448, Pass2=9216, clm=2 (8 cpus, 8 workers): 22.98, 22.97, 22.65, 22.55, 22.99, 23.03, 22.52, 22.52 ms. Throughput: 351.27 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=448, Pass2=9216, clm=1 (8 cpus, 1 worker): 2.86 ms. Throughput: 349.65 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=448, Pass2=9216, clm=1 (8 cpus, 8 workers): 23.09, 22.90, 22.72, 22.66, 23.16, 22.99, 22.66, 22.61 ms. Throughput: 350.13 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=896, Pass2=4608, clm=4 (8 cpus, 1 worker): 3.10 ms. Throughput: 322.38 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=896, Pass2=4608, clm=4 (8 cpus, 8 workers): 23.95, 23.50, 23.33, 23.25, 23.83, 23.55, 23.09, 23.16 ms. Throughput: 341.11 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=896, Pass2=4608, clm=2 (8 cpus, 1 worker): 2.80 ms. Throughput: 357.09 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=896, Pass2=4608, clm=2 (8 cpus, 8 workers): 22.41, 22.33, 21.86, 22.16, 22.30, 22.29, 22.18, 21.91 ms. Throughput: 360.69 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=896, Pass2=4608, clm=1 (8 cpus, 1 worker): 2.75 ms. Throughput: 363.52 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=896, Pass2=4608, clm=1 (8 cpus, 8 workers): 22.56, 22.43, 21.75, 21.82, 22.24, 22.43, 21.84, 21.78 ms. Throughput: 361.98 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=1792, Pass2=2304, clm=4 (8 cpus, 1 worker): 3.44 ms. Throughput: 290.53 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=1792, Pass2=2304, clm=4 (8 cpus, 8 workers): 25.73, 25.38, 25.12, 25.01, 25.81, 25.45, 24.92, 25.08 ms. Throughput: 316.08 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=1792, Pass2=2304, clm=2 (8 cpus, 1 worker): 3.12 ms. Throughput: 320.93 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=1792, Pass2=2304, clm=2 (8 cpus, 8 workers): 24.25, 24.08, 23.73, 23.62, 24.09, 25.16, 23.56, 23.94 ms. Throughput: 332.72 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=1792, Pass2=2304, clm=1 (8 cpus, 1 worker): 2.97 ms. Throughput: 336.40 iter/sec. FFTlen=4032K, Type=3, Arch=4, Pass1=1792, Pass2=2304, clm=1 (8 cpus, 8 workers): 23.14, 22.98, 22.84, 22.55, 23.37, 23.10, 22.67, 22.67 ms. Throughput: 349.15 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=256, Pass2=16384, clm=4 (8 cpus, 1 worker): 3.26 ms. Throughput: 306.61 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=256, Pass2=16384, clm=4 (8 cpus, 8 workers): 25.21, 24.90, 24.63, 24.51, 25.37, 25.13, 24.54, 24.58 ms. Throughput: 321.87 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=256, Pass2=16384, clm=2 (8 cpus, 1 worker): 3.18 ms. Throughput: 314.78 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=256, Pass2=16384, clm=2 (8 cpus, 8 workers): 24.69, 24.37, 23.97, 23.86, 24.74, 24.56, 23.92, 23.92 ms. Throughput: 329.92 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=256, Pass2=16384, clm=1 (8 cpus, 1 worker): 3.20 ms. Throughput: 312.07 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=256, Pass2=16384, clm=1 (8 cpus, 8 workers): 24.62, 24.36, 24.00, 23.82, 24.90, 24.56, 23.87, 23.85 ms. Throughput: 330.01 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=512, Pass2=8192, clm=4 (8 cpus, 1 worker): 3.07 ms. Throughput: 325.44 iter/sec. [Sat Apr 29 12:29:29 2017] FFTlen=4096K, Type=3, Arch=4, Pass1=512, Pass2=8192, clm=4 (8 cpus, 8 workers): 23.73, 23.47, 23.23, 23.32, 23.93, 23.68, 23.21, 23.18 ms. Throughput: 340.94 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=512, Pass2=8192, clm=2 (8 cpus, 1 worker): 2.86 ms. Throughput: 349.49 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=512, Pass2=8192, clm=2 (8 cpus, 8 workers): 22.99, 23.40, 22.46, 22.34, 23.17, 22.78, 22.45, 22.36 ms. Throughput: 351.86 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=512, Pass2=8192, clm=1 (8 cpus, 1 worker): 2.83 ms. Throughput: 353.30 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=512, Pass2=8192, clm=1 (8 cpus, 8 workers): 22.96, 22.92, 22.64, 22.45, 23.31, 22.81, 22.47, 22.85 ms. Throughput: 350.91 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=1024, Pass2=4096, clm=4 (8 cpus, 1 worker): 3.15 ms. Throughput: 317.49 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=1024, Pass2=4096, clm=4 (8 cpus, 8 workers): 24.37, 24.19, 23.83, 24.07, 24.78, 24.21, 23.68, 23.72 ms. Throughput: 331.94 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=1024, Pass2=4096, clm=2 (8 cpus, 1 worker): 2.85 ms. Throughput: 350.85 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=1024, Pass2=4096, clm=2 (8 cpus, 8 workers): 22.93, 22.78, 22.46, 22.26, 23.18, 22.85, 22.17, 22.21 ms. Throughput: 353.99 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=1024, Pass2=4096, clm=1 (8 cpus, 1 worker): 2.76 ms. Throughput: 362.83 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=1024, Pass2=4096, clm=1 (8 cpus, 8 workers): 22.47, 22.29, 22.02, 21.82, 22.73, 23.13, 21.78, 22.36 ms. Throughput: 358.46 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=2048, Pass2=2048, clm=4 (8 cpus, 1 worker): 3.51 ms. Throughput: 285.04 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=2048, Pass2=2048, clm=4 (8 cpus, 8 workers): 25.71, 25.75, 25.56, 24.97, 25.95, 25.72, 24.94, 25.11 ms. Throughput: 314.23 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=2048, Pass2=2048, clm=2 (8 cpus, 1 worker): 3.25 ms. Throughput: 308.05 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=2048, Pass2=2048, clm=2 (8 cpus, 8 workers): 24.73, 24.45, 24.29, 23.81, 24.73, 24.52, 24.08, 23.98 ms. Throughput: 328.96 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=2048, Pass2=2048, clm=1 (8 cpus, 1 worker): 3.02 ms. Throughput: 330.99 iter/sec. FFTlen=4096K, Type=3, Arch=4, Pass1=2048, Pass2=2048, clm=1 (8 cpus, 8 workers): 23.60, 23.34, 23.24, 23.02, 23.78, 23.46, 22.75, 23.13 ms. Throughput: 343.56 iter/sec. FFTlen=4480K, Type=3, Arch=4, Pass1=448, Pass2=10240, clm=4 (8 cpus, 1 worker): 3.24 ms. Throughput: 308.85 iter/sec. FFTlen=4480K, Type=3, Arch=4, Pass1=448, Pass2=10240, clm=4 (8 cpus, 8 workers): 25.73, 25.22, 25.05, 24.91, 25.85, 25.44, 24.86, 24.75 ms. Throughput: 317.20 iter/sec. FFTlen=4480K, Type=3, Arch=4, Pass1=448, Pass2=10240, clm=2 (8 cpus, 1 worker): 3.17 ms. Throughput: 315.61 iter/sec. FFTlen=4480K, Type=3, Arch=4, Pass1=448, Pass2=10240, clm=2 (8 cpus, 8 workers): 25.20, 24.99, 24.71, 24.47, 25.60, 25.22, 24.31, 24.42 ms. Throughput: 321.84 iter/sec. FFTlen=4480K, Type=3, Arch=4, Pass1=448, Pass2=10240, clm=1 (8 cpus, 1 worker): 3.19 ms. Throughput: 313.53 iter/sec. FFTlen=4480K, Type=3, Arch=4, Pass1=448, Pass2=10240, clm=1 (8 cpus, 8 workers): 25.32, 25.14, 24.75, 24.67, 25.65, 25.28, 24.64, 24.58 ms. Throughput: 320.02 iter/sec. FFTlen=4480K, Type=3, Arch=4, Pass1=896, Pass2=5120, clm=4 (8 cpus, 1 worker): 3.44 ms. Throughput: 291.03 iter/sec. FFTlen=4480K, Type=3, Arch=4, Pass1=896, Pass2=5120, clm=4 (8 cpus, 8 workers): 26.68, 26.39, 26.33, 25.81, 27.60, 26.69, 26.14, 25.84 ms. Throughput: 302.75 iter/sec. FFTlen=4480K, Type=3, Arch=4, Pass1=896, Pass2=5120, clm=2 (8 cpus, 1 worker): 3.13 ms. Throughput: 319.45 iter/sec. FFTlen=4480K, Type=3, Arch=4, Pass1=896, Pass2=5120, clm=2 (8 cpus, 8 workers): 25.26, 24.89, 24.67, 24.64, 25.14, 25.08, 24.60, 24.54 ms. Throughput: 321.92 iter/sec. FFTlen=4480K, Type=3, Arch=4, Pass1=896, Pass2=5120, clm=1 (8 cpus, 1 worker): 3.09 ms. Throughput: 323.94 iter/sec. FFTlen=4480K, Type=3, Arch=4, Pass1=896, Pass2=5120, clm=1 (8 cpus, 8 workers): 25.48, 25.01, 24.52, 24.50, 25.72, 24.99, 24.67, 24.49 ms. Throughput: 321.10 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=384, Pass2=12288, clm=4 (8 cpus, 1 worker): 3.52 ms. Throughput: 284.42 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=384, Pass2=12288, clm=4 (8 cpus, 8 workers): 27.28, 26.94, 26.58, 26.49, 27.73, 27.21, 26.47, 26.51 ms. Throughput: 297.45 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=384, Pass2=12288, clm=2 (8 cpus, 1 worker): 3.47 ms. Throughput: 288.12 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=384, Pass2=12288, clm=2 (8 cpus, 8 workers): 28.33, 27.32, 28.01, 26.63, 27.23, 26.92, 26.26, 26.21 ms. Throughput: 295.26 iter/sec. [Sat Apr 29 12:34:37 2017] FFTlen=4608K, Type=3, Arch=4, Pass1=384, Pass2=12288, clm=1 (8 cpus, 1 worker): 3.46 ms. Throughput: 289.22 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=384, Pass2=12288, clm=1 (8 cpus, 8 workers): 26.98, 26.70, 26.44, 26.24, 27.14, 26.78, 26.29, 26.33 ms. Throughput: 300.63 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=512, Pass2=9216, clm=4 (8 cpus, 1 worker): 3.56 ms. Throughput: 281.21 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=512, Pass2=9216, clm=4 (8 cpus, 8 workers): 27.40, 27.77, 27.17, 27.79, 27.84, 27.97, 26.95, 28.51 ms. Throughput: 289.15 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=512, Pass2=9216, clm=2 (8 cpus, 1 worker): 3.39 ms. Throughput: 294.83 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=512, Pass2=9216, clm=2 (8 cpus, 8 workers): 26.87, 26.60, 26.34, 26.11, 27.34, 26.89, 26.39, 26.17 ms. Throughput: 300.93 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=512, Pass2=9216, clm=1 (8 cpus, 1 worker): 3.35 ms. Throughput: 298.95 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=512, Pass2=9216, clm=1 (8 cpus, 8 workers): 26.74, 26.56, 26.26, 25.99, 26.98, 26.90, 26.01, 25.66 ms. Throughput: 303.28 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=768, Pass2=6144, clm=4 (8 cpus, 1 worker): 3.44 ms. Throughput: 290.61 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=768, Pass2=6144, clm=4 (8 cpus, 8 workers): 26.64, 26.36, 25.98, 25.79, 27.09, 26.66, 25.88, 25.84 ms. Throughput: 304.51 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=768, Pass2=6144, clm=2 (8 cpus, 1 worker): 3.14 ms. Throughput: 318.87 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=768, Pass2=6144, clm=2 (8 cpus, 8 workers): 25.29, 25.08, 24.75, 24.59, 25.15, 25.35, 24.63, 24.59 ms. Throughput: 320.96 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=768, Pass2=6144, clm=1 (8 cpus, 1 worker): 3.10 ms. Throughput: 322.90 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=768, Pass2=6144, clm=1 (8 cpus, 8 workers): 25.22, 25.12, 24.72, 24.59, 25.20, 24.77, 24.51, 24.58 ms. Throughput: 322.13 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=1024, Pass2=4608, clm=4 (8 cpus, 1 worker): 3.64 ms. Throughput: 274.51 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=1024, Pass2=4608, clm=4 (8 cpus, 8 workers): 27.49, 27.13, 27.23, 27.21, 27.82, 27.66, 26.81, 27.86 ms. Throughput: 292.01 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=1024, Pass2=4608, clm=2 (8 cpus, 1 worker): 3.28 ms. Throughput: 305.14 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=1024, Pass2=4608, clm=2 (8 cpus, 8 workers): 25.81, 25.65, 25.35, 25.25, 26.42, 25.75, 25.16, 25.13 ms. Throughput: 313.00 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=1024, Pass2=4608, clm=1 (8 cpus, 1 worker): 3.20 ms. Throughput: 312.32 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=1024, Pass2=4608, clm=1 (8 cpus, 8 workers): 25.69, 25.50, 25.24, 25.05, 26.17, 25.74, 24.85, 25.20 ms. Throughput: 314.66 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=1536, Pass2=3072, clm=4 (8 cpus, 1 worker): 3.72 ms. Throughput: 269.17 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=1536, Pass2=3072, clm=4 (8 cpus, 8 workers): 28.36, 28.11, 27.77, 27.49, 28.86, 28.29, 27.67, 27.68 ms. Throughput: 285.49 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=1536, Pass2=3072, clm=2 (8 cpus, 1 worker): 3.44 ms. Throughput: 290.52 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=1536, Pass2=3072, clm=2 (8 cpus, 8 workers): 26.90, 26.69, 26.51, 26.11, 27.12, 26.87, 26.25, 26.20 ms. Throughput: 301.00 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=1536, Pass2=3072, clm=1 (8 cpus, 1 worker): 3.29 ms. Throughput: 303.52 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=1536, Pass2=3072, clm=1 (8 cpus, 8 workers): 26.59, 26.14, 25.92, 25.62, 26.81, 26.64, 25.68, 25.54 ms. Throughput: 306.39 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=2048, Pass2=2304, clm=4 (8 cpus, 1 worker): 4.14 ms. Throughput: 241.46 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=2048, Pass2=2304, clm=4 (8 cpus, 8 workers): 30.71, 30.33, 29.69, 29.68, 30.89, 30.50, 29.54, 29.81 ms. Throughput: 265.46 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=2048, Pass2=2304, clm=2 (8 cpus, 1 worker): 3.92 ms. Throughput: 255.20 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=2048, Pass2=2304, clm=2 (8 cpus, 8 workers): 29.48, 29.28, 28.84, 28.56, 29.96, 29.48, 28.45, 28.67 ms. Throughput: 275.09 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=2048, Pass2=2304, clm=1 (8 cpus, 1 worker): 3.72 ms. Throughput: 268.77 iter/sec. FFTlen=4608K, Type=3, Arch=4, Pass1=2048, Pass2=2304, clm=1 (8 cpus, 8 workers): 28.58, 28.64, 28.21, 28.02, 28.97, 28.82, 27.94, 28.11 ms. Throughput: 281.63 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=320, Pass2=15360, clm=4 (8 cpus, 1 worker): 3.69 ms. Throughput: 271.10 iter/sec. [Sat Apr 29 12:39:46 2017] FFTlen=4800K, Type=3, Arch=4, Pass1=320, Pass2=15360, clm=4 (8 cpus, 8 workers): 28.09, 28.12, 27.80, 27.57, 28.61, 28.48, 27.88, 27.81 ms. Throughput: 285.30 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=320, Pass2=15360, clm=2 (8 cpus, 1 worker): 3.63 ms. Throughput: 275.46 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=320, Pass2=15360, clm=2 (8 cpus, 8 workers): 28.26, 28.00, 27.76, 27.53, 28.48, 28.35, 27.55, 27.50 ms. Throughput: 286.51 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=320, Pass2=15360, clm=1 (8 cpus, 1 worker): 3.63 ms. Throughput: 275.19 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=320, Pass2=15360, clm=1 (8 cpus, 8 workers): 28.20, 27.96, 27.78, 27.57, 28.64, 28.00, 27.48, 27.24 ms. Throughput: 287.23 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=384, Pass2=12800, clm=4 (8 cpus, 1 worker): 3.51 ms. Throughput: 285.19 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=384, Pass2=12800, clm=4 (8 cpus, 8 workers): 27.12, 27.02, 26.53, 26.42, 27.39, 27.09, 26.47, 26.56 ms. Throughput: 298.29 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=384, Pass2=12800, clm=2 (8 cpus, 1 worker): 3.49 ms. Throughput: 286.40 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=384, Pass2=12800, clm=2 (8 cpus, 8 workers): 27.05, 26.80, 26.51, 26.20, 27.47, 27.05, 26.14, 26.24 ms. Throughput: 299.89 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=384, Pass2=12800, clm=1 (8 cpus, 1 worker): 3.49 ms. Throughput: 286.29 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=384, Pass2=12800, clm=1 (8 cpus, 8 workers): 27.17, 26.83, 26.60, 26.29, 27.42, 27.06, 26.12, 26.33 ms. Throughput: 299.39 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=640, Pass2=7680, clm=4 (8 cpus, 1 worker): 3.58 ms. Throughput: 279.63 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=640, Pass2=7680, clm=4 (8 cpus, 8 workers): 27.97, 27.69, 27.25, 27.19, 28.18, 27.65, 27.29, 27.25 ms. Throughput: 290.32 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=640, Pass2=7680, clm=2 (8 cpus, 1 worker): 3.34 ms. Throughput: 299.79 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=640, Pass2=7680, clm=2 (8 cpus, 8 workers): 26.66, 26.40, 26.09, 25.70, 26.90, 26.30, 25.82, 25.85 ms. Throughput: 305.24 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=640, Pass2=7680, clm=1 (8 cpus, 1 worker): 3.34 ms. Throughput: 299.41 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=640, Pass2=7680, clm=1 (8 cpus, 8 workers): 26.59, 26.37, 26.36, 25.93, 27.30, 26.66, 25.78, 25.97 ms. Throughput: 303.45 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=768, Pass2=6400, clm=4 (8 cpus, 1 worker): 3.75 ms. Throughput: 266.94 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=768, Pass2=6400, clm=4 (8 cpus, 8 workers): 28.87, 28.66, 28.44, 28.13, 29.29, 28.82, 28.08, 27.98 ms. Throughput: 280.43 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=768, Pass2=6400, clm=2 (8 cpus, 1 worker): 3.46 ms. Throughput: 288.79 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=768, Pass2=6400, clm=2 (8 cpus, 8 workers): 27.25, 27.06, 26.92, 26.57, 27.81, 27.31, 26.50, 26.51 ms. Throughput: 296.47 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=768, Pass2=6400, clm=1 (8 cpus, 1 worker): 3.37 ms. Throughput: 296.59 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=768, Pass2=6400, clm=1 (8 cpus, 8 workers): 27.16, 27.05, 26.81, 26.62, 27.28, 27.33, 26.83, 26.53 ms. Throughput: 296.87 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=1280, Pass2=3840, clm=4 (8 cpus, 1 worker): 3.78 ms. Throughput: 264.46 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=1280, Pass2=3840, clm=4 (8 cpus, 8 workers): 28.52, 28.36, 28.30, 28.03, 29.08, 28.73, 28.14, 28.27 ms. Throughput: 281.45 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=1280, Pass2=3840, clm=2 (8 cpus, 1 worker): 3.51 ms. Throughput: 284.75 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=1280, Pass2=3840, clm=2 (8 cpus, 8 workers): 27.72, 27.21, 27.09, 26.64, 27.79, 27.57, 26.99, 26.68 ms. Throughput: 294.06 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=1280, Pass2=3840, clm=1 (8 cpus, 1 worker): 3.34 ms. Throughput: 299.08 iter/sec. FFTlen=4800K, Type=3, Arch=4, Pass1=1280, Pass2=3840, clm=1 (8 cpus, 8 workers): 27.05, 26.73, 26.42, 26.30, 27.50, 26.93, 26.34, 26.20 ms. Throughput: 299.88 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=256, Pass2=20480, clm=4 (8 cpus, 1 worker): 4.21 ms. Throughput: 237.47 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=256, Pass2=20480, clm=4 (8 cpus, 8 workers): 32.32, 31.53, 31.94, 31.24, 32.13, 31.90, 31.18, 31.22 ms. Throughput: 252.56 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=256, Pass2=20480, clm=2 (8 cpus, 1 worker): 4.18 ms. Throughput: 239.03 iter/sec. [Sat Apr 29 12:44:53 2017] FFTlen=5120K, Type=3, Arch=4, Pass1=256, Pass2=20480, clm=2 (8 cpus, 8 workers): 31.46, 31.16, 30.72, 30.50, 31.79, 31.22, 30.51, 30.41 ms. Throughput: 258.36 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=256, Pass2=20480, clm=1 (8 cpus, 1 worker): 4.19 ms. Throughput: 238.95 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=256, Pass2=20480, clm=1 (8 cpus, 8 workers): 31.20, 30.92, 30.58, 30.22, 31.49, 31.24, 30.54, 30.17 ms. Throughput: 259.84 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=320, Pass2=16384, clm=4 (8 cpus, 1 worker): 4.11 ms. Throughput: 243.52 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=320, Pass2=16384, clm=4 (8 cpus, 8 workers): 31.33, 30.78, 30.63, 30.29, 31.54, 31.21, 30.20, 30.37 ms. Throughput: 259.87 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=320, Pass2=16384, clm=2 (8 cpus, 1 worker): 4.05 ms. Throughput: 246.77 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=320, Pass2=16384, clm=2 (8 cpus, 8 workers): 30.88, 30.61, 30.35, 30.27, 31.07, 30.90, 30.14, 30.23 ms. Throughput: 261.86 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=320, Pass2=16384, clm=1 (8 cpus, 1 worker): 4.04 ms. Throughput: 247.80 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=320, Pass2=16384, clm=1 (8 cpus, 8 workers): 31.03, 30.70, 30.51, 30.27, 31.50, 30.75, 30.28, 30.28 ms. Throughput: 260.93 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=512, Pass2=10240, clm=4 (8 cpus, 1 worker): 3.96 ms. Throughput: 252.70 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=512, Pass2=10240, clm=4 (8 cpus, 8 workers): 30.30, 30.05, 29.69, 29.42, 30.59, 30.30, 29.48, 29.26 ms. Throughput: 267.74 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=512, Pass2=10240, clm=2 (8 cpus, 1 worker): 3.70 ms. Throughput: 270.37 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=512, Pass2=10240, clm=2 (8 cpus, 8 workers): 29.02, 28.67, 28.52, 28.32, 29.22, 28.98, 28.37, 28.33 ms. Throughput: 278.99 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=512, Pass2=10240, clm=1 (8 cpus, 1 worker): 3.67 ms. Throughput: 272.55 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=512, Pass2=10240, clm=1 (8 cpus, 8 workers): 29.29, 29.15, 28.64, 28.35, 29.52, 29.30, 28.45, 28.44 ms. Throughput: 276.95 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=640, Pass2=8192, clm=4 (8 cpus, 1 worker): 3.93 ms. Throughput: 254.53 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=640, Pass2=8192, clm=4 (8 cpus, 8 workers): 30.14, 29.86, 29.44, 29.34, 30.54, 29.99, 29.30, 29.25 ms. Throughput: 269.14 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=640, Pass2=8192, clm=2 (8 cpus, 1 worker): 3.60 ms. Throughput: 278.05 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=640, Pass2=8192, clm=2 (8 cpus, 8 workers): 28.56, 28.32, 28.16, 27.88, 28.86, 28.50, 28.03, 27.98 ms. Throughput: 282.87 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=640, Pass2=8192, clm=1 (8 cpus, 1 worker): 3.59 ms. Throughput: 278.26 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=640, Pass2=8192, clm=1 (8 cpus, 8 workers): 28.75, 28.49, 28.00, 28.85, 28.98, 28.79, 27.75, 29.10 ms. Throughput: 279.91 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=1024, Pass2=5120, clm=4 (8 cpus, 1 worker): 4.07 ms. Throughput: 245.48 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=1024, Pass2=5120, clm=4 (8 cpus, 8 workers): 31.21, 30.80, 30.46, 30.49, 31.48, 30.95, 30.48, 30.41 ms. Throughput: 259.91 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=1024, Pass2=5120, clm=2 (8 cpus, 1 worker): 3.69 ms. Throughput: 270.80 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=1024, Pass2=5120, clm=2 (8 cpus, 8 workers): 29.27, 28.89, 28.47, 28.25, 29.57, 29.16, 28.34, 28.22 ms. Throughput: 278.14 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=1024, Pass2=5120, clm=1 (8 cpus, 1 worker): 3.63 ms. Throughput: 275.81 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=1024, Pass2=5120, clm=1 (8 cpus, 8 workers): 28.82, 28.66, 28.08, 28.00, 29.04, 28.70, 28.04, 28.04 ms. Throughput: 281.53 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=1280, Pass2=4096, clm=4 (8 cpus, 1 worker): 3.97 ms. Throughput: 251.99 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=1280, Pass2=4096, clm=4 (8 cpus, 8 workers): 30.33, 29.84, 29.71, 29.49, 30.39, 29.99, 29.70, 29.59 ms. Throughput: 267.76 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=1280, Pass2=4096, clm=2 (8 cpus, 1 worker): 3.66 ms. Throughput: 273.31 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=1280, Pass2=4096, clm=2 (8 cpus, 8 workers): 28.76, 28.51, 28.36, 27.99, 29.47, 28.88, 28.23, 28.07 ms. Throughput: 280.45 iter/sec. FFTlen=5120K, Type=3, Arch=4, Pass1=1280, Pass2=4096, clm=1 (8 cpus, 1 worker): 3.56 ms. Throughput: 281.09 iter/sec. [Sat Apr 29 12:50:00 2017] FFTlen=5120K, Type=3, Arch=4, Pass1=1280, Pass2=4096, clm=1 (8 cpus, 8 workers): 28.48, 28.20, 28.06, 27.67, 28.63, 28.72, 27.69, 27.69 ms. Throughput: 284.32 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=448, Pass2=12288, clm=4 (8 cpus, 1 worker): 4.20 ms. Throughput: 238.20 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=448, Pass2=12288, clm=4 (8 cpus, 8 workers): 32.13, 31.96, 31.52, 31.30, 32.53, 31.88, 31.37, 31.44 ms. Throughput: 251.87 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=448, Pass2=12288, clm=2 (8 cpus, 1 worker): 5.37 ms. Throughput: 186.36 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=448, Pass2=12288, clm=2 (8 cpus, 8 workers): 32.56, 31.22, 30.63, 32.62, 39.50, 36.38, 35.16, 38.07 ms. Throughput: 233.56 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=448, Pass2=12288, clm=1 (8 cpus, 1 worker): 4.12 ms. Throughput: 242.98 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=448, Pass2=12288, clm=1 (8 cpus, 8 workers): 31.72, 31.51, 31.04, 31.16, 32.13, 31.78, 30.93, 31.12 ms. Throughput: 254.62 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=896, Pass2=6144, clm=4 (8 cpus, 1 worker): 4.11 ms. Throughput: 243.24 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=896, Pass2=6144, clm=4 (8 cpus, 8 workers): 31.66, 31.67, 30.93, 30.85, 31.87, 31.93, 30.98, 30.89 ms. Throughput: 255.25 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=896, Pass2=6144, clm=2 (8 cpus, 1 worker): 3.79 ms. Throughput: 263.85 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=896, Pass2=6144, clm=2 (8 cpus, 8 workers): 30.01, 29.60, 29.48, 29.05, 30.49, 31.15, 28.97, 29.77 ms. Throughput: 268.45 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=896, Pass2=6144, clm=1 (8 cpus, 1 worker): 3.71 ms. Throughput: 269.81 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=896, Pass2=6144, clm=1 (8 cpus, 8 workers): 29.92, 29.78, 29.50, 29.05, 30.47, 29.59, 29.01, 29.05 ms. Throughput: 270.82 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=1792, Pass2=3072, clm=4 (8 cpus, 1 worker): 4.52 ms. Throughput: 221.30 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=1792, Pass2=3072, clm=4 (8 cpus, 8 workers): 34.31, 34.72, 33.61, 33.48, 34.53, 34.13, 33.47, 33.72 ms. Throughput: 235.36 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=1792, Pass2=3072, clm=2 (8 cpus, 1 worker): 4.27 ms. Throughput: 234.11 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=1792, Pass2=3072, clm=2 (8 cpus, 8 workers): 33.34, 32.61, 32.24, 31.97, 33.31, 33.03, 32.16, 32.21 ms. Throughput: 245.39 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=1792, Pass2=3072, clm=1 (8 cpus, 1 worker): 4.17 ms. Throughput: 239.77 iter/sec. FFTlen=5376K, Type=3, Arch=4, Pass1=1792, Pass2=3072, clm=1 (8 cpus, 8 workers): 32.69, 32.34, 32.17, 31.97, 33.25, 32.70, 31.44, 31.69 ms. Throughput: 247.89 iter/sec. FFTlen=5600K, Type=3, Arch=4, Pass1=448, Pass2=12800, clm=4 (8 cpus, 1 worker): 4.23 ms. Throughput: 236.45 iter/sec. FFTlen=5600K, Type=3, Arch=4, Pass1=448, Pass2=12800, clm=4 (8 cpus, 8 workers): 32.22, 31.88, 31.56, 31.42, 32.49, 32.16, 31.48, 31.43 ms. Throughput: 251.37 iter/sec. FFTlen=5600K, Type=3, Arch=4, Pass1=448, Pass2=12800, clm=2 (8 cpus, 1 worker): 4.14 ms. Throughput: 241.37 iter/sec. FFTlen=5600K, Type=3, Arch=4, Pass1=448, Pass2=12800, clm=2 (8 cpus, 8 workers): 32.02, 31.61, 31.21, 30.93, 32.38, 31.97, 30.88, 31.07 ms. Throughput: 253.96 iter/sec. FFTlen=5600K, Type=3, Arch=4, Pass1=448, Pass2=12800, clm=1 (8 cpus, 1 worker): 4.18 ms. Throughput: 239.20 iter/sec. FFTlen=5600K, Type=3, Arch=4, Pass1=448, Pass2=12800, clm=1 (8 cpus, 8 workers): 32.06, 31.70, 31.40, 31.08, 32.32, 31.69, 30.87, 31.15 ms. Throughput: 253.75 iter/sec. FFTlen=5600K, Type=3, Arch=4, Pass1=896, Pass2=6400, clm=4 (8 cpus, 1 worker): 4.47 ms. Throughput: 223.90 iter/sec. FFTlen=5600K, Type=3, Arch=4, Pass1=896, Pass2=6400, clm=4 (8 cpus, 8 workers): 34.18, 33.88, 33.54, 33.41, 34.41, 33.78, 33.25, 33.84 ms. Throughput: 236.82 iter/sec. FFTlen=5600K, Type=3, Arch=4, Pass1=896, Pass2=6400, clm=2 (8 cpus, 1 worker): 4.13 ms. Throughput: 242.40 iter/sec. FFTlen=5600K, Type=3, Arch=4, Pass1=896, Pass2=6400, clm=2 (8 cpus, 8 workers): 35.47, 32.17, 31.84, 34.04, 33.02, 32.35, 31.66, 31.40 ms. Throughput: 244.70 iter/sec. FFTlen=5600K, Type=3, Arch=4, Pass1=896, Pass2=6400, clm=1 (8 cpus, 1 worker): 4.07 ms. Throughput: 245.61 iter/sec. FFTlen=5600K, Type=3, Arch=4, Pass1=896, Pass2=6400, clm=1 (8 cpus, 8 workers): 32.43, 32.20, 31.77, 31.42, 32.60, 32.58, 31.50, 31.40 ms. Throughput: 250.15 iter/sec. [Sat Apr 29 12:55:01 2017] FFTlen=5760K, Type=3, Arch=4, Pass1=384, Pass2=15360, clm=4 (8 cpus, 1 worker): 4.44 ms. Throughput: 225.39 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=384, Pass2=15360, clm=4 (8 cpus, 8 workers): 34.20, 33.79, 33.50, 33.05, 34.37, 34.30, 32.94, 33.02 ms. Throughput: 237.84 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=384, Pass2=15360, clm=2 (8 cpus, 1 worker): 4.37 ms. Throughput: 228.83 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=384, Pass2=15360, clm=2 (8 cpus, 8 workers): 33.78, 33.51, 33.11, 32.81, 34.00, 33.62, 32.86, 32.87 ms. Throughput: 240.14 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=384, Pass2=15360, clm=1 (8 cpus, 1 worker): 4.34 ms. Throughput: 230.36 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=384, Pass2=15360, clm=1 (8 cpus, 8 workers): 33.84, 33.47, 33.17, 32.98, 33.89, 33.72, 32.91, 32.97 ms. Throughput: 239.78 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=640, Pass2=9216, clm=4 (8 cpus, 1 worker): 4.58 ms. Throughput: 218.26 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=640, Pass2=9216, clm=4 (8 cpus, 8 workers): 35.21, 34.99, 34.58, 34.41, 35.84, 35.37, 34.41, 34.55 ms. Throughput: 229.13 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=640, Pass2=9216, clm=2 (8 cpus, 1 worker): 4.28 ms. Throughput: 233.53 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=640, Pass2=9216, clm=2 (8 cpus, 8 workers): 33.71, 33.66, 33.44, 33.02, 33.68, 33.30, 33.11, 33.04 ms. Throughput: 239.74 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=640, Pass2=9216, clm=1 (8 cpus, 1 worker): 4.36 ms. Throughput: 229.44 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=640, Pass2=9216, clm=1 (8 cpus, 8 workers): 33.76, 33.48, 33.06, 32.66, 33.85, 33.73, 32.81, 32.71 ms. Throughput: 240.58 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=768, Pass2=7680, clm=4 (8 cpus, 1 worker): 4.41 ms. Throughput: 226.81 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=768, Pass2=7680, clm=4 (8 cpus, 8 workers): 33.89, 33.46, 33.28, 33.07, 34.31, 33.65, 32.94, 33.17 ms. Throughput: 239.05 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=768, Pass2=7680, clm=2 (8 cpus, 1 worker): 4.06 ms. Throughput: 246.23 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=768, Pass2=7680, clm=2 (8 cpus, 8 workers): 32.19, 31.88, 31.35, 31.88, 32.39, 32.44, 31.76, 31.19 ms. Throughput: 250.96 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=768, Pass2=7680, clm=1 (8 cpus, 1 worker): 4.02 ms. Throughput: 248.46 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=768, Pass2=7680, clm=1 (8 cpus, 8 workers): 32.55, 31.95, 31.61, 31.34, 32.56, 32.25, 30.98, 31.28 ms. Throughput: 251.55 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=1280, Pass2=4608, clm=4 (8 cpus, 1 worker): 4.58 ms. Throughput: 218.52 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=1280, Pass2=4608, clm=4 (8 cpus, 8 workers): 34.64, 34.09, 33.88, 33.57, 34.87, 34.35, 33.66, 33.83 ms. Throughput: 234.56 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=1280, Pass2=4608, clm=2 (8 cpus, 1 worker): 4.19 ms. Throughput: 238.67 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=1280, Pass2=4608, clm=2 (8 cpus, 8 workers): 33.06, 32.57, 32.31, 31.80, 33.26, 33.01, 32.12, 32.21 ms. Throughput: 245.90 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=1280, Pass2=4608, clm=1 (8 cpus, 1 worker): 4.22 ms. Throughput: 236.79 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=1280, Pass2=4608, clm=1 (8 cpus, 8 workers): 32.41, 32.13, 31.99, 31.41, 32.83, 32.34, 31.46, 31.53 ms. Throughput: 249.95 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=1536, Pass2=3840, clm=4 (8 cpus, 1 worker): 4.67 ms. Throughput: 214.00 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=1536, Pass2=3840, clm=4 (8 cpus, 8 workers): 35.54, 35.25, 34.78, 34.67, 36.10, 35.33, 34.52, 34.76 ms. Throughput: 227.85 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=1536, Pass2=3840, clm=2 (8 cpus, 1 worker): 4.26 ms. Throughput: 234.67 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=1536, Pass2=3840, clm=2 (8 cpus, 8 workers): 34.01, 32.89, 33.33, 32.86, 33.70, 33.20, 32.72, 32.67 ms. Throughput: 241.22 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=1536, Pass2=3840, clm=1 (8 cpus, 1 worker): 4.10 ms. Throughput: 244.15 iter/sec. FFTlen=5760K, Type=3, Arch=4, Pass1=1536, Pass2=3840, clm=1 (8 cpus, 8 workers): 32.83, 32.63, 32.37, 31.89, 33.10, 32.85, 32.04, 31.92 ms. Throughput: 246.55 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=384, Pass2=16384, clm=4 (8 cpus, 1 worker): 4.92 ms. Throughput: 203.28 iter/sec. [Sat Apr 29 13:00:04 2017] FFTlen=6144K, Type=3, Arch=4, Pass1=384, Pass2=16384, clm=4 (8 cpus, 8 workers): 37.48, 37.30, 36.75, 36.76, 37.64, 37.05, 36.57, 36.74 ms. Throughput: 216.02 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=384, Pass2=16384, clm=2 (8 cpus, 1 worker): 4.87 ms. Throughput: 205.44 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=384, Pass2=16384, clm=2 (8 cpus, 8 workers): 37.14, 36.77, 36.33, 36.13, 37.95, 37.14, 36.14, 36.15 ms. Throughput: 217.93 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=384, Pass2=16384, clm=1 (8 cpus, 1 worker): 4.83 ms. Throughput: 207.11 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=384, Pass2=16384, clm=1 (8 cpus, 8 workers): 37.28, 36.89, 36.51, 35.98, 37.43, 37.21, 36.23, 36.34 ms. Throughput: 217.82 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=512, Pass2=12288, clm=4 (8 cpus, 1 worker): 5.05 ms. Throughput: 198.15 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=512, Pass2=12288, clm=4 (8 cpus, 8 workers): 38.14, 37.83, 37.34, 37.29, 38.43, 37.99, 37.35, 37.32 ms. Throughput: 212.16 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=512, Pass2=12288, clm=2 (8 cpus, 1 worker): 4.78 ms. Throughput: 209.29 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=512, Pass2=12288, clm=2 (8 cpus, 8 workers): 36.69, 36.39, 36.02, 35.88, 37.31, 36.71, 35.78, 35.77 ms. Throughput: 220.31 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=512, Pass2=12288, clm=1 (8 cpus, 1 worker): 4.72 ms. Throughput: 211.66 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=512, Pass2=12288, clm=1 (8 cpus, 8 workers): 36.83, 36.46, 36.00, 35.95, 37.16, 36.21, 35.95, 36.04 ms. Throughput: 220.26 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=768, Pass2=8192, clm=4 (8 cpus, 1 worker): 4.76 ms. Throughput: 209.99 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=768, Pass2=8192, clm=4 (8 cpus, 8 workers): 36.67, 36.26, 35.76, 35.39, 37.50, 36.36, 35.52, 35.40 ms. Throughput: 221.63 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=768, Pass2=8192, clm=2 (8 cpus, 1 worker): 4.37 ms. Throughput: 229.03 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=768, Pass2=8192, clm=2 (8 cpus, 8 workers): 34.33, 34.20, 33.82, 33.72, 34.84, 34.35, 33.61, 33.73 ms. Throughput: 234.81 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=768, Pass2=8192, clm=1 (8 cpus, 1 worker): 4.35 ms. Throughput: 230.14 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=768, Pass2=8192, clm=1 (8 cpus, 8 workers): 34.54, 34.18, 33.79, 33.58, 34.77, 34.46, 33.69, 33.70 ms. Throughput: 234.71 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=1024, Pass2=6144, clm=4 (8 cpus, 1 worker): 4.84 ms. Throughput: 206.67 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=1024, Pass2=6144, clm=4 (8 cpus, 8 workers): 36.96, 36.85, 36.03, 35.92, 36.96, 36.99, 35.83, 36.06 ms. Throughput: 219.53 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=1024, Pass2=6144, clm=2 (8 cpus, 1 worker): 4.36 ms. Throughput: 229.29 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=1024, Pass2=6144, clm=2 (8 cpus, 8 workers): 34.88, 34.48, 33.64, 33.55, 34.58, 34.31, 33.57, 33.59 ms. Throughput: 234.82 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=1024, Pass2=6144, clm=1 (8 cpus, 1 worker): 4.27 ms. Throughput: 233.97 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=1024, Pass2=6144, clm=1 (8 cpus, 8 workers): 34.20, 34.11, 33.56, 33.19, 34.57, 34.47, 33.31, 33.29 ms. Throughput: 236.48 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=1536, Pass2=4096, clm=4 (8 cpus, 1 worker): 5.04 ms. Throughput: 198.26 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=1536, Pass2=4096, clm=4 (8 cpus, 8 workers): 37.52, 37.07, 36.61, 36.73, 38.05, 37.08, 36.50, 36.61 ms. Throughput: 216.13 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=1536, Pass2=4096, clm=2 (8 cpus, 1 worker): 4.55 ms. Throughput: 219.87 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=1536, Pass2=4096, clm=2 (8 cpus, 8 workers): 35.80, 34.99, 35.11, 34.64, 35.71, 36.49, 34.70, 34.69 ms. Throughput: 226.92 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=1536, Pass2=4096, clm=1 (8 cpus, 1 worker): 4.89 ms. Throughput: 204.53 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=1536, Pass2=4096, clm=1 (8 cpus, 8 workers): 34.72, 34.43, 34.03, 33.63, 35.16, 34.89, 33.74, 33.94 ms. Throughput: 233.19 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=2048, Pass2=3072, clm=4 (8 cpus, 1 worker): 5.42 ms. Throughput: 184.46 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=2048, Pass2=3072, clm=4 (8 cpus, 8 workers): 40.06, 39.62, 39.28, 39.24, 40.35, 39.61, 39.09, 39.17 ms. Throughput: 202.29 iter/sec. [Sat Apr 29 13:05:09 2017] FFTlen=6144K, Type=3, Arch=4, Pass1=2048, Pass2=3072, clm=2 (8 cpus, 1 worker): 5.29 ms. Throughput: 189.12 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=2048, Pass2=3072, clm=2 (8 cpus, 8 workers): 40.51, 39.97, 39.49, 39.21, 40.79, 40.17, 39.12, 39.14 ms. Throughput: 201.05 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=2048, Pass2=3072, clm=1 (8 cpus, 1 worker): 5.18 ms. Throughput: 193.04 iter/sec. FFTlen=6144K, Type=3, Arch=4, Pass1=2048, Pass2=3072, clm=1 (8 cpus, 8 workers): 40.02, 39.85, 39.21, 38.95, 40.14, 39.92, 38.87, 38.78 ms. Throughput: 202.72 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=256, Pass2=25600, clm=4 (8 cpus, 1 worker): 5.23 ms. Throughput: 191.09 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=256, Pass2=25600, clm=4 (8 cpus, 8 workers): 39.32, 38.98, 38.75, 38.25, 40.05, 39.51, 38.39, 38.16 ms. Throughput: 205.57 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=256, Pass2=25600, clm=2 (8 cpus, 1 worker): 5.11 ms. Throughput: 195.65 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=256, Pass2=25600, clm=2 (8 cpus, 8 workers): 38.37, 37.67, 37.33, 37.67, 38.70, 38.24, 37.44, 37.32 ms. Throughput: 211.42 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=256, Pass2=25600, clm=1 (8 cpus, 1 worker): 5.14 ms. Throughput: 194.42 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=256, Pass2=25600, clm=1 (8 cpus, 8 workers): 38.16, 37.95, 37.59, 37.20, 38.57, 38.06, 37.34, 37.39 ms. Throughput: 211.78 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=320, Pass2=20480, clm=4 (8 cpus, 1 worker): 5.28 ms. Throughput: 189.40 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=320, Pass2=20480, clm=4 (8 cpus, 8 workers): 39.43, 39.04, 38.96, 38.57, 40.21, 39.35, 38.61, 38.68 ms. Throughput: 204.60 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=320, Pass2=20480, clm=2 (8 cpus, 1 worker): 5.26 ms. Throughput: 190.24 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=320, Pass2=20480, clm=2 (8 cpus, 8 workers): 39.89, 39.16, 39.07, 38.70, 40.00, 39.53, 38.46, 38.71 ms. Throughput: 204.17 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=320, Pass2=20480, clm=1 (8 cpus, 1 worker): 5.23 ms. Throughput: 191.05 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=320, Pass2=20480, clm=1 (8 cpus, 8 workers): 39.36, 39.00, 38.56, 38.51, 39.82, 39.57, 38.41, 38.72 ms. Throughput: 205.20 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=512, Pass2=12800, clm=4 (8 cpus, 1 worker): 5.06 ms. Throughput: 197.65 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=512, Pass2=12800, clm=4 (8 cpus, 8 workers): 38.37, 37.99, 37.44, 37.31, 38.40, 38.22, 37.28, 37.33 ms. Throughput: 211.72 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=512, Pass2=12800, clm=2 (8 cpus, 1 worker): 4.82 ms. Throughput: 207.57 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=512, Pass2=12800, clm=2 (8 cpus, 8 workers): 36.90, 36.58, 36.13, 35.78, 37.33, 36.76, 35.80, 35.85 ms. Throughput: 219.88 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=512, Pass2=12800, clm=1 (8 cpus, 1 worker): 4.77 ms. Throughput: 209.68 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=512, Pass2=12800, clm=1 (8 cpus, 8 workers): 36.78, 36.51, 36.07, 35.90, 36.93, 36.82, 36.00, 35.89 ms. Throughput: 220.03 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=640, Pass2=10240, clm=4 (8 cpus, 1 worker): 5.01 ms. Throughput: 199.66 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=640, Pass2=10240, clm=4 (8 cpus, 8 workers): 38.55, 38.29, 37.92, 37.50, 39.01, 38.33, 37.29, 37.29 ms. Throughput: 210.46 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=640, Pass2=10240, clm=2 (8 cpus, 1 worker): 4.65 ms. Throughput: 215.03 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=640, Pass2=10240, clm=2 (8 cpus, 8 workers): 37.26, 37.01, 35.69, 35.53, 37.49, 36.53, 35.86, 35.88 ms. Throughput: 219.82 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=640, Pass2=10240, clm=1 (8 cpus, 1 worker): 4.66 ms. Throughput: 214.69 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=640, Pass2=10240, clm=1 (8 cpus, 8 workers): 36.71, 36.52, 35.94, 35.66, 36.90, 36.89, 35.65, 35.46 ms. Throughput: 220.94 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=1024, Pass2=6400, clm=4 (8 cpus, 1 worker): 5.21 ms. Throughput: 192.10 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=1024, Pass2=6400, clm=4 (8 cpus, 8 workers): 40.21, 39.95, 39.02, 38.75, 40.26, 40.29, 38.63, 38.83 ms. Throughput: 202.64 iter/sec. [Sat Apr 29 13:10:12 2017] FFTlen=6400K, Type=3, Arch=4, Pass1=1024, Pass2=6400, clm=2 (8 cpus, 1 worker): 4.78 ms. Throughput: 209.11 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=1024, Pass2=6400, clm=2 (8 cpus, 8 workers): 37.21, 36.99, 36.61, 36.19, 37.81, 37.54, 36.39, 36.29 ms. Throughput: 216.99 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=1024, Pass2=6400, clm=1 (8 cpus, 1 worker): 4.69 ms. Throughput: 213.23 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=1024, Pass2=6400, clm=1 (8 cpus, 8 workers): 36.83, 36.56, 36.20, 35.91, 37.21, 36.81, 35.94, 36.00 ms. Throughput: 219.62 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=1280, Pass2=5120, clm=4 (8 cpus, 1 worker): 5.10 ms. Throughput: 195.97 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=1280, Pass2=5120, clm=4 (8 cpus, 8 workers): 38.96, 38.36, 38.04, 37.88, 39.32, 38.62, 37.54, 37.73 ms. Throughput: 208.90 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=1280, Pass2=5120, clm=2 (8 cpus, 1 worker): 4.69 ms. Throughput: 213.31 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=1280, Pass2=5120, clm=2 (8 cpus, 8 workers): 36.84, 36.53, 36.34, 35.79, 37.06, 36.54, 35.70, 35.92 ms. Throughput: 220.18 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=1280, Pass2=5120, clm=1 (8 cpus, 1 worker): 4.58 ms. Throughput: 218.51 iter/sec. FFTlen=6400K, Type=3, Arch=4, Pass1=1280, Pass2=5120, clm=1 (8 cpus, 8 workers): 36.35, 35.92, 35.55, 35.42, 36.84, 36.59, 35.22, 35.26 ms. Throughput: 222.94 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=448, Pass2=15360, clm=4 (8 cpus, 1 worker): 5.25 ms. Throughput: 190.62 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=448, Pass2=15360, clm=4 (8 cpus, 8 workers): 40.15, 39.74, 39.54, 39.60, 40.58, 40.23, 39.32, 39.43 ms. Throughput: 200.91 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=448, Pass2=15360, clm=2 (8 cpus, 1 worker): 5.17 ms. Throughput: 193.30 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=448, Pass2=15360, clm=2 (8 cpus, 8 workers): 39.85, 39.46, 39.01, 38.77, 40.49, 39.60, 38.73, 38.84 ms. Throughput: 203.38 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=448, Pass2=15360, clm=1 (8 cpus, 1 worker): 5.16 ms. Throughput: 193.66 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=448, Pass2=15360, clm=1 (8 cpus, 8 workers): 39.95, 39.69, 39.22, 39.01, 40.38, 40.11, 38.87, 38.97 ms. Throughput: 202.45 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=896, Pass2=7680, clm=4 (8 cpus, 1 worker): 5.25 ms. Throughput: 190.64 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=896, Pass2=7680, clm=4 (8 cpus, 8 workers): 40.47, 40.04, 39.51, 39.37, 40.77, 40.27, 39.22, 39.73 ms. Throughput: 200.42 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=896, Pass2=7680, clm=2 (8 cpus, 1 worker): 4.84 ms. Throughput: 206.45 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=896, Pass2=7680, clm=2 (8 cpus, 8 workers): 37.82, 37.76, 37.11, 36.74, 38.45, 38.16, 37.10, 36.81 ms. Throughput: 213.42 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=896, Pass2=7680, clm=1 (8 cpus, 1 worker): 4.77 ms. Throughput: 209.53 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=896, Pass2=7680, clm=1 (8 cpus, 8 workers): 38.01, 37.72, 37.36, 37.22, 38.47, 38.20, 36.97, 37.05 ms. Throughput: 212.66 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=1792, Pass2=3840, clm=4 (8 cpus, 1 worker): 5.75 ms. Throughput: 173.93 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=1792, Pass2=3840, clm=4 (8 cpus, 8 workers): 43.58, 43.12, 42.55, 42.28, 44.08, 43.51, 42.12, 42.24 ms. Throughput: 186.38 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=1792, Pass2=3840, clm=2 (8 cpus, 1 worker): 5.27 ms. Throughput: 189.86 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=1792, Pass2=3840, clm=2 (8 cpus, 8 workers): 41.06, 40.05, 40.50, 39.79, 40.86, 40.41, 39.84, 40.32 ms. Throughput: 198.27 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=1792, Pass2=3840, clm=1 (8 cpus, 1 worker): 5.39 ms. Throughput: 185.59 iter/sec. FFTlen=6720K, Type=3, Arch=4, Pass1=1792, Pass2=3840, clm=1 (8 cpus, 8 workers): 40.11, 39.61, 39.27, 38.87, 40.31, 39.81, 39.13, 38.99 ms. Throughput: 202.49 iter/sec. FFTlen=6912K, Type=3, Arch=4, Pass1=768, Pass2=9216, clm=4 (8 cpus, 1 worker): 5.53 ms. Throughput: 180.89 iter/sec. FFTlen=6912K, Type=3, Arch=4, Pass1=768, Pass2=9216, clm=4 (8 cpus, 8 workers): 43.02, 42.64, 42.10, 41.87, 43.02, 42.75, 41.95, 41.75 ms. Throughput: 188.76 iter/sec. FFTlen=6912K, Type=3, Arch=4, Pass1=768, Pass2=9216, clm=2 (8 cpus, 1 worker): 5.24 ms. Throughput: 190.94 iter/sec. [Sat Apr 29 13:15:21 2017] FFTlen=6912K, Type=3, Arch=4, Pass1=768, Pass2=9216, clm=2 (8 cpus, 8 workers): 41.08, 40.79, 40.27, 40.07, 41.69, 40.87, 39.99, 39.85 ms. Throughput: 197.21 iter/sec. FFTlen=6912K, Type=3, Arch=4, Pass1=768, Pass2=9216, clm=1 (8 cpus, 1 worker): 5.14 ms. Throughput: 194.66 iter/sec. FFTlen=6912K, Type=3, Arch=4, Pass1=768, Pass2=9216, clm=1 (8 cpus, 8 workers): 41.16, 40.98, 40.64, 40.31, 40.51, 41.09, 39.52, 40.32 ms. Throughput: 197.24 iter/sec. FFTlen=6912K, Type=3, Arch=4, Pass1=1536, Pass2=4608, clm=4 (8 cpus, 1 worker): 5.71 ms. Throughput: 175.18 iter/sec. FFTlen=6912K, Type=3, Arch=4, Pass1=1536, Pass2=4608, clm=4 (8 cpus, 8 workers): 42.58, 42.06, 41.66, 41.50, 42.95, 43.72, 42.64, 41.39 ms. Throughput: 189.12 iter/sec. FFTlen=6912K, Type=3, Arch=4, Pass1=1536, Pass2=4608, clm=2 (8 cpus, 1 worker): 5.16 ms. Throughput: 193.62 iter/sec. FFTlen=6912K, Type=3, Arch=4, Pass1=1536, Pass2=4608, clm=2 (8 cpus, 8 workers): 39.97, 40.19, 39.29, 39.39, 41.24, 40.76, 39.68, 38.62 ms. Throughput: 200.61 iter/sec. FFTlen=6912K, Type=3, Arch=4, Pass1=1536, Pass2=4608, clm=1 (8 cpus, 1 worker): 4.97 ms. Throughput: 201.09 iter/sec. FFTlen=6912K, Type=3, Arch=4, Pass1=1536, Pass2=4608, clm=1 (8 cpus, 8 workers): 39.23, 38.98, 38.38, 38.28, 39.27, 39.26, 38.19, 38.39 ms. Throughput: 206.50 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=448, Pass2=16384, clm=4 (8 cpus, 1 worker): 5.83 ms. Throughput: 171.44 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=448, Pass2=16384, clm=4 (8 cpus, 8 workers): 44.55, 43.94, 43.71, 43.51, 45.20, 44.31, 43.15, 43.37 ms. Throughput: 181.99 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=448, Pass2=16384, clm=2 (8 cpus, 1 worker): 5.70 ms. Throughput: 175.30 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=448, Pass2=16384, clm=2 (8 cpus, 8 workers): 43.89, 43.35, 42.87, 42.68, 44.37, 43.67, 42.61, 42.77 ms. Throughput: 184.90 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=448, Pass2=16384, clm=1 (8 cpus, 1 worker): 5.69 ms. Throughput: 175.68 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=448, Pass2=16384, clm=1 (8 cpus, 8 workers): 43.79, 43.44, 42.99, 42.84, 44.18, 43.28, 42.86, 42.80 ms. Throughput: 184.90 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=896, Pass2=8192, clm=4 (8 cpus, 1 worker): 5.65 ms. Throughput: 176.97 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=896, Pass2=8192, clm=4 (8 cpus, 8 workers): 43.29, 42.90, 42.74, 42.58, 44.15, 43.39, 42.24, 42.16 ms. Throughput: 186.39 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=896, Pass2=8192, clm=2 (8 cpus, 1 worker): 5.25 ms. Throughput: 190.42 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=896, Pass2=8192, clm=2 (8 cpus, 8 workers): 41.25, 40.61, 40.35, 40.06, 41.45, 41.33, 39.97, 39.97 ms. Throughput: 196.97 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=896, Pass2=8192, clm=1 (8 cpus, 1 worker): 5.17 ms. Throughput: 193.48 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=896, Pass2=8192, clm=1 (8 cpus, 8 workers): 40.56, 40.28, 40.05, 39.65, 41.59, 40.77, 39.68, 39.67 ms. Throughput: 198.65 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=1792, Pass2=4096, clm=4 (8 cpus, 1 worker): 6.22 ms. Throughput: 160.75 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=1792, Pass2=4096, clm=4 (8 cpus, 8 workers): 46.24, 45.59, 44.97, 44.59, 47.18, 46.75, 44.68, 44.86 ms. Throughput: 175.48 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=1792, Pass2=4096, clm=2 (8 cpus, 1 worker): 5.70 ms. Throughput: 175.41 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=1792, Pass2=4096, clm=2 (8 cpus, 8 workers): 44.04, 43.52, 42.92, 42.88, 44.26, 43.64, 42.81, 42.83 ms. Throughput: 184.51 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=1792, Pass2=4096, clm=1 (8 cpus, 1 worker): 5.59 ms. Throughput: 178.86 iter/sec. FFTlen=7168K, Type=3, Arch=4, Pass1=1792, Pass2=4096, clm=1 (8 cpus, 8 workers): 43.43, 43.34, 42.79, 42.60, 44.23, 43.50, 42.55, 42.38 ms. Throughput: 185.63 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=384, Pass2=20480, clm=4 (8 cpus, 1 worker): 6.31 ms. Throughput: 158.50 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=384, Pass2=20480, clm=4 (8 cpus, 8 workers): 48.40, 47.66, 46.88, 46.47, 49.84, 48.66, 46.23, 46.48 ms. Throughput: 168.25 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=384, Pass2=20480, clm=2 (8 cpus, 1 worker): 6.41 ms. Throughput: 156.12 iter/sec. [Sat Apr 29 13:20:25 2017] FFTlen=7680K, Type=3, Arch=4, Pass1=384, Pass2=20480, clm=2 (8 cpus, 8 workers): 47.58, 46.99, 46.46, 46.24, 48.57, 47.49, 46.12, 46.06 ms. Throughput: 170.49 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=384, Pass2=20480, clm=1 (8 cpus, 1 worker): 6.24 ms. Throughput: 160.14 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=384, Pass2=20480, clm=1 (8 cpus, 8 workers): 47.68, 47.17, 46.47, 46.33, 48.37, 47.31, 46.18, 46.47 ms. Throughput: 170.27 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=512, Pass2=15360, clm=4 (8 cpus, 1 worker): 6.24 ms. Throughput: 160.31 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=512, Pass2=15360, clm=4 (8 cpus, 8 workers): 48.22, 47.66, 46.90, 46.55, 48.94, 48.29, 46.48, 46.62 ms. Throughput: 168.63 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=512, Pass2=15360, clm=2 (8 cpus, 1 worker): 5.93 ms. Throughput: 168.62 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=512, Pass2=15360, clm=2 (8 cpus, 8 workers): 45.72, 45.15, 44.86, 44.38, 46.32, 45.75, 44.60, 44.50 ms. Throughput: 177.18 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=512, Pass2=15360, clm=1 (8 cpus, 1 worker): 5.90 ms. Throughput: 169.35 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=512, Pass2=15360, clm=1 (8 cpus, 8 workers): 45.79, 45.52, 45.04, 44.90, 45.34, 45.73, 44.85, 44.66 ms. Throughput: 176.89 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=640, Pass2=12288, clm=4 (8 cpus, 1 worker): 6.34 ms. Throughput: 157.62 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=640, Pass2=12288, clm=4 (8 cpus, 8 workers): 48.45, 47.80, 47.47, 47.31, 48.68, 48.22, 48.00, 48.00 ms. Throughput: 166.71 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=640, Pass2=12288, clm=2 (8 cpus, 1 worker): 5.99 ms. Throughput: 167.00 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=640, Pass2=12288, clm=2 (8 cpus, 8 workers): 46.12, 45.77, 45.17, 45.11, 46.66, 46.01, 44.90, 45.15 ms. Throughput: 175.42 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=640, Pass2=12288, clm=1 (8 cpus, 1 worker): 5.97 ms. Throughput: 167.43 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=640, Pass2=12288, clm=1 (8 cpus, 8 workers): 46.32, 46.19, 45.36, 45.12, 47.06, 46.37, 44.81, 44.97 ms. Throughput: 174.82 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=768, Pass2=10240, clm=4 (8 cpus, 1 worker): 6.00 ms. Throughput: 166.69 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=768, Pass2=10240, clm=4 (8 cpus, 8 workers): 46.65, 46.47, 45.97, 45.34, 46.99, 46.47, 45.40, 45.78 ms. Throughput: 173.44 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=768, Pass2=10240, clm=2 (8 cpus, 1 worker): 5.60 ms. Throughput: 178.63 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=768, Pass2=10240, clm=2 (8 cpus, 8 workers): 43.90, 43.52, 43.13, 43.12, 43.99, 43.70, 43.00, 43.13 ms. Throughput: 184.19 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=768, Pass2=10240, clm=1 (8 cpus, 1 worker): 5.60 ms. Throughput: 178.67 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=768, Pass2=10240, clm=1 (8 cpus, 8 workers): 43.74, 43.48, 43.27, 42.91, 44.27, 43.79, 42.88, 42.69 ms. Throughput: 184.44 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1024, Pass2=7680, clm=4 (8 cpus, 1 worker): 6.11 ms. Throughput: 163.59 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1024, Pass2=7680, clm=4 (8 cpus, 8 workers): 46.83, 46.31, 46.04, 45.73, 47.28, 47.10, 47.47, 47.36 ms. Throughput: 171.09 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1024, Pass2=7680, clm=2 (8 cpus, 1 worker): 5.57 ms. Throughput: 179.42 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1024, Pass2=7680, clm=2 (8 cpus, 8 workers): 46.27, 44.51, 44.02, 43.86, 43.78, 43.45, 42.40, 42.23 ms. Throughput: 182.72 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1024, Pass2=7680, clm=1 (8 cpus, 1 worker): 5.47 ms. Throughput: 182.93 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1024, Pass2=7680, clm=1 (8 cpus, 8 workers): 43.17, 42.67, 42.62, 42.07, 43.75, 43.13, 42.23, 42.29 ms. Throughput: 187.20 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1280, Pass2=6144, clm=4 (8 cpus, 1 worker): 6.01 ms. Throughput: 166.36 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1280, Pass2=6144, clm=4 (8 cpus, 8 workers): 46.17, 45.83, 44.99, 44.77, 46.97, 46.05, 44.83, 44.71 ms. Throughput: 175.72 iter/sec. [Sat Apr 29 13:25:26 2017] FFTlen=7680K, Type=3, Arch=4, Pass1=1280, Pass2=6144, clm=2 (8 cpus, 1 worker): 5.56 ms. Throughput: 179.92 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1280, Pass2=6144, clm=2 (8 cpus, 8 workers): 43.58, 42.88, 43.87, 43.57, 43.89, 43.28, 43.12, 43.39 ms. Throughput: 184.14 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1280, Pass2=6144, clm=1 (8 cpus, 1 worker): 5.39 ms. Throughput: 185.44 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1280, Pass2=6144, clm=1 (8 cpus, 8 workers): 43.05, 42.54, 42.08, 41.87, 43.72, 42.95, 42.04, 41.93 ms. Throughput: 188.18 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1536, Pass2=5120, clm=4 (8 cpus, 1 worker): 6.30 ms. Throughput: 158.67 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1536, Pass2=5120, clm=4 (8 cpus, 8 workers): 48.21, 47.30, 46.87, 46.63, 48.55, 47.96, 46.50, 46.45 ms. Throughput: 169.15 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1536, Pass2=5120, clm=2 (8 cpus, 1 worker): 5.77 ms. Throughput: 173.38 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1536, Pass2=5120, clm=2 (8 cpus, 8 workers): 45.66, 44.72, 44.48, 43.65, 46.18, 44.96, 43.85, 44.24 ms. Throughput: 178.96 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1536, Pass2=5120, clm=1 (8 cpus, 1 worker): 5.57 ms. Throughput: 179.60 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=1536, Pass2=5120, clm=1 (8 cpus, 8 workers): 44.31, 43.97, 43.39, 43.40, 45.04, 44.26, 43.23, 43.48 ms. Throughput: 182.33 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=2048, Pass2=3840, clm=4 (8 cpus, 1 worker): 6.89 ms. Throughput: 145.15 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=2048, Pass2=3840, clm=4 (8 cpus, 8 workers): 51.10, 50.47, 50.31, 49.81, 51.82, 51.09, 49.50, 50.00 ms. Throughput: 158.41 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=2048, Pass2=3840, clm=2 (8 cpus, 1 worker): 6.55 ms. Throughput: 152.56 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=2048, Pass2=3840, clm=2 (8 cpus, 8 workers): 50.36, 49.25, 49.21, 49.14, 50.69, 49.99, 48.71, 48.84 ms. Throughput: 161.57 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=2048, Pass2=3840, clm=1 (8 cpus, 1 worker): 6.41 ms. Throughput: 155.90 iter/sec. FFTlen=7680K, Type=3, Arch=4, Pass1=2048, Pass2=3840, clm=1 (8 cpus, 8 workers): 50.42, 49.24, 49.22, 49.00, 50.81, 50.16, 49.21, 48.95 ms. Throughput: 161.24 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=320, Pass2=25600, clm=4 (8 cpus, 1 worker): 6.50 ms. Throughput: 153.77 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=320, Pass2=25600, clm=4 (8 cpus, 8 workers): 48.71, 48.18, 47.91, 47.65, 48.95, 48.51, 47.45, 47.22 ms. Throughput: 166.44 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=320, Pass2=25600, clm=2 (8 cpus, 1 worker): 6.47 ms. Throughput: 154.48 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=320, Pass2=25600, clm=2 (8 cpus, 8 workers): 48.73, 48.00, 47.78, 47.25, 48.57, 48.33, 47.68, 47.54 ms. Throughput: 166.74 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=320, Pass2=25600, clm=1 (8 cpus, 1 worker): 6.48 ms. Throughput: 154.32 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=320, Pass2=25600, clm=1 (8 cpus, 8 workers): 48.41, 48.26, 47.20, 47.47, 48.64, 48.36, 47.60, 47.63 ms. Throughput: 166.87 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=640, Pass2=12800, clm=4 (8 cpus, 1 worker): 6.44 ms. Throughput: 155.33 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=640, Pass2=12800, clm=4 (8 cpus, 8 workers): 48.58, 47.96, 47.58, 47.25, 48.77, 48.07, 47.51, 47.38 ms. Throughput: 167.08 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=640, Pass2=12800, clm=2 (8 cpus, 1 worker): 6.05 ms. Throughput: 165.28 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=640, Pass2=12800, clm=2 (8 cpus, 8 workers): 46.30, 45.79, 45.45, 45.02, 46.90, 45.97, 44.95, 44.90 ms. Throughput: 175.24 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=640, Pass2=12800, clm=1 (8 cpus, 1 worker): 6.04 ms. Throughput: 165.58 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=640, Pass2=12800, clm=1 (8 cpus, 8 workers): 46.56, 46.12, 45.42, 45.03, 46.73, 46.48, 45.08, 44.80 ms. Throughput: 174.80 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=1280, Pass2=6400, clm=4 (8 cpus, 1 worker): 6.57 ms. Throughput: 152.28 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=1280, Pass2=6400, clm=4 (8 cpus, 8 workers): 50.08, 49.60, 50.04, 49.26, 50.48, 49.99, 49.09, 48.94 ms. Throughput: 161.03 iter/sec. [Sat Apr 29 13:30:37 2017] FFTlen=8000K, Type=3, Arch=4, Pass1=1280, Pass2=6400, clm=2 (8 cpus, 1 worker): 6.04 ms. Throughput: 165.43 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=1280, Pass2=6400, clm=2 (8 cpus, 8 workers): 47.02, 47.36, 46.36, 45.51, 48.14, 47.26, 47.18, 46.02 ms. Throughput: 170.78 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=1280, Pass2=6400, clm=1 (8 cpus, 1 worker): 5.90 ms. Throughput: 169.54 iter/sec. FFTlen=8000K, Type=3, Arch=4, Pass1=1280, Pass2=6400, clm=1 (8 cpus, 8 workers): 46.23, 46.32, 45.71, 45.47, 46.90, 46.29, 45.51, 45.33 ms. Throughput: 174.05 iter/sec. FFTlen=8064K, Type=3, Arch=4, Pass1=896, Pass2=9216, clm=4 (8 cpus, 1 worker): 6.60 ms. Throughput: 151.47 iter/sec. FFTlen=8064K, Type=3, Arch=4, Pass1=896, Pass2=9216, clm=4 (8 cpus, 8 workers): 50.93, 50.40, 49.90, 49.66, 51.28, 50.61, 49.55, 49.59 ms. Throughput: 159.26 iter/sec. FFTlen=8064K, Type=3, Arch=4, Pass1=896, Pass2=9216, clm=2 (8 cpus, 1 worker): 6.20 ms. Throughput: 161.22 iter/sec. FFTlen=8064K, Type=3, Arch=4, Pass1=896, Pass2=9216, clm=2 (8 cpus, 8 workers): 48.72, 48.28, 47.62, 47.29, 49.63, 48.37, 47.28, 47.41 ms. Throughput: 166.44 iter/sec. FFTlen=8064K, Type=3, Arch=4, Pass1=896, Pass2=9216, clm=1 (8 cpus, 1 worker): 6.07 ms. Throughput: 164.77 iter/sec. FFTlen=8064K, Type=3, Arch=4, Pass1=896, Pass2=9216, clm=1 (8 cpus, 8 workers): 48.55, 48.03, 47.15, 46.74, 49.69, 49.59, 47.39, 47.51 ms. Throughput: 166.46 iter/sec. FFTlen=8064K, Type=3, Arch=4, Pass1=1792, Pass2=4608, clm=4 (8 cpus, 1 worker): 6.88 ms. Throughput: 145.42 iter/sec. FFTlen=8064K, Type=3, Arch=4, Pass1=1792, Pass2=4608, clm=4 (8 cpus, 8 workers): 51.62, 51.00, 50.53, 50.05, 51.74, 51.00, 50.07, 50.69 ms. Throughput: 157.39 iter/sec. FFTlen=8064K, Type=3, Arch=4, Pass1=1792, Pass2=4608, clm=2 (8 cpus, 1 worker): 6.29 ms. Throughput: 159.11 iter/sec. FFTlen=8064K, Type=3, Arch=4, Pass1=1792, Pass2=4608, clm=2 (8 cpus, 8 workers): 48.77, 48.37, 47.62, 47.40, 50.79, 48.74, 47.59, 47.05 ms. Throughput: 165.75 iter/sec. FFTlen=8064K, Type=3, Arch=4, Pass1=1792, Pass2=4608, clm=1 (8 cpus, 1 worker): 6.02 ms. Throughput: 165.98 iter/sec. FFTlen=8064K, Type=3, Arch=4, Pass1=1792, Pass2=4608, clm=1 (8 cpus, 8 workers): 47.14, 46.42, 45.99, 45.70, 47.88, 46.88, 45.71, 45.51 ms. Throughput: 172.45 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=512, Pass2=16384, clm=4 (8 cpus, 1 worker): 6.92 ms. Throughput: 144.57 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=512, Pass2=16384, clm=4 (8 cpus, 8 workers): 52.83, 52.06, 51.53, 51.28, 53.51, 52.44, 51.14, 51.21 ms. Throughput: 153.88 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=512, Pass2=16384, clm=2 (8 cpus, 1 worker): 6.63 ms. Throughput: 150.92 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=512, Pass2=16384, clm=2 (8 cpus, 8 workers): 50.68, 49.95, 49.52, 49.29, 51.65, 50.43, 49.03, 49.18 ms. Throughput: 160.15 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=512, Pass2=16384, clm=1 (8 cpus, 1 worker): 6.58 ms. Throughput: 151.98 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=512, Pass2=16384, clm=1 (8 cpus, 8 workers): 50.38, 49.97, 49.56, 49.21, 50.88, 50.57, 49.32, 49.08 ms. Throughput: 160.43 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=1024, Pass2=8192, clm=4 (8 cpus, 1 worker): 6.57 ms. Throughput: 152.29 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=1024, Pass2=8192, clm=4 (8 cpus, 8 workers): 53.15, 52.19, 51.06, 50.68, 50.90, 49.88, 48.88, 48.91 ms. Throughput: 157.89 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=1024, Pass2=8192, clm=2 (8 cpus, 1 worker): 6.02 ms. Throughput: 166.05 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=1024, Pass2=8192, clm=2 (8 cpus, 8 workers): 47.36, 46.79, 46.48, 46.14, 47.74, 47.09, 46.36, 46.08 ms. Throughput: 171.13 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=1024, Pass2=8192, clm=1 (8 cpus, 1 worker): 5.92 ms. Throughput: 168.98 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=1024, Pass2=8192, clm=1 (8 cpus, 8 workers): 46.97, 46.42, 46.12, 45.76, 47.35, 46.76, 45.83, 45.64 ms. Throughput: 172.60 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=2048, Pass2=4096, clm=4 (8 cpus, 1 worker): 7.37 ms. Throughput: 135.61 iter/sec. [Sat Apr 29 13:35:38 2017] FFTlen=8192K, Type=3, Arch=4, Pass1=2048, Pass2=4096, clm=4 (8 cpus, 8 workers): 55.07, 54.53, 53.80, 53.13, 55.38, 54.65, 53.12, 53.62 ms. Throughput: 147.74 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=2048, Pass2=4096, clm=2 (8 cpus, 1 worker): 7.05 ms. Throughput: 141.93 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=2048, Pass2=4096, clm=2 (8 cpus, 8 workers): 54.03, 53.31, 53.05, 52.70, 55.04, 53.57, 53.27, 53.46 ms. Throughput: 149.41 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=2048, Pass2=4096, clm=1 (8 cpus, 1 worker): 7.01 ms. Throughput: 142.57 iter/sec. FFTlen=8192K, Type=3, Arch=4, Pass1=2048, Pass2=4096, clm=1 (8 cpus, 8 workers): 53.67, 54.27, 53.15, 53.21, 55.03, 54.13, 52.96, 53.41 ms. Throughput: 148.92 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=448, Pass2=20480, clm=4 (8 cpus, 1 worker): 7.43 ms. Throughput: 134.64 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=448, Pass2=20480, clm=4 (8 cpus, 8 workers): 56.69, 55.97, 55.61, 55.54, 57.61, 56.67, 55.34, 55.59 ms. Throughput: 142.56 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=448, Pass2=20480, clm=2 (8 cpus, 1 worker): 7.34 ms. Throughput: 136.29 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=448, Pass2=20480, clm=2 (8 cpus, 8 workers): 55.92, 55.04, 54.24, 54.32, 56.72, 55.66, 54.56, 54.43 ms. Throughput: 145.19 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=448, Pass2=20480, clm=1 (8 cpus, 1 worker): 7.32 ms. Throughput: 136.61 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=448, Pass2=20480, clm=1 (8 cpus, 8 workers): 56.47, 55.76, 54.65, 54.83, 56.62, 56.40, 54.38, 54.68 ms. Throughput: 144.25 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=896, Pass2=10240, clm=4 (8 cpus, 1 worker): 7.14 ms. Throughput: 140.15 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=896, Pass2=10240, clm=4 (8 cpus, 8 workers): 55.83, 54.86, 54.29, 54.00, 56.54, 55.33, 53.64, 53.85 ms. Throughput: 146.05 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=896, Pass2=10240, clm=2 (8 cpus, 1 worker): 6.63 ms. Throughput: 150.76 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=896, Pass2=10240, clm=2 (8 cpus, 8 workers): 52.38, 51.97, 51.51, 50.99, 52.56, 52.15, 51.15, 50.87 ms. Throughput: 154.77 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=896, Pass2=10240, clm=1 (8 cpus, 1 worker): 6.54 ms. Throughput: 153.01 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=896, Pass2=10240, clm=1 (8 cpus, 8 workers): 52.27, 51.99, 51.21, 50.48, 52.32, 52.25, 50.72, 50.81 ms. Throughput: 155.35 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=1792, Pass2=5120, clm=4 (8 cpus, 1 worker): 7.65 ms. Throughput: 130.64 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=1792, Pass2=5120, clm=4 (8 cpus, 8 workers): 58.70, 57.83, 57.28, 56.58, 59.28, 57.75, 57.23, 56.98 ms. Throughput: 138.67 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=1792, Pass2=5120, clm=2 (8 cpus, 1 worker): 7.06 ms. Throughput: 141.61 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=1792, Pass2=5120, clm=2 (8 cpus, 8 workers): 54.44, 53.74, 53.22, 52.76, 54.61, 54.40, 53.34, 52.99 ms. Throughput: 149.03 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=1792, Pass2=5120, clm=1 (8 cpus, 1 worker): 6.78 ms. Throughput: 147.45 iter/sec. FFTlen=8960K, Type=3, Arch=4, Pass1=1792, Pass2=5120, clm=1 (8 cpus, 8 workers): 52.94, 52.43, 51.91, 51.60, 53.42, 52.74, 51.44, 51.66 ms. Throughput: 153.08 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=768, Pass2=12288, clm=4 (8 cpus, 1 worker): 7.68 ms. Throughput: 130.29 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=768, Pass2=12288, clm=4 (8 cpus, 8 workers): 58.62, 58.25, 57.85, 57.46, 59.18, 58.30, 57.46, 57.72 ms. Throughput: 137.69 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=768, Pass2=12288, clm=2 (8 cpus, 1 worker): 7.20 ms. Throughput: 138.85 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=768, Pass2=12288, clm=2 (8 cpus, 8 workers): 55.62, 55.26, 54.67, 54.41, 56.64, 55.59, 54.31, 54.23 ms. Throughput: 145.24 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=768, Pass2=12288, clm=1 (8 cpus, 1 worker): 7.13 ms. Throughput: 140.19 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=768, Pass2=12288, clm=1 (8 cpus, 8 workers): 55.49, 54.88, 54.55, 54.22, 56.35, 55.21, 54.41, 54.42 ms. Throughput: 145.63 iter/sec. [Sat Apr 29 13:40:46 2017] FFTlen=9216K, Type=3, Arch=4, Pass1=1024, Pass2=9216, clm=4 (8 cpus, 1 worker): 7.66 ms. Throughput: 130.57 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=1024, Pass2=9216, clm=4 (8 cpus, 8 workers): 58.73, 58.37, 58.15, 57.91, 59.04, 58.43, 57.57, 58.07 ms. Throughput: 137.27 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=1024, Pass2=9216, clm=2 (8 cpus, 1 worker): 7.16 ms. Throughput: 139.65 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=1024, Pass2=9216, clm=2 (8 cpus, 8 workers): 55.77, 55.19, 54.84, 54.57, 56.35, 55.27, 54.36, 54.44 ms. Throughput: 145.22 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=1024, Pass2=9216, clm=1 (8 cpus, 1 worker): 7.03 ms. Throughput: 142.21 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=1024, Pass2=9216, clm=1 (8 cpus, 8 workers): 55.49, 54.70, 54.21, 53.96, 55.59, 55.17, 54.14, 54.04 ms. Throughput: 146.38 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=1536, Pass2=6144, clm=4 (8 cpus, 1 worker): 7.40 ms. Throughput: 135.12 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=1536, Pass2=6144, clm=4 (8 cpus, 8 workers): 56.84, 55.76, 55.28, 54.93, 57.34, 56.16, 55.24, 55.42 ms. Throughput: 143.22 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=1536, Pass2=6144, clm=2 (8 cpus, 1 worker): 6.83 ms. Throughput: 146.52 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=1536, Pass2=6144, clm=2 (8 cpus, 8 workers): 53.28, 52.86, 52.46, 52.24, 53.78, 52.72, 52.41, 52.66 ms. Throughput: 151.52 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=1536, Pass2=6144, clm=1 (8 cpus, 1 worker): 6.57 ms. Throughput: 152.28 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=1536, Pass2=6144, clm=1 (8 cpus, 8 workers): 52.09, 52.09, 51.27, 50.92, 52.60, 52.14, 50.97, 51.21 ms. Throughput: 154.88 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=2048, Pass2=4608, clm=4 (8 cpus, 1 worker): 8.27 ms. Throughput: 120.93 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=2048, Pass2=4608, clm=4 (8 cpus, 8 workers): 61.95, 60.99, 60.56, 59.77, 62.48, 61.40, 59.78, 60.48 ms. Throughput: 131.34 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=2048, Pass2=4608, clm=2 (8 cpus, 1 worker): 7.82 ms. Throughput: 127.83 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=2048, Pass2=4608, clm=2 (8 cpus, 8 workers): 59.89, 59.76, 58.69, 58.22, 60.21, 61.11, 57.98, 58.31 ms. Throughput: 135.02 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=2048, Pass2=4608, clm=1 (8 cpus, 1 worker): 7.50 ms. Throughput: 133.38 iter/sec. FFTlen=9216K, Type=3, Arch=4, Pass1=2048, Pass2=4608, clm=1 (8 cpus, 8 workers): 58.43, 57.79, 57.42, 56.61, 59.18, 58.22, 56.82, 56.51 ms. Throughput: 138.87 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=384, Pass2=25600, clm=4 (8 cpus, 1 worker): 7.76 ms. Throughput: 128.83 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=384, Pass2=25600, clm=4 (8 cpus, 8 workers): 58.51, 58.01, 57.34, 57.50, 59.14, 57.96, 57.11, 57.48 ms. Throughput: 138.23 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=384, Pass2=25600, clm=2 (8 cpus, 1 worker): 7.66 ms. Throughput: 130.56 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=384, Pass2=25600, clm=2 (8 cpus, 8 workers): 58.65, 58.25, 56.93, 56.44, 58.62, 58.18, 56.66, 56.53 ms. Throughput: 139.09 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=384, Pass2=25600, clm=1 (8 cpus, 1 worker): 7.70 ms. Throughput: 129.88 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=384, Pass2=25600, clm=1 (8 cpus, 8 workers): 58.62, 57.82, 57.37, 56.26, 59.48, 58.65, 56.00, 56.58 ms. Throughput: 138.95 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=640, Pass2=15360, clm=4 (8 cpus, 1 worker): 7.90 ms. Throughput: 126.55 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=640, Pass2=15360, clm=4 (8 cpus, 8 workers): 60.71, 59.96, 59.16, 59.18, 60.85, 60.74, 59.56, 59.19 ms. Throughput: 133.53 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=640, Pass2=15360, clm=2 (8 cpus, 1 worker): 7.51 ms. Throughput: 133.23 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=640, Pass2=15360, clm=2 (8 cpus, 8 workers): 57.10, 56.86, 56.17, 55.96, 57.68, 57.13, 56.09, 56.08 ms. Throughput: 141.27 iter/sec. [Sat Apr 29 13:45:49 2017] FFTlen=9600K, Type=3, Arch=4, Pass1=640, Pass2=15360, clm=1 (8 cpus, 1 worker): 7.35 ms. Throughput: 136.11 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=640, Pass2=15360, clm=1 (8 cpus, 8 workers): 57.90, 57.02, 56.41, 56.22, 58.10, 56.64, 55.79, 55.97 ms. Throughput: 140.98 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=768, Pass2=12800, clm=4 (8 cpus, 1 worker): 7.73 ms. Throughput: 129.36 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=768, Pass2=12800, clm=4 (8 cpus, 8 workers): 59.14, 58.64, 57.69, 57.31, 60.03, 59.21, 57.27, 57.47 ms. Throughput: 137.16 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=768, Pass2=12800, clm=2 (8 cpus, 1 worker): 7.23 ms. Throughput: 138.30 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=768, Pass2=12800, clm=2 (8 cpus, 8 workers): 56.10, 55.57, 54.90, 54.10, 56.28, 55.92, 54.21, 54.29 ms. Throughput: 145.04 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=768, Pass2=12800, clm=1 (8 cpus, 1 worker): 7.20 ms. Throughput: 138.89 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=768, Pass2=12800, clm=1 (8 cpus, 8 workers): 55.84, 55.44, 54.73, 54.15, 56.69, 55.14, 54.39, 54.01 ms. Throughput: 145.36 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=1280, Pass2=7680, clm=4 (8 cpus, 1 worker): 7.65 ms. Throughput: 130.68 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=1280, Pass2=7680, clm=4 (8 cpus, 8 workers): 58.98, 58.48, 57.60, 57.00, 60.04, 59.08, 56.81, 56.87 ms. Throughput: 137.73 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=1280, Pass2=7680, clm=2 (8 cpus, 1 worker): 7.08 ms. Throughput: 141.24 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=1280, Pass2=7680, clm=2 (8 cpus, 8 workers): 55.88, 55.48, 54.58, 53.90, 56.62, 55.92, 53.86, 54.25 ms. Throughput: 145.34 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=1280, Pass2=7680, clm=1 (8 cpus, 1 worker): 6.90 ms. Throughput: 144.96 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=1280, Pass2=7680, clm=1 (8 cpus, 8 workers): 54.64, 54.33, 53.56, 53.00, 55.84, 54.62, 52.84, 53.39 ms. Throughput: 148.12 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=1536, Pass2=6400, clm=4 (8 cpus, 1 worker): 8.08 ms. Throughput: 123.77 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=1536, Pass2=6400, clm=4 (8 cpus, 8 workers): 61.71, 61.31, 60.55, 59.88, 62.29, 61.51, 59.82, 59.94 ms. Throughput: 131.44 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=1536, Pass2=6400, clm=2 (8 cpus, 1 worker): 7.45 ms. Throughput: 134.26 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=1536, Pass2=6400, clm=2 (8 cpus, 8 workers): 58.72, 58.32, 57.14, 56.17, 59.56, 58.11, 56.66, 56.28 ms. Throughput: 138.89 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=1536, Pass2=6400, clm=1 (8 cpus, 1 worker): 7.21 ms. Throughput: 138.78 iter/sec. FFTlen=9600K, Type=3, Arch=4, Pass1=1536, Pass2=6400, clm=1 (8 cpus, 8 workers): 56.79, 56.20, 55.77, 55.32, 57.35, 56.76, 55.54, 55.17 ms. Throughput: 142.59 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=512, Pass2=20480, clm=4 (8 cpus, 1 worker): 8.90 ms. Throughput: 112.34 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=512, Pass2=20480, clm=4 (8 cpus, 8 workers): 67.49, 66.58, 65.86, 65.35, 68.16, 67.33, 65.27, 65.39 ms. Throughput: 120.46 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=512, Pass2=20480, clm=2 (8 cpus, 1 worker): 8.50 ms. Throughput: 117.60 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=512, Pass2=20480, clm=2 (8 cpus, 8 workers): 64.20, 63.73, 63.20, 62.50, 64.31, 64.06, 62.80, 62.84 ms. Throughput: 126.09 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=512, Pass2=20480, clm=1 (8 cpus, 1 worker): 8.47 ms. Throughput: 118.12 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=512, Pass2=20480, clm=1 (8 cpus, 8 workers): 64.97, 64.52, 63.03, 62.30, 65.29, 64.69, 62.78, 62.73 ms. Throughput: 125.45 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=640, Pass2=16384, clm=4 (8 cpus, 1 worker): 13.50 ms. Throughput: 74.10 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=640, Pass2=16384, clm=4 (8 cpus, 8 workers): 74.86, 77.05, 70.50, 66.65, 67.01, 66.40, 64.68, 65.28 ms. Throughput: 116.29 iter/sec. [Sat Apr 29 13:50:57 2017] FFTlen=10240K, Type=3, Arch=4, Pass1=640, Pass2=16384, clm=2 (8 cpus, 1 worker): 8.27 ms. Throughput: 120.94 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=640, Pass2=16384, clm=2 (8 cpus, 8 workers): 64.06, 63.24, 62.50, 62.12, 64.53, 63.80, 61.91, 62.15 ms. Throughput: 126.94 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=640, Pass2=16384, clm=1 (8 cpus, 1 worker): 8.24 ms. Throughput: 121.38 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=640, Pass2=16384, clm=1 (8 cpus, 8 workers): 63.36, 62.94, 62.29, 62.13, 63.83, 63.08, 62.06, 61.71 ms. Throughput: 127.66 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=1024, Pass2=10240, clm=4 (8 cpus, 1 worker): 8.34 ms. Throughput: 119.96 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=1024, Pass2=10240, clm=4 (8 cpus, 8 workers): 63.82, 63.72, 63.07, 62.92, 64.47, 63.53, 63.11, 62.89 ms. Throughput: 126.11 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=1024, Pass2=10240, clm=2 (8 cpus, 1 worker): 7.64 ms. Throughput: 130.94 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=1024, Pass2=10240, clm=2 (8 cpus, 8 workers): 60.03, 59.32, 58.82, 58.64, 60.91, 59.75, 58.65, 58.72 ms. Throughput: 134.80 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=1024, Pass2=10240, clm=1 (8 cpus, 1 worker): 7.46 ms. Throughput: 134.06 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=1024, Pass2=10240, clm=1 (8 cpus, 8 workers): 59.73, 59.42, 58.80, 58.21, 61.11, 60.67, 57.95, 58.13 ms. Throughput: 135.07 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=1280, Pass2=8192, clm=4 (8 cpus, 1 worker): 8.21 ms. Throughput: 121.80 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=1280, Pass2=8192, clm=4 (8 cpus, 8 workers): 63.51, 62.43, 62.15, 61.05, 65.10, 62.64, 61.70, 61.02 ms. Throughput: 128.15 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=1280, Pass2=8192, clm=2 (8 cpus, 1 worker): 7.61 ms. Throughput: 131.34 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=1280, Pass2=8192, clm=2 (8 cpus, 8 workers): 59.73, 59.79, 58.83, 57.87, 61.05, 62.32, 58.72, 58.24 ms. Throughput: 134.37 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=1280, Pass2=8192, clm=1 (8 cpus, 1 worker): 7.45 ms. Throughput: 134.15 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=1280, Pass2=8192, clm=1 (8 cpus, 8 workers): 58.67, 58.14, 57.78, 57.16, 59.88, 59.47, 57.55, 57.28 ms. Throughput: 137.40 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=2048, Pass2=5120, clm=4 (8 cpus, 1 worker): 9.22 ms. Throughput: 108.48 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=2048, Pass2=5120, clm=4 (8 cpus, 8 workers): 69.52, 68.61, 68.08, 67.44, 70.62, 69.64, 67.50, 67.81 ms. Throughput: 116.56 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=2048, Pass2=5120, clm=2 (8 cpus, 1 worker): 8.70 ms. Throughput: 114.95 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=2048, Pass2=5120, clm=2 (8 cpus, 8 workers): 66.82, 66.10, 65.26, 64.90, 67.01, 66.31, 64.66, 64.82 ms. Throughput: 121.72 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=2048, Pass2=5120, clm=1 (8 cpus, 1 worker): 9.32 ms. Throughput: 107.24 iter/sec. FFTlen=10240K, Type=3, Arch=4, Pass1=2048, Pass2=5120, clm=1 (8 cpus, 8 workers): 64.14, 63.78, 63.37, 63.44, 65.07, 64.02, 62.81, 62.97 ms. Throughput: 125.60 iter/sec. FFTlen=10752K, Type=3, Arch=4, Pass1=896, Pass2=12288, clm=4 (8 cpus, 1 worker): 9.10 ms. Throughput: 109.85 iter/sec. FFTlen=10752K, Type=3, Arch=4, Pass1=896, Pass2=12288, clm=4 (8 cpus, 8 workers): 70.20, 69.51, 68.79, 68.32, 70.80, 69.47, 68.46, 68.78 ms. Throughput: 115.47 iter/sec. FFTlen=10752K, Type=3, Arch=4, Pass1=896, Pass2=12288, clm=2 (8 cpus, 1 worker): 8.46 ms. Throughput: 118.26 iter/sec. FFTlen=10752K, Type=3, Arch=4, Pass1=896, Pass2=12288, clm=2 (8 cpus, 8 workers): 66.71, 65.76, 64.85, 64.55, 67.58, 65.94, 64.56, 64.63 ms. Throughput: 122.03 iter/sec. FFTlen=10752K, Type=3, Arch=4, Pass1=896, Pass2=12288, clm=1 (8 cpus, 1 worker): 8.40 ms. Throughput: 119.12 iter/sec. FFTlen=10752K, Type=3, Arch=4, Pass1=896, Pass2=12288, clm=1 (8 cpus, 8 workers): 65.98, 65.42, 64.69, 64.05, 66.90, 65.93, 63.88, 64.28 ms. Throughput: 122.84 iter/sec. [Sat Apr 29 13:56:04 2017] FFTlen=10752K, Type=3, Arch=4, Pass1=1792, Pass2=6144, clm=4 (8 cpus, 1 worker): 9.03 ms. Throughput: 110.77 iter/sec. FFTlen=10752K, Type=3, Arch=4, Pass1=1792, Pass2=6144, clm=4 (8 cpus, 8 workers): 69.12, 68.26, 67.85, 67.59, 69.59, 69.17, 67.49, 67.77 ms. Throughput: 117.05 iter/sec. FFTlen=10752K, Type=3, Arch=4, Pass1=1792, Pass2=6144, clm=2 (8 cpus, 1 worker): 8.39 ms. Throughput: 119.12 iter/sec. FFTlen=10752K, Type=3, Arch=4, Pass1=1792, Pass2=6144, clm=2 (8 cpus, 8 workers): 65.58, 65.60, 63.98, 64.05, 65.87, 64.93, 63.62, 64.37 ms. Throughput: 123.57 iter/sec. FFTlen=10752K, Type=3, Arch=4, Pass1=1792, Pass2=6144, clm=1 (8 cpus, 1 worker): 7.97 ms. Throughput: 125.42 iter/sec. FFTlen=10752K, Type=3, Arch=4, Pass1=1792, Pass2=6144, clm=1 (8 cpus, 8 workers): 64.35, 62.81, 62.69, 62.31, 63.61, 63.19, 61.89, 62.27 ms. Throughput: 127.22 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=448, Pass2=25600, clm=4 (8 cpus, 1 worker): 9.14 ms. Throughput: 109.45 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=448, Pass2=25600, clm=4 (8 cpus, 8 workers): 69.60, 68.69, 67.75, 67.51, 69.74, 68.99, 68.08, 67.88 ms. Throughput: 116.75 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=448, Pass2=25600, clm=2 (8 cpus, 1 worker): 8.98 ms. Throughput: 111.34 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=448, Pass2=25600, clm=2 (8 cpus, 8 workers): 69.10, 67.89, 67.08, 66.12, 69.32, 68.75, 66.11, 66.86 ms. Throughput: 118.28 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=448, Pass2=25600, clm=1 (8 cpus, 1 worker): 9.18 ms. Throughput: 108.98 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=448, Pass2=25600, clm=1 (8 cpus, 8 workers): 69.44, 68.79, 67.92, 67.62, 69.87, 68.38, 67.38, 67.58 ms. Throughput: 117.02 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=896, Pass2=12800, clm=4 (8 cpus, 1 worker): 9.16 ms. Throughput: 109.15 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=896, Pass2=12800, clm=4 (8 cpus, 8 workers): 70.57, 69.89, 69.37, 68.54, 73.09, 72.00, 68.65, 68.20 ms. Throughput: 114.28 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=896, Pass2=12800, clm=2 (8 cpus, 1 worker): 8.73 ms. Throughput: 114.55 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=896, Pass2=12800, clm=2 (8 cpus, 8 workers): 66.15, 65.27, 64.76, 64.63, 66.51, 65.47, 64.72, 64.73 ms. Throughput: 122.56 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=896, Pass2=12800, clm=1 (8 cpus, 1 worker): 8.45 ms. Throughput: 118.32 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=896, Pass2=12800, clm=1 (8 cpus, 8 workers): 66.31, 65.62, 65.00, 64.58, 67.15, 66.40, 64.38, 64.73 ms. Throughput: 122.12 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=1792, Pass2=6400, clm=4 (8 cpus, 1 worker): 9.77 ms. Throughput: 102.32 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=1792, Pass2=6400, clm=4 (8 cpus, 8 workers): 74.56, 74.14, 73.18, 72.67, 75.40, 74.18, 73.08, 72.94 ms. Throughput: 108.46 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=1792, Pass2=6400, clm=2 (8 cpus, 1 worker): 9.06 ms. Throughput: 110.40 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=1792, Pass2=6400, clm=2 (8 cpus, 8 workers): 69.72, 69.31, 68.54, 68.46, 70.40, 70.74, 68.00, 69.70 ms. Throughput: 115.36 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=1792, Pass2=6400, clm=1 (8 cpus, 1 worker): 8.66 ms. Throughput: 115.51 iter/sec. FFTlen=11200K, Type=3, Arch=4, Pass1=1792, Pass2=6400, clm=1 (8 cpus, 8 workers): 67.87, 67.21, 66.88, 66.01, 69.17, 68.91, 66.81, 66.46 ms. Throughput: 118.69 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=768, Pass2=15360, clm=4 (8 cpus, 1 worker): 9.53 ms. Throughput: 104.94 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=768, Pass2=15360, clm=4 (8 cpus, 8 workers): 73.39, 72.70, 72.16, 71.92, 74.23, 72.90, 71.52, 72.02 ms. Throughput: 110.20 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=768, Pass2=15360, clm=2 (8 cpus, 1 worker): 8.94 ms. Throughput: 111.89 iter/sec. [Sat Apr 29 14:01:08 2017] FFTlen=11520K, Type=3, Arch=4, Pass1=768, Pass2=15360, clm=2 (8 cpus, 8 workers): 69.52, 69.00, 68.01, 67.47, 70.11, 69.50, 67.46, 67.78 ms. Throughput: 116.63 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=768, Pass2=15360, clm=1 (8 cpus, 1 worker): 8.85 ms. Throughput: 112.98 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=768, Pass2=15360, clm=1 (8 cpus, 8 workers): 69.14, 68.38, 67.69, 67.47, 70.51, 68.72, 66.87, 67.53 ms. Throughput: 117.18 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=1280, Pass2=9216, clm=4 (8 cpus, 1 worker): 9.49 ms. Throughput: 105.35 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=1280, Pass2=9216, clm=4 (8 cpus, 8 workers): 73.49, 73.70, 72.95, 72.01, 75.20, 73.15, 71.57, 71.87 ms. Throughput: 109.63 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=1280, Pass2=9216, clm=2 (8 cpus, 1 worker): 8.98 ms. Throughput: 111.40 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=1280, Pass2=9216, clm=2 (8 cpus, 8 workers): 70.68, 70.12, 69.51, 69.03, 71.68, 70.82, 69.13, 69.50 ms. Throughput: 114.21 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=1280, Pass2=9216, clm=1 (8 cpus, 1 worker): 8.93 ms. Throughput: 112.01 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=1280, Pass2=9216, clm=1 (8 cpus, 8 workers): 70.47, 69.81, 69.07, 68.66, 70.82, 70.03, 68.96, 68.83 ms. Throughput: 114.99 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=1536, Pass2=7680, clm=4 (8 cpus, 1 worker): 9.40 ms. Throughput: 106.38 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=1536, Pass2=7680, clm=4 (8 cpus, 8 workers): 73.34, 72.45, 71.30, 72.05, 74.04, 73.01, 70.61, 72.34 ms. Throughput: 110.53 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=1536, Pass2=7680, clm=2 (8 cpus, 1 worker): 8.70 ms. Throughput: 115.01 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=1536, Pass2=7680, clm=2 (8 cpus, 8 workers): 68.16, 67.66, 67.33, 67.10, 68.94, 67.90, 66.14, 66.66 ms. Throughput: 118.56 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=1536, Pass2=7680, clm=1 (8 cpus, 1 worker): 8.39 ms. Throughput: 119.16 iter/sec. FFTlen=11520K, Type=3, Arch=4, Pass1=1536, Pass2=7680, clm=1 (8 cpus, 8 workers): 67.87, 66.95, 66.15, 65.14, 68.22, 67.37, 65.18, 65.12 ms. Throughput: 120.34 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=768, Pass2=16384, clm=4 (8 cpus, 1 worker): 10.47 ms. Throughput: 95.54 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=768, Pass2=16384, clm=4 (8 cpus, 8 workers): 80.58, 79.91, 78.87, 78.59, 81.08, 80.12, 78.75, 79.02 ms. Throughput: 100.49 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=768, Pass2=16384, clm=2 (8 cpus, 1 worker): 9.88 ms. Throughput: 101.18 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=768, Pass2=16384, clm=2 (8 cpus, 8 workers): 77.36, 76.13, 75.26, 74.84, 78.17, 76.61, 74.92, 75.08 ms. Throughput: 105.23 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=768, Pass2=16384, clm=1 (8 cpus, 1 worker): 9.85 ms. Throughput: 101.48 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=768, Pass2=16384, clm=1 (8 cpus, 8 workers): 76.92, 76.07, 75.20, 74.49, 77.00, 76.81, 74.60, 74.79 ms. Throughput: 105.65 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=1024, Pass2=12288, clm=4 (8 cpus, 1 worker): 10.52 ms. Throughput: 95.02 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=1024, Pass2=12288, clm=4 (8 cpus, 8 workers): 81.70, 83.63, 79.70, 79.92, 82.54, 81.21, 79.69, 79.85 ms. Throughput: 98.76 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=1024, Pass2=12288, clm=2 (8 cpus, 1 worker): 9.80 ms. Throughput: 102.08 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=1024, Pass2=12288, clm=2 (8 cpus, 8 workers): 76.45, 75.65, 74.73, 74.09, 77.25, 76.10, 74.43, 74.37 ms. Throughput: 106.15 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=1024, Pass2=12288, clm=1 (8 cpus, 1 worker): 9.59 ms. Throughput: 104.29 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=1024, Pass2=12288, clm=1 (8 cpus, 8 workers): 75.69, 75.07, 74.34, 73.98, 76.92, 75.81, 73.54, 73.75 ms. Throughput: 106.85 iter/sec. [Sat Apr 29 14:06:18 2017] FFTlen=12288K, Type=3, Arch=4, Pass1=1536, Pass2=8192, clm=4 (8 cpus, 1 worker): 10.18 ms. Throughput: 98.28 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=1536, Pass2=8192, clm=4 (8 cpus, 8 workers): 78.21, 77.02, 75.88, 75.41, 79.46, 78.15, 75.18, 75.39 ms. Throughput: 104.16 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=1536, Pass2=8192, clm=2 (8 cpus, 1 worker): 9.34 ms. Throughput: 107.10 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=1536, Pass2=8192, clm=2 (8 cpus, 8 workers): 72.45, 72.25, 71.38, 70.94, 73.58, 72.08, 70.72, 71.94 ms. Throughput: 111.25 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=1536, Pass2=8192, clm=1 (8 cpus, 1 worker): 9.02 ms. Throughput: 110.83 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=1536, Pass2=8192, clm=1 (8 cpus, 8 workers): 71.60, 71.69, 70.08, 69.27, 72.60, 72.16, 69.35, 69.31 ms. Throughput: 113.10 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=2048, Pass2=6144, clm=4 (8 cpus, 1 worker): 10.96 ms. Throughput: 91.25 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=2048, Pass2=6144, clm=4 (8 cpus, 8 workers): 83.85, 82.70, 81.80, 81.14, 83.98, 82.64, 80.56, 80.76 ms. Throughput: 97.37 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=2048, Pass2=6144, clm=2 (8 cpus, 1 worker): 10.59 ms. Throughput: 94.40 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=2048, Pass2=6144, clm=2 (8 cpus, 8 workers): 81.86, 80.76, 80.38, 80.08, 84.21, 83.19, 80.77, 80.50 ms. Throughput: 98.22 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=2048, Pass2=6144, clm=1 (8 cpus, 1 worker): 10.21 ms. Throughput: 97.92 iter/sec. FFTlen=12288K, Type=3, Arch=4, Pass1=2048, Pass2=6144, clm=1 (8 cpus, 8 workers): 80.21, 79.88, 79.84, 77.83, 81.61, 81.21, 78.55, 78.95 ms. Throughput: 100.32 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=512, Pass2=25600, clm=4 (8 cpus, 1 worker): 10.70 ms. Throughput: 93.45 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=512, Pass2=25600, clm=4 (8 cpus, 8 workers): 83.04, 82.05, 80.89, 80.65, 82.62, 82.42, 80.52, 80.14 ms. Throughput: 98.13 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=512, Pass2=25600, clm=2 (8 cpus, 1 worker): 10.35 ms. Throughput: 96.64 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=512, Pass2=25600, clm=2 (8 cpus, 8 workers): 79.08, 78.25, 77.13, 76.61, 80.20, 78.12, 77.07, 76.43 ms. Throughput: 102.77 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=512, Pass2=25600, clm=1 (8 cpus, 1 worker): 10.31 ms. Throughput: 97.04 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=512, Pass2=25600, clm=1 (8 cpus, 8 workers): 78.95, 78.54, 77.34, 76.57, 80.14, 78.16, 77.52, 76.48 ms. Throughput: 102.64 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=640, Pass2=20480, clm=4 (8 cpus, 1 worker): 11.14 ms. Throughput: 89.77 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=640, Pass2=20480, clm=4 (8 cpus, 8 workers): 86.03, 84.65, 83.13, 83.29, 86.41, 85.02, 82.44, 82.78 ms. Throughput: 95.02 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=640, Pass2=20480, clm=2 (8 cpus, 1 worker): 10.58 ms. Throughput: 94.50 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=640, Pass2=20480, clm=2 (8 cpus, 8 workers): 80.99, 80.77, 79.64, 79.13, 82.35, 80.71, 79.14, 79.26 ms. Throughput: 99.71 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=640, Pass2=20480, clm=1 (8 cpus, 1 worker): 10.57 ms. Throughput: 94.59 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=640, Pass2=20480, clm=1 (8 cpus, 8 workers): 81.44, 80.28, 79.82, 78.80, 82.23, 80.49, 78.71, 79.28 ms. Throughput: 99.86 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=1024, Pass2=12800, clm=4 (8 cpus, 1 worker): 10.67 ms. Throughput: 93.76 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=1024, Pass2=12800, clm=4 (8 cpus, 8 workers): 81.09, 81.41, 80.29, 79.85, 82.40, 81.08, 80.41, 81.25 ms. Throughput: 98.81 iter/sec. [Sat Apr 29 14:11:23 2017] FFTlen=12800K, Type=3, Arch=4, Pass1=1024, Pass2=12800, clm=2 (8 cpus, 1 worker): 9.81 ms. Throughput: 101.94 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=1024, Pass2=12800, clm=2 (8 cpus, 8 workers): 76.64, 75.83, 75.20, 74.15, 77.30, 76.18, 74.13, 74.49 ms. Throughput: 106.00 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=1024, Pass2=12800, clm=1 (8 cpus, 1 worker): 9.73 ms. Throughput: 102.77 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=1024, Pass2=12800, clm=1 (8 cpus, 8 workers): 75.54, 74.91, 74.13, 73.53, 76.15, 75.60, 73.51, 73.54 ms. Throughput: 107.24 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=1280, Pass2=10240, clm=4 (8 cpus, 1 worker): 10.35 ms. Throughput: 96.64 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=1280, Pass2=10240, clm=4 (8 cpus, 8 workers): 80.18, 79.41, 79.24, 79.19, 81.76, 80.65, 78.06, 81.70 ms. Throughput: 99.99 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=1280, Pass2=10240, clm=2 (8 cpus, 1 worker): 9.63 ms. Throughput: 103.82 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=1280, Pass2=10240, clm=2 (8 cpus, 8 workers): 76.28, 75.94, 75.39, 74.28, 76.94, 76.54, 74.48, 74.80 ms. Throughput: 105.86 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=1280, Pass2=10240, clm=1 (8 cpus, 1 worker): 9.53 ms. Throughput: 104.91 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=1280, Pass2=10240, clm=1 (8 cpus, 8 workers): 74.62, 73.92, 73.07, 72.80, 75.22, 73.94, 73.04, 73.30 ms. Throughput: 108.50 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=2048, Pass2=6400, clm=4 (8 cpus, 1 worker): 11.79 ms. Throughput: 84.81 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=2048, Pass2=6400, clm=4 (8 cpus, 8 workers): 89.61, 88.84, 87.47, 87.74, 90.57, 88.91, 86.78, 87.75 ms. Throughput: 90.45 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=2048, Pass2=6400, clm=2 (8 cpus, 1 worker): 11.26 ms. Throughput: 88.81 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=2048, Pass2=6400, clm=2 (8 cpus, 8 workers): 86.86, 85.54, 84.64, 84.21, 87.00, 86.61, 84.13, 84.70 ms. Throughput: 93.63 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=2048, Pass2=6400, clm=1 (8 cpus, 1 worker): 10.86 ms. Throughput: 92.06 iter/sec. FFTlen=12800K, Type=3, Arch=4, Pass1=2048, Pass2=6400, clm=1 (8 cpus, 8 workers): 86.10, 85.87, 84.64, 82.25, 86.88, 85.04, 83.58, 83.21 ms. Throughput: 94.48 iter/sec. FFTlen=13440K, Type=3, Arch=4, Pass1=896, Pass2=15360, clm=4 (8 cpus, 1 worker): 11.22 ms. Throughput: 89.13 iter/sec. FFTlen=13440K, Type=3, Arch=4, Pass1=896, Pass2=15360, clm=4 (8 cpus, 8 workers): 94.95, 89.41, 88.67, 87.04, 88.27, 87.63, 85.46, 85.67 ms. Throughput: 90.60 iter/sec. FFTlen=13440K, Type=3, Arch=4, Pass1=896, Pass2=15360, clm=2 (8 cpus, 1 worker): 10.48 ms. Throughput: 95.42 iter/sec. FFTlen=13440K, Type=3, Arch=4, Pass1=896, Pass2=15360, clm=2 (8 cpus, 8 workers): 82.33, 81.41, 80.64, 80.44, 83.14, 82.03, 80.30, 80.45 ms. Throughput: 98.36 iter/sec. FFTlen=13440K, Type=3, Arch=4, Pass1=896, Pass2=15360, clm=1 (8 cpus, 1 worker): 10.45 ms. Throughput: 95.74 iter/sec. FFTlen=13440K, Type=3, Arch=4, Pass1=896, Pass2=15360, clm=1 (8 cpus, 8 workers): 82.03, 81.03, 80.47, 79.59, 82.98, 81.49, 79.48, 80.32 ms. Throughput: 98.88 iter/sec. FFTlen=13440K, Type=3, Arch=4, Pass1=1792, Pass2=7680, clm=4 (8 cpus, 1 worker): 11.44 ms. Throughput: 87.43 iter/sec. FFTlen=13440K, Type=3, Arch=4, Pass1=1792, Pass2=7680, clm=4 (8 cpus, 8 workers): 88.67, 87.66, 86.53, 86.63, 89.67, 88.69, 86.42, 86.83 ms. Throughput: 91.30 iter/sec. FFTlen=13440K, Type=3, Arch=4, Pass1=1792, Pass2=7680, clm=2 (8 cpus, 1 worker): 10.57 ms. Throughput: 94.63 iter/sec. FFTlen=13440K, Type=3, Arch=4, Pass1=1792, Pass2=7680, clm=2 (8 cpus, 8 workers): 84.94, 82.48, 81.53, 81.88, 85.45, 84.25, 81.02, 82.43 ms. Throughput: 96.42 iter/sec. [Sat Apr 29 14:16:27 2017] FFTlen=13440K, Type=3, Arch=4, Pass1=1792, Pass2=7680, clm=1 (8 cpus, 1 worker): 10.18 ms. Throughput: 98.22 iter/sec. FFTlen=13440K, Type=3, Arch=4, Pass1=1792, Pass2=7680, clm=1 (8 cpus, 8 workers): 81.44, 79.97, 78.76, 78.83, 82.15, 80.34, 79.29, 79.19 ms. Throughput: 100.03 iter/sec. FFTlen=13824K, Type=3, Arch=4, Pass1=1536, Pass2=9216, clm=4 (8 cpus, 1 worker): 11.78 ms. Throughput: 84.92 iter/sec. FFTlen=13824K, Type=3, Arch=4, Pass1=1536, Pass2=9216, clm=4 (8 cpus, 8 workers): 91.95, 90.52, 89.30, 89.33, 93.58, 90.86, 89.07, 89.75 ms. Throughput: 88.38 iter/sec. FFTlen=13824K, Type=3, Arch=4, Pass1=1536, Pass2=9216, clm=2 (8 cpus, 1 worker): 11.07 ms. Throughput: 90.36 iter/sec. FFTlen=13824K, Type=3, Arch=4, Pass1=1536, Pass2=9216, clm=2 (8 cpus, 8 workers): 86.79, 85.75, 84.97, 85.23, 87.89, 85.75, 84.98, 84.86 ms. Throughput: 93.28 iter/sec. FFTlen=13824K, Type=3, Arch=4, Pass1=1536, Pass2=9216, clm=1 (8 cpus, 1 worker): 10.74 ms. Throughput: 93.10 iter/sec. FFTlen=13824K, Type=3, Arch=4, Pass1=1536, Pass2=9216, clm=1 (8 cpus, 8 workers): 85.88, 85.07, 84.76, 84.27, 86.39, 85.05, 85.65, 83.77 ms. Throughput: 94.01 iter/sec. FFTlen=14336K, Type=3, Arch=4, Pass1=896, Pass2=16384, clm=4 (8 cpus, 1 worker): 12.49 ms. Throughput: 80.07 iter/sec. FFTlen=14336K, Type=3, Arch=4, Pass1=896, Pass2=16384, clm=4 (8 cpus, 8 workers): 96.35, 95.54, 94.07, 93.75, 98.34, 97.36, 93.50, 94.32 ms. Throughput: 83.88 iter/sec. FFTlen=14336K, Type=3, Arch=4, Pass1=896, Pass2=16384, clm=2 (8 cpus, 1 worker): 11.66 ms. Throughput: 85.76 iter/sec. FFTlen=14336K, Type=3, Arch=4, Pass1=896, Pass2=16384, clm=2 (8 cpus, 8 workers): 90.69, 89.58, 89.10, 89.07, 91.13, 90.44, 88.97, 89.14 ms. Throughput: 89.13 iter/sec. FFTlen=14336K, Type=3, Arch=4, Pass1=896, Pass2=16384, clm=1 (8 cpus, 1 worker): 11.79 ms. Throughput: 84.80 iter/sec. FFTlen=14336K, Type=3, Arch=4, Pass1=896, Pass2=16384, clm=1 (8 cpus, 8 workers): 90.75, 89.54, 88.43, 88.01, 90.86, 90.01, 88.78, 88.71 ms. Throughput: 89.51 iter/sec. FFTlen=14336K, Type=3, Arch=4, Pass1=1792, Pass2=8192, clm=4 (8 cpus, 1 worker): 12.31 ms. Throughput: 81.23 iter/sec. FFTlen=14336K, Type=3, Arch=4, Pass1=1792, Pass2=8192, clm=4 (8 cpus, 8 workers): 94.03, 92.89, 91.91, 91.33, 96.25, 94.10, 93.35, 93.77 ms. Throughput: 85.62 iter/sec. FFTlen=14336K, Type=3, Arch=4, Pass1=1792, Pass2=8192, clm=2 (8 cpus, 1 worker): 11.30 ms. Throughput: 88.50 iter/sec. FFTlen=14336K, Type=3, Arch=4, Pass1=1792, Pass2=8192, clm=2 (8 cpus, 8 workers): 89.50, 87.77, 85.90, 86.07, 89.59, 89.10, 85.70, 85.73 ms. Throughput: 91.54 iter/sec. FFTlen=14336K, Type=3, Arch=4, Pass1=1792, Pass2=8192, clm=1 (8 cpus, 1 worker): 10.86 ms. Throughput: 92.07 iter/sec. FFTlen=14336K, Type=3, Arch=4, Pass1=1792, Pass2=8192, clm=1 (8 cpus, 8 workers): 85.77, 84.88, 83.99, 83.12, 86.33, 85.91, 83.07, 84.08 ms. Throughput: 94.53 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=768, Pass2=20480, clm=4 (8 cpus, 1 worker): 13.47 ms. Throughput: 74.25 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=768, Pass2=20480, clm=4 (8 cpus, 8 workers): 103.64, 102.59, 101.35, 100.94, 105.24, 102.75, 100.67, 100.73 ms. Throughput: 78.26 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=768, Pass2=20480, clm=2 (8 cpus, 1 worker): 12.64 ms. Throughput: 79.09 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=768, Pass2=20480, clm=2 (8 cpus, 8 workers): 97.89, 96.69, 95.68, 95.04, 98.36, 97.07, 95.72, 95.35 ms. Throughput: 82.93 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=768, Pass2=20480, clm=1 (8 cpus, 1 worker): 12.65 ms. Throughput: 79.08 iter/sec. [Sat Apr 29 14:21:31 2017] FFTlen=15360K, Type=3, Arch=4, Pass1=768, Pass2=20480, clm=1 (8 cpus, 8 workers): 97.79, 96.76, 95.60, 95.17, 99.33, 96.99, 94.26, 95.09 ms. Throughput: 83.03 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1024, Pass2=15360, clm=4 (8 cpus, 1 worker): 13.07 ms. Throughput: 76.50 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1024, Pass2=15360, clm=4 (8 cpus, 8 workers): 102.78, 101.62, 100.40, 99.61, 104.39, 101.31, 99.70, 99.58 ms. Throughput: 79.09 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1024, Pass2=15360, clm=2 (8 cpus, 1 worker): 12.09 ms. Throughput: 82.73 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1024, Pass2=15360, clm=2 (8 cpus, 8 workers): 95.24, 94.14, 93.36, 92.23, 96.29, 94.79, 92.67, 92.60 ms. Throughput: 85.20 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1024, Pass2=15360, clm=1 (8 cpus, 1 worker): 11.93 ms. Throughput: 83.80 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1024, Pass2=15360, clm=1 (8 cpus, 8 workers): 93.96, 93.33, 91.65, 91.07, 96.23, 93.32, 91.62, 91.56 ms. Throughput: 86.19 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1280, Pass2=12288, clm=4 (8 cpus, 1 worker): 13.22 ms. Throughput: 75.65 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1280, Pass2=12288, clm=4 (8 cpus, 8 workers): 102.72, 101.27, 100.32, 99.51, 103.25, 102.17, 99.55, 100.01 ms. Throughput: 79.14 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1280, Pass2=12288, clm=2 (8 cpus, 1 worker): 12.23 ms. Throughput: 81.76 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1280, Pass2=12288, clm=2 (8 cpus, 8 workers): 97.75, 96.02, 95.06, 94.40, 98.71, 96.53, 94.02, 94.07 ms. Throughput: 83.51 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1280, Pass2=12288, clm=1 (8 cpus, 1 worker): 11.97 ms. Throughput: 83.55 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1280, Pass2=12288, clm=1 (8 cpus, 8 workers): 95.27, 94.24, 92.89, 93.20, 95.32, 94.76, 92.97, 92.96 ms. Throughput: 85.16 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1536, Pass2=10240, clm=4 (8 cpus, 1 worker): 12.68 ms. Throughput: 78.87 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1536, Pass2=10240, clm=4 (8 cpus, 8 workers): 98.53, 97.40, 95.98, 96.10, 100.75, 98.63, 96.17, 96.19 ms. Throughput: 82.10 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1536, Pass2=10240, clm=2 (8 cpus, 1 worker): 11.72 ms. Throughput: 85.34 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1536, Pass2=10240, clm=2 (8 cpus, 8 workers): 93.00, 92.27, 90.68, 90.55, 94.12, 93.26, 90.09, 90.22 ms. Throughput: 87.19 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1536, Pass2=10240, clm=1 (8 cpus, 1 worker): 11.39 ms. Throughput: 87.79 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=1536, Pass2=10240, clm=1 (8 cpus, 8 workers): 90.96, 90.30, 89.61, 89.37, 92.65, 90.90, 88.77, 88.52 ms. Throughput: 88.77 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=2048, Pass2=7680, clm=4 (8 cpus, 1 worker): 13.82 ms. Throughput: 72.38 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=2048, Pass2=7680, clm=4 (8 cpus, 8 workers): 105.47, 104.36, 102.30, 102.02, 106.97, 104.61, 101.90, 102.44 ms. Throughput: 77.12 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=2048, Pass2=7680, clm=2 (8 cpus, 1 worker): 13.21 ms. Throughput: 75.68 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=2048, Pass2=7680, clm=2 (8 cpus, 8 workers): 102.51, 101.38, 100.58, 99.52, 104.36, 102.25, 99.52, 100.12 ms. Throughput: 79.01 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=2048, Pass2=7680, clm=1 (8 cpus, 1 worker): 12.76 ms. Throughput: 78.36 iter/sec. FFTlen=15360K, Type=3, Arch=4, Pass1=2048, Pass2=7680, clm=1 (8 cpus, 8 workers): 100.54, 99.66, 98.84, 97.49, 101.27, 100.00, 97.92, 98.07 ms. Throughput: 80.64 iter/sec. [Sat Apr 29 14:26:41 2017] FFTlen=16000K, Type=3, Arch=4, Pass1=640, Pass2=25600, clm=4 (8 cpus, 1 worker): 13.53 ms. Throughput: 73.92 iter/sec. FFTlen=16000K, Type=3, Arch=4, Pass1=640, Pass2=25600, clm=4 (8 cpus, 8 workers): 104.38, 103.46, 102.70, 101.20, 104.83, 104.17, 102.52, 101.74 ms. Throughput: 77.59 iter/sec. FFTlen=16000K, Type=3, Arch=4, Pass1=640, Pass2=25600, clm=2 (8 cpus, 1 worker): 12.97 ms. Throughput: 77.10 iter/sec. FFTlen=16000K, Type=3, Arch=4, Pass1=640, Pass2=25600, clm=2 (8 cpus, 8 workers): 99.78, 98.93, 97.90, 97.27, 101.61, 99.98, 97.38, 97.47 ms. Throughput: 81.00 iter/sec. FFTlen=16000K, Type=3, Arch=4, Pass1=640, Pass2=25600, clm=1 (8 cpus, 1 worker): 12.86 ms. Throughput: 77.79 iter/sec. FFTlen=16000K, Type=3, Arch=4, Pass1=640, Pass2=25600, clm=1 (8 cpus, 8 workers): 100.16, 98.50, 98.28, 98.07, 100.28, 99.63, 97.64, 98.14 ms. Throughput: 80.95 iter/sec. FFTlen=16000K, Type=3, Arch=4, Pass1=1280, Pass2=12800, clm=4 (8 cpus, 1 worker): 13.34 ms. Throughput: 74.96 iter/sec. FFTlen=16000K, Type=3, Arch=4, Pass1=1280, Pass2=12800, clm=4 (8 cpus, 8 workers): 102.25, 101.53, 99.27, 99.11, 103.43, 101.79, 99.08, 99.57 ms. Throughput: 79.42 iter/sec. FFTlen=16000K, Type=3, Arch=4, Pass1=1280, Pass2=12800, clm=2 (8 cpus, 1 worker): 12.44 ms. Throughput: 80.35 iter/sec. FFTlen=16000K, Type=3, Arch=4, Pass1=1280, Pass2=12800, clm=2 (8 cpus, 8 workers): 97.38, 96.54, 94.87, 94.83, 97.41, 97.07, 93.82, 94.10 ms. Throughput: 83.57 iter/sec. FFTlen=16000K, Type=3, Arch=4, Pass1=1280, Pass2=12800, clm=1 (8 cpus, 1 worker): 12.12 ms. Throughput: 82.51 iter/sec. FFTlen=16000K, Type=3, Arch=4, Pass1=1280, Pass2=12800, clm=1 (8 cpus, 8 workers): 95.17, 94.34, 93.84, 92.41, 95.92, 95.24, 93.21, 92.66 ms. Throughput: 85.03 iter/sec. FFTlen=16128K, Type=3, Arch=4, Pass1=1792, Pass2=9216, clm=4 (8 cpus, 1 worker): 14.33 ms. Throughput: 69.76 iter/sec. FFTlen=16128K, Type=3, Arch=4, Pass1=1792, Pass2=9216, clm=4 (8 cpus, 8 workers): 111.43, 110.59, 109.38, 108.94, 112.21, 110.97, 108.93, 110.16 ms. Throughput: 72.52 iter/sec. FFTlen=16128K, Type=3, Arch=4, Pass1=1792, Pass2=9216, clm=2 (8 cpus, 1 worker): 13.33 ms. Throughput: 75.00 iter/sec. FFTlen=16128K, Type=3, Arch=4, Pass1=1792, Pass2=9216, clm=2 (8 cpus, 8 workers): 105.62, 104.71, 103.49, 102.63, 107.57, 104.99, 102.34, 103.95 ms. Throughput: 76.64 iter/sec. FFTlen=16128K, Type=3, Arch=4, Pass1=1792, Pass2=9216, clm=1 (8 cpus, 1 worker): 13.02 ms. Throughput: 76.81 iter/sec. FFTlen=16128K, Type=3, Arch=4, Pass1=1792, Pass2=9216, clm=1 (8 cpus, 8 workers): 102.88, 101.73, 101.34, 100.37, 104.12, 103.33, 99.86, 100.40 ms. Throughput: 78.64 iter/sec. FFTlen=16384K, Type=3, Arch=4, Pass1=1024, Pass2=16384, clm=4 (8 cpus, 1 worker): 14.73 ms. Throughput: 67.91 iter/sec. FFTlen=16384K, Type=3, Arch=4, Pass1=1024, Pass2=16384, clm=4 (8 cpus, 8 workers): 113.85, 112.81, 111.47, 111.78, 114.82, 113.24, 111.93, 111.62 ms. Throughput: 71.00 iter/sec. FFTlen=16384K, Type=3, Arch=4, Pass1=1024, Pass2=16384, clm=2 (8 cpus, 1 worker): 13.61 ms. Throughput: 73.49 iter/sec. FFTlen=16384K, Type=3, Arch=4, Pass1=1024, Pass2=16384, clm=2 (8 cpus, 8 workers): 106.07, 104.96, 104.37, 104.13, 107.18, 105.42, 104.26, 103.74 ms. Throughput: 76.19 iter/sec. FFTlen=16384K, Type=3, Arch=4, Pass1=1024, Pass2=16384, clm=1 (8 cpus, 1 worker): 13.56 ms. Throughput: 73.76 iter/sec. FFTlen=16384K, Type=3, Arch=4, Pass1=1024, Pass2=16384, clm=1 (8 cpus, 8 workers): 104.97, 104.46, 102.68, 102.26, 105.77, 104.62, 101.61, 102.33 ms. Throughput: 77.24 iter/sec. [Sat Apr 29 14:31:46 2017] FFTlen=16384K, Type=3, Arch=4, Pass1=2048, Pass2=8192, clm=4 (8 cpus, 1 worker): 14.80 ms. Throughput: 67.58 iter/sec. FFTlen=16384K, Type=3, Arch=4, Pass1=2048, Pass2=8192, clm=4 (8 cpus, 8 workers): 112.86, 111.73, 110.26, 109.36, 113.68, 111.98, 109.29, 109.53 ms. Throughput: 72.03 iter/sec. FFTlen=16384K, Type=3, Arch=4, Pass1=2048, Pass2=8192, clm=2 (8 cpus, 1 worker): 14.04 ms. Throughput: 71.25 iter/sec. FFTlen=16384K, Type=3, Arch=4, Pass1=2048, Pass2=8192, clm=2 (8 cpus, 8 workers): 108.03, 107.12, 106.15, 105.42, 109.38, 108.78, 105.40, 104.85 ms. Throughput: 74.86 iter/sec. FFTlen=16384K, Type=3, Arch=4, Pass1=2048, Pass2=8192, clm=1 (8 cpus, 1 worker): 13.41 ms. Throughput: 74.57 iter/sec. FFTlen=16384K, Type=3, Arch=4, Pass1=2048, Pass2=8192, clm=1 (8 cpus, 8 workers): 105.18, 104.37, 103.65, 102.49, 105.92, 104.78, 103.55, 103.00 ms. Throughput: 76.85 iter/sec. FFTlen=17920K, Type=3, Arch=4, Pass1=896, Pass2=20480, clm=4 (8 cpus, 1 worker): 16.21 ms. Throughput: 61.71 iter/sec. FFTlen=17920K, Type=3, Arch=4, Pass1=896, Pass2=20480, clm=4 (8 cpus, 8 workers): 122.80, 121.94, 120.63, 120.09, 124.51, 122.90, 119.55, 119.72 ms. Throughput: 65.85 iter/sec. FFTlen=17920K, Type=3, Arch=4, Pass1=896, Pass2=20480, clm=2 (8 cpus, 1 worker): 14.93 ms. Throughput: 66.98 iter/sec. FFTlen=17920K, Type=3, Arch=4, Pass1=896, Pass2=20480, clm=2 (8 cpus, 8 workers): 116.00, 114.75, 113.80, 112.44, 117.51, 115.35, 113.12, 112.31 ms. Throughput: 69.94 iter/sec. FFTlen=17920K, Type=3, Arch=4, Pass1=896, Pass2=20480, clm=1 (8 cpus, 1 worker): 14.89 ms. Throughput: 67.17 iter/sec. FFTlen=17920K, Type=3, Arch=4, Pass1=896, Pass2=20480, clm=1 (8 cpus, 8 workers): 115.11, 113.33, 113.21, 111.77, 115.94, 114.25, 111.76, 112.29 ms. Throughput: 70.52 iter/sec. FFTlen=17920K, Type=3, Arch=4, Pass1=1792, Pass2=10240, clm=4 (8 cpus, 1 worker): 15.40 ms. Throughput: 64.93 iter/sec. FFTlen=17920K, Type=3, Arch=4, Pass1=1792, Pass2=10240, clm=4 (8 cpus, 8 workers): 121.39, 119.36, 118.11, 116.87, 122.43, 119.90, 117.06, 117.19 ms. Throughput: 67.22 iter/sec. FFTlen=17920K, Type=3, Arch=4, Pass1=1792, Pass2=10240, clm=2 (8 cpus, 1 worker): 15.31 ms. Throughput: 65.33 iter/sec. FFTlen=17920K, Type=3, Arch=4, Pass1=1792, Pass2=10240, clm=2 (8 cpus, 8 workers): 112.23, 113.46, 110.94, 110.52, 112.93, 110.80, 109.06, 108.74 ms. Throughput: 72.03 iter/sec. FFTlen=17920K, Type=3, Arch=4, Pass1=1792, Pass2=10240, clm=1 (8 cpus, 1 worker): 13.60 ms. Throughput: 73.51 iter/sec. FFTlen=17920K, Type=3, Arch=4, Pass1=1792, Pass2=10240, clm=1 (8 cpus, 8 workers): 109.20, 108.38, 106.82, 105.80, 110.43, 109.24, 106.04, 105.98 ms. Throughput: 74.27 iter/sec. FFTlen=18432K, Type=3, Arch=4, Pass1=1536, Pass2=12288, clm=4 (8 cpus, 1 worker): 16.14 ms. Throughput: 61.94 iter/sec. FFTlen=18432K, Type=3, Arch=4, Pass1=1536, Pass2=12288, clm=4 (8 cpus, 8 workers): 126.29, 125.00, 123.37, 122.34, 127.33, 125.33, 122.42, 123.40 ms. Throughput: 64.30 iter/sec. FFTlen=18432K, Type=3, Arch=4, Pass1=1536, Pass2=12288, clm=2 (8 cpus, 1 worker): 14.96 ms. Throughput: 66.87 iter/sec. FFTlen=18432K, Type=3, Arch=4, Pass1=1536, Pass2=12288, clm=2 (8 cpus, 8 workers): 118.50, 117.31, 116.08, 115.32, 118.65, 117.52, 115.31, 115.58 ms. Throughput: 68.51 iter/sec. FFTlen=18432K, Type=3, Arch=4, Pass1=1536, Pass2=12288, clm=1 (8 cpus, 1 worker): 14.55 ms. Throughput: 68.74 iter/sec. FFTlen=18432K, Type=3, Arch=4, Pass1=1536, Pass2=12288, clm=1 (8 cpus, 8 workers): 115.79, 115.40, 113.02, 112.78, 117.78, 115.82, 113.03, 112.68 ms. Throughput: 69.86 iter/sec. [Sat Apr 29 14:36:58 2017] FFTlen=18432K, Type=3, Arch=4, Pass1=2048, Pass2=9216, clm=4 (8 cpus, 1 worker): 16.95 ms. Throughput: 58.99 iter/sec. FFTlen=18432K, Type=3, Arch=4, Pass1=2048, Pass2=9216, clm=4 (8 cpus, 8 workers): 131.47, 130.14, 128.89, 128.69, 132.40, 130.47, 128.94, 129.90 ms. Throughput: 61.49 iter/sec. FFTlen=18432K, Type=3, Arch=4, Pass1=2048, Pass2=9216, clm=2 (8 cpus, 1 worker): 15.82 ms. Throughput: 63.20 iter/sec. FFTlen=18432K, Type=3, Arch=4, Pass1=2048, Pass2=9216, clm=2 (8 cpus, 8 workers): 124.13, 123.01, 122.67, 120.95, 125.67, 123.01, 121.51, 122.03 ms. Throughput: 65.12 iter/sec. FFTlen=18432K, Type=3, Arch=4, Pass1=2048, Pass2=9216, clm=1 (8 cpus, 1 worker): 14.97 ms. Throughput: 66.78 iter/sec. FFTlen=18432K, Type=3, Arch=4, Pass1=2048, Pass2=9216, clm=1 (8 cpus, 8 workers): 118.72, 117.88, 118.00, 117.69, 121.11, 117.43, 115.88, 117.05 ms. Throughput: 67.82 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=768, Pass2=25600, clm=4 (8 cpus, 1 worker): 16.28 ms. Throughput: 61.42 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=768, Pass2=25600, clm=4 (8 cpus, 8 workers): 127.72, 126.13, 124.78, 123.06, 128.73, 126.62, 123.52, 123.31 ms. Throughput: 63.77 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=768, Pass2=25600, clm=2 (8 cpus, 1 worker): 15.40 ms. Throughput: 64.93 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=768, Pass2=25600, clm=2 (8 cpus, 8 workers): 120.61, 118.70, 118.24, 117.05, 121.54, 119.55, 117.59, 117.16 ms. Throughput: 67.35 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=768, Pass2=25600, clm=1 (8 cpus, 1 worker): 15.39 ms. Throughput: 64.98 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=768, Pass2=25600, clm=1 (8 cpus, 8 workers): 119.63, 118.72, 117.94, 117.13, 120.72, 120.06, 117.48, 116.96 ms. Throughput: 67.47 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=1280, Pass2=15360, clm=4 (8 cpus, 1 worker): 16.15 ms. Throughput: 61.91 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=1280, Pass2=15360, clm=4 (8 cpus, 8 workers): 128.69, 126.53, 125.18, 124.04, 128.82, 131.55, 124.06, 129.59 ms. Throughput: 62.87 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=1280, Pass2=15360, clm=2 (8 cpus, 1 worker): 15.15 ms. Throughput: 66.03 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=1280, Pass2=15360, clm=2 (8 cpus, 8 workers): 120.34, 119.35, 117.84, 118.18, 121.58, 120.31, 117.98, 118.89 ms. Throughput: 67.06 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=1280, Pass2=15360, clm=1 (8 cpus, 1 worker): 15.07 ms. Throughput: 66.35 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=1280, Pass2=15360, clm=1 (8 cpus, 8 workers): 118.27, 117.60, 116.26, 115.71, 119.99, 117.95, 116.59, 117.05 ms. Throughput: 68.13 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=1536, Pass2=12800, clm=4 (8 cpus, 1 worker): 16.33 ms. Throughput: 61.22 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=1536, Pass2=12800, clm=4 (8 cpus, 8 workers): 127.24, 125.85, 122.49, 122.77, 128.46, 130.08, 122.39, 127.27 ms. Throughput: 63.61 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=1536, Pass2=12800, clm=2 (8 cpus, 1 worker): 15.18 ms. Throughput: 65.88 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=1536, Pass2=12800, clm=2 (8 cpus, 8 workers): 119.34, 118.25, 116.75, 115.47, 120.63, 120.45, 114.53, 114.60 ms. Throughput: 68.11 iter/sec. FFTlen=19200K, Type=3, Arch=4, Pass1=1536, Pass2=12800, clm=1 (8 cpus, 1 worker): 14.71 ms. Throughput: 67.99 iter/sec. [Sat Apr 29 14:42:07 2017] FFTlen=19200K, Type=3, Arch=4, Pass1=1536, Pass2=12800, clm=1 (8 cpus, 8 workers): 115.91, 114.99, 113.08, 112.70, 117.85, 115.75, 113.72, 112.63 ms. Throughput: 69.84 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=1024, Pass2=20480, clm=4 (8 cpus, 1 worker): 18.46 ms. Throughput: 54.17 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=1024, Pass2=20480, clm=4 (8 cpus, 8 workers): 143.61, 142.54, 140.21, 139.65, 145.00, 142.65, 139.31, 139.51 ms. Throughput: 56.53 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=1024, Pass2=20480, clm=2 (8 cpus, 1 worker): 17.11 ms. Throughput: 58.46 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=1024, Pass2=20480, clm=2 (8 cpus, 8 workers): 134.04, 132.80, 131.32, 130.55, 136.74, 133.74, 130.87, 130.36 ms. Throughput: 60.37 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=1024, Pass2=20480, clm=1 (8 cpus, 1 worker): 17.00 ms. Throughput: 58.82 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=1024, Pass2=20480, clm=1 (8 cpus, 8 workers): 132.73, 131.15, 130.08, 129.25, 134.00, 132.32, 129.43, 129.45 ms. Throughput: 61.05 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=1280, Pass2=16384, clm=4 (8 cpus, 1 worker): 19.60 ms. Throughput: 51.02 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=1280, Pass2=16384, clm=4 (8 cpus, 8 workers): 154.39, 152.61, 150.89, 150.18, 156.86, 153.19, 149.97, 151.48 ms. Throughput: 52.49 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=1280, Pass2=16384, clm=2 (8 cpus, 1 worker): 19.68 ms. Throughput: 50.82 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=1280, Pass2=16384, clm=2 (8 cpus, 8 workers): 152.90, 151.28, 150.34, 148.12, 153.27, 150.85, 150.86, 150.37 ms. Throughput: 52.99 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=1280, Pass2=16384, clm=1 (8 cpus, 1 worker): 20.60 ms. Throughput: 48.55 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=1280, Pass2=16384, clm=1 (8 cpus, 8 workers): 160.40, 158.31, 159.16, 161.13, 160.33, 156.96, 157.02, 160.13 ms. Throughput: 50.26 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=2048, Pass2=10240, clm=4 (8 cpus, 1 worker): 18.50 ms. Throughput: 54.05 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=2048, Pass2=10240, clm=4 (8 cpus, 8 workers): 143.00, 141.54, 140.00, 139.74, 143.91, 142.22, 140.10, 140.61 ms. Throughput: 56.59 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=2048, Pass2=10240, clm=2 (8 cpus, 1 worker): 17.47 ms. Throughput: 57.24 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=2048, Pass2=10240, clm=2 (8 cpus, 8 workers): 137.22, 135.82, 133.38, 133.62, 137.32, 135.66, 133.49, 134.48 ms. Throughput: 59.21 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=2048, Pass2=10240, clm=1 (8 cpus, 1 worker): 16.55 ms. Throughput: 60.42 iter/sec. FFTlen=20480K, Type=3, Arch=4, Pass1=2048, Pass2=10240, clm=1 (8 cpus, 8 workers): 131.07, 129.42, 128.52, 126.78, 131.50, 129.32, 126.71, 126.94 ms. Throughput: 62.13 iter/sec. FFTlen=21504K, Type=3, Arch=4, Pass1=1792, Pass2=12288, clm=4 (8 cpus, 1 worker): 19.37 ms. Throughput: 51.62 iter/sec. FFTlen=21504K, Type=3, Arch=4, Pass1=1792, Pass2=12288, clm=4 (8 cpus, 8 workers): 152.44, 150.42, 148.56, 148.26, 153.18, 150.79, 148.68, 149.49 ms. Throughput: 53.26 iter/sec. FFTlen=21504K, Type=3, Arch=4, Pass1=1792, Pass2=12288, clm=2 (8 cpus, 1 worker): 18.11 ms. Throughput: 55.22 iter/sec. [Sat Apr 29 14:47:12 2017] FFTlen=21504K, Type=3, Arch=4, Pass1=1792, Pass2=12288, clm=2 (8 cpus, 8 workers): 144.24, 141.87, 140.61, 140.10, 145.14, 144.92, 140.40, 141.03 ms. Throughput: 56.23 iter/sec. FFTlen=21504K, Type=3, Arch=4, Pass1=1792, Pass2=12288, clm=1 (8 cpus, 1 worker): 17.57 ms. Throughput: 56.91 iter/sec. FFTlen=21504K, Type=3, Arch=4, Pass1=1792, Pass2=12288, clm=1 (8 cpus, 8 workers): 139.82, 138.40, 137.36, 135.68, 140.56, 139.21, 136.32, 135.84 ms. Throughput: 58.02 iter/sec. FFTlen=22400K, Type=3, Arch=4, Pass1=896, Pass2=25600, clm=4 (8 cpus, 1 worker): 19.24 ms. Throughput: 51.99 iter/sec. FFTlen=22400K, Type=3, Arch=4, Pass1=896, Pass2=25600, clm=4 (8 cpus, 8 workers): 152.16, 150.19, 149.30, 147.42, 151.86, 151.20, 147.07, 146.95 ms. Throughput: 53.51 iter/sec. FFTlen=22400K, Type=3, Arch=4, Pass1=896, Pass2=25600, clm=2 (8 cpus, 1 worker): 18.22 ms. Throughput: 54.89 iter/sec. FFTlen=22400K, Type=3, Arch=4, Pass1=896, Pass2=25600, clm=2 (8 cpus, 8 workers): 142.88, 141.82, 139.96, 139.27, 144.48, 142.75, 138.14, 139.23 ms. Throughput: 56.72 iter/sec. FFTlen=22400K, Type=3, Arch=4, Pass1=896, Pass2=25600, clm=1 (8 cpus, 1 worker): 18.10 ms. Throughput: 55.25 iter/sec. FFTlen=22400K, Type=3, Arch=4, Pass1=896, Pass2=25600, clm=1 (8 cpus, 8 workers): 140.97, 140.41, 139.48, 138.84, 141.96, 141.56, 138.11, 138.59 ms. Throughput: 57.15 iter/sec. FFTlen=22400K, Type=3, Arch=4, Pass1=1792, Pass2=12800, clm=4 (8 cpus, 1 worker): 19.70 ms. Throughput: 50.77 iter/sec. FFTlen=22400K, Type=3, Arch=4, Pass1=1792, Pass2=12800, clm=4 (8 cpus, 8 workers): 152.97, 151.46, 150.14, 149.03, 155.32, 151.97, 151.99, 152.76 ms. Throughput: 52.65 iter/sec. FFTlen=22400K, Type=3, Arch=4, Pass1=1792, Pass2=12800, clm=2 (8 cpus, 1 worker): 18.42 ms. Throughput: 54.28 iter/sec. FFTlen=22400K, Type=3, Arch=4, Pass1=1792, Pass2=12800, clm=2 (8 cpus, 8 workers): 142.59, 141.85, 140.52, 139.74, 144.90, 148.17, 139.17, 145.67 ms. Throughput: 56.04 iter/sec. FFTlen=22400K, Type=3, Arch=4, Pass1=1792, Pass2=12800, clm=1 (8 cpus, 1 worker): 17.57 ms. Throughput: 56.91 iter/sec. FFTlen=22400K, Type=3, Arch=4, Pass1=1792, Pass2=12800, clm=1 (8 cpus, 8 workers): 138.33, 136.22, 135.18, 133.78, 196.53, 145.37, 150.17, 149.96 ms. Throughput: 54.74 iter/sec. FFTlen=23040K, Type=3, Arch=4, Pass1=1536, Pass2=15360, clm=4 (8 cpus, 1 worker): 37.60 ms. Throughput: 26.59 iter/sec. FFTlen=23040K, Type=3, Arch=4, Pass1=1536, Pass2=15360, clm=4 (8 cpus, 8 workers): 156.67, 154.90, 152.88, 152.56, 157.76, 159.27, 152.25, 157.35 ms. Throughput: 51.48 iter/sec. FFTlen=23040K, Type=3, Arch=4, Pass1=1536, Pass2=15360, clm=2 (8 cpus, 1 worker): 18.64 ms. Throughput: 53.65 iter/sec. FFTlen=23040K, Type=3, Arch=4, Pass1=1536, Pass2=15360, clm=2 (8 cpus, 8 workers): 148.41, 147.44, 145.45, 144.55, 149.78, 147.56, 143.97, 145.00 ms. Throughput: 54.61 iter/sec. FFTlen=23040K, Type=3, Arch=4, Pass1=1536, Pass2=15360, clm=1 (8 cpus, 1 worker): 18.08 ms. Throughput: 55.32 iter/sec. FFTlen=23040K, Type=3, Arch=4, Pass1=1536, Pass2=15360, clm=1 (8 cpus, 8 workers): 143.44, 142.70, 141.64, 141.23, 145.98, 144.96, 141.09, 141.46 ms. Throughput: 56.03 iter/sec. [Sat Apr 29 14:52:17 2017] FFTlen=24576K, Type=3, Arch=4, Pass1=1536, Pass2=16384, clm=4 (8 cpus, 1 worker): 24.67 ms. Throughput: 40.53 iter/sec. FFTlen=24576K, Type=3, Arch=4, Pass1=1536, Pass2=16384, clm=4 (8 cpus, 8 workers): 189.04, 187.27, 185.89, 185.07, 189.94, 189.52, 188.71, 186.49 ms. Throughput: 42.62 iter/sec. FFTlen=24576K, Type=3, Arch=4, Pass1=1536, Pass2=16384, clm=2 (8 cpus, 1 worker): 25.05 ms. Throughput: 39.92 iter/sec. FFTlen=24576K, Type=3, Arch=4, Pass1=1536, Pass2=16384, clm=2 (8 cpus, 8 workers): 195.84, 190.61, 188.54, 188.41, 194.09, 192.51, 192.09, 189.07 ms. Throughput: 41.81 iter/sec. FFTlen=24576K, Type=3, Arch=4, Pass1=1536, Pass2=16384, clm=1 (8 cpus, 1 worker): 26.79 ms. Throughput: 37.33 iter/sec. FFTlen=24576K, Type=3, Arch=4, Pass1=1536, Pass2=16384, clm=1 (8 cpus, 8 workers): 202.52, 204.55, 204.16, 201.13, 202.91, 203.49, 209.50, 202.71 ms. Throughput: 39.25 iter/sec. FFTlen=24576K, Type=3, Arch=4, Pass1=2048, Pass2=12288, clm=4 (8 cpus, 1 worker): 23.54 ms. Throughput: 42.47 iter/sec. FFTlen=24576K, Type=3, Arch=4, Pass1=2048, Pass2=12288, clm=4 (8 cpus, 8 workers): 182.48, 180.50, 177.79, 177.75, 185.25, 182.01, 179.46, 178.07 ms. Throughput: 44.35 iter/sec. FFTlen=24576K, Type=3, Arch=4, Pass1=2048, Pass2=12288, clm=2 (8 cpus, 1 worker): 22.72 ms. Throughput: 44.01 iter/sec. FFTlen=24576K, Type=3, Arch=4, Pass1=2048, Pass2=12288, clm=2 (8 cpus, 8 workers): 179.95, 175.96, 173.58, 173.90, 180.04, 177.10, 173.44, 174.81 ms. Throughput: 45.44 iter/sec. FFTlen=24576K, Type=3, Arch=4, Pass1=2048, Pass2=12288, clm=1 (8 cpus, 1 worker): 23.68 ms. Throughput: 42.23 iter/sec. FFTlen=24576K, Type=3, Arch=4, Pass1=2048, Pass2=12288, clm=1 (8 cpus, 8 workers): 174.50, 172.27, 170.65, 170.29, 177.43, 172.76, 170.50, 170.67 ms. Throughput: 46.42 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=1024, Pass2=25600, clm=4 (8 cpus, 1 worker): 22.48 ms. Throughput: 44.49 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=1024, Pass2=25600, clm=4 (8 cpus, 8 workers): 176.67, 173.71, 171.84, 171.31, 177.38, 174.33, 171.48, 171.38 ms. Throughput: 46.11 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=1024, Pass2=25600, clm=2 (8 cpus, 1 worker): 20.89 ms. Throughput: 47.88 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=1024, Pass2=25600, clm=2 (8 cpus, 8 workers): 164.10, 162.57, 161.47, 159.94, 165.55, 163.87, 159.66, 161.38 ms. Throughput: 49.29 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=1024, Pass2=25600, clm=1 (8 cpus, 1 worker): 20.68 ms. Throughput: 48.36 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=1024, Pass2=25600, clm=1 (8 cpus, 8 workers): 161.84, 160.62, 160.02, 159.33, 163.37, 161.63, 158.89, 158.69 ms. Throughput: 49.83 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=1280, Pass2=20480, clm=4 (8 cpus, 1 worker): 23.82 ms. Throughput: 41.98 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=1280, Pass2=20480, clm=4 (8 cpus, 8 workers): 179.13, 175.89, 173.99, 174.33, 185.67, 183.57, 173.55, 174.25 ms. Throughput: 45.09 iter/sec. [Sat Apr 29 14:57:19 2017] FFTlen=25600K, Type=3, Arch=4, Pass1=1280, Pass2=20480, clm=2 (8 cpus, 1 worker): 22.39 ms. Throughput: 44.66 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=1280, Pass2=20480, clm=2 (8 cpus, 8 workers): 180.85, 172.46, 167.18, 165.47, 180.50, 172.29, 166.52, 165.56 ms. Throughput: 46.74 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=1280, Pass2=20480, clm=1 (8 cpus, 1 worker): 21.49 ms. Throughput: 46.52 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=1280, Pass2=20480, clm=1 (8 cpus, 8 workers): 169.40, 165.96, 164.44, 162.95, 171.29, 166.42, 163.32, 163.39 ms. Throughput: 48.24 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=2048, Pass2=12800, clm=4 (8 cpus, 1 worker): 23.75 ms. Throughput: 42.11 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=2048, Pass2=12800, clm=4 (8 cpus, 8 workers): 190.96, 183.52, 179.52, 179.92, 190.63, 183.39, 177.84, 178.05 ms. Throughput: 43.75 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=2048, Pass2=12800, clm=2 (8 cpus, 1 worker): 23.62 ms. Throughput: 42.33 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=2048, Pass2=12800, clm=2 (8 cpus, 8 workers): 176.85, 173.11, 172.09, 169.52, 175.97, 172.80, 169.54, 169.37 ms. Throughput: 46.41 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=2048, Pass2=12800, clm=1 (8 cpus, 1 worker): 21.36 ms. Throughput: 46.83 iter/sec. FFTlen=25600K, Type=3, Arch=4, Pass1=2048, Pass2=12800, clm=1 (8 cpus, 8 workers): 176.76, 167.37, 165.26, 161.44, 173.02, 167.77, 161.49, 161.68 ms. Throughput: 48.00 iter/sec. FFTlen=26880K, Type=3, Arch=4, Pass1=1792, Pass2=15360, clm=4 (8 cpus, 1 worker): 24.49 ms. Throughput: 40.83 iter/sec. FFTlen=26880K, Type=3, Arch=4, Pass1=1792, Pass2=15360, clm=4 (8 cpus, 8 workers): 194.71, 190.23, 186.79, 186.01, 196.59, 190.42, 186.47, 187.90 ms. Throughput: 42.15 iter/sec. FFTlen=26880K, Type=3, Arch=4, Pass1=1792, Pass2=15360, clm=2 (8 cpus, 1 worker): 22.62 ms. Throughput: 44.21 iter/sec. FFTlen=26880K, Type=3, Arch=4, Pass1=1792, Pass2=15360, clm=2 (8 cpus, 8 workers): 181.67, 179.54, 176.21, 174.24, 182.90, 179.91, 174.53, 175.12 ms. Throughput: 44.95 iter/sec. FFTlen=26880K, Type=3, Arch=4, Pass1=1792, Pass2=15360, clm=1 (8 cpus, 1 worker): 21.99 ms. Throughput: 45.48 iter/sec. FFTlen=26880K, Type=3, Arch=4, Pass1=1792, Pass2=15360, clm=1 (8 cpus, 8 workers): 177.51, 173.62, 170.26, 169.13, 180.13, 175.57, 170.18, 169.45 ms. Throughput: 46.20 iter/sec. FFTlen=28672K, Type=3, Arch=4, Pass1=1792, Pass2=16384, clm=4 (8 cpus, 1 worker): 28.63 ms. Throughput: 34.93 iter/sec. FFTlen=28672K, Type=3, Arch=4, Pass1=1792, Pass2=16384, clm=4 (8 cpus, 8 workers): 228.42, 223.94, 220.96, 219.72, 228.97, 225.51, 219.68, 221.00 ms. Throughput: 35.80 iter/sec. FFTlen=28672K, Type=3, Arch=4, Pass1=1792, Pass2=16384, clm=2 (8 cpus, 1 worker): 30.09 ms. Throughput: 33.24 iter/sec. FFTlen=28672K, Type=3, Arch=4, Pass1=1792, Pass2=16384, clm=2 (8 cpus, 8 workers): 235.75, 232.82, 230.04, 228.62, 236.70, 233.53, 231.66, 229.22 ms. Throughput: 34.44 iter/sec. [Sat Apr 29 15:02:30 2017] FFTlen=28672K, Type=3, Arch=4, Pass1=1792, Pass2=16384, clm=1 (8 cpus, 1 worker): 32.91 ms. Throughput: 30.39 iter/sec. FFTlen=28672K, Type=3, Arch=4, Pass1=1792, Pass2=16384, clm=1 (8 cpus, 8 workers): 252.35, 254.03, 256.12, 246.81, 256.55, 254.41, 253.75, 261.09 ms. Throughput: 31.46 iter/sec. FFTlen=30720K, Type=3, Arch=4, Pass1=1536, Pass2=20480, clm=4 (8 cpus, 1 worker): 28.31 ms. Throughput: 35.32 iter/sec. FFTlen=30720K, Type=3, Arch=4, Pass1=1536, Pass2=20480, clm=4 (8 cpus, 8 workers): 222.24, 218.20, 215.36, 213.75, 223.69, 220.52, 213.92, 214.45 ms. Throughput: 36.75 iter/sec. FFTlen=30720K, Type=3, Arch=4, Pass1=1536, Pass2=20480, clm=2 (8 cpus, 1 worker): 26.30 ms. Throughput: 38.02 iter/sec. FFTlen=30720K, Type=3, Arch=4, Pass1=1536, Pass2=20480, clm=2 (8 cpus, 8 workers): 211.37, 207.53, 204.70, 202.24, 213.43, 208.48, 201.50, 202.33 ms. Throughput: 38.77 iter/sec. FFTlen=30720K, Type=3, Arch=4, Pass1=1536, Pass2=20480, clm=1 (8 cpus, 1 worker): 25.68 ms. Throughput: 38.95 iter/sec. FFTlen=30720K, Type=3, Arch=4, Pass1=1536, Pass2=20480, clm=1 (8 cpus, 8 workers): 205.14, 202.47, 199.58, 197.59, 207.60, 202.85, 198.40, 198.37 ms. Throughput: 39.71 iter/sec. FFTlen=30720K, Type=3, Arch=4, Pass1=2048, Pass2=15360, clm=4 (8 cpus, 1 worker): 28.82 ms. Throughput: 34.70 iter/sec. FFTlen=30720K, Type=3, Arch=4, Pass1=2048, Pass2=15360, clm=4 (8 cpus, 8 workers): 228.49, 225.73, 221.16, 220.60, 230.52, 227.01, 220.35, 222.42 ms. Throughput: 35.64 iter/sec. FFTlen=30720K, Type=3, Arch=4, Pass1=2048, Pass2=15360, clm=2 (8 cpus, 1 worker): 27.55 ms. Throughput: 36.30 iter/sec. FFTlen=30720K, Type=3, Arch=4, Pass1=2048, Pass2=15360, clm=2 (8 cpus, 8 workers): 222.18, 216.45, 213.04, 212.77, 221.10, 217.25, 214.60, 213.88 ms. Throughput: 36.98 iter/sec. FFTlen=30720K, Type=3, Arch=4, Pass1=2048, Pass2=15360, clm=1 (8 cpus, 1 worker): 26.15 ms. Throughput: 38.25 iter/sec. FFTlen=30720K, Type=3, Arch=4, Pass1=2048, Pass2=15360, clm=1 (8 cpus, 8 workers): 212.39, 209.07, 205.91, 208.30, 212.26, 210.31, 205.43, 205.51 ms. Throughput: 38.35 iter/sec. FFTlen=32000K, Type=3, Arch=4, Pass1=1280, Pass2=25600, clm=4 (8 cpus, 1 worker): 28.41 ms. Throughput: 35.20 iter/sec. FFTlen=32000K, Type=3, Arch=4, Pass1=1280, Pass2=25600, clm=4 (8 cpus, 8 workers): 223.85, 219.93, 216.77, 216.43, 227.07, 222.29, 222.80, 217.82 ms. Throughput: 36.23 iter/sec. FFTlen=32000K, Type=3, Arch=4, Pass1=1280, Pass2=25600, clm=2 (8 cpus, 1 worker): 26.47 ms. Throughput: 37.78 iter/sec. FFTlen=32000K, Type=3, Arch=4, Pass1=1280, Pass2=25600, clm=2 (8 cpus, 8 workers): 213.00, 208.97, 207.57, 203.86, 213.86, 209.37, 205.47, 205.44 ms. Throughput: 38.39 iter/sec. [Sat Apr 29 15:07:32 2017] FFTlen=32000K, Type=3, Arch=4, Pass1=1280, Pass2=25600, clm=1 (8 cpus, 1 worker): 25.92 ms. Throughput: 38.58 iter/sec. FFTlen=32000K, Type=3, Arch=4, Pass1=1280, Pass2=25600, clm=1 (8 cpus, 8 workers): 206.53, 202.61, 201.06, 201.00, 208.16, 203.84, 201.86, 200.32 ms. Throughput: 39.38 iter/sec. FFTlen=32768K, Type=3, Arch=4, Pass1=2048, Pass2=16384, clm=4 (8 cpus, 1 worker): 32.89 ms. Throughput: 30.40 iter/sec. FFTlen=32768K, Type=3, Arch=4, Pass1=2048, Pass2=16384, clm=4 (8 cpus, 8 workers): 260.56, 257.09, 252.64, 251.76, 263.23, 258.60, 251.44, 252.74 ms. Throughput: 31.26 iter/sec. FFTlen=32768K, Type=3, Arch=4, Pass1=2048, Pass2=16384, clm=2 (8 cpus, 1 worker): 34.94 ms. Throughput: 28.62 iter/sec. FFTlen=32768K, Type=3, Arch=4, Pass1=2048, Pass2=16384, clm=2 (8 cpus, 8 workers): 276.81, 271.01, 269.00, 266.93, 278.81, 274.18, 267.54, 270.79 ms. Throughput: 29.43 iter/sec. FFTlen=32768K, Type=3, Arch=4, Pass1=2048, Pass2=16384, clm=1 (8 cpus, 1 worker): 39.42 ms. Throughput: 25.37 iter/sec. FFTlen=32768K, Type=3, Arch=4, Pass1=2048, Pass2=16384, clm=1 (8 cpus, 8 workers): 304.98, 298.97, 295.85, 306.42, 305.02, 298.79, 292.21, 296.42 ms. Throughput: 26.69 iter/sec.