Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- 1) OS X 10.11.1 (15B42) 2.6 GHz Intel Core i7
- 2) ok
- 3) many warnings:
- g++ -O3 -march=native -std=c++11 -Wall best_chunk.cpp -o best_chunk fht.o
- g++: warning: couldn’t understand kern.osversion ‘15.0.0
- ld: warning: object file (/var/folders/bh/g93rjzn51b74cgmc6f75mlyw0000gn/T//ccBwysOD.o) was built for newer OSX version (10.11) than being linked (10.4)
- ld: warning: could not create compact unwind for measureTime(int, int, int, bool): does not use RBP or RSP based frame
- ld: warning: object file (fht.o) was built for newer OSX version (10.11) than being linked (10.4)
- ld: warning: could not create compact unwind for _FHTFloatIterativeHelper: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTDoubleIterativeHelper: register 3 saved somewhere other than in frame
- ld: warning: could not create compact unwind for _FHTFloatCombinedHelper: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTDoubleCombinedHelper: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTFloatIterative8HelperAVX: register 6 saved somewhere other than in frame
- ld: warning: could not create compact unwind for _FHTFloatIterative16HelperAVX: register 6 saved somewhere other than in frame
- ld: warning: could not create compact unwind for _FHTFloatIterative32HelperAVX: register 6 saved somewhere other than in frame
- ld: warning: could not create compact unwind for _FHTDoubleIterativeHelperAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTDoubleCombinedHelperAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTDouble: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTFloatNormalizeAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTDoubleNormalizeAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTDoubleCombinedAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTFloatCombined: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTDoubleCombined: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTFloatIterativeLongHelperAVX: register 3 saved somewhere other than in frame
- ld: warning: could not create compact unwind for _FHTFloatIterativeHelperAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTFloatCombinedHelperAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTFloatCombinedAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTFloat: does not use RBP or RSP based frame
- g++ -O3 -march=native -std=c++11 -Wall test.cpp -o test fht.o
- g++: warning: couldn’t understand kern.osversion ‘15.0.0
- ld: warning: object file (/var/folders/bh/g93rjzn51b74cgmc6f75mlyw0000gn/T//cczJdsf7.o) was built for newer OSX version (10.11) than being linked (10.4)
- ld: warning: object file (fht.o) was built for newer OSX version (10.11) than being linked (10.4)
- ld: warning: could not create compact unwind for void referenceFHTHelper<double>(double*, int) [clone .part.11]: does not use RBP or RSP based frame
- ld: warning: ld: warning: could not create compact unwind for void referenceFHTHelper<float>(float*, int) [clone .part.10]: does not use RBP or RSP based framecould not create compact unwind for _FHTFloatIterativeHelper: does not use RBP or RSP based frame
- ld: warning: ld: warning: could not create compact unwind for void referenceFHT<float>(float*, int): does not use RBP or RSP based framecould not create compact unwind for _FHTDoubleIterativeHelper: register 3 saved somewhere other than in frame
- ld: warning: ld: warning: could not create compact unwind for void referenceFHT<double>(double*, int): does not use RBP or RSP based framecould not create compact unwind for _FHTFloatCombinedHelper: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTDoubleCombinedHelper: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTFloatIterative8HelperAVX: register 6 saved somewhere other than in frame
- ld: warning: could not create compact unwind for _FHTFloatIterative16HelperAVX: register 6 saved somewhere other than in frame
- ld: warning: could not create compact unwind for _FHTFloatIterative32HelperAVX: register 6 saved somewhere other than in frame
- ld: warning: could not create compact unwind for _FHTDoubleIterativeHelperAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTDoubleCombinedHelperAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTDouble: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTFloatNormalizeAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTDoubleNormalizeAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTDoubleCombinedAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTFloatCombined: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTDoubleCombined: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTFloatIterativeLongHelperAVX: register 3 saved somewhere other than in frame
- ld: warning: could not create compact unwind for _FHTFloatIterativeHelperAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTFloatCombinedHelperAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTFloatCombinedAVX: does not use RBP or RSP based frame
- ld: warning: could not create compact unwind for _FHTFloat: does not use RBP or RSP based frame
- 4) tests ok
- 5)
- n = 1024
- determining the best chunk size for float
- chunk size 8: 1.58373e-06
- chunk size 16: 1.2211e-06
- chunk size 32: 1.01729e-06
- chunk size 64: 8.80433e-07
- chunk size 128: 8.69654e-07
- chunk size 256: 7.92288e-07
- chunk size 512: 7.73384e-07
- chunk size 1024: 7.57544e-07
- best chunk: 1024
- best time: 7.57544e-07
- determining the best chunk size for double
- chunk size 8: 2.78142e-06
- chunk size 16: 1.90129e-06
- chunk size 32: 1.63022e-06
- chunk size 64: 1.44125e-06
- chunk size 128: 1.31193e-06
- chunk size 256: 1.3095e-06
- chunk size 512: 1.30715e-06
- chunk size 1024: 1.32411e-06
- best chunk: 512
- best time: 1.30715e-06
- n = 1048576
- determining the best chunk size for float
- chunk size 8: 0.00286267
- chunk size 16: 0.00255652
- chunk size 32: 0.00240716
- chunk size 64: 0.00247964
- chunk size 128: 0.00236718
- chunk size 256: 0.00231752
- chunk size 512: 0.00229667
- chunk size 1024: 0.00229837
- chunk size 2048: 0.00232201
- chunk size 4096: 0.00228597
- chunk size 8192: 0.00205678
- chunk size 16384: 0.00276382
- chunk size 32768: 0.00283525
- chunk size 65536: 0.00278609
- chunk size 131072: 0.0032337
- chunk size 262144: 0.00362588
- chunk size 524288: 0.00355522
- chunk size 1048576: 0.00355652
- best chunk: 8192
- best time: 0.00205678
- determining the best chunk size for double
- chunk size 8: 0.00625507
- chunk size 16: 0.0051595
- chunk size 32: 0.00476814
- chunk size 64: 0.00519182
- chunk size 128: 0.00490852
- chunk size 256: 0.00483421
- chunk size 512: 0.00521217
- chunk size 1024: 0.0050567
- chunk size 2048: 0.00514843
- chunk size 4096: 0.00498361
- chunk size 8192: 0.00577783
- chunk size 16384: 0.0057993
- chunk size 32768: 0.00681016
- chunk size 65536: 0.0073452
- chunk size 131072: 0.00727893
- chunk size 262144: 0.00694342
- chunk size 524288: 0.0083042
- chunk size 1048576: 0.00787831
- best chunk: 32
- best time: 0.00476814
- n = 134217728
- determining the best chunk size for float
- chunk size 8: 0.755426
- chunk size 16: 0.757058
- chunk size 32: 0.733649
- chunk size 64: 0.705
- chunk size 128: 0.678706
- chunk size 256: 0.668905
- chunk size 512: 0.699621
- chunk size 1024: 0.675661
- chunk size 2048: 0.717759
- chunk size 4096: 0.714223
- chunk size 8192: 0.712501
- chunk size 16384: 0.856925
- chunk size 32768: 0.846723
- chunk size 65536: 0.769211
- chunk size 131072: 0.92012
- chunk size 262144: 0.82591
- chunk size 524288: 0.807856
- chunk size 1048576: 0.874469
- chunk size 2097152: 0.938643
- chunk size 4194304: 0.989538
- chunk size 8388608: 1.0535
- chunk size 16777216: 1.0475
- chunk size 33554432: 1.33383
- chunk size 67108864: 1.84095
- chunk size 134217728: 1.65796
- best chunk: 256
- best time: 0.668905
- determining the best chunk size for double
- chunk size 8: 1.61663
- chunk size 16: 1.59505
- chunk size 32: 1.55713
- chunk size 64: 1.41474
- chunk size 128: 1.46516
- chunk size 256: 1.62923
- chunk size 512: 1.45572
- chunk size 1024: 1.47333
- chunk size 2048: 1.48651
- chunk size 4096: 1.51168
- chunk size 8192: 1.62347
- chunk size 16384: 1.60903
- chunk size 32768: 1.67903
- chunk size 65536: 1.76872
- chunk size 131072: 1.76537
- chunk size 262144: 1.775
- chunk size 524288: 1.81351
- chunk size 1048576: 2.03947
- chunk size 2097152: 2.24668
- chunk size 4194304: 2.34219
- chunk size 8388608: 2.22652
- chunk size 16777216: 2.66225
- chunk size 33554432: 3.44465
- chunk size 67108864: 3.43289
- chunk size 134217728: 3.4003
- best chunk: 64
- best time: 1.41474
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement