Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- WARNING: Logging before InitGoogleLogging() is written to STDERR
- I0622 04:49:41.355347 15050 SpatialConvolutionCuFFTTuner.cpp:160] START exploring FFT perf for pass = updateOutput (b x p x f) = 128x64x192 (input rows x cols) = 31x31 (filter rows x cols) = 5x5 (common rows x cols) = 31x31
- I0622 04:49:42.396143 15050 SpatialConvolutionCuFFTTuner.cpp:115] Found best cufft result Buffer=375.39M strategy = (FBMM cuFFT) (b x p x f) = 128x64x192 (input rows x cols) = 31x31 (filter rows x cols) = 5x5 (common rows x cols) = 32x32 GReductions(virtual fmas)/s = 3922.87 time = 9.63ms
- I0622 04:49:42.396190 15050 SpatialConvolutionCuFFTTuner.cpp:119] Found best fbfft result Buffer=291.50M strategy = (FBMM FBFFT) (b x p x f) = 128x64x192 (input rows x cols) = 31x31 (filter rows x cols) = 5x5 (common rows x cols) = 32x32 GReductions(virtual fmas)/s = 7055.91 time = 5.36ms
- I0622 04:49:42.396198 15050 SpatialConvolutionCuFFTTuner.cpp:166] Found best result Buffer=291.50M strategy = (FBMM FBFFT) (b x p x f) = 128x64x192 (input rows x cols) = 31x31 (filter rows x cols) = 5x5 (common rows x cols) = 32x32 GReductions(virtual fmas)/s = 7055.91 time = 5.36ms
- I0622 04:49:42.396206 15050 SpatialConvolutionCuFFTTuner.cpp:160] START exploring FFT perf for pass = updateGradInput (b x p x f) = 128x64x192 (input rows x cols) = 31x31 (filter rows x cols) = 5x5 (common rows x cols) = 31x31
- I0622 04:49:43.253260 15050 SpatialConvolutionCuFFTTuner.cpp:115] Found best cufft result Buffer=380.63M strategy = (FBMM cuFFT) (b x p x f) = 128x64x192 (input rows x cols) = 31x31 (filter rows x cols) = 5x5 (common rows x cols) = 32x32 GReductions(virtual fmas)/s = 4029.53 time = 9.38ms
- I0622 04:49:43.253279 15050 SpatialConvolutionCuFFTTuner.cpp:119] Found best fbfft result Buffer=229.64M strategy = (FBMM FBFFT) (b x p x f) = 128x64x192 (input rows x cols) = 31x31 (filter rows x cols) = 5x5 (common rows x cols) = 32x32 GReductions(virtual fmas)/s = 7818.31 time = 4.83ms
- I0622 04:49:43.253286 15050 SpatialConvolutionCuFFTTuner.cpp:166] Found best result Buffer=229.64M strategy = (FBMM FBFFT) (b x p x f) = 128x64x192 (input rows x cols) = 31x31 (filter rows x cols) = 5x5 (common rows x cols) = 32x32 GReductions(virtual fmas)/s = 7818.31 time = 4.83ms
- I0622 04:49:43.253293 15050 SpatialConvolutionCuFFTTuner.cpp:160] START exploring FFT perf for pass = accGradParameters(b x p x f) = 128x64x192 (input rows x cols) = 31x31 (filter rows x cols) = 5x5 (common rows x cols) = 31x31
- I0622 04:49:44.110939 15050 SpatialConvolutionCuFFTTuner.cpp:115] Found best cufft result Buffer=378.54M strategy = (FBMM cuFFT) (b x p x f) = 128x64x192 (input rows x cols) = 31x31 (filter rows x cols) = 5x5 (common rows x cols) = 32x32 GReductions(virtual fmas)/s = 3798.11 time = 9.95ms
- I0622 04:49:44.110967 15050 SpatialConvolutionCuFFTTuner.cpp:119] Found best fbfft result Buffer=244.32M strategy = (FBMM FBFFT) (b x p x f) = 128x64x192 (input rows x cols) = 31x31 (filter rows x cols) = 5x5 (common rows x cols) = 32x32 GReductions(virtual fmas)/s = 7023.68 time = 5.38ms
- I0622 04:49:44.110973 15050 SpatialConvolutionCuFFTTuner.cpp:166] Found best result Buffer=244.32M strategy = (FBMM FBFFT) (b x p x f) = 128x64x192 (input rows x cols) = 31x31 (filter rows x cols) = 5x5 (common rows x cols) = 32x32 GReductions(virtual fmas)/s = 7023.68 time = 5.38ms
- I0622 04:49:44.122201 15050 SpatialConvolutionCuFFTTuner.cpp:160] START exploring FFT perf for pass = updateOutput (b x p x f) = 128x192x384 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x15
- I0622 04:49:44.674590 15050 SpatialConvolutionCuFFTTuner.cpp:115] Found best cufft result Buffer=297.14M strategy = (Many cuFFT) (b x p x f) = 128x192x384 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x16 GReductions(virtual fmas)/s = 2129.20 time = 8.98ms
- I0622 04:49:44.674612 15050 SpatialConvolutionCuFFTTuner.cpp:119] Found best fbfft result Buffer=214.96M strategy = (Many FBFFT) (b x p x f) = 128x192x384 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3435.64 time = 5.56ms
- I0622 04:49:44.674635 15050 SpatialConvolutionCuFFTTuner.cpp:166] Found best result Buffer=214.96M strategy = (Many FBFFT) (b x p x f) = 128x192x384 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3435.64 time = 5.56ms
- I0622 04:49:44.674643 15050 SpatialConvolutionCuFFTTuner.cpp:160] START exploring FFT perf for pass = updateGradInput (b x p x f) = 128x192x384 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x15
- I0622 04:49:45.223516 15050 SpatialConvolutionCuFFTTuner.cpp:115] Found best cufft result Buffer=298.98M strategy = (Many cuFFT) (b x p x f) = 128x192x384 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x16 GReductions(virtual fmas)/s = 2229.91 time = 8.57ms
- I0622 04:49:45.223539 15050 SpatialConvolutionCuFFTTuner.cpp:119] Found best fbfft result Buffer=193.99M strategy = (Many FBFFT) (b x p x f) = 128x192x384 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3793.94 time = 5.04ms
- I0622 04:49:45.223546 15050 SpatialConvolutionCuFFTTuner.cpp:166] Found best result Buffer=193.99M strategy = (Many FBFFT) (b x p x f) = 128x192x384 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3793.94 time = 5.04ms
- I0622 04:49:45.223553 15050 SpatialConvolutionCuFFTTuner.cpp:160] START exploring FFT perf for pass = accGradParameters(b x p x f) = 128x192x384 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x15
- I0622 04:49:45.742804 15050 SpatialConvolutionCuFFTTuner.cpp:115] Found best cufft result Buffer=294.13M strategy = (Many cuFFT) (b x p x f) = 128x192x384 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x16 GReductions(virtual fmas)/s = 2229.14 time = 8.57ms
- I0622 04:49:45.742831 15050 SpatialConvolutionCuFFTTuner.cpp:119] Found best fbfft result Buffer=236.98M strategy = (Many FBFFT) (b x p x f) = 128x192x384 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3602.78 time = 5.30ms
- I0622 04:49:45.742838 15050 SpatialConvolutionCuFFTTuner.cpp:166] Found best result Buffer=236.98M strategy = (Many FBFFT) (b x p x f) = 128x192x384 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3602.78 time = 5.30ms
- I0622 04:49:45.755329 15050 SpatialConvolutionCuFFTTuner.cpp:160] START exploring FFT perf for pass = updateOutput (b x p x f) = 128x384x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x15
- I0622 04:49:46.436781 15050 SpatialConvolutionCuFFTTuner.cpp:115] Found best cufft result Buffer=364.77M strategy = (Many cuFFT) (b x p x f) = 128x384x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x16 GReductions(virtual fmas)/s = 2356.80 time = 10.81ms
- I0622 04:49:46.436800 15050 SpatialConvolutionCuFFTTuner.cpp:119] Found best fbfft result Buffer=239.08M strategy = (Many FBFFT) (b x p x f) = 128x384x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3888.00 time = 6.55ms
- I0622 04:49:46.436807 15050 SpatialConvolutionCuFFTTuner.cpp:166] Found best result Buffer=239.08M strategy = (Many FBFFT) (b x p x f) = 128x384x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3888.00 time = 6.55ms
- I0622 04:49:46.436815 15050 SpatialConvolutionCuFFTTuner.cpp:160] START exploring FFT perf for pass = updateGradInput (b x p x f) = 128x384x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x15
- I0622 04:49:47.104396 15050 SpatialConvolutionCuFFTTuner.cpp:115] Found best cufft result Buffer=363.86M strategy = (Many cuFFT) (b x p x f) = 128x384x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x16 GReductions(virtual fmas)/s = 2417.25 time = 10.54ms
- I0622 04:49:47.104432 15050 SpatialConvolutionCuFFTTuner.cpp:119] Found best fbfft result Buffer=252.71M strategy = (Many FBFFT) (b x p x f) = 128x384x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 4025.10 time = 6.33ms
- I0622 04:49:47.104439 15050 SpatialConvolutionCuFFTTuner.cpp:166] Found best result Buffer=252.71M strategy = (Many FBFFT) (b x p x f) = 128x384x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 4025.10 time = 6.33ms
- I0622 04:49:47.104447 15050 SpatialConvolutionCuFFTTuner.cpp:160] START exploring FFT perf for pass = accGradParameters(b x p x f) = 128x384x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x15
- I0622 04:49:47.732851 15050 SpatialConvolutionCuFFTTuner.cpp:115] Found best cufft result Buffer=356.91M strategy = (Many cuFFT) (b x p x f) = 128x384x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x16 GReductions(virtual fmas)/s = 2458.81 time = 10.36ms
- I0622 04:49:47.732870 15050 SpatialConvolutionCuFFTTuner.cpp:119] Found best fbfft result Buffer=297.80M strategy = (Many FBFFT) (b x p x f) = 128x384x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3869.26 time = 6.59ms
- I0622 04:49:47.732877 15050 SpatialConvolutionCuFFTTuner.cpp:166] Found best result Buffer=297.80M strategy = (Many FBFFT) (b x p x f) = 128x384x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3869.26 time = 6.59ms
- I0622 04:49:47.740394 15050 SpatialConvolutionCuFFTTuner.cpp:160] START exploring FFT perf for pass = updateOutput (b x p x f) = 128x256x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x15
- I0622 04:49:48.245391 15050 SpatialConvolutionCuFFTTuner.cpp:115] Found best cufft result Buffer=265.55M strategy = (Many cuFFT) (b x p x f) = 128x256x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x16 GReductions(virtual fmas)/s = 2149.08 time = 7.90ms
- I0622 04:49:48.245411 15050 SpatialConvolutionCuFFTTuner.cpp:119] Found best fbfft result Buffer=181.40M strategy = (Many FBFFT) (b x p x f) = 128x256x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3532.56 time = 4.81ms
- I0622 04:49:48.245419 15050 SpatialConvolutionCuFFTTuner.cpp:166] Found best result Buffer=181.40M strategy = (Many FBFFT) (b x p x f) = 128x256x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3532.56 time = 4.81ms
- I0622 04:49:48.245425 15050 SpatialConvolutionCuFFTTuner.cpp:160] START exploring FFT perf for pass = updateGradInput (b x p x f) = 128x256x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x15
- I0622 04:49:48.742132 15050 SpatialConvolutionCuFFTTuner.cpp:115] Found best cufft result Buffer=264.50M strategy = (Many cuFFT) (b x p x f) = 128x256x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x16 GReductions(virtual fmas)/s = 2263.76 time = 7.50ms
- I0622 04:49:48.742156 15050 SpatialConvolutionCuFFTTuner.cpp:119] Found best fbfft result Buffer=182.45M strategy = (Many FBFFT) (b x p x f) = 128x256x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3870.46 time = 4.39ms
- I0622 04:49:48.742163 15050 SpatialConvolutionCuFFTTuner.cpp:166] Found best result Buffer=182.45M strategy = (Many FBFFT) (b x p x f) = 128x256x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3870.46 time = 4.39ms
- I0622 04:49:48.742197 15050 SpatialConvolutionCuFFTTuner.cpp:160] START exploring FFT perf for pass = accGradParameters(b x p x f) = 128x256x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x15
- I0622 04:49:49.215654 15050 SpatialConvolutionCuFFTTuner.cpp:115] Found best cufft result Buffer=261.62M strategy = (Many cuFFT) (b x p x f) = 128x256x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 15x16 GReductions(virtual fmas)/s = 2293.49 time = 7.41ms
- I0622 04:49:49.215674 15050 SpatialConvolutionCuFFTTuner.cpp:119] Found best fbfft result Buffer=210.76M strategy = (Many FBFFT) (b x p x f) = 128x256x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3709.48 time = 4.58ms
- I0622 04:49:49.215682 15050 SpatialConvolutionCuFFTTuner.cpp:166] Found best result Buffer=210.76M strategy = (Many FBFFT) (b x p x f) = 128x256x256 (input rows x cols) = 15x15 (filter rows x cols) = 3x3 (common rows x cols) = 16x16 GReductions(virtual fmas)/s = 3709.48 time = 4.58ms
- I0622 04:50:25.343171 15050 SpatialConvolutionCuFFTTuner.cpp:160] START exploring FFT perf for pass = updateOutput (b x p x f) = 64x3x64 (input rows x cols) = 226x226 (filter rows x cols) = 3x3 (common rows x cols) = 226x226
- I0622 04:50:25.945965 15050 SpatialConvolutionCuFFTTuner.cpp:119] Found best fbfft result Buffer=1952.87M strategy = (Batch FBFFT) (b x p x f) = 64x3x64 (input rows x cols) = 226x226 (filter rows x cols) = 3x3 (common rows x cols) = 256x256 GReductions(virtual fmas)/s = 210.08 time = 26.89ms
- I0622 04:50:25.945983 15050 SpatialConvolutionCuFFTTuner.cpp:166] Found best result Buffer=1952.87M strategy = (Batch FBFFT) (b x p x f) = 64x3x64 (input rows x cols) = 226x226 (filter rows x cols) = 3x3 (common rows x cols) = 256x256 GReductions(virtual fmas)/s = 210.08 time = 26.89ms
- I0622 04:50:25.945996 15050 SpatialConvolutionCuFFTTuner.cpp:160] START exploring FFT perf for pass = updateGradInput (b x p x f) = 64x3x64 (input rows x cols) = 226x226 (filter rows x cols) = 3x3 (common rows x cols) = 226x226
- I0622 04:50:26.491912 15050 SpatialConvolutionCuFFTTuner.cpp:119] Found best fbfft result Buffer=1243.35M strategy = (Batch FBFFT) (b x p x f) = 64x3x64 (input rows x cols) = 226x226 (filter rows x cols) = 3x3 (common rows x cols) = 256x256 GReductions(virtual fmas)/s = 334.48 time = 16.89ms
- I0622 04:50:26.491930 15050 SpatialConvolutionCuFFTTuner.cpp:166] Found best result Buffer=1243.35M strategy = (Batch FBFFT) (b x p x f) = 64x3x64 (input rows x cols) = 226x226 (filter rows x cols) = 3x3 (common rows x cols) = 256x256 GReductions(virtual fmas)/s = 334.48 time = 16.89ms
- I0622 04:50:26.491941 15050 SpatialConvolutionCuFFTTuner.cpp:160] START exploring FFT perf for pass = accGradParameters(b x p x f) = 64x3x64 (input rows x cols) = 226x226 (filter rows x cols) = 3x3 (common rows x cols) = 226x226
- I0622 04:50:27.077749 15050 SpatialConvolutionCuFFTTuner.cpp:119] Found best fbfft result Buffer=1241.25M strategy = (Batch FBFFT) (b x p x f) = 64x3x64 (input rows x cols) = 226x226 (filter rows x cols) = 3x3 (common rows x cols) = 256x256 GReductions(virtual fmas)/s = 237.71 time = 23.76ms
- I0622 04:50:27.077766 15050 SpatialConvolutionCuFFTTuner.cpp:166] Found best result Buffer=1241.25M strategy = (Batch FBFFT) (b x p x f) = 64x3x64 (input rows x cols) = 226x226 (filter rows x cols) = 3x3 (common rows x cols) = 256x256 GReductions(virtual fmas)/s = 237.71 time = 23.76ms
- /home/uname/torch/install/bin/luajit: ...ra/torch/install/share/lua/5.1/nn/SpatialZeroPadding.lua:78: /home/uname/torch/extra/cutorch/lib/THC/THCStorage.cu(30) : cuda runtime error (2) : out of memory at /home/uname/torch/extra/cutorch/lib/THC/THCGeneral.c:241
- stack traceback:
- [C]: in function 'resizeAs'
- ...ra/torch/install/share/lua/5.1/nn/SpatialZeroPadding.lua:78: in function 'updateGradInput'
- /home/uname/torch/install/share/lua/5.1/nn/Sequential.lua:40: in function 'updateGradInput'
- /home/uname/torch/install/share/lua/5.1/nn/Sequential.lua:43: in function 'updateGradInput'
- benchmark.lua:49: in main chunk
- [C]: in function 'dofile'
- ..uname/torch/install/lib/luarocks/rocks/trepl/scm-1/bin/th:131: in main chunk
- [C]: at 0x00406670
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement