Advertisement
Guest User

Untitled

a guest
May 2nd, 2024
40
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 27.39 KB | None | 0 0
  1. 2024-05-01 13:24:06.597101: E external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:9261] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered
  2. 2024-05-01 13:24:06.597152: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:607] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered
  3. 2024-05-01 13:24:06.598624: E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1515] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered
  4. 2024-05-01 13:24:06.606296: I tensorflow/core/platform/cpu_feature_guard.cc:182] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.
  5. To enable the following instructions: AVX2 FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.
  6. 2024-05-01 13:24:07.720774: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT
  7. 2024-05-01 13:24:09.758749: I external/local_xla/xla/stream_executor/cuda/cuda_executor.cc:901] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
  8. 2024-05-01 13:24:09.812354: I external/local_xla/xla/stream_executor/cuda/cuda_executor.cc:901] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
  9. 2024-05-01 13:24:09.812669: I external/local_xla/xla/stream_executor/cuda/cuda_executor.cc:901] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
  10. GPUs available: 1
  11. CPUs available: 1
  12. Loading dataset...
  13. Loaded: 2900 samples (112.971s)
  14. Defining model...
  15. 2024-05-01 13:26:02.826919: I external/local_xla/xla/stream_executor/cuda/cuda_executor.cc:901] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
  16. 2024-05-01 13:26:02.827341: I external/local_xla/xla/stream_executor/cuda/cuda_executor.cc:901] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
  17. 2024-05-01 13:26:02.827619: I external/local_xla/xla/stream_executor/cuda/cuda_executor.cc:901] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
  18. 2024-05-01 13:26:03.225394: I external/local_xla/xla/stream_executor/cuda/cuda_executor.cc:901] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
  19. 2024-05-01 13:26:03.225736: I external/local_xla/xla/stream_executor/cuda/cuda_executor.cc:901] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
  20. 2024-05-01 13:26:03.225950: W tensorflow/core/common_runtime/gpu/gpu_bfc_allocator.cc:47] Overriding orig_value setting because the TF_FORCE_GPU_ALLOW_GROWTH environment variable is set. Original config value was 0.
  21. 2024-05-01 13:26:03.226042: I external/local_xla/xla/stream_executor/cuda/cuda_executor.cc:901] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
  22. 2024-05-01 13:26:03.226246: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1929] Created device /job:localhost/replica:0/task:0/device:GPU:0 with 13949 MB memory: -> device: 0, name: Tesla T4, pci bus id: 0000:00:04.0, compute capability: 7.5
  23. Model: "SegNet"
  24. ┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━┓
  25. ┃ Layer (type) ┃ Output Shape ┃ Param # ┃ Connected to ┃
  26. ┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━┩
  27. │ input (InputLayer) │ (None, None, None, 3) │ 0 │ - │
  28. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  29. │ en_conv1_1 (Conv2DWithBN) │ (None, None, None, 64) │ 2,048 │ input[0][0] │
  30. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  31. │ en_conv1_2 (Conv2DWithBN) │ (None, None, None, 64) │ 37,184 │ en_conv1_1[0][0] │
  32. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  33. │ max_pool_with_argmax │ [(None, None, None, │ 0 │ en_conv1_2[0][0] │
  34. │ (MaxPoolWithArgmax) │ 64), (None, None, │ │ │
  35. │ │ None, 64)] │ │ │
  36. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  37. │ en_conv2_1 (Conv2DWithBN) │ (None, None, None, │ 74,368 │ max_pool_with_argmax[… │
  38. │ │ 128) │ │ │
  39. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  40. │ en_conv2_2 (Conv2DWithBN) │ (None, None, None, │ 148,096 │ en_conv2_1[0][0] │
  41. │ │ 128) │ │ │
  42. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  43. │ max_pool_with_argmax_1 │ [(None, None, None, │ 0 │ en_conv2_2[0][0] │
  44. │ (MaxPoolWithArgmax) │ 128), (None, None, │ │ │
  45. │ │ None, 128)] │ │ │
  46. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  47. │ en_conv3_1 (Conv2DWithBN) │ (None, None, None, │ 296,192 │ max_pool_with_argmax_… │
  48. │ │ 256) │ │ │
  49. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  50. │ en_conv3_2 (Conv2DWithBN) │ (None, None, None, │ 591,104 │ en_conv3_1[0][0] │
  51. │ │ 256) │ │ │
  52. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  53. │ en_conv3_3 (Conv2DWithBN) │ (None, None, None, │ 591,104 │ en_conv3_2[0][0] │
  54. │ │ 256) │ │ │
  55. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  56. │ max_pool_with_argmax_2 │ [(None, None, None, │ 0 │ en_conv3_3[0][0] │
  57. │ (MaxPoolWithArgmax) │ 256), (None, None, │ │ │
  58. │ │ None, 256)] │ │ │
  59. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  60. │ en_conv4_1 (Conv2DWithBN) │ (None, None, None, │ 1,182,208 │ max_pool_with_argmax_… │
  61. │ │ 512) │ │ │
  62. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  63. │ en_conv4_2 (Conv2DWithBN) │ (None, None, None, │ 2,361,856 │ en_conv4_1[0][0] │
  64. │ │ 512) │ │ │
  65. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  66. │ en_conv4_3 (Conv2DWithBN) │ (None, None, None, │ 2,361,856 │ en_conv4_2[0][0] │
  67. │ │ 512) │ │ │
  68. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  69. │ max_pool_with_argmax_3 │ [(None, None, None, │ 0 │ en_conv4_3[0][0] │
  70. │ (MaxPoolWithArgmax) │ 512), (None, None, │ │ │
  71. │ │ None, 512)] │ │ │
  72. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  73. │ de_deconv1_1 │ (None, None, None, │ 2,361,856 │ max_pool_with_argmax_… │
  74. │ (Conv2DTransposeWithBN) │ 512) │ │ │
  75. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  76. │ max_unpooling2d │ (None, None, None, │ 0 │ de_deconv1_1[0][0], │
  77. │ (MaxUnpooling2D) │ 512) │ │ max_pool_with_argmax_… │
  78. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  79. │ de_deconv1_2 │ (None, None, None, │ 2,361,856 │ max_unpooling2d[0][0] │
  80. │ (Conv2DWithBN) │ 512) │ │ │
  81. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  82. │ de_deconv1_3 │ (None, None, None, │ 2,361,856 │ de_deconv1_2[0][0] │
  83. │ (Conv2DWithBN) │ 512) │ │ │
  84. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  85. │ de_deconv2_1 │ (None, None, None, │ 1,180,928 │ de_deconv1_3[0][0] │
  86. │ (Conv2DTransposeWithBN) │ 256) │ │ │
  87. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  88. │ max_unpooling2d_1 │ (None, None, None, │ 0 │ de_deconv2_1[0][0], │
  89. │ (MaxUnpooling2D) │ 256) │ │ max_pool_with_argmax_… │
  90. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  91. │ de_deconv2_2 │ (None, None, None, │ 591,104 │ max_unpooling2d_1[0][… │
  92. │ (Conv2DWithBN) │ 256) │ │ │
  93. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  94. │ de_deconv2_3 │ (None, None, None, │ 591,104 │ de_deconv2_2[0][0] │
  95. │ (Conv2DWithBN) │ 256) │ │ │
  96. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  97. │ de_deconv3_1 │ (None, None, None, │ 295,552 │ de_deconv2_3[0][0] │
  98. │ (Conv2DTransposeWithBN) │ 128) │ │ │
  99. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  100. │ max_unpooling2d_2 │ (None, None, None, │ 0 │ de_deconv3_1[0][0], │
  101. │ (MaxUnpooling2D) │ 128) │ │ max_pool_with_argmax_… │
  102. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  103. │ de_deconv3_2 │ (None, None, None, │ 148,096 │ max_unpooling2d_2[0][… │
  104. │ (Conv2DWithBN) │ 128) │ │ │
  105. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  106. │ de_deconv4_1 │ (None, None, None, 64) │ 74,048 │ de_deconv3_2[0][0] │
  107. │ (Conv2DTransposeWithBN) │ │ │ │
  108. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  109. │ max_unpooling2d_3 │ (None, None, None, 64) │ 0 │ de_deconv4_1[0][0], │
  110. │ (MaxUnpooling2D) │ │ │ max_pool_with_argmax[… │
  111. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  112. │ de_deconv4_2 │ (None, None, None, 64) │ 37,184 │ max_unpooling2d_3[0][… │
  113. │ (Conv2DWithBN) │ │ │ │
  114. ├───────────────────────────┼────────────────────────┼────────────────┼────────────────────────┤
  115. │ output (Conv2DTranspose) │ (None, None, None, 10) │ 5,770 │ de_deconv4_2[0][0] │
  116. └───────────────────────────┴────────────────────────┴────────────────┴────────────────────────┘
  117. Total params: 17,655,370 (67.35 MB)
  118. Trainable params: 17,644,618 (67.31 MB)
  119. Non-trainable params: 10,752 (42.00 KB)
  120. ----------------------------------------------------------------------------------------------------
  121. Start training...
  122. Epoch 1/30
  123. 2024-05-01 13:26:19.327948: W external/local_tsl/tsl/framework/cpu_allocator_impl.cc:83] Allocation of 31457280 exceeds 10% of free system memory.
  124. 2024-05-01 13:26:19.502327: I external/local_xla/xla/service/service.cc:168] XLA service 0x7dac5c038470 initialized for platform CUDA (this does not guarantee that XLA will be used). Devices:
  125. 2024-05-01 13:26:19.502399: I external/local_xla/xla/service/service.cc:176] StreamExecutor device (0): Tesla T4, Compute Capability 7.5
  126. 2024-05-01 13:26:20.095428: W tensorflow/core/framework/op_kernel.cc:1839] OP_REQUIRES failed at xla_ops.cc:574 : INVALID_ARGUMENT: Detected unsupported operations when trying to compile graph __inference_one_step_on_data_18280[] on XLA_GPU_JIT: MaxPoolWithArgmax (No registered 'MaxPoolWithArgmax' OpKernel for XLA_GPU_JIT devices compatible with node {{node SegNet_1/max_pool_with_argmax_1/MaxPoolWithArgmax}}){{node SegNet_1/max_pool_with_argmax_1/MaxPoolWithArgmax}}
  127. The op is created at:
  128. File "content/drive/MyDrive/SegNet_SceneParse150/train.py", line 42, in <module>
  129. File "root/.local/lib/python3.10/site-packages/keras/src/utils/traceback_utils.py", line 117, in error_handler
  130. File "root/.local/lib/python3.10/site-packages/keras/src/backend/tensorflow/trainer.py", line 314, in fit
  131. File "root/.local/lib/python3.10/site-packages/keras/src/backend/tensorflow/trainer.py", line 117, in one_step_on_iterator
  132. File "root/.local/lib/python3.10/site-packages/keras/src/backend/tensorflow/trainer.py", line 104, in one_step_on_data
  133. File "root/.local/lib/python3.10/site-packages/keras/src/backend/tensorflow/trainer.py", line 51, in train_step
  134. File "root/.local/lib/python3.10/site-packages/keras/src/utils/traceback_utils.py", line 117, in error_handler
  135. File "root/.local/lib/python3.10/site-packages/keras/src/layers/layer.py", line 842, in __call__
  136. File "root/.local/lib/python3.10/site-packages/keras/src/utils/traceback_utils.py", line 117, in error_handler
  137. File "root/.local/lib/python3.10/site-packages/keras/src/ops/operation.py", line 48, in __call__
  138. File "root/.local/lib/python3.10/site-packages/keras/src/utils/traceback_utils.py", line 156, in error_handler
  139. File "root/.local/lib/python3.10/site-packages/keras/src/models/functional.py", line 199, in call
  140. File "root/.local/lib/python3.10/site-packages/keras/src/ops/function.py", line 151, in _run_through_graph
  141. File "root/.local/lib/python3.10/site-packages/keras/src/models/functional.py", line 589, in call
  142. File "root/.local/lib/python3.10/site-packages/keras/src/utils/traceback_utils.py", line 117, in error_handler
  143. File "root/.local/lib/python3.10/site-packages/keras/src/layers/layer.py", line 842, in __call__
  144. File "root/.local/lib/python3.10/site-packages/keras/src/utils/traceback_utils.py", line 117, in error_handler
  145. File "root/.local/lib/python3.10/site-packages/keras/src/ops/operation.py", line 48, in __call__
  146. File "root/.local/lib/python3.10/site-packages/keras/src/utils/traceback_utils.py", line 156, in error_handler
  147. File "content/drive/MyDrive/SegNet_SceneParse150/Layers/max_pool_with_argmax.py", line 32, in call
  148. tf2xla conversion failed while converting __inference_one_step_on_data_18280[]. Run with TF_DUMP_GRAPH_PREFIX=/path/to/dump/dir and --vmodule=xla_compiler=2 to obtain a dump of the compiled functions.
  149. Traceback (most recent call last):
  150. File "/content/drive/MyDrive/SegNet_SceneParse150/train.py", line 42, in <module>
  151. train_history = model.fit(
  152. File "/root/.local/lib/python3.10/site-packages/keras/src/utils/traceback_utils.py", line 122, in error_handler
  153. raise e.with_traceback(filtered_tb) from None
  154. File "/usr/local/lib/python3.10/dist-packages/tensorflow/python/eager/execute.py", line 53, in quick_execute
  155. tensors = pywrap_tfe.TFE_Py_Execute(ctx._handle, device_name, op_name,
  156. tensorflow.python.framework.errors_impl.InvalidArgumentError: Graph execution error:
  157.  
  158. Detected at node SegNet_1/max_pool_with_argmax_1/MaxPoolWithArgmax defined at (most recent call last):
  159. <stack traces unavailable>
  160. Detected at node SegNet_1/max_pool_with_argmax_1/MaxPoolWithArgmax defined at (most recent call last):
  161. <stack traces unavailable>
  162. Detected unsupported operations when trying to compile graph __inference_one_step_on_data_18280[] on XLA_GPU_JIT: MaxPoolWithArgmax (No registered 'MaxPoolWithArgmax' OpKernel for XLA_GPU_JIT devices compatible with node {{node SegNet_1/max_pool_with_argmax_1/MaxPoolWithArgmax}}){{node SegNet_1/max_pool_with_argmax_1/MaxPoolWithArgmax}}
  163. The op is created at:
  164. File "content/drive/MyDrive/SegNet_SceneParse150/train.py", line 42, in <module>
  165. File "root/.local/lib/python3.10/site-packages/keras/src/utils/traceback_utils.py", line 117, in error_handler
  166. File "root/.local/lib/python3.10/site-packages/keras/src/backend/tensorflow/trainer.py", line 314, in fit
  167. File "root/.local/lib/python3.10/site-packages/keras/src/backend/tensorflow/trainer.py", line 117, in one_step_on_iterator
  168. File "root/.local/lib/python3.10/site-packages/keras/src/backend/tensorflow/trainer.py", line 104, in one_step_on_data
  169. File "root/.local/lib/python3.10/site-packages/keras/src/backend/tensorflow/trainer.py", line 51, in train_step
  170. File "root/.local/lib/python3.10/site-packages/keras/src/utils/traceback_utils.py", line 117, in error_handler
  171. File "root/.local/lib/python3.10/site-packages/keras/src/layers/layer.py", line 842, in __call__
  172. File "root/.local/lib/python3.10/site-packages/keras/src/utils/traceback_utils.py", line 117, in error_handler
  173. File "root/.local/lib/python3.10/site-packages/keras/src/ops/operation.py", line 48, in __call__
  174. File "root/.local/lib/python3.10/site-packages/keras/src/utils/traceback_utils.py", line 156, in error_handler
  175. File "root/.local/lib/python3.10/site-packages/keras/src/models/functional.py", line 199, in call
  176. File "root/.local/lib/python3.10/site-packages/keras/src/ops/function.py", line 151, in _run_through_graph
  177. File "root/.local/lib/python3.10/site-packages/keras/src/models/functional.py", line 589, in call
  178. File "root/.local/lib/python3.10/site-packages/keras/src/utils/traceback_utils.py", line 117, in error_handler
  179. File "root/.local/lib/python3.10/site-packages/keras/src/layers/layer.py", line 842, in __call__
  180. File "root/.local/lib/python3.10/site-packages/keras/src/utils/traceback_utils.py", line 117, in error_handler
  181. File "root/.local/lib/python3.10/site-packages/keras/src/ops/operation.py", line 48, in __call__
  182. File "root/.local/lib/python3.10/site-packages/keras/src/utils/traceback_utils.py", line 156, in error_handler
  183. File "content/drive/MyDrive/SegNet_SceneParse150/Layers/max_pool_with_argmax.py", line 32, in call
  184. tf2xla conversion failed while converting __inference_one_step_on_data_18280[]. Run with TF_DUMP_GRAPH_PREFIX=/path/to/dump/dir and --vmodule=xla_compiler=2 to obtain a dump of the compiled functions.
  185. [[StatefulPartitionedCall]] [Op:__inference_one_step_on_iterator_18873]
  186. 2024-05-01 13:26:20.405371: W tensorflow/core/kernels/data/generator_dataset_op.cc:108] Error occurred when finalizing GeneratorDataset iterator: FAILED_PRECONDITION: Python interpreter state is not initialized. The process may be terminated.
  187. [[{{node PyFunc}}]]
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement