Guest User

Untitled

a guest
Jun 8th, 2023
46
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 3.27 KB | None | 0 0
  1. GTL_DEBUG: [2] hsa_amd_ipc_memory_attach: HSA_STATUS_ERROR_INVALID_ARGUMENT: One of the actual arguments does not meet a precondition stated in the documentation of the corresponding formal argument.
  2. GTL_DEBUG: [3] hsa_amd_ipc_memory_attach: HSA_STATUS_ERROR_INVALID_ARGUMENT: One of the actual arguments does not meet a precondition stated in the documentation of the corresponding formal argument.
  3. MPICH ERROR [Rank 2] [job id 3654896.0] [Thu Jun 8 19:05:54 2023] [nid007563] - Abort(742508546) (rank 2 in comm 0): Fatal error in PMPI_Recv: Invalid count, error stack:
  4. PMPI_Recv(177)............................: MPI_Recv(buf=0x149dae200000, count=921600, MPI_DOUBLE, src=0, tag=4, MPI_COMM_WORLD, status=0x1) failed
  5. MPIR_Wait_impl(41)........................:
  6. MPID_Progress_wait(193)...................:
  7. MPIDI_Progress_test(97)...................:
  8. MPIDI_SHMI_progress(118)..................:
  9. MPIDI_POSIX_progress(422).................:
  10. MPIDI_CRAY_Common_lmt_ctrl_send_rts_cb(64):
  11. MPIDI_CRAY_Common_lmt_handle_recv(44).....:
  12. MPIDI_CRAY_Common_lmt_import_mem(218).....:
  13. (unknown)(): Invalid count
  14.  
  15. aborting job:
  16. Fatal error in PMPI_Recv: Invalid count, error stack:
  17. PMPI_Recv(177)............................: MPI_Recv(buf=0x149dae200000, count=921600, MPI_DOUBLE, src=0, tag=4, MPI_COMM_WORLD, status=0x1) failed
  18. MPIR_Wait_impl(41)........................:
  19. MPID_Progress_wait(193)...................:
  20. MPIDI_Progress_test(97)...................:
  21. MPIDI_SHMI_progress(118)..................:
  22. MPIDI_POSIX_progress(422).................:
  23. MPIDI_CRAY_Common_lmt_ctrl_send_rts_cb(64):
  24. MPIDI_CRAY_Common_lmt_handle_recv(44).....:
  25. MPIDI_CRAY_Common_lmt_import_mem(218).....:
  26. (unknown)(): Invalid count
  27. MPICH ERROR [Rank 3] [job id 3654896.0] [Thu Jun 8 19:05:54 2023] [nid007563] - Abort(1010944002) (rank 3 in comm 0): Fatal error in PMPI_Recv: Invalid count, error stack:
  28. PMPI_Recv(177)............................: MPI_Recv(buf=0x14f482c00000, count=921600, MPI_DOUBLE, src=1, tag=5, MPI_COMM_WORLD, status=0x1) failed
  29. MPIR_Wait_impl(41)........................:
  30. MPID_Progress_wait(193)...................:
  31. MPIDI_Progress_test(97)...................:
  32. MPIDI_SHMI_progress(118)..................:
  33. MPIDI_POSIX_progress(422).................:
  34. MPIDI_CRAY_Common_lmt_ctrl_send_rts_cb(64):
  35. MPIDI_CRAY_Common_lmt_handle_recv(44).....:
  36. MPIDI_CRAY_Common_lmt_import_mem(218).....:
  37. (unknown)(): Invalid count
  38.  
  39. aborting job:
  40. Fatal error in PMPI_Recv: Invalid count, error stack:
  41. PMPI_Recv(177)............................: MPI_Recv(buf=0x14f482c00000, count=921600, MPI_DOUBLE, src=1, tag=5, MPI_COMM_WORLD, status=0x1) failed
  42. MPIR_Wait_impl(41)........................:
  43. MPID_Progress_wait(193)...................:
  44. MPIDI_Progress_test(97)...................:
  45. MPIDI_SHMI_progress(118)..................:
  46. MPIDI_POSIX_progress(422).................:
  47. MPIDI_CRAY_Common_lmt_ctrl_send_rts_cb(64):
  48. MPIDI_CRAY_Common_lmt_handle_recv(44).....:
  49. MPIDI_CRAY_Common_lmt_import_mem(218).....:
  50. (unknown)(): Invalid count
  51. srun: error: nid007563: tasks 2-3: Exited with exit code 255
  52. srun: launch/slurm: _step_signal: Terminating StepId=3654896.0
  53. slurmstepd: error: *** STEP 3654896.0 ON nid007563 CANCELLED AT 2023-06-08T19:05:55 ***
  54. srun: error: nid007563: tasks 0-1,4-7: Terminated
  55. srun: Force Terminated StepId=3654896.0
Advertisement
Add Comment
Please, Sign In to add comment