Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- [2024-07-26T13:13:49.562] debug: slurmctld log levels: stderr=quiet logfile=debug2 syslog=fatal
- [2024-07-26T13:13:49.562] debug: Log file re-opened
- [2024-07-26T13:13:49.564] debug: sched: slurmctld starting
- [2024-07-26T13:13:49.565] slurmscriptd: debug: slurmscriptd: Got ack from slurmctld, initialization successful
- [2024-07-26T13:13:49.565] slurmscriptd: debug: _slurmscriptd_mainloop: started
- [2024-07-26T13:13:49.565] debug: slurmctld: slurmscriptd fork()'d and initialized.
- [2024-07-26T13:13:49.565] debug: _slurmctld_listener_thread: started listening to slurmscriptd
- [2024-07-26T13:13:49.565] slurmctld version 22.05.8 started on cluster dlabcluster
- [2024-07-26T13:13:49.566] cred/munge: init: Munge credential signature plugin loaded
- [2024-07-26T13:13:49.567] debug: auth/munge: init: Munge authentication plugin loaded
- [2024-07-26T13:13:49.567] select/cons_res: common_init: select/cons_res loaded
- [2024-07-26T13:13:49.568] select/cons_tres: common_init: select/cons_tres loaded
- [2024-07-26T13:13:49.568] select/cray_aries: init: Cray/Aries node selection plugin loaded
- [2024-07-26T13:13:49.568] preempt/none: init: preempt/none loaded
- [2024-07-26T13:13:49.568] debug: acct_gather_energy/none: init: AcctGatherEnergy NONE plugin loaded
- [2024-07-26T13:13:49.568] debug: acct_gather_profile/none: init: AcctGatherProfile NONE plugin loaded
- [2024-07-26T13:13:49.568] debug: acct_gather_interconnect/none: init: AcctGatherInterconnect NONE plugin loaded
- [2024-07-26T13:13:49.568] debug: acct_gather_filesystem/none: init: AcctGatherFilesystem NONE plugin loaded
- [2024-07-26T13:13:49.568] debug2: No acct_gather.conf file (/etc/slurm/acct_gather.conf)
- [2024-07-26T13:13:49.568] debug: jobacct_gather/none: init: Job accounting gather NOT_INVOKED plugin loaded
- [2024-07-26T13:13:49.569] ext_sensors/none: init: ExtSensors NONE plugin loaded
- [2024-07-26T13:13:49.569] debug: MPI: Loading all types
- [2024-07-26T13:13:49.570] debug: mpi/pmix_v4: init: PMIx plugin loaded
- [2024-07-26T13:13:49.570] debug2: No mpi.conf file (/etc/slurm/mpi.conf)
- [2024-07-26T13:13:49.573] accounting_storage/none: init: Accounting storage NOT INVOKED plugin loaded
- [2024-07-26T13:13:49.573] debug: switch Cray/Aries plugin loaded.
- [2024-07-26T13:13:49.573] debug: switch/none: init: switch NONE plugin loaded
- [2024-07-26T13:13:49.573] debug: Reading slurm.conf file: /etc/slurm/slurm.conf
- [2024-07-26T13:13:49.574] No memory enforcing mechanism configured.
- [2024-07-26T13:13:49.574] topology/none: init: topology NONE plugin loaded
- [2024-07-26T13:13:49.574] debug: No DownNodes
- [2024-07-26T13:13:49.577] debug: slurmctld log levels: stderr=quiet logfile=debug2 syslog=fatal
- [2024-07-26T13:13:49.577] debug: Log file re-opened
- [2024-07-26T13:13:49.578] sched: Backfill scheduler plugin loaded
- [2024-07-26T13:13:49.578] route/default: init: route default plugin loaded
- [2024-07-26T13:13:49.578] Recovered state of 3 nodes
- [2024-07-26T13:13:49.578] Down nodes: server[2-3]
- [2024-07-26T13:13:49.579] Recovered JobId=314 Assoc=0
- [2024-07-26T13:13:49.579] debug: starting JobId=314 in accounting
- [2024-07-26T13:13:49.579] Recovered information about 1 jobs
- [2024-07-26T13:13:49.579] select/cons_tres: select_p_node_init: select/cons_tres SelectTypeParameters not specified, using default value: CR_Core_Memory
- [2024-07-26T13:13:49.579] select/cons_tres: part_data_create_array: select/cons_tres: preparing for 1 partitions
- [2024-07-26T13:13:49.579] debug: Updating partition uid access list
- [2024-07-26T13:13:49.579] Recovered state of 0 reservations
- [2024-07-26T13:13:49.579] State of 0 triggers recovered
- [2024-07-26T13:13:49.579] read_slurm_conf: backup_controller not specified
- [2024-07-26T13:13:49.579] select/cons_tres: select_p_reconfigure: select/cons_tres: reconfigure
- [2024-07-26T13:13:49.579] select/cons_tres: part_data_create_array: select/cons_tres: preparing for 1 partitions
- [2024-07-26T13:13:49.580] debug: power_save module disabled, SuspendTime < 0
- [2024-07-26T13:13:49.580] Running as primary controller
- [2024-07-26T13:13:49.580] debug: No backup controllers, not launching heartbeat.
- [2024-07-26T13:13:49.580] debug: priority/basic: init: Priority BASIC plugin loaded
- [2024-07-26T13:13:49.580] No parameter for mcs plugin, default values set
- [2024-07-26T13:13:49.580] mcs: MCSParameters = (null). ondemand set.
- [2024-07-26T13:13:49.580] debug: mcs/none: init: mcs none plugin loaded
- [2024-07-26T13:13:49.580] debug2: slurmctld listening on 0.0.0.0:6817
- [2024-07-26T13:13:52.662] debug: hash/k12: init: init: KangarooTwelve hash plugin loaded
- [2024-07-26T13:13:52.662] debug2: Processing RPC: MESSAGE_NODE_REGISTRATION_STATUS from UID=0
- [2024-07-26T13:13:52.662] debug: gres/gpu: init: loaded
- [2024-07-26T13:13:52.662] debug: validate_node_specs: node server1 registered with 0 jobs
- [2024-07-26T13:13:52.662] debug2: _slurm_rpc_node_registration complete for server1 usec=229
- [2024-07-26T13:13:53.586] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:13:53.586] SchedulerParameters=default_queue_depth=100,max_rpc_cnt=0,max_sched_time=2,partition_job_depth=0,sched_max_job_start=0,sched_min_interval=2
- [2024-07-26T13:13:53.586] debug: sched: Running job scheduler for default depth.
- [2024-07-26T13:13:53.586] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:13:53.587] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:13:53.588] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:13:53.588] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:13:53.588] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:13:53.588] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:13:54.588] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:13:54.588] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:13:54.589] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:13:54.589] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:13:55.589] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:13:55.589] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:13:55.590] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:13:55.590] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:13:56.590] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:13:56.590] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:13:56.591] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:13:56.591] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:13:57.591] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:13:57.591] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:13:57.592] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:13:57.592] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:13:58.592] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:13:58.592] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:13:58.593] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:13:58.593] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:13:59.593] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:13:59.593] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:13:59.594] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:13:59.594] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:14:00.594] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:14:00.594] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:14:00.595] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:14:00.595] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:14:01.595] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:14:01.595] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:14:01.596] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:14:01.596] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:14:02.596] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:14:02.596] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:14:02.597] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:14:02.597] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:14:03.597] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:14:03.597] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:14:03.598] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:14:03.598] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:14:04.597] debug2: Tree head got back 1
- [2024-07-26T13:14:04.598] debug2: Tree head got back 2
- [2024-07-26T13:14:04.598] agent/is_node_resp: node:server2 RPC:REQUEST_NODE_REGISTRATION_STATUS : Communication connection failure
- [2024-07-26T13:14:04.598] agent/is_node_resp: node:server3 RPC:REQUEST_NODE_REGISTRATION_STATUS : Communication connection failure
- [2024-07-26T13:14:19.578] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:14:19.578] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:14:19.626] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:14:49.669] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:14:49.669] debug2: Performing purge of old job records
- [2024-07-26T13:14:49.670] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:14:50.579] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:14:50.579] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:15:19.715] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:15:33.737] debug: Spawning ping agent for server1
- [2024-07-26T13:15:33.737] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:15:33.737] debug2: Spawning RPC agent for msg_type REQUEST_PING
- [2024-07-26T13:15:33.737] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:15:33.737] debug2: Tree head got back 0 looking for 1
- [2024-07-26T13:15:33.738] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:15:33.738] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:33.739] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:33.739] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:33.739] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:33.740] debug2: Tree head got back 1
- [2024-07-26T13:15:33.742] debug2: node_did_resp server1
- [2024-07-26T13:15:34.581] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:15:34.582] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:15:34.740] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:34.740] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:34.740] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:34.740] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:35.741] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:35.741] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:35.741] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:35.741] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:36.741] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:36.741] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:36.742] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:36.742] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:37.742] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:37.742] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:37.743] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:37.743] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:38.743] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:38.743] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:38.744] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:38.744] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:39.744] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:39.744] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:39.745] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:39.745] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:40.745] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:40.745] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:40.746] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:40.746] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:41.746] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:41.746] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:41.746] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:41.747] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:42.747] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:42.747] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:42.748] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:42.748] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:43.748] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:43.748] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:15:43.749] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:43.749] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:15:44.748] debug2: Tree head got back 1
- [2024-07-26T13:15:44.749] debug2: Tree head got back 2
- [2024-07-26T13:15:49.759] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:15:49.759] debug2: Performing purge of old job records
- [2024-07-26T13:15:49.759] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:16:04.582] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:16:04.582] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:16:19.802] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:16:49.845] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:16:49.845] debug2: Performing purge of old job records
- [2024-07-26T13:16:49.845] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:16:50.585] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:16:50.585] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:17:13.881] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:17:13.881] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:17:13.882] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:17:13.883] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:13.883] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:13.883] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:13.883] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:14.884] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:14.884] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:14.884] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:14.884] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:15.885] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:15.885] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:15.885] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:15.885] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:16.886] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:16.886] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:16.886] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:16.886] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:17.887] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:17.887] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:17.887] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:17.887] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:18.888] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:18.888] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:18.888] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:18.888] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:19.889] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:19.889] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:19.889] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:19.889] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:19.890] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:17:20.890] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:20.890] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:20.890] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:20.890] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:21.891] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:21.891] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:21.891] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:21.891] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:22.891] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:22.892] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:22.892] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:22.892] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:23.892] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:23.892] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:17:23.893] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:23.893] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:17:24.893] debug2: Tree head got back 1
- [2024-07-26T13:17:24.893] debug2: Tree head got back 2
- [2024-07-26T13:17:49.937] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:17:49.937] debug2: Performing purge of old job records
- [2024-07-26T13:17:49.937] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:17:50.590] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:17:50.590] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:18:19.982] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:18:49.026] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:18:49.026] debug2: Performing purge of old job records
- [2024-07-26T13:18:49.026] debug2: Performing full system state save
- [2024-07-26T13:18:49.026] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:18:49.595] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:18:49.595] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:18:53.049] debug: Spawning ping agent for server1
- [2024-07-26T13:18:53.049] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:18:53.049] debug2: Spawning RPC agent for msg_type REQUEST_PING
- [2024-07-26T13:18:53.049] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:18:53.050] debug2: Tree head got back 0 looking for 1
- [2024-07-26T13:18:53.050] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:18:53.050] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:18:53.050] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:18:53.050] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:18:53.051] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:18:53.051] debug2: Tree head got back 1
- [2024-07-26T13:18:53.055] debug2: node_did_resp server1
- [2024-07-26T13:18:54.051] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:18:54.051] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:18:54.051] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:18:54.051] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:18:55.052] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:18:55.052] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:18:55.052] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:18:55.052] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:18:56.053] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:18:56.053] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:18:56.053] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:18:56.053] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:18:57.054] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:18:57.054] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:18:57.054] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:18:57.054] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:18:58.055] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:18:58.055] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:18:58.055] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:18:58.055] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:18:59.056] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:18:59.056] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:18:59.056] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:18:59.056] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:19:00.057] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:19:00.057] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:19:00.057] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:19:00.057] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:19:01.057] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:19:01.057] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:19:01.057] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:19:01.057] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:19:02.059] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:19:02.059] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:19:02.059] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:19:02.059] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:19:03.059] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:19:03.060] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:19:03.060] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:19:03.060] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:19:04.060] debug2: Tree head got back 1
- [2024-07-26T13:19:04.060] debug2: Tree head got back 2
- [2024-07-26T13:19:19.089] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:19:19.595] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:19:19.595] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:19:49.135] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:19:49.136] debug2: Performing purge of old job records
- [2024-07-26T13:19:49.136] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:19:49.596] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:19:49.596] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:20:19.181] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:20:19.596] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:20:19.596] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:20:33.204] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:20:33.204] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:20:33.204] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:20:33.205] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:33.205] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:33.205] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:33.205] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:34.206] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:34.206] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:34.206] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:34.206] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:35.207] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:35.207] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:35.207] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:35.207] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:36.208] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:36.208] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:36.208] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:36.208] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:37.209] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:37.209] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:37.209] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:37.209] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:38.210] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:38.210] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:38.210] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:38.210] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:39.211] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:39.211] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:39.211] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:39.211] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:40.212] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:40.212] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:40.212] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:40.212] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:41.213] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:41.213] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:41.213] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:41.213] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:42.213] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:42.213] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:42.214] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:42.214] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:43.215] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:43.215] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:20:43.215] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:43.215] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:20:44.215] debug2: Tree head got back 1
- [2024-07-26T13:20:44.215] debug2: Tree head got back 2
- [2024-07-26T13:20:49.229] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:20:49.229] debug2: Performing purge of old job records
- [2024-07-26T13:20:49.230] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:20:49.596] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:20:49.597] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:21:19.274] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:21:19.597] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:21:19.597] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:21:49.321] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:21:49.322] debug2: Performing purge of old job records
- [2024-07-26T13:21:49.322] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:21:49.597] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:21:49.597] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:22:13.358] debug: Spawning ping agent for server1
- [2024-07-26T13:22:13.359] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:22:13.359] debug2: Spawning RPC agent for msg_type REQUEST_PING
- [2024-07-26T13:22:13.359] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:22:13.359] debug2: Tree head got back 0 looking for 1
- [2024-07-26T13:22:13.359] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:22:13.360] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:13.360] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:13.360] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:13.360] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:13.361] debug2: Tree head got back 1
- [2024-07-26T13:22:13.364] debug2: node_did_resp server1
- [2024-07-26T13:22:14.361] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:14.361] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:14.361] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:14.361] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:15.362] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:15.362] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:15.362] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:15.362] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:16.363] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:16.363] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:16.363] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:16.363] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:17.364] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:17.364] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:17.364] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:17.364] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:18.365] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:18.365] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:18.365] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:18.365] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:19.366] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:19.366] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:19.366] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:19.366] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:19.368] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:22:19.597] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:22:19.598] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:22:20.367] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:20.367] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:20.367] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:20.367] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:21.367] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:21.367] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:21.368] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:21.368] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:22.368] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:22.368] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:22.369] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:22.369] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:23.369] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:23.369] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:22:23.369] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:23.369] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:22:24.370] debug2: Tree head got back 2
- [2024-07-26T13:22:49.417] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:22:49.417] debug2: Performing purge of old job records
- [2024-07-26T13:22:49.417] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:22:49.598] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:22:49.598] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:23:19.462] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:23:19.598] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:23:19.598] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:23:49.510] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:23:49.510] debug: Updating partition uid access list
- [2024-07-26T13:23:49.510] debug2: Updating reservations group's uid access lists
- [2024-07-26T13:23:49.510] debug2: Performing purge of old job records
- [2024-07-26T13:23:49.510] debug2: Performing full system state save
- [2024-07-26T13:23:49.510] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:23:49.598] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:23:49.599] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:23:53.533] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:23:53.533] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:23:53.534] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:23:53.534] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:23:53.534] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:23:53.534] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:23:53.534] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:23:54.535] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:23:54.535] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:23:54.535] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:23:54.535] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:23:55.536] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:23:55.536] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:23:55.536] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:23:55.536] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:23:56.537] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:23:56.537] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:23:56.537] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:23:56.537] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:23:57.538] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:23:57.538] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:23:57.538] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:23:57.538] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:23:58.538] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:23:58.538] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:23:58.539] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:23:58.539] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:23:59.539] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:23:59.540] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:23:59.540] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:23:59.540] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:24:00.540] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:24:00.540] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:24:00.541] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:24:00.541] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:24:01.541] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:24:01.541] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:24:01.542] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:24:01.542] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:24:02.542] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:24:02.542] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:24:02.543] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:24:02.543] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:24:03.543] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:24:03.543] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:24:03.544] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:24:03.544] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:24:04.544] debug2: Tree head got back 1
- [2024-07-26T13:24:04.545] debug2: Tree head got back 2
- [2024-07-26T13:24:19.576] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:24:19.599] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:24:19.599] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:24:49.625] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:24:49.625] debug2: Performing purge of old job records
- [2024-07-26T13:24:49.625] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:24:50.599] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:24:50.600] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:25:19.674] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:25:33.697] debug: Spawning ping agent for server1
- [2024-07-26T13:25:33.697] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:25:33.697] debug2: Spawning RPC agent for msg_type REQUEST_PING
- [2024-07-26T13:25:33.698] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:25:33.698] debug2: Tree head got back 0 looking for 1
- [2024-07-26T13:25:33.698] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:25:33.699] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:33.699] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:33.699] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:33.699] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:33.700] debug2: Tree head got back 1
- [2024-07-26T13:25:33.703] debug2: node_did_resp server1
- [2024-07-26T13:25:34.602] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:25:34.602] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:25:34.700] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:34.700] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:34.700] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:34.700] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:35.701] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:35.701] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:35.701] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:35.701] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:36.701] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:36.702] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:36.702] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:36.702] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:37.702] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:37.702] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:37.703] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:37.703] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:38.703] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:38.704] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:38.704] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:38.704] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:39.704] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:39.704] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:39.705] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:39.705] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:40.705] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:40.705] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:40.705] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:40.706] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:41.706] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:41.706] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:41.706] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:41.706] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:42.707] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:42.707] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:42.707] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:42.707] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:43.708] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:43.708] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:25:43.708] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:43.708] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:25:44.708] debug2: Tree head got back 1
- [2024-07-26T13:25:44.708] debug2: Tree head got back 2
- [2024-07-26T13:25:49.722] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:25:49.722] debug2: Performing purge of old job records
- [2024-07-26T13:25:49.722] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:26:04.603] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:26:04.603] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:26:19.768] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:26:49.814] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:26:49.814] debug2: Performing purge of old job records
- [2024-07-26T13:26:49.815] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:26:50.606] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:26:50.606] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:27:13.850] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:27:13.850] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:27:13.851] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:27:13.852] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:13.852] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:13.852] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:13.852] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:14.853] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:14.853] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:14.853] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:14.853] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:15.854] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:15.854] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:15.854] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:15.854] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:16.854] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:16.854] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:16.854] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:16.855] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:17.855] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:17.855] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:17.855] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:17.855] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:18.856] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:18.856] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:18.856] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:18.856] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:19.857] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:19.857] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:19.857] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:19.857] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:19.860] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:27:20.858] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:20.858] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:20.858] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:20.858] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:21.859] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:21.859] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:21.859] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:21.859] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:22.860] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:22.860] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:22.860] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:22.860] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:23.861] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:23.861] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:27:23.861] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:23.861] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:27:24.861] debug2: Tree head got back 1
- [2024-07-26T13:27:24.861] debug2: Tree head got back 2
- [2024-07-26T13:27:49.907] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:27:49.907] debug2: Performing purge of old job records
- [2024-07-26T13:27:49.907] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:27:50.611] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:27:50.611] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:28:19.954] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:28:50.001] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:28:50.001] debug2: Performing purge of old job records
- [2024-07-26T13:28:50.001] debug2: Performing full system state save
- [2024-07-26T13:28:50.001] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:28:50.616] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:28:50.616] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:28:53.023] debug: Spawning ping agent for server1
- [2024-07-26T13:28:53.023] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:28:53.023] debug2: Spawning RPC agent for msg_type REQUEST_PING
- [2024-07-26T13:28:53.023] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:28:53.024] debug2: Tree head got back 0 looking for 1
- [2024-07-26T13:28:53.024] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:28:53.024] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:28:53.025] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:28:53.025] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:28:53.025] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:28:53.025] debug2: Tree head got back 1
- [2024-07-26T13:28:53.029] debug2: node_did_resp server1
- [2024-07-26T13:28:54.025] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:28:54.025] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:28:54.026] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:28:54.026] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:28:55.026] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:28:55.026] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:28:55.027] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:28:55.027] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:28:56.027] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:28:56.027] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:28:56.027] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:28:56.027] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:28:57.028] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:28:57.028] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:28:57.028] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:28:57.028] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:28:58.029] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:28:58.029] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:28:58.029] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:28:58.029] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:28:59.030] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:28:59.030] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:28:59.030] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:28:59.030] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:29:00.031] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:29:00.031] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:29:00.031] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:29:00.031] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:29:01.032] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:29:01.032] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:29:01.032] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:29:01.032] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:29:02.033] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:29:02.033] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:29:02.033] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:29:02.033] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:29:03.034] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:29:03.034] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:29:03.034] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:29:03.034] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:29:04.034] debug2: Tree head got back 1
- [2024-07-26T13:29:04.034] debug2: Tree head got back 2
- [2024-07-26T13:29:19.064] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:29:20.616] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:29:20.616] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:29:49.109] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:29:49.109] debug2: Performing purge of old job records
- [2024-07-26T13:29:49.109] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:29:50.617] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:29:50.617] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:30:19.151] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:30:33.170] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:30:33.170] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:30:33.171] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:30:33.171] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:33.172] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:33.172] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:33.172] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:34.173] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:34.173] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:34.173] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:34.173] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:35.173] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:35.173] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:35.174] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:35.174] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:36.175] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:36.175] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:36.175] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:36.175] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:37.176] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:37.176] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:37.176] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:37.176] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:38.177] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:38.177] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:38.177] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:38.177] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:39.177] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:39.178] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:39.178] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:39.178] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:40.179] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:40.179] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:40.179] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:40.179] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:41.180] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:41.180] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:41.180] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:41.180] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:42.181] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:42.181] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:42.181] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:42.181] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:43.181] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:43.181] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:30:43.182] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:43.182] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:30:44.182] debug2: Tree head got back 1
- [2024-07-26T13:30:44.182] debug2: Tree head got back 2
- [2024-07-26T13:30:49.195] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:30:49.195] debug2: Performing purge of old job records
- [2024-07-26T13:30:49.195] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:30:49.622] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:30:49.622] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:31:19.241] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:31:19.622] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:31:19.622] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:31:49.286] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:31:49.286] debug2: Performing purge of old job records
- [2024-07-26T13:31:49.286] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:31:49.622] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:31:49.622] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:32:13.323] debug: Spawning ping agent for server1
- [2024-07-26T13:32:13.323] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:32:13.323] debug2: Spawning RPC agent for msg_type REQUEST_PING
- [2024-07-26T13:32:13.323] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:32:13.323] debug2: Tree head got back 0 looking for 1
- [2024-07-26T13:32:13.324] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:32:13.324] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:13.324] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:13.324] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:13.325] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:13.325] debug2: Tree head got back 1
- [2024-07-26T13:32:13.328] debug2: node_did_resp server1
- [2024-07-26T13:32:14.325] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:14.325] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:14.325] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:14.325] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:15.326] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:15.326] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:15.326] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:15.326] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:16.327] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:16.327] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:16.327] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:16.327] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:17.328] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:17.328] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:17.328] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:17.328] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:18.329] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:18.329] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:18.329] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:18.329] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:19.330] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:19.330] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:19.330] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:19.330] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:19.332] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:32:19.623] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:32:19.623] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:32:20.331] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:20.331] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:20.331] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:20.331] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:21.332] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:21.332] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:21.332] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:21.332] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:22.332] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:22.333] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:22.333] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:22.333] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:23.334] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:23.334] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:32:23.334] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:23.334] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:32:24.334] debug2: Tree head got back 1
- [2024-07-26T13:32:24.334] debug2: Tree head got back 2
- [2024-07-26T13:32:49.378] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:32:49.378] debug2: Performing purge of old job records
- [2024-07-26T13:32:49.378] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:32:49.623] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:32:49.623] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:33:19.425] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:33:19.623] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:33:19.624] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:33:49.473] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:33:49.474] debug: Updating partition uid access list
- [2024-07-26T13:33:49.474] debug2: Updating reservations group's uid access lists
- [2024-07-26T13:33:49.474] debug2: Performing purge of old job records
- [2024-07-26T13:33:49.474] debug2: Performing full system state save
- [2024-07-26T13:33:49.474] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:33:49.624] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:33:49.624] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:33:53.496] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:33:53.496] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:33:53.497] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:33:53.498] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:33:53.498] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:33:53.498] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:33:53.498] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:33:54.498] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:33:54.498] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:33:54.499] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:33:54.499] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:33:55.499] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:33:55.499] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:33:55.500] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:33:55.500] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:33:56.500] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:33:56.500] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:33:56.501] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:33:56.501] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:33:57.501] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:33:57.501] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:33:57.502] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:33:57.502] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:33:58.502] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:33:58.502] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:33:58.502] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:33:58.502] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:33:59.503] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:33:59.503] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:33:59.504] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:33:59.504] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:34:00.504] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:34:00.504] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:34:00.504] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:34:00.504] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:34:01.505] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:34:01.505] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:34:01.505] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:34:01.505] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:34:02.506] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:34:02.506] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:34:02.506] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:34:02.506] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:34:03.507] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:34:03.507] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:34:03.507] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:34:03.507] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:34:04.507] debug2: Tree head got back 2
- [2024-07-26T13:34:04.507] debug2: Tree head got back 2
- [2024-07-26T13:34:19.537] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:34:19.624] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:34:19.624] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:34:49.582] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:34:49.582] debug2: Performing purge of old job records
- [2024-07-26T13:34:49.582] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:34:49.625] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:34:49.625] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:35:19.624] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:35:19.625] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:35:19.625] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:35:33.645] debug: Spawning ping agent for server1
- [2024-07-26T13:35:33.645] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:35:33.645] debug2: Spawning RPC agent for msg_type REQUEST_PING
- [2024-07-26T13:35:33.645] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:35:33.645] debug2: Tree head got back 0 looking for 1
- [2024-07-26T13:35:33.645] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:35:33.646] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:33.646] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:33.646] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:33.646] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:33.647] debug2: Tree head got back 1
- [2024-07-26T13:35:33.650] debug2: node_did_resp server1
- [2024-07-26T13:35:34.647] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:34.647] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:34.647] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:34.647] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:35.648] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:35.648] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:35.648] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:35.648] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:36.649] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:36.649] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:36.649] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:36.649] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:37.650] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:37.650] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:37.650] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:37.650] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:38.651] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:38.651] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:38.651] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:38.651] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:39.652] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:39.652] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:39.652] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:39.652] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:40.653] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:40.653] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:40.653] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:40.653] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:41.654] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:41.654] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:41.654] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:41.654] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:42.654] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:42.654] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:42.655] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:42.655] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:43.655] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:43.655] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:35:43.655] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:43.655] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:35:44.655] debug2: Tree head got back 1
- [2024-07-26T13:35:44.656] debug2: Tree head got back 2
- [2024-07-26T13:35:49.625] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:35:49.625] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:35:49.671] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:35:49.671] debug2: Performing purge of old job records
- [2024-07-26T13:35:49.671] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:36:19.626] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:36:19.626] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:36:19.719] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:36:49.766] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:36:49.766] debug2: Performing purge of old job records
- [2024-07-26T13:36:49.766] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:36:50.626] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:36:50.626] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:37:13.803] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:37:13.803] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:37:13.804] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:37:13.804] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:13.804] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:13.804] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:13.804] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:14.805] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:14.805] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:14.805] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:14.805] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:15.806] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:15.806] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:15.806] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:15.806] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:16.807] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:16.807] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:16.807] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:16.807] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:17.808] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:17.808] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:17.808] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:17.808] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:18.809] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:18.809] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:18.809] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:18.809] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:19.810] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:19.810] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:19.810] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:19.811] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:19.813] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:37:20.811] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:20.812] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:20.812] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:20.812] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:21.812] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:21.813] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:21.813] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:21.813] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:22.814] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:22.814] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:22.814] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:22.814] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:23.814] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:23.814] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:37:23.815] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:23.815] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:37:24.815] debug2: Tree head got back 1
- [2024-07-26T13:37:24.815] debug2: Tree head got back 2
- [2024-07-26T13:37:49.861] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:37:49.861] debug2: Performing purge of old job records
- [2024-07-26T13:37:49.861] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:37:50.631] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:37:50.631] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:38:19.907] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:38:49.955] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:38:49.955] debug2: Performing purge of old job records
- [2024-07-26T13:38:49.955] debug2: Performing full system state save
- [2024-07-26T13:38:49.955] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:38:50.636] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:38:50.636] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:38:53.978] debug: Spawning ping agent for server1
- [2024-07-26T13:38:53.978] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:38:53.978] debug2: Spawning RPC agent for msg_type REQUEST_PING
- [2024-07-26T13:38:53.978] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:38:53.979] debug2: Tree head got back 0 looking for 1
- [2024-07-26T13:38:53.979] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:38:53.979] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:38:53.980] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:38:53.980] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:38:53.980] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:38:53.980] debug2: Tree head got back 1
- [2024-07-26T13:38:53.984] debug2: node_did_resp server1
- [2024-07-26T13:38:54.981] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:38:54.981] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:38:54.981] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:38:54.981] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:38:55.982] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:38:55.982] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:38:55.982] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:38:55.982] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:38:56.983] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:38:56.983] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:38:56.983] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:38:56.983] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:38:57.984] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:38:57.984] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:38:57.984] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:38:57.984] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:38:58.985] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:38:58.985] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:38:58.985] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:38:58.985] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:38:59.986] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:38:59.986] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:38:59.986] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:38:59.986] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:39:00.986] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:39:00.986] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:39:00.987] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:39:00.987] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:39:01.987] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:39:01.987] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:39:01.987] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:39:01.988] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:39:02.988] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:39:02.988] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:39:02.989] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:39:02.989] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:39:03.989] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:39:03.989] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:39:03.989] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:39:03.989] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:39:04.990] debug2: Tree head got back 1
- [2024-07-26T13:39:04.990] debug2: Tree head got back 2
- [2024-07-26T13:39:19.017] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:39:20.636] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:39:20.636] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:39:49.064] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:39:49.064] debug2: Performing purge of old job records
- [2024-07-26T13:39:49.064] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:39:50.637] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:39:50.637] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:40:19.109] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:40:33.130] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:40:33.130] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:40:33.131] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:40:33.132] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:33.132] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:33.132] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:33.132] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:34.133] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:34.133] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:34.133] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:34.133] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:35.133] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:35.134] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:35.134] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:35.134] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:36.134] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:36.134] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:36.135] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:36.135] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:37.135] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:37.135] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:37.135] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:37.135] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:38.136] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:38.136] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:38.136] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:38.136] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:39.137] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:39.137] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:39.137] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:39.137] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:40.138] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:40.138] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:40.138] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:40.138] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:41.139] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:41.139] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:41.139] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:41.139] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:42.140] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:42.140] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:42.140] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:42.140] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:43.141] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:43.141] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:40:43.141] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:43.141] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:40:44.141] debug2: Tree head got back 1
- [2024-07-26T13:40:44.141] debug2: Tree head got back 2
- [2024-07-26T13:40:49.153] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:40:49.153] debug2: Performing purge of old job records
- [2024-07-26T13:40:49.153] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:40:49.642] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:40:49.642] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:41:19.196] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:41:19.642] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:41:19.642] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:41:49.244] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:41:49.244] debug2: Performing purge of old job records
- [2024-07-26T13:41:49.244] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:41:49.642] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:41:49.642] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:42:13.279] debug: Spawning ping agent for server1
- [2024-07-26T13:42:13.279] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:42:13.279] debug2: Spawning RPC agent for msg_type REQUEST_PING
- [2024-07-26T13:42:13.279] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:42:13.279] debug2: Tree head got back 0 looking for 1
- [2024-07-26T13:42:13.280] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:42:13.280] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:13.280] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:13.280] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:13.280] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:13.281] debug2: Tree head got back 1
- [2024-07-26T13:42:13.284] debug2: node_did_resp server1
- [2024-07-26T13:42:14.281] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:14.281] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:14.281] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:14.281] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:15.282] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:15.282] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:15.282] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:15.282] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:16.283] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:16.283] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:16.283] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:16.283] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:17.284] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:17.284] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:17.284] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:17.284] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:18.285] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:18.285] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:18.285] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:18.285] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:19.286] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:19.286] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:19.286] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:19.286] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:19.289] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:42:19.643] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:42:19.643] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:42:20.287] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:20.287] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:20.287] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:20.287] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:21.288] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:21.288] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:21.288] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:21.288] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:22.288] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:22.288] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:22.289] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:22.289] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:23.289] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:23.290] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:42:23.290] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:23.290] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:42:24.290] debug2: Tree head got back 1
- [2024-07-26T13:42:24.290] debug2: Tree head got back 2
- [2024-07-26T13:42:49.336] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:42:49.336] debug2: Performing purge of old job records
- [2024-07-26T13:42:49.336] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:42:49.643] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:42:49.643] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:43:19.382] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:43:19.643] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:43:19.644] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:43:49.430] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:43:49.430] debug: Updating partition uid access list
- [2024-07-26T13:43:49.430] debug2: Updating reservations group's uid access lists
- [2024-07-26T13:43:49.430] debug2: Performing purge of old job records
- [2024-07-26T13:43:49.430] debug2: Performing full system state save
- [2024-07-26T13:43:49.430] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:43:49.644] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:43:49.644] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:43:53.452] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:43:53.452] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:43:53.453] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:43:53.453] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:43:53.453] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:43:53.453] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:43:53.453] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:43:54.454] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:43:54.454] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:43:54.454] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:43:54.454] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:43:55.455] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:43:55.455] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:43:55.455] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:43:55.455] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:43:56.456] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:43:56.456] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:43:56.456] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:43:56.456] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:43:57.457] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:43:57.457] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:43:57.457] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:43:57.457] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:43:58.458] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:43:58.458] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:43:58.458] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:43:58.458] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:43:59.459] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:43:59.459] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:43:59.459] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:43:59.459] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:44:00.460] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:44:00.460] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:44:00.460] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:44:00.460] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:44:01.461] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:44:01.461] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:44:01.461] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:44:01.461] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:44:02.461] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:44:02.461] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:44:02.462] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:44:02.462] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:44:03.462] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:44:03.463] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:44:03.463] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:44:03.463] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:44:04.463] debug2: Tree head got back 1
- [2024-07-26T13:44:04.464] debug2: Tree head got back 2
- [2024-07-26T13:44:19.493] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:44:19.644] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:44:19.644] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:44:49.535] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:44:49.536] debug2: Performing purge of old job records
- [2024-07-26T13:44:49.536] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:44:49.645] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:44:49.645] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:45:19.579] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:45:19.645] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:45:19.645] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:45:33.601] debug: Spawning registration agent for server[1-3] 3 hosts
- [2024-07-26T13:45:33.601] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:45:33.602] debug2: Tree head got back 0 looking for 3
- [2024-07-26T13:45:33.602] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:33.603] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:33.603] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:33.603] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:33.604] debug2: Tree head got back 1
- [2024-07-26T13:45:33.604] debug2: Processing RPC: MESSAGE_NODE_REGISTRATION_STATUS from UID=0
- [2024-07-26T13:45:33.604] debug2: _slurm_rpc_node_registration complete for server1 usec=14
- [2024-07-26T13:45:34.604] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:34.604] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:34.604] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:34.604] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:35.605] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:35.605] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:35.605] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:35.605] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:36.606] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:36.606] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:36.606] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:36.606] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:37.607] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:37.607] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:37.607] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:37.607] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:38.608] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:38.608] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:38.608] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:38.608] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:39.609] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:39.609] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:39.609] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:39.609] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:40.610] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:40.610] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:40.610] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:40.610] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:41.610] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:41.611] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:41.611] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:41.611] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:42.612] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:42.612] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:42.612] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:42.612] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:43.613] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:43.613] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:45:43.613] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:43.613] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:45:44.613] debug2: Tree head got back 2
- [2024-07-26T13:45:44.613] debug2: Tree head got back 3
- [2024-07-26T13:45:44.879] debug2: node_did_resp server1
- [2024-07-26T13:45:49.625] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:45:49.625] debug2: Performing purge of old job records
- [2024-07-26T13:45:49.626] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:45:49.645] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:45:49.645] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:46:19.646] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:46:19.646] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:46:19.673] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:46:49.716] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:46:49.716] debug2: Performing purge of old job records
- [2024-07-26T13:46:49.716] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:46:50.646] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:46:50.646] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:47:13.752] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:47:13.752] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:47:13.753] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:47:13.754] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:13.754] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:13.754] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:13.754] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:14.755] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:14.755] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:14.755] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:14.755] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:15.756] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:15.756] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:15.756] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:15.756] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:16.756] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:16.757] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:16.757] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:16.757] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:17.757] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:17.757] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:17.758] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:17.758] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:18.758] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:18.758] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:18.759] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:18.759] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:19.759] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:19.760] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:19.760] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:19.760] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:19.762] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:47:20.760] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:20.760] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:20.761] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:20.761] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:21.761] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:21.761] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:21.762] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:21.762] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:22.762] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:22.762] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:22.763] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:22.763] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:23.763] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:23.763] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:47:23.764] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:23.764] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:47:24.763] debug2: Tree head got back 1
- [2024-07-26T13:47:24.764] debug2: Tree head got back 2
- [2024-07-26T13:47:49.810] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:47:49.810] debug2: Performing purge of old job records
- [2024-07-26T13:47:49.810] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:47:50.651] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:47:50.652] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:48:19.853] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:48:49.899] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:48:49.899] debug2: Performing purge of old job records
- [2024-07-26T13:48:49.899] debug2: Performing full system state save
- [2024-07-26T13:48:49.899] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:48:50.656] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:48:50.657] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:48:53.922] debug: Spawning ping agent for server1
- [2024-07-26T13:48:53.922] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:48:53.922] debug2: Spawning RPC agent for msg_type REQUEST_PING
- [2024-07-26T13:48:53.922] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:48:53.922] debug2: Tree head got back 0 looking for 1
- [2024-07-26T13:48:53.922] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:48:53.923] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:48:53.923] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:48:53.923] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:48:53.923] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:48:53.924] debug2: Tree head got back 1
- [2024-07-26T13:48:53.927] debug2: node_did_resp server1
- [2024-07-26T13:48:54.924] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:48:54.924] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:48:54.924] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:48:54.924] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:48:55.925] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:48:55.925] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:48:55.925] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:48:55.925] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:48:56.926] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:48:56.926] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:48:56.926] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:48:56.926] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:48:57.927] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:48:57.927] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:48:57.927] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:48:57.927] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:48:58.928] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:48:58.928] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:48:58.928] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:48:58.928] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:48:59.928] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:48:59.929] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:48:59.929] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:48:59.929] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:49:00.929] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:49:00.929] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:49:00.930] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:49:00.930] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:49:01.930] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:49:01.930] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:49:01.931] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:49:01.931] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:49:02.931] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:49:02.931] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:49:02.932] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:49:02.932] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:49:03.932] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:49:03.932] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:49:03.933] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:49:03.933] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:49:04.933] debug2: Tree head got back 1
- [2024-07-26T13:49:04.933] debug2: Tree head got back 2
- [2024-07-26T13:49:19.960] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:49:20.657] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:49:20.657] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:49:49.005] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:49:49.005] debug2: Performing purge of old job records
- [2024-07-26T13:49:49.005] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:49:50.657] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:49:50.657] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:50:19.050] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:50:33.073] debug: Spawning registration agent for server[2-3] 2 hosts
- [2024-07-26T13:50:33.073] debug2: Spawning RPC agent for msg_type REQUEST_NODE_REGISTRATION_STATUS
- [2024-07-26T13:50:33.073] debug2: Tree head got back 0 looking for 2
- [2024-07-26T13:50:33.074] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:33.074] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:33.074] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:33.074] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:34.075] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:34.075] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:34.075] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:34.075] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:35.076] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:35.076] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:35.076] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:35.076] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:36.077] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:36.077] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:36.077] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:36.077] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:37.078] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:37.078] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:37.078] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:37.078] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:38.079] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:38.079] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:38.079] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:38.079] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:39.080] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:39.080] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:39.080] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:39.080] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:40.081] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:40.081] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:40.081] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:40.081] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:41.082] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:41.082] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:41.082] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:41.082] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:42.082] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:42.083] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:42.083] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:42.083] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:43.083] debug2: _slurm_connect: failed to connect to 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:43.084] debug2: Error connecting slurm stream socket at 10.36.17.132:6818: Connection refused
- [2024-07-26T13:50:43.084] debug2: _slurm_connect: failed to connect to 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:43.084] debug2: Error connecting slurm stream socket at 10.36.17.166:6818: Connection refused
- [2024-07-26T13:50:44.084] debug2: Tree head got back 1
- [2024-07-26T13:50:44.084] debug2: Tree head got back 2
- [2024-07-26T13:50:49.095] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:50:49.095] debug2: Performing purge of old job records
- [2024-07-26T13:50:49.096] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:50:49.662] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:50:49.662] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:51:19.140] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:51:19.662] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:51:19.662] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
- [2024-07-26T13:51:49.187] debug2: Testing job time limits and checkpoints
- [2024-07-26T13:51:49.188] debug2: Performing purge of old job records
- [2024-07-26T13:51:49.188] debug: sched: Running job scheduler for full queue.
- [2024-07-26T13:51:49.663] debug: sched/backfill: _attempt_backfill: beginning
- [2024-07-26T13:51:49.663] debug: sched/backfill: _attempt_backfill: 1 jobs to backfill
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement