Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- # At this time, the fence handler was set to 644 and then both switches were powered off.
- Oct 23 14:01:55 an-c05n02 kernel: igb: eth4 NIC Link is Down
- Oct 23 14:01:55 an-c05n02 kernel: e1000e: eth3 NIC Link is Down
- Oct 23 14:01:55 an-c05n02 kernel: igb: eth5 NIC Link is Down
- Oct 23 14:01:55 an-c05n02 kernel: e1000e: eth2 NIC Link is Down
- Oct 23 14:01:55 an-c05n02 kernel: igb: eth0 NIC Link is Down
- Oct 23 14:01:55 an-c05n02 kernel: igb: eth1 NIC Link is Down
- Oct 23 14:01:55 an-c05n02 kernel: bonding: bond2: link status definitely down for interface eth2, disabling it
- Oct 23 14:01:55 an-c05n02 kernel: device eth2 left promiscuous mode
- Oct 23 14:01:55 an-c05n02 kernel: bonding: bond2: now running without any active interface !
- Oct 23 14:01:55 an-c05n02 kernel: bonding: bond2: link status definitely down for interface eth5, disabling it
- Oct 23 14:01:55 an-c05n02 kernel: bonding: bond1: link status definitely down for interface eth1, disabling it
- Oct 23 14:01:55 an-c05n02 kernel: bonding: bond1: now running without any active interface !
- Oct 23 14:01:55 an-c05n02 kernel: bonding: bond1: link status definitely down for interface eth4, disabling it
- Oct 23 14:01:55 an-c05n02 kernel: bonding: bond0: link status definitely down for interface eth0, disabling it
- Oct 23 14:01:55 an-c05n02 kernel: bonding: bond0: now running without any active interface !
- Oct 23 14:01:55 an-c05n02 kernel: bonding: bond0: link status definitely down for interface eth3, disabling it
- Oct 23 14:01:56 an-c05n02 kernel: vbr2: port 1(bond2) entering disabled state
- Oct 23 14:01:56 an-c05n02 kernel: block drbd1: PingAck did not arrive in time.
- Oct 23 14:01:56 an-c05n02 kernel: block drbd1: peer( Primary -> Unknown ) conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown ) susp( 0 -> 1 )
- Oct 23 14:01:56 an-c05n02 kernel: block drbd1: asender terminated
- Oct 23 14:01:56 an-c05n02 kernel: block drbd1: Terminating drbd1_asender
- Oct 23 14:01:56 an-c05n02 kernel: block drbd1: Connection closed
- Oct 23 14:01:56 an-c05n02 kernel: block drbd1: conn( NetworkFailure -> Unconnected )
- Oct 23 14:01:56 an-c05n02 kernel: block drbd1: receiver terminated
- Oct 23 14:01:56 an-c05n02 kernel: block drbd1: Restarting drbd1_receiver
- Oct 23 14:01:56 an-c05n02 kernel: block drbd1: receiver (re)started
- Oct 23 14:01:56 an-c05n02 kernel: block drbd1: conn( Unconnected -> WFConnection )
- Oct 23 14:01:56 an-c05n02 kernel: block drbd1: helper command: /sbin/drbdadm fence-peer minor-1
- Oct 23 14:01:56 an-c05n02 kernel: block drbd1: helper command: /sbin/drbdadm fence-peer minor-1 exit code 10 (0xa00)
- # This is the "error 10"
- Oct 23 14:01:56 an-c05n02 kernel: block drbd1: fence-peer helper broken, returned 10
- Oct 23 14:02:04 an-c05n02 corosync[6754]: [TOTEM ] A processor failed, forming new configuration.
- Oct 23 14:02:06 an-c05n02 corosync[6754]: [QUORUM] Members[1]: 2
- Oct 23 14:02:06 an-c05n02 corosync[6754]: [TOTEM ] A processor joined or left the membership and a new membership was formed.
- Oct 23 14:02:06 an-c05n02 kernel: dlm: closing connection to node 1
- Oct 23 14:02:06 an-c05n02 corosync[6754]: [CPG ] chosen downlist: sender r(0) ip(10.20.50.2) ; members(old:2 left:1)
- Oct 23 14:02:06 an-c05n02 corosync[6754]: [MAIN ] Completed service synchronization, ready to provide service.
- Oct 23 14:02:06 an-c05n02 fenced[6807]: fencing node an-c05n01.alteeve.ca
- Oct 23 14:02:06 an-c05n02 kernel: GFS2: fsid=an-cluster-A:shared.1: jid=0: Trying to acquire journal lock...
- Oct 23 14:02:13 an-c05n02 kernel: block drbd0: PingAck did not arrive in time.
- Oct 23 14:02:13 an-c05n02 kernel: block drbd0: peer( Primary -> Unknown ) conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown ) susp( 0 -> 1 )
- Oct 23 14:02:13 an-c05n02 kernel: block drbd0: asender terminated
- Oct 23 14:02:13 an-c05n02 kernel: block drbd0: Terminating drbd0_asender
- Oct 23 14:02:13 an-c05n02 kernel: block drbd0: Connection closed
- Oct 23 14:02:13 an-c05n02 kernel: block drbd0: conn( NetworkFailure -> Unconnected )
- Oct 23 14:02:13 an-c05n02 kernel: block drbd0: receiver terminated
- Oct 23 14:02:13 an-c05n02 kernel: block drbd0: Restarting drbd0_receiver
- Oct 23 14:02:13 an-c05n02 kernel: block drbd0: receiver (re)started
- Oct 23 14:02:13 an-c05n02 kernel: block drbd0: conn( Unconnected -> WFConnection )
- Oct 23 14:02:13 an-c05n02 kernel: block drbd0: helper command: /sbin/drbdadm fence-peer minor-0
- Oct 23 14:02:13 an-c05n02 kernel: block drbd0: helper command: /sbin/drbdadm fence-peer minor-0 exit code 10 (0xa00)
- Oct 23 14:02:13 an-c05n02 kernel: block drbd0: fence-peer helper broken, returned 10
- Oct 23 14:02:31 an-c05n02 fenced[6807]: fence an-c05n01.alteeve.ca dev 0.0 agent fence_ipmilan result: error from agent
- Oct 23 14:02:31 an-c05n02 fenced[6807]: fence an-c05n01.alteeve.ca dev 1.0 agent fence_apc_snmp result: error from agent
- Oct 23 14:02:31 an-c05n02 fenced[6807]: fence an-c05n01.alteeve.ca failed
- Oct 23 14:02:35 an-c05n02 fenced[6807]: fencing node an-c05n01.alteeve.ca
- Oct 23 14:03:00 an-c05n02 fenced[6807]: fence an-c05n01.alteeve.ca dev 0.0 agent fence_ipmilan result: error from agent
- Oct 23 14:03:00 an-c05n02 fenced[6807]: fence an-c05n01.alteeve.ca dev 1.0 agent fence_apc_snmp result: error from agent
- Oct 23 14:03:00 an-c05n02 fenced[6807]: fence an-c05n01.alteeve.ca failed
- Oct 23 14:03:03 an-c05n02 fenced[6807]: fencing node an-c05n01.alteeve.ca
- Oct 23 14:03:28 an-c05n02 fenced[6807]: fence an-c05n01.alteeve.ca dev 0.0 agent fence_ipmilan result: error from agent
- Oct 23 14:03:28 an-c05n02 fenced[6807]: fence an-c05n01.alteeve.ca dev 1.0 agent fence_apc_snmp result: error from agent
- Oct 23 14:03:28 an-c05n02 fenced[6807]: fence an-c05n01.alteeve.ca failed
- Oct 23 14:03:54 an-c05n02 kernel: e1000e: eth3 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: Rx/Tx
- Oct 23 14:03:54 an-c05n02 kernel: bonding: bond0: link status up for interface eth3, enabling it in 0 ms.
- Oct 23 14:03:54 an-c05n02 kernel: bond0: link status definitely up for interface eth3, 1000 Mbps full duplex.
- Oct 23 14:03:54 an-c05n02 kernel: bonding: bond0: making interface eth3 the new active one.
- Oct 23 14:03:54 an-c05n02 kernel: bonding: bond0: first active interface up!
- Oct 23 14:03:54 an-c05n02 kernel: e1000e: eth2 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: Rx/Tx
- Oct 23 14:03:54 an-c05n02 kernel: bonding: bond2: link status up for interface eth2, enabling it in 0 ms.
- Oct 23 14:03:54 an-c05n02 kernel: bond2: link status definitely up for interface eth2, 1000 Mbps full duplex.
- Oct 23 14:03:54 an-c05n02 kernel: bonding: bond2: making interface eth2 the new active one.
- Oct 23 14:03:54 an-c05n02 kernel: device eth2 entered promiscuous mode
- Oct 23 14:03:54 an-c05n02 kernel: bonding: bond2: first active interface up!
- Oct 23 14:03:54 an-c05n02 kernel: vbr2: port 1(bond2) entering forwarding state
- Oct 23 14:03:54 an-c05n02 corosync[6754]: [TOTEM ] A processor joined or left the membership and a new membership was formed.
- Oct 23 14:03:54 an-c05n02 corosync[6754]: [QUORUM] Members[2]: 1 2
- Oct 23 14:03:54 an-c05n02 corosync[6754]: [QUORUM] Members[2]: 1 2
- Oct 23 14:03:54 an-c05n02 corosync[6754]: [CPG ] chosen downlist: sender r(0) ip(10.20.50.1) ; members(old:1 left:0)
- Oct 23 14:03:54 an-c05n02 corosync[6754]: [MAIN ] Completed service synchronization, ready to provide service.
- Oct 23 14:03:54 an-c05n02 gfs_controld[6882]: receive_start 1:4 add node with started_count 3
- Oct 23 14:03:56 an-c05n02 kernel: igb: eth1 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX/TX
- Oct 23 14:03:56 an-c05n02 kernel: bonding: bond1: link status up for interface eth1, enabling it in 0 ms.
- Oct 23 14:03:56 an-c05n02 kernel: bond1: link status definitely up for interface eth1, 1000 Mbps full duplex.
- Oct 23 14:03:56 an-c05n02 kernel: bonding: bond1: making interface eth1 the new active one.
- Oct 23 14:03:56 an-c05n02 kernel: bonding: bond1: first active interface up!
- Oct 23 14:03:56 an-c05n02 kernel: igb: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX/TX
- Oct 23 14:03:56 an-c05n02 kernel: bonding: bond0: link status up for interface eth0, enabling it in 12000 ms.
- Oct 23 14:03:56 an-c05n02 fenced[6807]: fence an-c05n01.alteeve.ca dev 0.0 agent fence_ipmilan result: error from agent
- # Switches recover, cman finally fences node 1
- Oct 23 14:03:56 an-c05n02 fenced[6807]: fence an-c05n01.alteeve.ca success
- Oct 23 14:03:57 an-c05n02 kernel: igb: eth4 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX/TX
- Oct 23 14:03:57 an-c05n02 kernel: bonding: bond1: link status up for interface eth4, enabling it in 12000 ms.
- Oct 23 14:03:57 an-c05n02 kernel: igb: eth5 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX/TX
- Oct 23 14:03:57 an-c05n02 kernel: bonding: bond2: link status up for interface eth5, enabling it in 12000 ms.
- Oct 23 14:04:05 an-c05n02 corosync[6754]: [TOTEM ] A processor failed, forming new configuration.
- Oct 23 14:04:07 an-c05n02 corosync[6754]: [QUORUM] Members[1]: 2
- Oct 23 14:04:07 an-c05n02 corosync[6754]: [TOTEM ] A processor joined or left the membership and a new membership was formed.
- Oct 23 14:04:07 an-c05n02 kernel: dlm: closing connection to node 1
- Oct 23 14:04:07 an-c05n02 corosync[6754]: [CPG ] chosen downlist: sender r(0) ip(10.20.50.2) ; members(old:2 left:1)
- Oct 23 14:04:07 an-c05n02 corosync[6754]: [MAIN ] Completed service synchronization, ready to provide service.
- Oct 23 14:04:07 an-c05n02 rgmanager[7487]: Marking service:storage_an01 as stopped: Restricted domain unavailable
- Oct 23 14:04:07 an-c05n02 rgmanager[7487]: Taking over service vm:vm01-dev from down member an-c05n01.alteeve.ca
- Oct 23 14:04:07 an-c05n02 rgmanager[7487]: Marking service:storage_an01 as stopped: Restricted domain unavailable
- Oct 23 14:04:08 an-c05n02 kernel: bond0: link status definitely up for interface eth0, 1000 Mbps full duplex.
- Oct 23 14:04:08 an-c05n02 kernel: bonding: bond0: making interface eth0 the new active one.
- Oct 23 14:04:09 an-c05n02 kernel: bond1: link status definitely up for interface eth4, 1000 Mbps full duplex.
- Oct 23 14:04:09 an-c05n02 kernel: bond2: link status definitely up for interface eth5, 1000 Mbps full duplex.
- Oct 23 14:04:09 an-c05n02 kernel: vbr2: port 1(bond2) entering forwarding state
- # At this point, DRBD still hasn't fenced because the handler is still broken.
- Oct 23 14:04:21 an-c05n02 kernel: INFO: task gfs2_quotad:9195 blocked for more than 120 seconds.
- Oct 23 14:04:21 an-c05n02 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
- <snip gfs2 traces>
- Oct 23 14:08:21 an-c05n02 kernel: [<ffffffff8100c0c0>] ? child_rip+0x0/0x20
- Oct 23 14:20:54 an-c05n02 libvirtd: Could not find keytab file: /etc/libvirt/krb5.tab: No such file or directory
- Oct 23 14:20:54 an-c05n02 kernel: lo: Disabled Privacy Extensions
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: Handshake successful: Agreed network protocol version 97
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: conn( WFConnection -> WFReportParams )
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: Starting asender thread (from drbd0_receiver [8973])
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: data-integrity-alg: <not-used>
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: drbd_sync_handshake:
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: self C9F77959FD4A71D3:0000000000000000:282E11A241B8A05A:282D11A241B8A05B bits:0 flags:0
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: peer C9F77959FD4A71D2:0000000000000000:282E11A241B8A05B:282D11A241B8A05B bits:263168 flags:2
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: uuid_compare()=-1 by rule 40
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: I shall become SyncTarget, but I am primary!
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: conn( WFReportParams -> Disconnecting )
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: error receiving ReportState, l: 4!
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: asender terminated
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: Terminating drbd0_asender
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: Connection closed
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: conn( Disconnecting -> StandAlone )
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: receiver terminated
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: Terminating drbd0_receiver
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: helper command: /sbin/drbdadm fence-peer minor-0
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: helper command: /sbin/drbdadm fence-peer minor-0 exit code 126 (0x7e00)
- Oct 23 14:28:58 an-c05n02 kernel: block drbd0: fence-peer helper broken, returned 126
- Oct 23 14:34:46 an-c05n02 kernel: block drbd0: conn( StandAlone -> Unconnected )
- Oct 23 14:34:46 an-c05n02 kernel: block drbd0: Starting receiver thread (from drbd0_worker [8965])
- Oct 23 14:34:46 an-c05n02 kernel: block drbd0: receiver (re)started
- Oct 23 14:34:46 an-c05n02 kernel: block drbd0: conn( Unconnected -> WFConnection )
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: Handshake successful: Agreed network protocol version 97
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: conn( WFConnection -> WFReportParams )
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: Starting asender thread (from drbd0_receiver [26267])
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: data-integrity-alg: <not-used>
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: drbd_sync_handshake:
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: self C9F77959FD4A71D3:0000000000000000:282E11A241B8A05A:282D11A241B8A05B bits:0 flags:0
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: peer C9F77959FD4A71D2:0000000000000000:282E11A241B8A05B:282D11A241B8A05B bits:263168 flags:2
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: uuid_compare()=-1 by rule 40
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: I shall become SyncTarget, but I am primary!
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: conn( WFReportParams -> Disconnecting )
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: error receiving ReportState, l: 4!
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: asender terminated
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: Terminating drbd0_asender
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: Connection closed
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: conn( Disconnecting -> StandAlone )
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: helper command: /sbin/drbdadm fence-peer minor-0
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: receiver terminated
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: Terminating drbd0_receiver
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: helper command: /sbin/drbdadm fence-peer minor-0 exit code 126 (0x7e00)
- Oct 23 14:34:47 an-c05n02 kernel: block drbd0: fence-peer helper broken, returned 126
- Oct 23 14:37:46 an-c05n02 udevd[865]: worker [9349] unexpectedly returned with status 0x0100
- Oct 23 14:37:46 an-c05n02 udevd[865]: worker [9349] failed while handling '/devices/virtual/block/drbd0'
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: Handshake successful: Agreed network protocol version 97
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: conn( WFConnection -> WFReportParams )
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: Starting asender thread (from drbd1_receiver [7153])
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: data-integrity-alg: <not-used>
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: drbd_sync_handshake:
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: self 8314C38D1738144B:0000000000000000:97927DD1274D9799:97917DD1274D9799 bits:0 flags:0
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: peer 8314C38D1738144A:0000000000000000:97927DD1274D9799:97917DD1274D9799 bits:263168 flags:2
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: uuid_compare()=-1 by rule 40
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: I shall become SyncTarget, but I am primary!
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: conn( WFReportParams -> Disconnecting )
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: error receiving ReportState, l: 4!
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: asender terminated
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: Terminating drbd1_asender
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: Connection closed
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: helper command: /sbin/drbdadm fence-peer minor-1
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: conn( Disconnecting -> StandAlone )
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: receiver terminated
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: Terminating drbd1_receiver
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: helper command: /sbin/drbdadm fence-peer minor-1 exit code 126 (0x7e00)
- Oct 23 14:37:48 an-c05n02 kernel: block drbd1: fence-peer helper broken, returned 126
- Oct 23 14:44:01 an-c05n02 kernel: block drbd0: conn( StandAlone -> Unconnected )
- Oct 23 14:44:01 an-c05n02 kernel: block drbd0: Starting receiver thread (from drbd0_worker [8965])
- Oct 23 14:44:01 an-c05n02 kernel: block drbd0: receiver (re)started
- Oct 23 14:44:01 an-c05n02 kernel: block drbd0: conn( Unconnected -> WFConnection )
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: Handshake successful: Agreed network protocol version 97
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: conn( WFConnection -> WFReportParams )
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: Starting asender thread (from drbd0_receiver [4278])
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: data-integrity-alg: <not-used>
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: drbd_sync_handshake:
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: self C9F77959FD4A71D3:0000000000000000:282E11A241B8A05A:282D11A241B8A05B bits:0 flags:0
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: peer C9F77959FD4A71D2:0000000000000000:282E11A241B8A05B:282D11A241B8A05B bits:263168 flags:2
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: uuid_compare()=-1 by rule 40
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: I shall become SyncTarget, but I am primary!
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: conn( WFReportParams -> Disconnecting )
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: error receiving ReportState, l: 4!
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: asender terminated
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: Terminating drbd0_asender
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: Connection closed
- # At this point, the mode of rhcs_fence was set back to 755. This was triggered when 'drbdadm connect r0' was run from this node.
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: helper command: /sbin/drbdadm fence-peer minor-0
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: conn( Disconnecting -> StandAlone )
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: receiver terminated
- Oct 23 14:44:02 an-c05n02 kernel: block drbd0: Terminating drbd0_receiver
- Oct 23 14:44:02 an-c05n02 rhcs_fence: Attempting to fence peer using RHCS from DRBD...
- Oct 23 14:44:21 an-c05n02 corosync[6754]: [TOTEM ] A processor failed, forming new configuration.
- Oct 23 14:44:23 an-c05n02 corosync[6754]: [QUORUM] Members[1]: 2
- Oct 23 14:44:23 an-c05n02 corosync[6754]: [TOTEM ] A processor joined or left the membership and a new membership was formed.
- Oct 23 14:44:23 an-c05n02 kernel: dlm: closing connection to node 1
- Oct 23 14:44:23 an-c05n02 corosync[6754]: [CPG ] chosen downlist: sender r(0) ip(10.20.50.2) ; members(old:2 left:1)
- Oct 23 14:44:23 an-c05n02 corosync[6754]: [MAIN ] Completed service synchronization, ready to provide service.
- Oct 23 14:44:23 an-c05n02 fenced[6807]: fencing node an-c05n01.alteeve.ca
- Oct 23 14:44:27 an-c05n02 fenced[6807]: fence an-c05n01.alteeve.ca success
- Oct 23 14:44:27 an-c05n02 fence_node[4546]: fence an-c05n01.alteeve.ca success
- Oct 23 14:44:27 an-c05n02 kernel: block drbd0: helper command: /sbin/drbdadm fence-peer minor-0 exit code 7 (0x700)
- Oct 23 14:44:27 an-c05n02 kernel: block drbd0: fence-peer helper returned 7 (peer was stonithed)
- Oct 23 14:44:27 an-c05n02 kernel: block drbd0: pdsk( DUnknown -> Outdated )
- Oct 23 14:44:27 an-c05n02 kernel: block drbd0: new current UUID BDB851199DCABE3F:C9F77959FD4A71D3:282E11A241B8A05A:282D11A241B8A05B
- Oct 23 14:44:27 an-c05n02 kernel: block drbd0: susp( 1 -> 0 )
- Oct 23 14:44:27 an-c05n02 kernel: GFS2: fsid=an-cluster-A:shared.1: jid=0: Looking at journal...
- Oct 23 14:44:27 an-c05n02 rgmanager[7487]: start on vm "vm01-dev" returned 1 (generic error)
- Oct 23 14:44:27 an-c05n02 kernel: GFS2: fsid=an-cluster-A:shared.1: jid=0: Done
- Oct 23 14:44:27 an-c05n02 rgmanager[5023]: [vm] Could not determine Hypervisor
- Oct 23 14:44:27 an-c05n02 rgmanager[7487]: status on vm "vm02-cthulhu" returned 2 (invalid argument(s))
- # vm02-cthulhu is on r1 which is still blocked at this point, hence the failures. vm01-dev was on r0 which is now working and the VM recovers.
- Oct 23 14:44:27 an-c05n02 rgmanager[7487]: Stopping service vm:vm02-cthulhu
- Oct 23 14:44:27 an-c05n02 rgmanager[7487]: #68: Failed to start vm:vm01-dev; return value: 1
- Oct 23 14:44:27 an-c05n02 rgmanager[7487]: Stopping service vm:vm01-dev
- Oct 23 14:44:28 an-c05n02 rgmanager[7487]: Service vm:vm01-dev is recovering
- Oct 23 14:44:28 an-c05n02 rgmanager[7487]: #71: Relocating failed service vm:vm01-dev
- Oct 23 14:44:28 an-c05n02 rgmanager[7487]: Service vm:vm01-dev is stopped
- Oct 23 14:44:28 an-c05n02 rgmanager[7487]: Starting stopped service vm:vm01-dev
- Oct 23 14:44:28 an-c05n02 kernel: device vnet1 entered promiscuous mode
- Oct 23 14:44:28 an-c05n02 kernel: vbr2: port 3(vnet1) entering forwarding state
- Oct 23 14:44:28 an-c05n02 qemu-kvm: Could not find keytab file: /etc/qemu/krb5.tab: No such file or directory
- Oct 23 14:44:29 an-c05n02 rgmanager[7487]: Service vm:vm01-dev started
- Oct 23 14:44:31 an-c05n02 ntpd[7102]: Listening on interface #11 vnet1, fe80::fc54:ff:fed4:2230#123 Enabled
- Oct 23 14:44:38 an-c05n02 kernel: kvm: 5354: cpu0 disabled perfctr wrmsr: 0xc1 data 0xabcd
- Oct 23 14:44:43 an-c05n02 kernel: vbr2: port 3(vnet1) entering forwarding state
- Oct 23 14:46:29 an-c05n02 corosync[6754]: [TOTEM ] A processor joined or left the membership and a new membership was formed.
- Oct 23 14:46:29 an-c05n02 corosync[6754]: [QUORUM] Members[2]: 1 2
- Oct 23 14:46:29 an-c05n02 corosync[6754]: [QUORUM] Members[2]: 1 2
- Oct 23 14:46:29 an-c05n02 corosync[6754]: [CPG ] chosen downlist: sender r(0) ip(10.20.50.1) ; members(old:1 left:0)
- Oct 23 14:46:29 an-c05n02 corosync[6754]: [MAIN ] Completed service synchronization, ready to provide service.
- Oct 23 14:49:15 an-c05n02 rgmanager[7487]: stop on vm "vm02-cthulhu" returned 1 (generic error)
- Oct 23 14:49:15 an-c05n02 rgmanager[7487]: #12: RG vm:vm02-cthulhu failed to stop; intervention required
- Oct 23 14:49:15 an-c05n02 rgmanager[7487]: Service vm:vm02-cthulhu is failed
- Oct 23 14:49:15 an-c05n02 rgmanager[7487]: #43: Service vm:vm02-cthulhu has failed; can not start.
- Oct 23 14:49:15 an-c05n02 rgmanager[7487]: #13: Service vm:vm02-cthulhu failed to stop cleanly
- Oct 23 14:52:00 an-c05n02 kernel: dlm: got connection from 1
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: conn( StandAlone -> Unconnected )
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: Starting receiver thread (from drbd0_worker [8965])
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: receiver (re)started
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: conn( Unconnected -> WFConnection )
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: Handshake successful: Agreed network protocol version 97
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: conn( WFConnection -> WFReportParams )
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: Starting asender thread (from drbd0_receiver [30109])
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: data-integrity-alg: <not-used>
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: drbd_sync_handshake:
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: self BDB851199DCABE3F:C9F77959FD4A71D3:282E11A241B8A05A:282D11A241B8A05B bits:427 flags:0
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: peer C9F77959FD4A71D2:0000000000000000:282E11A241B8A05B:282D11A241B8A05B bits:0 flags:2
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: uuid_compare()=1 by rule 70
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( Outdated -> Consistent )
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: helper command: /sbin/drbdadm before-resync-source minor-0
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: helper command: /sbin/drbdadm before-resync-source minor-0 exit code 0 (0x0)
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: conn( WFBitMapS -> SyncSource ) pdsk( Consistent -> Inconsistent )
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: Began resync as SyncSource (will sync 1708 KB [427 bits set]).
- Oct 23 15:16:19 an-c05n02 kernel: block drbd0: updated sync UUID BDB851199DCABE3F:C9F87959FD4A71D3:C9F77959FD4A71D3:282E11A241B8A05A
- Oct 23 15:16:22 an-c05n02 kernel: block drbd0: Resync done (total 2 sec; paused 0 sec; 852 K/sec)
- Oct 23 15:16:22 an-c05n02 kernel: block drbd0: updated UUIDs BDB851199DCABE3F:0000000000000000:C9F87959FD4A71D3:C9F77959FD4A71D3
- Oct 23 15:16:22 an-c05n02 kernel: block drbd0: conn( SyncSource -> Connected ) pdsk( Inconsistent -> UpToDate )
- Oct 23 15:16:22 an-c05n02 kernel: block drbd0: bitmap WRITE of 3153 pages took 10 jiffies
- Oct 23 15:16:22 an-c05n02 kernel: block drbd0: 0 KB (0 bits) marked out-of-sync by on disk bit-map.
- Oct 23 15:18:22 an-c05n02 kernel: block drbd1: conn( StandAlone -> Unconnected )
- Oct 23 15:18:22 an-c05n02 kernel: block drbd1: Starting receiver thread (from drbd1_worker [7142])
- Oct 23 15:18:22 an-c05n02 kernel: block drbd1: receiver (re)started
- Oct 23 15:18:22 an-c05n02 kernel: block drbd1: conn( Unconnected -> WFConnection )
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: Handshake successful: Agreed network protocol version 97
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: conn( WFConnection -> WFReportParams )
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: Starting asender thread (from drbd1_receiver [32137])
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: data-integrity-alg: <not-used>
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: drbd_sync_handshake:
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: self 8314C38D1738144B:0000000000000000:97927DD1274D9799:97917DD1274D9799 bits:0 flags:0
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: peer 8314C38D1738144A:0000000000000000:97927DD1274D9799:97917DD1274D9799 bits:0 flags:2
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: uuid_compare()=-1 by rule 40
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: I shall become SyncTarget, but I am primary!
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: conn( WFReportParams -> Disconnecting )
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: error receiving ReportState, l: 4!
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: asender terminated
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: Terminating drbd1_asender
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: Connection closed
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: helper command: /sbin/drbdadm fence-peer minor-1
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: conn( Disconnecting -> StandAlone )
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: receiver terminated
- Oct 23 15:18:23 an-c05n02 kernel: block drbd1: Terminating drbd1_receiver
- Oct 23 15:18:23 an-c05n02 kernel: block drbd0: peer( Secondary -> Primary )
- Oct 23 15:18:23 an-c05n02 rhcs_fence: Attempting to fence peer using RHCS from DRBD...
- Oct 23 15:18:24 an-c05n02 corosync[6754]: cman killed by node 1 because we were killed by cman_tool or other application
- Oct 23 15:18:24 an-c05n02 fenced[6807]: cluster is down, exiting
- Oct 23 15:18:24 an-c05n02 gfs_controld[6882]: cluster is down, exiting
- Oct 23 15:18:24 an-c05n02 dlm_controld[6833]: cluster is down, exiting
- Oct 23 15:18:24 an-c05n02 fenced[6807]: daemon cpg_dispatch error 2
- Oct 23 15:18:24 an-c05n02 gfs_controld[6882]: daemon cpg_dispatch error 2
- Oct 23 15:18:24 an-c05n02 dlm_controld[6833]: daemon cpg_dispatch error 2
- Oct 23 15:18:24 an-c05n02 rgmanager[7487]: #67: Shutting down uncleanly
- Oct 23 15:18:24 an-c05n02 rgmanager[32242]: [script] Executing /etc/init.d/libvirtd stop
- Oct 23 15:18:24 an-c05n02 rgmanager[32270]: [vm] Could not determine Hypervisor
- Oct 23 15:18:24 an-c05n02 rgmanager[7487]: stop on vm "vm01-dev" returned 2 (invalid argument(s))
- Oct 23 15:18:24 an-c05n02 rgmanager[32290]: [vm] Could not determine Hypervisor
- Oct 23 15:18:24 an-c05n02 rgmanager[7487]: stop on vm "vm02-cthulhu" returned 2 (invalid argument(s))
- Oct 23 15:18:24 an-c05n02 rgmanager[32307]: [script] Executing /etc/init.d/gfs2 stop
- Oct 23 15:18:32 an-c05n02 kernel: dlm: closing connection to node 1
- Oct 23 15:18:32 an-c05n02 kernel: dlm: closing connection to node 2
- Oct 23 15:18:32 an-c05n02 kernel: dlm: shared: no userland control daemon, stopping lockspace
- Oct 23 15:18:32 an-c05n02 kernel: dlm: clvmd: no userland control daemon, stopping lockspace
- Oct 23 15:18:32 an-c05n02 kernel: dlm: rgmanager: no userland control daemon, stopping lockspace
- Oct 23 15:18:32 an-c05n02 kernel: block drbd1: helper command: /sbin/drbdadm fence-peer minor-1 exit code 1 (0x100)
- Oct 23 15:18:32 an-c05n02 kernel: block drbd1: fence-peer helper broken, returned 1
- # This is where node 1 fenced this node when trying to 'drbdadm connect r1'.
- Write failed: Broken pipe
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement