########## Logs from srv2-2 ########## // Online verification started Feb 16 04:00:01 srv2-2 kernel: block drbd1: conn( Connected -> VerifyT ) Feb 16 04:00:01 srv2-2 kernel: block drbd1: Online Verify start sector: 0 // Many many many errors during 3 days ! Feb 17 20:31:08 srv2-2 kernel: block drbd1: [drbd1_worker/30804] sock_sendmsg time expired, ko = 4294967295 Feb 17 20:31:14 srv2-2 kernel: block drbd1: [drbd1_worker/30804] sock_sendmsg time expired, ko = 4294967294 (...) Feb 19 19:20:50 srv2-2 kernel: block drbd1: [drbd1_worker/30804] sock_sendmsg time expired, ko = 4294939198 Feb 19 19:20:56 srv2-2 kernel: block drbd1: [drbd1_worker/30804] sock_sendmsg time expired, ko = 4294939197 // Close network communication between srv2-1 and srv2-2 // srv2-2 can now be reached ! // Change srv2-1 from Secondary to Primary Feb 19 19:20:56 srv2-2 kernel: block drbd1: PingAck did not arrive in time. Feb 19 19:20:56 srv2-2 kernel: block drbd1: peer( Secondary -> Unknown ) conn( VerifyT -> NetworkFailure ) pdsk( UpToDate -> DUnknown ) Feb 19 19:20:56 srv2-2 kernel: block drbd1: Online Verify reached sector 2983106184 Feb 19 19:20:56 srv2-2 kernel: block drbd1: drbd_pp_alloc interrupted! Feb 19 19:20:56 srv2-2 kernel: block drbd1: error receiving OVRequest, l: 24! Feb 19 19:20:56 srv2-2 kernel: block drbd1: asender terminated Feb 19 19:20:56 srv2-2 kernel: block drbd1: Terminating asender thread Feb 19 19:21:02 srv2-2 kernel: block drbd1: new current UUID 7033A3893BFC813F:78A03E42A1044DAD:1F43CD451DB9AFCB:1F42CD451DB9AFCB Feb 19 19:21:02 srv2-2 kernel: block drbd1: Connection closed Feb 19 19:21:02 srv2-2 kernel: block drbd1: conn( NetworkFailure -> Unconnected ) Feb 19 19:21:02 srv2-2 kernel: block drbd1: bitmap WRITE of 1 pages took 0 jiffies Feb 19 19:21:02 srv2-2 kernel: block drbd1: 12 MB (2977 bits) marked out-of-sync by on disk bit-map. Feb 19 19:21:02 srv2-2 kernel: block drbd1: receiver terminated Feb 19 19:21:02 srv2-2 kernel: block drbd1: Restarting receiver thread Feb 19 19:21:02 srv2-2 kernel: block drbd1: receiver (re)started Feb 19 19:21:02 srv2-2 kernel: block drbd1: conn( Unconnected -> WFConnection ) // Reboot of srv2-2 Feb 19 19:32:06 srv2-2 kernel: drbd: initialized. Version: 8.3.14 (api:88/proto:86-97) Feb 19 19:32:06 srv2-2 kernel: drbd: GIT-hash: 3fc56aae927694b31eb959d922c9eec3f491f75c build by root@srv2-2, 2012-11-06 15:28:52 Feb 19 19:32:06 srv2-2 kernel: drbd: registered as block device major 147 Feb 19 19:32:06 srv2-2 kernel: drbd: minor_table @ 0xedcac480 Feb 19 19:32:06 srv2-2 kernel: block drbd1: Starting worker thread (from cqueue [3107]) Feb 19 19:32:06 srv2-2 kernel: block drbd1: disk( Diskless -> Attaching ) Feb 19 19:32:06 srv2-2 kernel: block drbd1: Found 4 transactions (192 active extents) in activity log. Feb 19 19:32:06 srv2-2 kernel: block drbd1: Method to ensure write ordering: barrier Feb 19 19:32:06 srv2-2 kernel: block drbd1: max BIO size = 130560 Feb 19 19:32:06 srv2-2 kernel: block drbd1: drbd_bm_resize called with capacity == 3896416992 Feb 19 19:32:06 srv2-2 kernel: block drbd1: resync bitmap: bits=487052124 words=15220380 pages=14864 Feb 19 19:32:06 srv2-2 kernel: block drbd1: size = 1858 GB (1948208496 KB) Feb 19 19:32:06 srv2-2 kernel: block drbd1: bitmap READ of 14864 pages took 157 jiffies Feb 19 19:32:06 srv2-2 kernel: block drbd1: recounting of set bits took additional 13 jiffies Feb 19 19:32:06 srv2-2 kernel: block drbd1: 157 MB (40258 bits) marked out-of-sync by on disk bit-map. Feb 19 19:32:06 srv2-2 kernel: block drbd1: Marked additional 471 MB as out-of-sync based on AL. Feb 19 19:32:06 srv2-2 kernel: block drbd1: bitmap WRITE of 0 pages took 0 jiffies Feb 19 19:32:07 srv2-2 kernel: block drbd1: 628 MB (160811 bits) marked out-of-sync by on disk bit-map. Feb 19 19:32:07 srv2-2 kernel: block drbd1: disk( Attaching -> UpToDate ) Feb 19 19:32:07 srv2-2 kernel: block drbd1: attached to UUIDs 7033A3893BFC813F:78A03E42A1044DAD:1F43CD451DB9AFCB:1F42CD451DB9AFCB Feb 19 19:32:07 srv2-2 kernel: block drbd1: conn( StandAlone -> Unconnected ) Feb 19 19:32:07 srv2-2 kernel: block drbd1: Starting receiver thread (from drbd1_worker [3116]) Feb 19 19:32:07 srv2-2 kernel: block drbd1: receiver (re)started Feb 19 19:32:07 srv2-2 kernel: block drbd1: conn( Unconnected -> WFConnection ) // Open network communication between srv2-1 and srv2-2 Feb 19 19:34:29 srv2-2 kernel: block drbd1: Handshake successful: Agreed network protocol version 97 Feb 19 19:34:29 srv2-2 kernel: block drbd1: conn( WFConnection -> WFReportParams ) Feb 19 19:34:29 srv2-2 kernel: block drbd1: Starting asender thread (from drbd1_receiver [3125]) Feb 19 19:34:29 srv2-2 kernel: block drbd1: data-integrity-alg: Feb 19 19:34:29 srv2-2 kernel: block drbd1: drbd_sync_handshake: Feb 19 19:34:29 srv2-2 kernel: block drbd1: self 7033A3893BFC813E:78A03E42A1044DAD:1F43CD451DB9AFCB:1F42CD451DB9AFCB bits:160811 flags:0 Feb 19 19:34:29 srv2-2 kernel: block drbd1: peer 892066CDDBD054B9:78A03E42A1044DAC:1F43CD451DB9AFCA:1F42CD451DB9AFCB bits:38627 flags:0 Feb 19 19:34:29 srv2-2 kernel: block drbd1: uuid_compare()=100 by rule 90 Feb 19 19:34:29 srv2-2 kernel: block drbd1: helper command: /sbin/drbdadm initial-split-brain minor-1 Feb 19 19:34:29 srv2-2 kernel: block drbd1: helper command: /sbin/drbdadm initial-split-brain minor-1 exit code 0 (0x0) Feb 19 19:34:29 srv2-2 kernel: block drbd1: Split-Brain detected, 1 primaries, automatically solved. Sync from peer node Feb 19 19:34:29 srv2-2 kernel: block drbd1: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) disk( UpToDate -> Outdated ) pdsk( DUnknown -> UpToDate ) Feb 19 19:34:41 srv2-2 kernel: block drbd1: conn( WFBitMapT -> WFSyncUUID ) Feb 19 19:34:42 srv2-2 kernel: block drbd1: updated sync uuid 78A13E42A1044DAC:0000000000000000:1F43CD451DB9AFCB:1F42CD451DB9AFCB Feb 19 19:34:42 srv2-2 kernel: block drbd1: helper command: /sbin/drbdadm before-resync-target minor-1 Feb 19 19:34:42 srv2-2 kernel: block drbd1: helper command: /sbin/drbdadm before-resync-target minor-1 exit code 0 (0x0) Feb 19 19:34:42 srv2-2 kernel: block drbd1: conn( WFSyncUUID -> SyncTarget ) disk( Outdated -> Inconsistent ) Feb 19 19:34:42 srv2-2 kernel: block drbd1: Began resync as SyncTarget (will sync 786200 KB [196550 bits set]). Feb 19 19:35:59 srv2-2 kernel: block drbd1: Resync done (total 76 sec; paused 0 sec; 10344 K/sec) Feb 19 19:35:59 srv2-2 kernel: block drbd1: updated UUIDs 892066CDDBD054B8:0000000000000000:78A13E42A1044DAC:78A03E42A1044DAC Feb 19 19:35:59 srv2-2 kernel: block drbd1: conn( SyncTarget -> Connected ) disk( Inconsistent -> UpToDate ) Feb 19 19:35:59 srv2-2 kernel: block drbd1: helper command: /sbin/drbdadm after-resync-target minor-1 Feb 19 19:35:59 srv2-2 kernel: block drbd1: helper command: /sbin/drbdadm after-resync-target minor-1 exit code 0 (0x0) Feb 19 19:36:00 srv2-2 kernel: block drbd1: bitmap WRITE of 14340 pages took 448 jiffies Feb 19 19:36:01 srv2-2 kernel: block drbd1: 0 KB (0 bits) marked out-of-sync by on disk bit-map.