Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- #### FROM SECONDARY NODE ####
- # tail -n 100 /var/log/messages
- Jun 30 01:52:58 secondarynode kernel: block drbd0: Starting worker thread (from kworker/u:3 [4103])
- Jun 30 01:52:58 secondarynode kernel: block drbd0: disk( Diskless -> Attaching )
- Jun 30 01:52:58 secondarynode kernel: block drbd0: No usable activity log found.
- Jun 30 01:52:58 secondarynode kernel: block drbd0: Method to ensure write ordering: flush
- Jun 30 01:52:58 secondarynode kernel: block drbd0: max_segment_size ( = BIO size ) = 65536
- Jun 30 01:52:58 secondarynode kernel: block drbd0: drbd_bm_resize called with capacity == 1023896
- Jun 30 01:52:58 secondarynode kernel: block drbd0: resync bitmap: bits=127987 words=2000
- Jun 30 01:52:58 secondarynode kernel: block drbd0: size = 500 MB (511948 KB)
- Jun 30 01:52:58 secondarynode kernel: block drbd0: recounting of set bits took additional 0 jiffies
- Jun 30 01:52:58 secondarynode kernel: block drbd0: 0 KB (0 bits) marked out-of-sync by on disk bit-map.
- Jun 30 01:52:58 secondarynode kernel: block drbd0: disk( Attaching -> UpToDate )
- Jun 30 01:52:58 secondarynode kernel: block drbd1: Starting worker thread (from kworker/u:3 [4103])
- Jun 30 01:52:58 secondarynode kernel: block drbd1: disk( Diskless -> Attaching )
- Jun 30 01:52:58 secondarynode kernel: block drbd1: No usable activity log found.
- Jun 30 01:52:58 secondarynode kernel: block drbd1: Method to ensure write ordering: flush
- Jun 30 01:52:58 secondarynode kernel: block drbd1: max_segment_size ( = BIO size ) = 65536
- Jun 30 01:52:58 secondarynode kernel: block drbd1: drbd_bm_resize called with capacity == 611309264
- Jun 30 01:52:58 secondarynode kernel: block drbd1: resync bitmap: bits=76413658 words=1193964
- Jun 30 01:52:58 secondarynode kernel: block drbd1: size = 291 GB (305654632 KB)
- Jun 30 01:52:58 secondarynode kernel: block drbd1: recounting of set bits took additional 0 jiffies
- Jun 30 01:52:58 secondarynode kernel: block drbd1: 0 KB (0 bits) marked out-of-sync by on disk bit-map.
- Jun 30 01:52:58 secondarynode kernel: block drbd1: disk( Attaching -> UpToDate )
- Jun 30 01:52:58 secondarynode kernel: block drbd0: conn( StandAlone -> Unconnected )
- Jun 30 01:52:58 secondarynode kernel: block drbd0: Starting receiver thread (from drbd0_worker [3288])
- Jun 30 01:52:58 secondarynode kernel: block drbd0: receiver (re)started
- Jun 30 01:52:58 secondarynode kernel: block drbd0: conn( Unconnected -> WFConnection )
- Jun 30 01:52:58 secondarynode kernel: block drbd1: conn( StandAlone -> Unconnected )
- Jun 30 01:52:58 secondarynode kernel: block drbd1: Starting receiver thread (from drbd1_worker [3297])
- Jun 30 01:52:58 secondarynode kernel: block drbd1: receiver (re)started
- Jun 30 01:52:58 secondarynode kernel: block drbd1: conn( Unconnected -> WFConnection )
- Jun 30 01:52:58 secondarynode kernel: block drbd0: Handshake successful: Agreed network protocol version 95
- Jun 30 01:52:58 secondarynode kernel: block drbd0: conn( WFConnection -> WFReportParams )
- Jun 30 01:52:58 secondarynode kernel: block drbd0: Starting asender thread (from drbd0_receiver [3313])
- Jun 30 01:52:58 secondarynode kernel: block drbd0: data-integrity-alg: <not-used>
- Jun 30 01:52:58 secondarynode kernel: block drbd0: max_segment_size ( = BIO size ) = 65536
- Jun 30 01:52:58 secondarynode kernel: block drbd0: drbd_sync_handshake:
- Jun 30 01:52:58 secondarynode kernel: block drbd0: self C3C0A265E9277308:0000000000000000:173E3BD3C3CA5CD4:BBF4F1769486E305 bits:0 flags:0
- Jun 30 01:52:58 secondarynode kernel: block drbd0: peer 378FBD5A1F9BAB2D:C3C0A265E9277309:173E3BD3C3CA5CD4:BBF4F1769486E305 bits:0 flags:0
- Jun 30 01:52:58 secondarynode kernel: block drbd0: uuid_compare()=-1 by rule 50
- Jun 30 01:52:58 secondarynode kernel: block drbd0: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate )
- Jun 30 01:52:58 secondarynode kernel: block drbd1: Handshake successful: Agreed network protocol version 95
- Jun 30 01:52:58 secondarynode kernel: block drbd1: conn( WFConnection -> WFReportParams )
- Jun 30 01:52:58 secondarynode kernel: block drbd1: Starting asender thread (from drbd1_receiver [3324])
- Jun 30 01:52:58 secondarynode kernel: block drbd1: data-integrity-alg: <not-used>
- Jun 30 01:52:58 secondarynode kernel: block drbd1: max_segment_size ( = BIO size ) = 65536
- Jun 30 01:52:58 secondarynode kernel: block drbd1: drbd_sync_handshake:
- Jun 30 01:52:58 secondarynode kernel: block drbd1: self FC433F9C35D4E19C:0000000000000000:EFD171D1BE6D85C4:305E2BE3E64F5FA9 bits:0 flags:0
- Jun 30 01:52:58 secondarynode kernel: block drbd1: peer BDB69235F03FB11B:FC433F9C35D4E19D:EFD171D1BE6D85C5:305E2BE3E64F5FA9 bits:28134237 flags:0
- Jun 30 01:52:58 secondarynode kernel: block drbd1: uuid_compare()=-1 by rule 50
- Jun 30 01:52:58 secondarynode kernel: block drbd1: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate )
- ## I noticed the time was wrong so I changed it here ##
- Jun 30 22:08:22 secondarynode kernel: block drbd1: meta connection shut down by peer.
- Jun 30 22:08:22 secondarynode kernel: block drbd1: peer( Primary -> Unknown ) conn( WFBitMapT -> NetworkFailure ) pdsk( UpToDate -> DUnknown )
- Jun 30 22:08:22 secondarynode kernel: block drbd1: short read expecting header on sock: r=-512
- Jun 30 22:08:22 secondarynode kernel: block drbd0: meta connection shut down by peer.
- Jun 30 22:08:22 secondarynode kernel: block drbd0: peer( Primary -> Unknown ) conn( WFBitMapT -> NetworkFailure ) pdsk( UpToDate -> DUnknown )
- Jun 30 22:08:22 secondarynode kernel: block drbd0: short read expecting header on sock: r=-512
- Jun 30 22:08:22 secondarynode kernel: block drbd1: asender terminated
- Jun 30 22:08:22 secondarynode kernel: block drbd1: Terminating drbd1_asender
- Jun 30 22:08:22 secondarynode kernel: block drbd1: Connection closed
- Jun 30 22:08:22 secondarynode kernel: block drbd1: conn( NetworkFailure -> Unconnected )
- Jun 30 22:08:22 secondarynode kernel: block drbd1: receiver terminated
- Jun 30 22:08:22 secondarynode kernel: block drbd1: Restarting drbd1_receiver
- Jun 30 22:08:22 secondarynode kernel: block drbd1: receiver (re)started
- Jun 30 22:08:22 secondarynode kernel: block drbd1: conn( Unconnected -> WFConnection )
- Jun 30 22:08:22 secondarynode kernel: block drbd0: asender terminated
- Jun 30 22:08:22 secondarynode kernel: block drbd0: Terminating drbd0_asender
- Jun 30 22:08:22 secondarynode kernel: block drbd0: Connection closed
- Jun 30 22:08:22 secondarynode kernel: block drbd0: conn( NetworkFailure -> Unconnected )
- Jun 30 22:08:22 secondarynode kernel: block drbd0: receiver terminated
- Jun 30 22:08:22 secondarynode kernel: block drbd0: Restarting drbd0_receiver
- Jun 30 22:08:22 secondarynode kernel: block drbd0: receiver (re)started
- Jun 30 22:08:22 secondarynode kernel: block drbd0: conn( Unconnected -> WFConnection )
- Jun 30 22:08:22 secondarynode kernel: block drbd0: Handshake successful: Agreed network protocol version 95
- Jun 30 22:08:22 secondarynode kernel: block drbd0: conn( WFConnection -> WFReportParams )
- Jun 30 22:08:22 secondarynode kernel: block drbd1: Handshake successful: Agreed network protocol version 95
- Jun 30 22:08:22 secondarynode kernel: block drbd1: conn( WFConnection -> WFReportParams )
- Jun 30 22:08:22 secondarynode kernel: block drbd1: Starting asender thread (from drbd1_receiver [3324])
- Jun 30 22:08:22 secondarynode kernel: block drbd0: Starting asender thread (from drbd0_receiver [3313])
- Jun 30 22:08:22 secondarynode kernel: block drbd0: data-integrity-alg: <not-used>
- Jun 30 22:08:22 secondarynode kernel: block drbd0: max_segment_size ( = BIO size ) = 65536
- Jun 30 22:08:22 secondarynode kernel: block drbd1: data-integrity-alg: <not-used>
- Jun 30 22:08:22 secondarynode kernel: block drbd1: max_segment_size ( = BIO size ) = 65536
- Jun 30 22:08:22 secondarynode kernel: block drbd0: drbd_sync_handshake:
- Jun 30 22:08:22 secondarynode kernel: block drbd0: self C3C0A265E9277308:0000000000000000:173E3BD3C3CA5CD4:BBF4F1769486E305 bits:0 flags:0
- Jun 30 22:08:22 secondarynode kernel: block drbd0: peer 378FBD5A1F9BAB2D:C3C0A265E9277309:173E3BD3C3CA5CD4:BBF4F1769486E305 bits:0 flags:0
- Jun 30 22:08:22 secondarynode kernel: block drbd0: uuid_compare()=-1 by rule 50
- Jun 30 22:08:22 secondarynode kernel: block drbd0: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate )
- # /etc/init.d/drbd status
- DRBD module version: 8.3.9
- userland version: 8.3.8
- you should upgrade your drbd tools!
- * drbd driver loaded OK; device status: ... [ ok ]
- version: 8.3.9 (api:88/proto:86-95)
- built-in
- 0: cs:WFBitMapT ro:Secondary/Primary ds:UpToDate/UpToDate C r-----
- ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
- 1: cs:WFReportParams ro:Secondary/Unknown ds:UpToDate/DUnknown C r-----
- ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
- # drbd-overview
- 0:meta WFBitMapT Secondary/Primary UpToDate/UpToDate C r-----
- 1:data WFReportParams Secondary/Unknown UpToDate/DUnknown C r-----
- # ps aux | grep drbd | grep -v grep
- root 3288 0.0 0.0 0 0 ? S 21:52 0:00 [drbd0_worker]
- root 3297 0.0 0.0 0 0 ? S 21:52 0:00 [drbd1_worker]
- root 3313 0.0 0.0 0 0 ? S 21:52 0:00 [drbd0_receiver]
- root 3324 0.0 0.0 0 0 ? S 21:52 0:00 [drbd1_receiver]
- root 7098 0.0 0.0 0 0 ? S 22:08 0:00 [drbd1_asender]
- root 7099 0.0 0.0 0 0 ? S 22:08 0:00 [drbd0_asender]
- root 14498 0.0 0.0 8356 992 tty1 S+ Jun28 2:12 watch cat /proc/drbd
- ##################### PRIMARY NODE #####################
- # tail -n 300 /var/log/messages
- Jun 30 17:52:33 primarynode kernel: block drbd0: Handshake successful: Agreed network protocol version 95
- Jun 30 17:52:33 primarynode kernel: block drbd0: conn( WFConnection -> WFReportParams )
- Jun 30 17:52:33 primarynode kernel: block drbd0: Starting asender thread (from drbd0_receiver [14311])
- Jun 30 17:52:33 primarynode kernel: block drbd0: data-integrity-alg: <not-used>
- Jun 30 17:52:33 primarynode kernel: block drbd0: max_segment_size ( = BIO size ) = 65536
- Jun 30 17:52:33 primarynode kernel: block drbd0: drbd_sync_handshake:
- Jun 30 17:52:33 primarynode kernel: block drbd0: self 378FBD5A1F9BAB2D:C3C0A265E9277309:173E3BD3C3CA5CD4:BBF4F1769486E305 bits:0 flags:0
- Jun 30 17:52:33 primarynode kernel: block drbd0: peer C3C0A265E9277308:0000000000000000:173E3BD3C3CA5CD4:BBF4F1769486E305 bits:0 flags:0
- Jun 30 17:52:33 primarynode kernel: block drbd0: uuid_compare()=1 by rule 70
- Jun 30 17:52:33 primarynode kernel: block drbd0: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> UpToDate )
- Jun 30 17:52:33 primarynode kernel: block drbd1: Handshake successful: Agreed network protocol version 95
- Jun 30 17:52:33 primarynode kernel: block drbd1: conn( WFConnection -> WFReportParams )
- Jun 30 17:52:33 primarynode kernel: block drbd1: Starting asender thread (from drbd1_receiver [14322])
- Jun 30 17:52:33 primarynode kernel: block drbd1: data-integrity-alg: <not-used>
- Jun 30 17:52:33 primarynode kernel: block drbd1: max_segment_size ( = BIO size ) = 65536
- Jun 30 17:52:33 primarynode kernel: block drbd1: drbd_sync_handshake:
- Jun 30 17:52:33 primarynode kernel: block drbd1: self BDB69235F03FB11B:FC433F9C35D4E19D:EFD171D1BE6D85C5:305E2BE3E64F5FA9 bits:28134237 flags:0
- Jun 30 17:52:33 primarynode kernel: block drbd1: peer FC433F9C35D4E19C:0000000000000000:EFD171D1BE6D85C4:305E2BE3E64F5FA9 bits:0 flags:0
- Jun 30 17:52:33 primarynode kernel: block drbd1: uuid_compare()=1 by rule 70
- Jun 30 17:52:33 primarynode kernel: block drbd1: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> UpToDate )
- Jun 30 17:52:45 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967295
- Jun 30 17:52:51 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967294
- Jun 30 17:52:57 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967293
- Jun 30 17:53:03 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967292
- Jun 30 17:53:09 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967291
- Jun 30 17:53:15 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967290
- Jun 30 17:53:21 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967289
- Jun 30 17:53:27 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967288
- Jun 30 17:53:33 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967287
- Jun 30 17:53:39 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967286
- Jun 30 17:53:45 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967285
- Jun 30 17:53:51 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967284
- Jun 30 17:53:57 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967283
- Jun 30 17:54:03 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967282
- Jun 30 17:54:09 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967281
- Jun 30 17:54:15 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967280
- Jun 30 17:54:21 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967279
- Jun 30 17:54:27 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967278
- Jun 30 17:54:33 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967277
- Jun 30 17:54:39 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967276
- Jun 30 17:54:45 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967275
- Jun 30 17:54:51 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967274
- Jun 30 17:54:57 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967273
- Jun 30 17:55:03 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967272
- Jun 30 17:55:09 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967271
- Jun 30 17:55:15 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967270
- Jun 30 17:55:21 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967269
- Jun 30 17:55:27 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967268
- Jun 30 17:55:33 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967267
- Jun 30 17:55:39 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967266
- Jun 30 17:55:45 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967265
- Jun 30 17:55:51 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967264
- Jun 30 17:55:57 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967263
- Jun 30 17:56:03 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967262
- Jun 30 17:56:09 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967261
- Jun 30 17:56:15 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967260
- Jun 30 17:56:21 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967259
- Jun 30 17:56:27 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967258
- Jun 30 17:56:33 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967257
- Jun 30 17:56:39 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967256
- Jun 30 17:56:45 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967255
- Jun 30 17:56:51 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967254
- Jun 30 17:56:57 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967253
- Jun 30 17:57:03 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967252
- Jun 30 17:57:09 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967251
- Jun 30 17:57:15 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967250
- Jun 30 17:57:21 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967249
- Jun 30 17:57:27 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967248
- Jun 30 17:57:33 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967247
- Jun 30 17:57:39 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967246
- Jun 30 21:58:02 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967245
- Jun 30 21:58:08 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967244
- Jun 30 21:58:14 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967243
- Jun 30 21:58:20 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967242
- Jun 30 21:58:26 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967241
- Jun 30 21:58:32 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967240
- Jun 30 21:58:38 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967239
- Jun 30 21:58:44 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967238
- Jun 30 21:58:50 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967237
- Jun 30 21:58:56 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967236
- Jun 30 21:59:02 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967235
- Jun 30 21:59:08 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967234
- Jun 30 21:59:14 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967233
- Jun 30 21:59:20 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967232
- Jun 30 21:59:26 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967231
- Jun 30 21:59:32 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967230
- Jun 30 21:59:38 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967229
- Jun 30 21:59:44 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967228
- Jun 30 21:59:50 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967227
- Jun 30 21:59:56 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967226
- Jun 30 22:00:02 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967225
- Jun 30 22:00:08 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967224
- Jun 30 22:00:14 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967223
- Jun 30 22:00:20 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967222
- Jun 30 22:00:26 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967221
- Jun 30 22:00:32 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967220
- Jun 30 22:00:38 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967219
- Jun 30 22:00:44 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967218
- Jun 30 22:00:50 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967217
- Jun 30 22:00:56 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967216
- Jun 30 22:01:02 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967215
- Jun 30 22:01:08 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967214
- Jun 30 22:01:14 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967213
- Jun 30 22:01:20 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967212
- Jun 30 22:01:26 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967211
- Jun 30 22:01:32 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967210
- Jun 30 22:01:38 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967209
- Jun 30 22:01:44 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967208
- Jun 30 22:01:50 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967207
- Jun 30 22:01:56 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967206
- Jun 30 22:02:02 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967205
- Jun 30 22:02:08 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967204
- Jun 30 22:02:14 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967203
- Jun 30 22:02:20 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967202
- Jun 30 22:02:26 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967201
- Jun 30 22:02:32 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967200
- Jun 30 22:02:38 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967199
- Jun 30 22:02:44 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967198
- Jun 30 22:02:50 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967197
- Jun 30 22:02:56 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967196
- Jun 30 22:03:02 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967195
- Jun 30 22:03:08 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967194
- Jun 30 22:03:14 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967193
- Jun 30 22:03:20 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967192
- Jun 30 22:03:26 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967191
- Jun 30 22:03:32 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967190
- Jun 30 22:03:38 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967189
- Jun 30 22:03:44 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967188
- Jun 30 22:03:50 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967187
- Jun 30 22:03:56 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967186
- Jun 30 22:04:02 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967185
- Jun 30 22:04:08 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967184
- Jun 30 22:04:14 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967183
- Jun 30 22:04:20 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967182
- Jun 30 22:04:26 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967181
- Jun 30 22:04:32 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967180
- Jun 30 22:04:38 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967179
- Jun 30 22:04:44 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967178
- Jun 30 22:04:50 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967177
- Jun 30 22:04:56 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967176
- Jun 30 22:05:02 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967175
- Jun 30 22:05:08 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967174
- Jun 30 22:05:14 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967173
- Jun 30 22:05:20 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967172
- Jun 30 22:05:26 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967171
- Jun 30 22:05:32 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967170
- Jun 30 22:05:38 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967169
- Jun 30 22:05:44 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967168
- Jun 30 22:05:50 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967167
- Jun 30 22:05:56 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967166
- Jun 30 22:06:02 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967165
- Jun 30 22:06:08 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967164
- Jun 30 22:06:14 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967163
- Jun 30 22:06:20 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967162
- Jun 30 22:06:26 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967161
- Jun 30 22:06:32 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967160
- Jun 30 22:06:38 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967159
- Jun 30 22:06:44 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967158
- Jun 30 22:06:50 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967157
- Jun 30 22:06:56 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967156
- Jun 30 22:07:02 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967155
- Jun 30 22:07:08 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967154
- Jun 30 22:07:14 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967153
- Jun 30 22:07:20 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967152
- Jun 30 22:07:26 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967151
- Jun 30 22:07:32 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967150
- Jun 30 22:07:38 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967149
- Jun 30 22:07:44 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967148
- Jun 30 22:07:50 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967147
- Jun 30 22:07:56 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967146
- Jun 30 22:08:02 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967145
- Jun 30 22:08:08 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967144
- Jun 30 22:08:14 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967143
- Jun 30 22:08:20 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967142
- Jun 30 22:08:26 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967141
- Jun 30 22:08:27 primarynode kernel: block drbd1: sock_sendmsg returned -110
- Jun 30 22:08:27 primarynode kernel: block drbd0: sock_recvmsg returned -110
- Jun 30 22:08:27 primarynode kernel: block drbd0: peer( Secondary -> Unknown ) conn( WFBitMapS -> BrokenPipe ) pdsk( UpToDate -> DUnknown )
- Jun 30 22:08:27 primarynode kernel: block drbd1: peer( Secondary -> Unknown ) conn( WFBitMapS -> BrokenPipe ) pdsk( UpToDate -> DUnknown )
- Jun 30 22:08:27 primarynode kernel: block drbd0: short read expecting header on sock: r=-110
- Jun 30 22:08:27 primarynode kernel: block drbd1: short sent ReportBitMap size=4096 sent=2256
- Jun 30 22:08:27 primarynode kernel: block drbd1: sock was shut down by peer
- Jun 30 22:08:27 primarynode kernel: block drbd1: short read expecting header on sock: r=0
- Jun 30 22:08:27 primarynode kernel: block drbd1: asender terminated
- Jun 30 22:08:27 primarynode kernel: block drbd1: Terminating drbd1_asender
- Jun 30 22:08:27 primarynode kernel: block drbd1: Connection closed
- Jun 30 22:08:27 primarynode kernel: block drbd1: conn( BrokenPipe -> Unconnected )
- Jun 30 22:08:27 primarynode kernel: block drbd1: receiver terminated
- Jun 30 22:08:27 primarynode kernel: block drbd1: Restarting drbd1_receiver
- Jun 30 22:08:27 primarynode kernel: block drbd1: receiver (re)started
- Jun 30 22:08:27 primarynode kernel: block drbd1: conn( Unconnected -> WFConnection )
- Jun 30 22:08:27 primarynode kernel: block drbd0: asender terminated
- Jun 30 22:08:27 primarynode kernel: block drbd0: Terminating drbd0_asender
- Jun 30 22:08:27 primarynode kernel: block drbd0: Connection closed
- Jun 30 22:08:27 primarynode kernel: block drbd0: conn( BrokenPipe -> Unconnected )
- Jun 30 22:08:27 primarynode kernel: block drbd0: receiver terminated
- Jun 30 22:08:27 primarynode kernel: block drbd0: Restarting drbd0_receiver
- Jun 30 22:08:27 primarynode kernel: block drbd0: receiver (re)started
- Jun 30 22:08:27 primarynode kernel: block drbd0: conn( Unconnected -> WFConnection )
- Jun 30 22:08:27 primarynode kernel: block drbd0: Handshake successful: Agreed network protocol version 95
- Jun 30 22:08:27 primarynode kernel: block drbd0: conn( WFConnection -> WFReportParams )
- Jun 30 22:08:27 primarynode kernel: block drbd0: Starting asender thread (from drbd0_receiver [14311])
- Jun 30 22:08:27 primarynode kernel: block drbd1: Handshake successful: Agreed network protocol version 95
- Jun 30 22:08:27 primarynode kernel: block drbd1: conn( WFConnection -> WFReportParams )
- Jun 30 22:08:27 primarynode kernel: block drbd1: Starting asender thread (from drbd1_receiver [14322])
- Jun 30 22:08:27 primarynode kernel: block drbd1: data-integrity-alg: <not-used>
- Jun 30 22:08:27 primarynode kernel: block drbd0: data-integrity-alg: <not-used>
- Jun 30 22:08:27 primarynode kernel: block drbd1: max_segment_size ( = BIO size ) = 65536
- Jun 30 22:08:27 primarynode kernel: block drbd1: drbd_sync_handshake:
- Jun 30 22:08:27 primarynode kernel: block drbd1: self BDB69235F03FB11B:FC433F9C35D4E19D:EFD171D1BE6D85C5:305E2BE3E64F5FA9 bits:28134237 flags:0
- Jun 30 22:08:27 primarynode kernel: block drbd1: peer FC433F9C35D4E19C:0000000000000000:EFD171D1BE6D85C4:305E2BE3E64F5FA9 bits:0 flags:0
- Jun 30 22:08:27 primarynode kernel: block drbd1: uuid_compare()=1 by rule 70
- Jun 30 22:08:27 primarynode kernel: block drbd1: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> UpToDate )
- Jun 30 22:08:27 primarynode kernel: block drbd0: max_segment_size ( = BIO size ) = 65536
- Jun 30 22:08:27 primarynode kernel: block drbd0: drbd_sync_handshake:
- Jun 30 22:08:27 primarynode kernel: block drbd0: self 378FBD5A1F9BAB2D:C3C0A265E9277309:173E3BD3C3CA5CD4:BBF4F1769486E305 bits:0 flags:0
- Jun 30 22:08:27 primarynode kernel: block drbd0: peer C3C0A265E9277308:0000000000000000:173E3BD3C3CA5CD4:BBF4F1769486E305 bits:0 flags:0
- Jun 30 22:08:27 primarynode kernel: block drbd0: uuid_compare()=1 by rule 70
- Jun 30 22:08:27 primarynode kernel: block drbd0: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> UpToDate )
- Jun 30 22:08:39 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967295
- Jun 30 22:08:45 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967294
- Jun 30 22:08:51 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967293
- Jun 30 22:08:57 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967292
- Jun 30 22:09:03 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967291
- Jun 30 22:09:09 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967290
- Jun 30 22:09:15 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967289
- Jun 30 22:09:21 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967288
- Jun 30 22:09:27 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967287
- Jun 30 22:09:33 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967286
- Jun 30 22:09:39 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967285
- Jun 30 22:09:45 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967284
- Jun 30 22:09:51 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967283
- Jun 30 22:09:57 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967282
- Jun 30 22:10:03 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967281
- Jun 30 22:10:09 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967280
- Jun 30 22:10:15 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967279
- Jun 30 22:10:21 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967278
- Jun 30 22:10:27 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967277
- Jun 30 22:10:33 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967276
- Jun 30 22:10:39 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967275
- Jun 30 22:10:45 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967274
- Jun 30 22:10:51 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967273
- Jun 30 22:10:57 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967272
- Jun 30 22:11:03 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967271
- Jun 30 22:11:09 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967270
- Jun 30 22:11:15 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967269
- Jun 30 22:11:21 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967268
- Jun 30 22:11:27 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967267
- Jun 30 22:11:33 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967266
- Jun 30 22:11:39 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967265
- Jun 30 22:11:45 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967264
- Jun 30 22:11:51 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967263
- Jun 30 22:11:57 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967262
- Jun 30 22:12:03 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967261
- Jun 30 22:12:09 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967260
- Jun 30 22:12:15 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967259
- Jun 30 22:12:21 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967258
- Jun 30 22:12:27 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967257
- Jun 30 22:12:33 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967256
- Jun 30 22:12:39 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967255
- Jun 30 22:12:45 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967254
- Jun 30 22:12:51 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967253
- Jun 30 22:12:57 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967252
- Jun 30 22:13:03 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967251
- Jun 30 22:13:09 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967250
- Jun 30 22:13:15 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967249
- Jun 30 22:13:21 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967248
- Jun 30 22:13:27 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967247
- Jun 30 22:13:33 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967246
- Jun 30 22:13:39 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967245
- Jun 30 22:13:45 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967244
- Jun 30 22:13:51 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967243
- Jun 30 22:13:57 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967242
- Jun 30 22:14:03 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967241
- Jun 30 22:14:09 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967240
- Jun 30 22:14:15 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967239
- Jun 30 22:14:21 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967238
- Jun 30 22:14:27 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967237
- Jun 30 22:14:33 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967236
- Jun 30 22:14:39 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967235
- Jun 30 22:14:45 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967234
- Jun 30 22:14:51 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967233
- Jun 30 22:14:57 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967232
- Jun 30 22:15:03 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967231
- Jun 30 22:15:09 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967230
- Jun 30 22:15:15 primarynode kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967229
- # /etc/init.d/drbd status
- DRBD module version: 8.3.9
- userland version: 8.3.8
- you should upgrade your drbd tools!
- * drbd driver loaded OK; device status: ... [ ok ]
- version: 8.3.9 (api:88/proto:86-95)
- built-in
- 0: cs:WFBitMapS ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
- ns:0 nr:0 dw:0 dr:700 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
- 1: cs:WFBitMapS ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
- ns:279805224 nr:0 dw:508694668 dr:1621564 al:61640 bm:28031 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:112536948
- # drbd-overview
- ## It never completed - just hung so I CTRL-C'd it ##
- # ps aux | grep drbd | grep -v grep
- root 4975 0.0 0.0 0 0 ? S 22:08 0:00 [drbd0_asender]
- root 4976 0.0 0.0 0 0 ? S 22:08 0:00 [drbd1_asender]
- root 14266 0.0 0.0 0 0 ? S Jun29 0:00 [drbd0_worker]
- root 14277 0.2 0.0 0 0 ? S Jun29 4:10 [drbd1_worker]
- root 14311 0.0 0.0 0 0 ? S Jun29 0:00 [drbd0_receiver]
- root 14322 0.9 0.0 0 0 ? S Jun29 13:54 [drbd1_receiver]
- # /etc/init.d/drbd stop
- * Caching service dependencies ...
- DRBD module version: 8.3.9
- userland version: 8.3.8
- you should upgrade your drbd tools! [ ok ]
- DRBD module version: 8.3.9
- userland version: 8.3.8
- you should upgrade your drbd tools!
- * Stopping all DRBD resources ...
- DRBD module version: 8.3.9
- userland version: 8.3.8
- you should upgrade your drbd tools!
- 0: State change failed: (-12) Device is held open by someone
- Command '/sbin/drbdsetup 0 down' terminated with exit code 11
- 1: State change failed: (-12) Device is held open by someone
- Command '/sbin/drbdsetup 1 down' terminated with exit code 11
- ## The service has stopped, so I stopped it on the secondary as well then started the primary before the secondary##
- # /var/log/messages on primary after restart
- Jun 30 22:24:34 itfof01 kernel: block drbd1: Handshake successful: Agreed network protocol version 95
- Jun 30 22:24:34 itfof01 kernel: block drbd1: conn( WFConnection -> WFReportParams )
- Jun 30 22:24:34 itfof01 kernel: block drbd1: Starting asender thread (from drbd1_receiver [14322])
- Jun 30 22:24:34 itfof01 kernel: block drbd1: data-integrity-alg: <not-used>
- Jun 30 22:24:34 itfof01 kernel: block drbd1: max_segment_size ( = BIO size ) = 65536
- Jun 30 22:24:34 itfof01 kernel: block drbd1: drbd_sync_handshake:
- Jun 30 22:24:34 itfof01 kernel: block drbd1: self BDB69235F03FB11B:FC433F9C35D4E19D:EFD171D1BE6D85C5:305E2BE3E64F5FA9 bits:28134237 flags:0
- Jun 30 22:24:34 itfof01 kernel: block drbd1: peer FC433F9C35D4E19C:0000000000000000:EFD171D1BE6D85C4:305E2BE3E64F5FA9 bits:0 flags:0
- Jun 30 22:24:34 itfof01 kernel: block drbd1: uuid_compare()=1 by rule 70
- Jun 30 22:24:34 itfof01 kernel: block drbd1: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> UpToDate )
- Jun 30 22:24:34 itfof01 kernel: block drbd0: Handshake successful: Agreed network protocol version 95
- Jun 30 22:24:34 itfof01 kernel: block drbd0: conn( WFConnection -> WFReportParams )
- Jun 30 22:24:34 itfof01 kernel: block drbd0: Starting asender thread (from drbd0_receiver [14311])
- Jun 30 22:24:34 itfof01 kernel: block drbd0: data-integrity-alg: <not-used>
- Jun 30 22:24:34 itfof01 kernel: block drbd0: max_segment_size ( = BIO size ) = 65536
- Jun 30 22:24:34 itfof01 kernel: block drbd0: drbd_sync_handshake:
- Jun 30 22:24:34 itfof01 kernel: block drbd0: self 378FBD5A1F9BAB2D:C3C0A265E9277309:173E3BD3C3CA5CD4:BBF4F1769486E305 bits:0 flags:0
- Jun 30 22:24:34 itfof01 kernel: block drbd0: peer C3C0A265E9277308:0000000000000000:173E3BD3C3CA5CD4:BBF4F1769486E305 bits:0 flags:0
- Jun 30 22:24:34 itfof01 kernel: block drbd0: uuid_compare()=1 by rule 70
- Jun 30 22:24:34 itfof01 kernel: block drbd0: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> UpToDate )
- Jun 30 22:24:46 itfof01 kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967295
- Jun 30 22:24:52 itfof01 kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967294
- Jun 30 22:24:58 itfof01 kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967293
- Jun 30 22:25:04 itfof01 kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967292
- Jun 30 22:25:10 itfof01 kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967291
- Jun 30 22:25:16 itfof01 kernel: block drbd1: [drbd1_worker/14277] sock_sendmsg time expired, ko = 4294967290
- # ETC
- # dmesg output (from primary) after stopping & starting the services on both nodes
- block drbd0: peer( Secondary -> Unknown ) conn( WFBitMapS -> TearDown ) pdsk( UpToDate -> DUnknown )
- block drbd0: meta connection shut down by peer.
- block drbd0: asender terminated
- block drbd0: Terminating drbd0_asender
- block drbd0: Connection closed
- block drbd0: conn( TearDown -> Unconnected )
- block drbd0: receiver terminated
- block drbd0: Restarting drbd0_receiver
- block drbd0: receiver (re)started
- block drbd0: conn( Unconnected -> WFConnection )
- block drbd1: peer( Secondary -> Unknown ) conn( WFBitMapS -> TearDown ) pdsk( UpToDate -> DUnknown )
- block drbd1: meta connection shut down by peer.
- block drbd1: asender terminated
- block drbd1: Terminating drbd1_asender
- block drbd1: short sent ReportBitMap size=4096 sent=2380
- block drbd1: Connection closed
- block drbd1: conn( TearDown -> Unconnected )
- block drbd1: receiver terminated
- block drbd1: Restarting drbd1_receiver
- block drbd1: receiver (re)started
- block drbd1: conn( Unconnected -> WFConnection )
- block drbd0: role( Primary -> Secondary )
- block drbd0: conn( WFConnection -> Disconnecting )
- block drbd0: Discarding network configuration.
- block drbd0: Connection closed
- block drbd0: conn( Disconnecting -> StandAlone )
- block drbd0: receiver terminated
- block drbd0: Terminating drbd0_receiver
- block drbd0: disk( UpToDate -> Diskless )
- block drbd0: Sending state for being diskless failed
- block drbd0: drbd_bm_resize called with capacity == 0
- block drbd0: worker terminated
- block drbd0: Terminating drbd0_worker
- block drbd1: State change failed: Device is held open by someone
- block drbd1: state = { cs:WFConnection ro:Primary/Unknown ds:UpToDate/DUnknown r--- }
- block drbd1: wanted = { cs:WFConnection ro:Secondary/Unknown ds:UpToDate/DUnknown r--- }
- block drbd0: Starting worker thread (from kworker/u:0 [5])
- block drbd0: disk( Diskless -> Attaching )
- block drbd0: No usable activity log found.
- block drbd0: Method to ensure write ordering: flush
- block drbd0: max_segment_size ( = BIO size ) = 65536
- block drbd0: drbd_bm_resize called with capacity == 1023896
- block drbd0: resync bitmap: bits=127987 words=2000
- block drbd0: size = 500 MB (511948 KB)
- block drbd0: recounting of set bits took additional 0 jiffies
- block drbd0: 0 KB (0 bits) marked out-of-sync by on disk bit-map.
- block drbd0: disk( Attaching -> UpToDate )
- block drbd0: conn( StandAlone -> Unconnected )
- block drbd0: Starting receiver thread (from drbd0_worker [9168])
- block drbd0: receiver (re)started
- block drbd0: conn( Unconnected -> WFConnection )
- block drbd0: Handshake successful: Agreed network protocol version 95
- block drbd0: conn( WFConnection -> WFReportParams )
- block drbd0: Starting asender thread (from drbd0_receiver [9180])
- block drbd0: data-integrity-alg: <not-used>
- block drbd0: max_segment_size ( = BIO size ) = 65536
- block drbd0: drbd_sync_handshake:
- block drbd0: self 378FBD5A1F9BAB2C:C3C0A265E9277309:173E3BD3C3CA5CD4:BBF4F1769486E305 bits:0 flags:0
- block drbd0: peer C3C0A265E9277308:0000000000000000:173E3BD3C3CA5CD4:BBF4F1769486E305 bits:0 flags:0
- block drbd0: uuid_compare()=1 by rule 70
- block drbd0: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> UpToDate )
- block drbd1: Handshake successful: Agreed network protocol version 95
- block drbd1: conn( WFConnection -> WFReportParams )
- block drbd1: Starting asender thread (from drbd1_receiver [8012])
- block drbd1: data-integrity-alg: <not-used>
- block drbd1: max_segment_size ( = BIO size ) = 65536
- block drbd1: drbd_sync_handshake:
- block drbd1: self BDB69235F03FB11B:FC433F9C35D4E19D:EFD171D1BE6D85C5:305E2BE3E64F5FA9 bits:28134237 flags:0
- block drbd1: peer FC433F9C35D4E19C:0000000000000000:EFD171D1BE6D85C4:305E2BE3E64F5FA9 bits:0 flags:0
- block drbd1: uuid_compare()=1 by rule 70
- block drbd1: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> UpToDate )
- #cat /etc/drbd.conf
- global {
- usage-count no;
- }
- common {
- # transfer protocol to use.
- # C: write IO is reported as completed, if we know it has
- # reached _both_ local and remote DISK.
- # * for critical transactional data.
- # B: write IO is reported as completed, if it has reached
- # local DISK and remote buffer cache.
- # * for most cases.
- # A: write IO is reported as completed, if it has reached
- # local DISK and local tcp send buffer. (see also sndbuf-size)
- # * for high latency networks
- #
- protocol C;
- handlers {
- # what should be done in case the cluster starts up in
- # degraded mode, but knows it has inconsistent data.
- #pri-on-incon-degr "echo '!DRBD! pri on incon-degr' | wall ; sleep 60 ; halt -f";
- pri-on-incon-degr "echo 'DRBD: primary requested but inconsistent!' | wall; /etc/init.d/heartbeat stop"; #"halt -f";
- pri-lost-after-sb "echo 'DRBD: primary requested but lost!' | wall; /etc/init.d/heartbeat stop"; #"halt -f";
- #pri-on-incon-degr "echo o > /proc/sysrq-trigger";
- #pri-lost-after-sb "echo o > /proc/sysrq-trigger";
- #local-io-error "echo o > /proc/sysrq-trigger";
- }
- startup {
- #The init script drbd(8) blocks the boot process until the DRBD resources are connected. When the cluster manager
- #starts later, it does not see a resource with internal split-brain. In case you want to limit the wait time, do it
- #here. Default is 0, which means unlimited. The unit is seconds.
- wfc-timeout 0; # 2 minutes
- # Wait for connection timeout if this node was a degraded cluster.
- # In case a degraded cluster (= cluster with only one node left)
- # is rebooted, this timeout value is used.
- #
- degr-wfc-timeout 120; # 2 minutes.
- }
- syncer {
- rate 13M;
- # This is now expressed with "after res-name"
- #group 1;
- al-extents 257;
- }
- net {
- # TODO: Should these timeouts be relative to some heartbeat settings?
- # timeout 60; # 6 seconds (unit = 0.1 seconds)
- # connect-int 10; # 10 seconds (unit = 1 second)
- # ping-int 10; # 10 seconds (unit = 1 second)
- # if the connection to the peer is lost you have the choice of
- # "reconnect" -> Try to reconnect (AKA WFConnection state)
- # "stand_alone" -> Do not reconnect (AKA StandAlone state)
- # "freeze_io" -> Try to reconnect but freeze all IO until
- # the connection is established again.
- # FIXME This appears to be obsoleate
- #on-disconnect reconnect;
- # FIXME Experemental Crap
- #cram-hmac-alg "sha256";
- #shared-secret "secretPassword555";
- #after-sb-0pri discard-younger-primary;
- #after-sb-1pri consensus;
- #after-sb-2pri disconnect;
- #rr-conflict disconnect;
- }
- disk {
- # if the lower level device reports io-error you have the choice of
- # "pass_on" -> Report the io-error to the upper layers.
- # Primary -> report it to the mounted file system.
- # Secondary -> ignore it.
- # "panic" -> The node leaves the cluster by doing a kernel panic.
- # "detach" -> The node drops its backing storage device, and
- # continues in disk less mode.
- #
- on-io-error pass_on;
- # Under fencing we understand preventive measures to avoid situations where both nodes are
- # primary and disconnected (AKA split brain).
- fencing dont-care;
- # In case you only want to use a fraction of the available space
- # you might use the "size" option here.
- #
- # size 10G;
- }
- }
- #
- # this need not be drbd#, you may use phony resource names,
- # like "resource web" or "resource mail", too
- #
- resource "meta" {
- device /dev/drbd0;
- meta-disk internal;
- on primary {
- address 192.168.50.51:7788;
- disk /dev/sdb1;
- }
- on secondary {
- address 192.168.50.52:7788;
- disk /dev/md0p1;
- }
- }
- resource "data" {
- device /dev/drbd1;
- meta-disk internal;
- on primary {
- address 192.168.50.51:7789;
- disk /dev/sdb2;
- }
- on secondary {
- address 192.168.50.52:7789;
- disk /dev/md0p2;
- }
- }
- ########### Notes
- DRBD wasn't built as a module; its baked into the kernel; lsmod doesn't show the drbd module loaded. (i.e.: find /lib/modules -name "drbd*" and find /lib/modules/2.6.*gentoo-*/ -type f -iname '*.o' -or -iname '*.ko' | grep drbd return no results)
- Linux primary 2.6.38-gentoo-r6 #1 SMP Mon Jun 27 07:35:00 EDT 2011 x86_64 Intel(R) Xeon(R) CPU E5504 @ 2.00GHz GenuineIntel GNU/Linux
- Linux secondary 2.6.38-gentoo-r6 #1 SMP Sun Jun 26 15:57:21 EDT 2011 x86_64 Intel(R) Core(TM)2 Quad CPU Q6600 @ 2.40GHz GenuineIntel GNU/Linux
- Using 'sys-cluster/drbd' emerge build (v 8.3.8.1) on both nodes.
- Did not emerge 'sys-cluster/drbd-kernel' emerge build (v 8.0.16)
- Other steps performed:
- `drbdadm {disconnect,connect} resource` on both nodes
- `drbdadm role resource` on both nodes and primary reports primary, secondary reports secondary
- `drbdadm -- --discard-my-data connect resource` on the secondary
- While most of the above commands work correctly on the secondary (except the --discard-my-data of course), on the primary I occasionally see things like
- No response from the DRBD driver! Is the module loaded?
- 1: State change failed: (-12) Device is held open by someone
- Command '/sbin/drbdsetup 1 down' terminated with exit code 11
- 0: Failure: (125) Device has a net-config (use disconnect first)
- Command 'drbdsetup 0 net 192.168.50.51:7788 192.168.50.52:7788 C --set-defaults --create-device' terminated with exit code 10
- Command 'drbdsetup 1 net 192.168.50.51:7789 192.168.50.52:7789 C --set-defaults --create-device' did not terminate within 5 seconds
- Some commands never seem to complete.
- The problem is closely related, if not identical, to the issue outlined here: http://copilotco.com/mail-archives/drbd.2009/msg00449.html. Instead of a bad link, or link with latency I simply stopped the DRBD service on the secondary to simulate an outage or disconnect scenario. I left it off for hours just to get an idea of how well it replicated & how long it would take but to my dismay it hasn't reconnected.
Advertisement
Add Comment
Please, Sign In to add comment