Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- Servers involved:-
- 103.25.214.60 - apollo.launtel.net.au - node crashed
- 122.201.85.240 - aphrodite.launtel.net.au - node crashed
- 103.25.214.61 - zeus.launtel.net.au - executed drop table on this node, didn't crash but went non-primary
- 54.66.156.123 - ares.launtel.net.au - didn't crash but went non-primary.
- 172.31.15.84 - ares.launtel.net.au - internal IP of above server - behind NAT gw (AWS)
- Galera section of mysql config file (all the same except IP addresses and names swapped around):-
- [galera]
- wsrep_provider=/usr/lib64/galera/libgalera_smm.so
- wsrep_cluster_address=gcomm://122.201.85.240,103.25.214.60,54.66.156.123
- binlog_format=row
- default_storage_engine=InnoDB
- innodb_autoinc_lock_mode=2
- bind-address=0.0.0.0
- wsrep_cluster_name='launtel'
- wsrep_node_address='103.25.214.61'
- wsrep_node_name='zeus'
- wsrep_sst_method=rsync
- wsrep_sst_auth=replicate:XXXXXXXXXXXX
- wsrep_on=ON
- ===========================================================================================
- /var/lib/mysql/apollo.launtel.net.au.err -
- 150523 13:28:51 [ERROR] mysqld got signal 11 ;
- This could be because you hit a bug. It is also possible that this binary
- or one of the libraries it was linked against is corrupt, improperly built,
- or misconfigured. This error can also be caused by malfunctioning hardware.
- To report this bug, see http://kb.askmonty.org/en/reporting-bugs
- We will try our best to scrape up some info that will hopefully help
- diagnose the problem, but since we have already crashed,
- something is definitely wrong and this may fail.
- Server version: 5.5.41-MariaDB-wsrep
- key_buffer_size=134217728
- read_buffer_size=131072
- max_used_connections=16
- max_threads=153
- thread_count=17
- It is possible that mysqld could use up to
- key_buffer_size + (read_buffer_size + sort_buffer_size)*max_threads = 466778 K bytes of memory
- Hope that's ok; if not, decrease some variables in the equation.
- Thread pointer: 0x0x7fc6e6c14000
- Attempting backtrace. You can use the following information to find out
- where mysqld died. If you see no messages after this, something went
- terribly wrong...
- stack_bottom = 0x7fc6f90c59b0 thread_stack 0x48000
- /usr/sbin/mysqld(my_print_stacktrace+0x2e)[0xae59ee]
- /usr/sbin/mysqld(handle_fatal_signal+0x390)[0x6fec00]
- /lib64/libpthread.so.0(+0xf130)[0x7fc6f8cfc130]
- /usr/sbin/mysqld(_ZN28Format_description_log_event14do_apply_eventEPK14Relay_log_info+0xc8)[0x7c45b8]
- /usr/sbin/mysqld(_Z14wsrep_apply_cbPvPKvmjPK14wsrep_trx_meta+0x6d0)[0x6b2380]
- /usr/lib64/galera/libgalera_smm.so(_ZNK6galera9TrxHandle5applyEPvPF15wsrep_cb_statusS1_PKvmjPK14wsrep_trx_metaERS6_+0x100)[0x7fc6f3d551a0]
- /usr/lib64/galera/libgalera_smm.so(+0x1b7330)[0x7fc6f3d8a330]
- /usr/lib64/galera/libgalera_smm.so(_ZN6galera13ReplicatorSMM9apply_trxEPvPNS_9TrxHandleE+0xc3)[0x7fc6f3d8ca53]
- /usr/lib64/galera/libgalera_smm.so(_ZN6galera13ReplicatorSMM11process_trxEPvPNS_9TrxHandleE+0x136)[0x7fc6f3d8f476]
- /usr/lib64/galera/libgalera_smm.so(_ZN6galera15GcsActionSource8dispatchEPvRK10gcs_actionRb+0x1d9)[0x7fc6f3d6e759]
- /usr/lib64/galera/libgalera_smm.so(_ZN6galera15GcsActionSource7processEPvRb+0x5c)[0x7fc6f3d6f55c]
- /usr/lib64/galera/libgalera_smm.so(_ZN6galera13ReplicatorSMM10async_recvEPv+0x83)[0x7fc6f3d8fa13]
- /usr/lib64/galera/libgalera_smm.so(galera_recv+0x2b)[0x7fc6f3d9eedb]
- /usr/sbin/mysqld[0x6b2c6f]
- /usr/sbin/mysqld(start_wsrep_THD+0x4f8)[0x522718]
- /lib64/libpthread.so.0(+0x7df3)[0x7fc6f8cf4df3]
- /lib64/libc.so.6(clone+0x6d)[0x7fc6f75721ad]
- Trying to get some variables.
- Some pointers may be invalid and cause the dump to abort.
- Query (0x0): is an invalid pointer
- Connection ID (thread ID): 2
- Status: NOT_KILLED
- Optimizer switch: index_merge=on,index_merge_union=on,index_merge_sort_union=on,index_merge_intersection=on,index_merge_sort_intersection=off,engine_condition_pushdown=off,index_condition_pushdown=on,derived_merge=on,derived_with_keys=on,firstmatch=on,loosescan=on,materialization=on,in_to_exists=on,semijoin=on,partial_match_rowid_merge=on,partial_match_table_scan=on,subquery_cache=on,mrr=off,mrr_cost_based=off,mrr_sort_keys=off,outer_join_with_cache=on,semijoin_with_cache=on,join_cache_incremental=on,join_cache_hashed=on,join_cache_bka=on,optimize_join_buffer_size=off,table_elimination=on,extended_keys=off
- The manual page at http://dev.mysql.com/doc/mysql/en/crashing.html contains
- information that should help you find out what is causing the crash.
- 150523 13:28:52 mysqld_safe Number of processes running now: 0
- 150523 13:28:52 mysqld_safe WSREP: not restarting wsrep node automatically
- 150523 13:28:52 mysqld_safe mysqld from pid file /var/lib/mysql/apollo.launtel.net.au.pid ended
- ===========================================================================================
- /var/lib/mysqld/zeus.launtel.net.au.err (node didn't crash):-
- 150523 13:28:52 [Note] WSREP: (a5225fff, 'tcp://0.0.0.0:4567') turning message relay requesting on, nonlive peers: tcp://103.25.214.60:4567
- 150523 13:28:53 [Note] WSREP: (a5225fff, 'tcp://0.0.0.0:4567') reconnecting to 9a8aa1a2 (tcp://103.25.214.60:4567), attempt 0
- 150523 13:28:53 [Note] WSREP: (a5225fff, 'tcp://0.0.0.0:4567') reconnecting to ed987f90 (tcp://122.201.85.240:4567), attempt 0
- 150523 13:28:57 [Note] WSREP: evs::proto(a5225fff, OPERATIONAL, view_id(REG,9a8aa1a2,4)) suspecting node: 9a8aa1a2
- 150523 13:28:57 [Note] WSREP: evs::proto(a5225fff, OPERATIONAL, view_id(REG,9a8aa1a2,4)) suspected node without join message, declaring inactive
- 150523 13:28:57 [Note] WSREP: evs::proto(a5225fff, OPERATIONAL, view_id(REG,9a8aa1a2,4)) suspecting node: ed987f90
- 150523 13:28:57 [Note] WSREP: evs::proto(a5225fff, OPERATIONAL, view_id(REG,9a8aa1a2,4)) suspected node without join message, declaring inactive
- 150523 13:29:04 [Warning] WSREP: evs::proto(a5225fff, GATHER, view_id(REG,9a8aa1a2,4)) install timer expired
- evs::proto(evs::proto(a5225fff, GATHER, view_id(REG,9a8aa1a2,4)), GATHER) {
- current_view=view(view_id(REG,9a8aa1a2,4) memb {
- 9a8aa1a2,0
- a5225fff,0
- cc1f173d,0
- ed987f90,0
- } joined {
- } left {
- } partitioned {
- }),
- input_map=evs::input_map: {aru_seq=67516,safe_seq=67516,node_index=node: {idx=0,range=[67517,67516],safe_seq=67516} node: {idx=1,range=[67531,67530],safe_seq=67516} node: {idx=2,range=[67531,67530],safe_seq=67516} node: {idx=3,range=[67519,67518],safe_seq=67516} },
- fifo_seq=153345,
- last_sent=67530,
- known:
- 9a8aa1a2 at tcp://103.25.214.60:4567
- {o=0,s=1,i=0,fs=147332,}
- a5225fff at
- {o=1,s=0,i=0,fs=-1,jm=
- {v=0,t=4,ut=255,o=1,s=67516,sr=-1,as=67516,f=0,src=a5225fff,srcvid=view_id(REG,9a8aa1a2,4),insvid=view_id(UNKNOWN,00000000,0),ru=00000000,r=[-1,-1],fs=153343,nl=(
- 9a8aa1a2, {o=0,s=1,e=0,ls=-1,vid=view_id(REG,9a8aa1a2,4),ss=67516,ir=[67517,67516],}
- a5225fff, {o=1,s=0,e=0,ls=-1,vid=view_id(REG,9a8aa1a2,4),ss=67516,ir=[67531,67530],}
- cc1f173d, {o=1,s=0,e=0,ls=-1,vid=view_id(REG,9a8aa1a2,4),ss=67516,ir=[67531,67530],}
- ed987f90, {o=0,s=1,e=0,ls=-1,vid=view_id(REG,9a8aa1a2,4),ss=67516,ir=[67519,67518],}
- )
- },
- }
- cc1f173d at tcp://54.66.156.123:4567
- {o=1,s=0,i=0,fs=135024,jm=
- {v=0,t=4,ut=255,o=1,s=67516,sr=-1,as=67516,f=4,src=cc1f173d,srcvid=view_id(REG,9a8aa1a2,4),insvid=view_id(UNKNOWN,00000000,0),ru=00000000,r=[-1,-1],fs=135024,nl=(
- 9a8aa1a2, {o=1,s=1,e=0,ls=-1,vid=view_id(REG,9a8aa1a2,4),ss=67516,ir=[67517,67516],}
- a5225fff, {o=1,s=0,e=0,ls=-1,vid=view_id(REG,9a8aa1a2,4),ss=67516,ir=[67531,67530],}
- cc1f173d, {o=1,s=0,e=0,ls=-1,vid=view_id(REG,9a8aa1a2,4),ss=67516,ir=[67531,67530],}
- ed987f90, {o=1,s=1,e=0,ls=-1,vid=view_id(REG,9a8aa1a2,4),ss=67516,ir=[67519,67518],}
- )
- },
- }
- ed987f90 at tcp://122.201.85.240:4567
- {o=0,s=1,i=0,fs=154192,}
- }
- 150523 13:29:04 [Note] WSREP: no install message received
- 150523 13:29:04 [Note] WSREP: view(view_id(NON_PRIM,9a8aa1a2,4) memb {
- a5225fff,0
- } joined {
- } left {
- } partitioned {
- 9a8aa1a2,0
- cc1f173d,0
- ed987f90,0
- })
- 150523 13:29:04 [Note] WSREP: New COMPONENT: primary = no, bootstrap = no, my_idx = 0, memb_num = 1
- 150523 13:29:04 [Note] WSREP: Flow-control interval: [16, 16]
- 150523 13:29:04 [Note] WSREP: Received NON-PRIMARY.
- 150523 13:29:04 [Note] WSREP: Shifting SYNCED -> OPEN (TO: 49731511)
- 150523 13:29:04 [Warning] WSREP: Last Applied Action message in non-primary configuration from member 0
- 150523 13:29:04 [Note] WSREP: view(view_id(NON_PRIM,a5225fff,5) memb {
- a5225fff,0
- } joined {
- } left {
- } partitioned {
- 9a8aa1a2,0
- cc1f173d,0
- ed987f90,0
- })
- 150523 13:29:04 [Note] WSREP: New COMPONENT: primary = no, bootstrap = no, my_idx = 0, memb_num = 1
- 150523 13:29:04 [Note] WSREP: New cluster view: global state: 1e90fb11-b059-11e4-a6ce-0663d9000b59:49731511, view# -1: non-Primary, number of nodes: 1, my index: 0, protocol version 3
- 150523 13:29:04 [Note] WSREP: Flow-control interval: [16, 16]
- 150523 13:29:04 [Note] WSREP: Received NON-PRIMARY.
- 150523 13:29:04 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
- 150523 13:29:04 [Note] WSREP: New cluster view: global state: 1e90fb11-b059-11e4-a6ce-0663d9000b59:49731511, view# -1: non-Primary, number of nodes: 1, my index: 0, protocol version 3
- 150523 13:29:04 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
- 150523 13:29:04 [Warning] WSREP: Send action {(nil), 439, TORDERED} returned -107 (Transport endpoint is not connected)
- 150523 13:29:05 [Note] WSREP: declaring cc1f173d at tcp://54.66.156.123:4567 stable
- 150523 13:29:05 [Note] WSREP: view(view_id(NON_PRIM,a5225fff,6) memb {
- a5225fff,0
- cc1f173d,0
- } joined {
- } left {
- } partitioned {
- 9a8aa1a2,0
- ed987f90,0
- })
- 150523 13:29:05 [Note] WSREP: New COMPONENT: primary = no, bootstrap = no, my_idx = 0, memb_num = 2
- 150523 13:29:05 [Note] WSREP: Flow-control interval: [23, 23]
- 150523 13:29:05 [Note] WSREP: Received NON-PRIMARY.
- 150523 13:29:05 [Note] WSREP: New cluster view: global state: 1e90fb11-b059-11e4-a6ce-0663d9000b59:49731511, view# -1: non-Primary, number of nodes: 2, my index: 0, protocol version 3
- 150523 13:29:05 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
- 150523 13:29:37 [Note] WSREP: (a5225fff, 'tcp://0.0.0.0:4567') reconnecting to 9a8aa1a2 (tcp://103.25.214.60:4567), attempt 30
- 150523 13:29:38 [Note] WSREP: (a5225fff, 'tcp://0.0.0.0:4567') reconnecting to ed987f90 (tcp://122.201.85.240:4567), attempt 30
- 150523 13:30:22 [Note] WSREP: (a5225fff, 'tcp://0.0.0.0:4567') reconnecting to 9a8aa1a2 (tcp://103.25.214.60:4567), attempt 60
- 150523 13:30:23 [Note] WSREP: (a5225fff, 'tcp://0.0.0.0:4567') reconnecting to ed987f90 (tcp://122.201.85.240:4567), attempt 60
- 150523 13:31:07 [Note] WSREP: (a5225fff, 'tcp://0.0.0.0:4567') reconnecting to 9a8aa1a2 (tcp://103.25.214.60:4567), attempt 90
- 150523 13:31:08 [Note] WSREP: (a5225fff, 'tcp://0.0.0.0:4567') reconnecting to ed987f90 (tcp://122.201.85.240:4567), attempt 90
- 150523 13:31:52 [Note] WSREP: (a5225fff, 'tcp://0.0.0.0:4567') reconnecting to 9a8aa1a2 (tcp://103.25.214.60:4567), attempt 120
- 150523 13:31:53 [Note] WSREP: (a5225fff, 'tcp://0.0.0.0:4567') reconnecting to ed987f90 (tcp://122.201.85.240:4567), attempt 120
- 150523 13:32:37 [Note] WSREP: (a5225fff, 'tcp://0.0.0.0:4567') reconnecting to 9a8aa1a2 (tcp://103.25.214.60:4567), attempt 150
- 150523 13:32:38 [Note] WSREP: (a5225fff, 'tcp://0.0.0.0:4567') reconnecting to ed987f90 (tcp://122.201.85.240:4567), attempt 150
- 150523 13:33:17 [Note] WSREP: view(view_id(PRIM,a5225fff,6) memb {
- a5225fff,0
- cc1f173d,0
- } joined {
- } left {
- } partitioned {
- 9a8aa1a2,0
- ed987f90,0
- })
- 150523 13:33:17 [Note] WSREP: save pc into disk
- 150523 13:33:17 [Note] WSREP: forgetting 9a8aa1a2 (tcp://103.25.214.60:4567)
- 150523 13:33:17 [Note] WSREP: New COMPONENT: primary = yes, bootstrap = yes, my_idx = 0, memb_num = 2
- 150523 13:33:17 [Note] WSREP: forgetting ed987f90 (tcp://122.201.85.240:4567)
- 150523 13:33:17 [Note] WSREP: (a5225fff, 'tcp://0.0.0.0:4567') turning message relay requesting off
- 150523 13:33:17 [Note] WSREP: STATE_EXCHANGE: sent state UUID: 73a3c47a-00fc-11e5-8ba9-b25b7ee5f569
- 150523 13:33:17 [Note] WSREP: STATE EXCHANGE: sent state msg: 73a3c47a-00fc-11e5-8ba9-b25b7ee5f569
- 150523 13:33:17 [Note] WSREP: STATE EXCHANGE: got state msg: 73a3c47a-00fc-11e5-8ba9-b25b7ee5f569 from 0 (zeus)
- 150523 13:33:17 [Note] WSREP: STATE EXCHANGE: got state msg: 73a3c47a-00fc-11e5-8ba9-b25b7ee5f569 from 1 (ares)
- 150523 13:33:17 [Warning] WSREP: Quorum: No node with complete state:
- Version : 3
- Flags : 0x7
- Protocols : 0 / 7 / 3
- State : NON-PRIMARY
- Prim state : SYNCED
- Prim UUID : cd99c074-00e2-11e5-addc-2bca0d0759b6
- Prim seqno : 4
- First seqno : 49703422
- Last seqno : 49731511
- Prim JOINED : 4
- State UUID : 73a3c47a-00fc-11e5-8ba9-b25b7ee5f569
- Group UUID : 1e90fb11-b059-11e4-a6ce-0663d9000b59
- Name : 'zeus'
- Incoming addr: '103.25.214.61:3306'
- Version : 3
- Flags : 0x6
- Protocols : 0 / 5 / 3
- State : NON-PRIMARY
- Prim state : SYNCED
- Prim UUID : cd99c074-00e2-11e5-addc-2bca0d0759b6
- Prim seqno : 4
- First seqno : 49705790
- Last seqno : 49731511
- Prim JOINED : 4
- State UUID : 73a3c47a-00fc-11e5-8ba9-b25b7ee5f569
- Group UUID : 1e90fb11-b059-11e4-a6ce-0663d9000b59
- Name : 'ares'
- Incoming addr: '172.31.15.84:3306'
- 150523 13:33:17 [Note] WSREP: Partial re-merge of primary cd99c074-00e2-11e5-addc-2bca0d0759b6 found: 2 of 4.
- 150523 13:33:17 [Note] WSREP: Quorum results:
- version = 3,
- component = PRIMARY,
- conf_id = 4,
- members = 2/2 (joined/total),
- act_id = 49731511,
- last_appl. = 49731384,
- protocols = 0/5/3 (gcs/repl/appl),
- group UUID = 1e90fb11-b059-11e4-a6ce-0663d9000b59
- 150523 13:33:17 [Note] WSREP: Flow-control interval: [23, 23]
- 150523 13:33:17 [Note] WSREP: Restored state OPEN -> SYNCED (49731511)
- 150523 13:33:17 [Note] WSREP: New cluster view: global state: 1e90fb11-b059-11e4-a6ce-0663d9000b59:49731511, view# 5: Primary, number of nodes: 2, my index: 0, protocol version 3
- 150523 13:33:17 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
- 150523 13:33:17 [Note] WSREP: REPL Protocols: 5 (3, 1)
- 150523 13:33:17 [Note] WSREP: Service thread queue flushed.
- 150523 13:33:17 [Note] WSREP: Assign initial position for certification: 49731511, protocol version: 3
- 150523 13:33:17 [Note] WSREP: Service thread queue flushed.
- 150523 13:33:17 [Note] WSREP: Synchronized with group, ready for connections
- 150523 13:33:17 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
- 150523 13:33:20 [Note] WSREP: cleaning up 9a8aa1a2 (tcp://103.25.214.60:4567)
- 150523 13:33:20 [Note] WSREP: cleaning up ed987f90 (tcp://122.201.85.240:4567)
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement