Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- Jan 30 16:16:35 hpc-be028.cern.ch kernel: kworker/1:36: page allocation failure: order:5, mode:0x104050
- Jan 30 16:16:35 hpc-be028.cern.ch kernel: CPU: 1 PID: 78445 Comm: kworker/1:36 Kdump: loaded Tainted: P OE ------------ T 3.10.0-957.27.2.el7.x86_64 #1
- Jan 30 16:16:35 hpc-be028.cern.ch kernel: Hardware name: Intel Corporation S2600KPR/S2600KPR, BIOS SE5C610.86B.01.01.0020.122820161512 12/28/2016
- Jan 30 16:16:35 hpc-be028.cern.ch kernel: Workqueue: ceph-msgr ceph_con_workfn [libceph]
- Jan 30 16:16:35 hpc-be028.cern.ch kernel: Call Trace:
- Jan 30 16:16:35 hpc-be028.cern.ch kernel: [<ffffffffb6164147>] dump_stack+0x19/0x1b
- Jan 30 16:16:35 hpc-be028.cern.ch kernel: [<ffffffffb5bbdec0>] warn_alloc_failed+0x110/0x180
- Jan 30 16:16:35 hpc-be028.cern.ch kernel: [<ffffffffb615f74e>] __alloc_pages_slowpath+0x6b6/0x724
- Jan 30 16:16:35 hpc-be028.cern.ch kernel: [<ffffffffb5bc2524>] __alloc_pages_nodemask+0x404/0x420
- Jan 30 16:16:35 hpc-be028.cern.ch kernel: [<ffffffffb5c0f438>] alloc_pages_current+0x98/0x110
- Jan 30 16:16:35 hpc-be028.cern.ch kernel: [<ffffffffb5bbcc6e>] __get_free_pages+0xe/0x40
- Jan 30 16:16:35 hpc-be028.cern.ch kernel: [<ffffffffb5c1ab4e>] kmalloc_order_trace+0x2e/0xa0
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffb5c208d1>] __kmalloc_track_caller+0x221/0x240
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffc0d101e6>] ? osdmap_set_max_osd+0x76/0x1d0 [libceph]
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffb5bd73ef>] krealloc+0x4f/0xa0
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffc0d101e6>] osdmap_set_max_osd+0x76/0x1d0 [libceph]
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffc0d134d5>] ceph_osdmap_decode+0x195/0x860 [libceph]
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffc0d09c34>] handle_one_map+0x224/0x250 [libceph]
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffc0d0e69c>] ceph_osdc_handle_map+0x7dc/0x8c0 [libceph]
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffc0d0ea81>] dispatch+0x301/0xca0 [libceph]
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffc0cfcfb4>] try_read+0x514/0x12c0 [libceph]
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffb5addd9e>] ? account_entity_dequeue+0xae/0xd0
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffb5ae192c>] ? dequeue_entity+0x11c/0x5e0
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffb601b417>] ? kernel_sendmsg+0x37/0x50
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffc0cfdf64>] ceph_con_workfn+0xe4/0x1530 [libceph]
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffb6169aba>] ? __schedule+0x42a/0x860
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffb5abaf9f>] process_one_work+0x17f/0x440
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffb5abc036>] worker_thread+0x126/0x3c0
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffb5abbf10>] ? manage_workers.isra.25+0x2a0/0x2a0
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffb5ac2e81>] kthread+0xd1/0xe0
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffb5ac2db0>] ? insert_kthread_work+0x40/0x40
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffb6176c1d>] ret_from_fork_nospec_begin+0x7/0x21
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: [<ffffffffb5ac2db0>] ? insert_kthread_work+0x40/0x40
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Mem-Info:
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: active_anon:17199252 inactive_anon:1384796 isolated_anon:0
- active_file:6645205 inactive_file:6831015 isolated_file:64
- unevictable:592 dirty:2964882 writeback:201597 unstable:0
- slab_reclaimable:163441 slab_unreclaimable:80435
- mapped:61869 shmem:46608 pagetables:48720 bounce:0
- free:249748 free_pcp:1338 free_cma:0
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Node 0 DMA free:15892kB min:60kB low:72kB high:88kB active_anon:0kB inactive_anon:0kB active_file:0kB inactive_file:0kB unevictable:0kB isolated(anon):0kB isolated(file):0k
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: lowmem_reserve[]: 0 1705 64136 64136
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Node 0 DMA32 free:256916kB min:6948kB low:8684kB high:10420kB active_anon:1059064kB inactive_anon:360964kB active_file:144kB inactive_file:6772kB unevictable:0kB isolated(a
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: lowmem_reserve[]: 0 0 62430 62430
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Node 0 Normal free:686628kB min:254432kB low:318040kB high:381648kB active_anon:33449040kB inactive_anon:2586528kB active_file:12803548kB inactive_file:13007272kB unevictab
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: lowmem_reserve[]: 0 0 0 0
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Node 1 Normal free:461520kB min:262840kB low:328548kB high:394260kB active_anon:34316120kB inactive_anon:2591692kB active_file:13766564kB inactive_file:13852284kB unevictab
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: lowmem_reserve[]: 0 0 0 0
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Node 0 DMA: 1*4kB (U) 0*8kB 1*16kB (U) 0*32kB 2*64kB (U) 1*128kB (U) 1*256kB (U) 0*512kB 1*1024kB (U) 1*2048kB (M) 3*4096kB (M) = 15892kB
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Node 0 DMA32: 1213*4kB (UEM) 812*8kB (UEM) 420*16kB (UEM) 83*32kB (UEM) 233*64kB (UEM) 168*128kB (UEM) 122*256kB (UEM) 100*512kB (UEM) 115*1024kB (UEM) 0*2048kB 0*4096kB =
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Node 0 Normal: 96701*4kB (UEM) 34788*8kB (UEM) 4239*16kB (UEM) 57*32kB (UEM) 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 734756kB
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Node 1 Normal: 81934*4kB (UEM) 34559*8kB (UEM) 1082*16kB (UEM) 97*32kB (UE) 3*64kB (UE) 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 624816kB
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=1048576kB
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=2048kB
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Node 1 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=1048576kB
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Node 1 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=2048kB
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: 13386727 total pagecache pages
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: 35711 pages in swap cache
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Swap cache stats: add 6414255, delete 6385221, find 2627543/2678239
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Free swap = 65962236kB
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: Total swap = 67108860kB
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: 33530063 pages RAM
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: 0 pages HighMem/MovableOnly
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: 595264 pages reserved
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: libceph: corrupt full osdmap (-12) epoch 676797 off 895 (ffffbcb316713d77 of ffffbcb3167139f8-ffffbcb316781502)
- Jan 30 16:16:36 hpc-be028.cern.ch kernel: osdmap: 00000000: ...
- ...
- then nothing relevant until 1 day later when the client got stuck.
- Now today the same node got stuck again:
- Feb 02 17:03:12 hpc-be028.cern.ch kernel: ceph: mds0 caps stale
- Feb 02 17:05:16 hpc-be028.cern.ch kernel: INFO: task cp:169181 blocked for more than 120 seconds.
- Feb 02 17:05:16 hpc-be028.cern.ch kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
- Feb 02 17:05:16 hpc-be028.cern.ch kernel: cp D ffff9a71612e30c0 0 169181 1 0x00000086
- Feb 02 17:05:16 hpc-be028.cern.ch kernel: Call Trace:
- Feb 02 17:05:16 hpc-be028.cern.ch kernel: [<ffffffffc0cfc7fa>] ? ceph_con_send+0xba/0x1c0 [libceph]
- Feb 02 17:05:16 hpc-be028.cern.ch kernel: [<ffffffffc0cf9416>] ? ceph_kvmalloc+0x26/0x60 [libceph]
- Feb 02 17:05:16 hpc-be028.cern.ch kernel: [<ffffffffb6168060>] ? bit_wait+0x50/0x50
- Feb 02 17:05:16 hpc-be028.cern.ch kernel: [<ffffffffb6169f19>] schedule+0x29/0x70
- Feb 02 17:05:16 hpc-be028.cern.ch kernel: [<ffffffffb6167a21>] schedule_timeout+0x221/0x2d0
- Feb 02 17:05:16 hpc-be028.cern.ch kernel: [<ffffffffc0ee522e>] ? send_cap_msg+0x28e/0x470 [ceph]
- Feb 02 17:05:16 hpc-be028.cern.ch kernel: [<ffffffffb5b02372>] ? ktime_get_ts64+0x52/0xf0
- Feb 02 17:05:16 hpc-be028.cern.ch kernel: [<ffffffffb6168060>] ? bit_wait+0x50/0x50
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb61695ed>] io_schedule_timeout+0xad/0x130
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb6169688>] io_schedule+0x18/0x20
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb6168071>] bit_wait_io+0x11/0x50
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb6167c21>] __wait_on_bit_lock+0x61/0xc0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5bba6de>] ? __find_get_pages+0x11e/0x1c0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5bb6f64>] __lock_page+0x74/0x90
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac4010>] ? wake_bit_function+0x40/0x40
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5bc859c>] truncate_inode_pages_range+0x6cc/0x700
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffc0ee6c82>] ? __ceph_caps_issued+0x82/0xe0 [ceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c70eca>] ? __inode_wait_for_writeback+0x7a/0xf0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac4010>] ? wake_bit_function+0x40/0x40
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5bc863f>] truncate_inode_pages_final+0x4f/0x60
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5ffdc>] evict+0x16c/0x180
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c6082c>] iput+0xfc/0x190
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5b1b0>] __dentry_kill+0x120/0x180
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5b2c0>] dput+0xb0/0x160
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c44ace>] __fput+0x17e/0x260
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c44c9e>] ____fput+0xe/0x10
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5abf9eb>] task_work_run+0xbb/0xe0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5a9eeb1>] do_exit+0x2d1/0xa40
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5a9f69f>] do_group_exit+0x3f/0xa0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5ab049e>] get_signal_to_deliver+0x1ce/0x5e0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5a2b527>] do_signal+0x57/0x6f0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5a2bc32>] do_notify_resume+0x72/0xc0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb6177134>] int_signal+0x12/0x17
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: INFO: task kworker/5:2:300661 blocked for more than 120 seconds.
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: kworker/5:2 D ffff9a7149f22080 0 300661 2 0x00000080
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: Workqueue: ceph-msgr ceph_con_workfn [libceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: Call Trace:
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffc0ee45f0>] ? ceph_fh_to_parent+0x70/0x70 [ceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb6169f19>] schedule+0x29/0x70
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5f0c0>] __wait_on_freeing_inode+0xb0/0xf0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac4010>] ? wake_bit_function+0x40/0x40
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5f199>] find_inode+0x99/0xc0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffc0ee45f0>] ? ceph_fh_to_parent+0x70/0x70 [ceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5f231>] ilookup5_nowait+0x71/0x90
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c6004f>] ilookup5+0xf/0x60
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffc0eebb8a>] ceph_handle_caps+0x26a/0x1a00 [ceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffc0ef28b0>] ? mds_check_message_signature+0x30/0x40 [ceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffc0efadfa>] dispatch+0x7fa/0xb00 [ceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb601b56a>] ? kernel_recvmsg+0x3a/0x50
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffc0cfcfb4>] try_read+0x514/0x12c0 [libceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5addd9e>] ? account_entity_dequeue+0xae/0xd0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5ae192c>] ? dequeue_entity+0x11c/0x5e0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffc0cfdf64>] ceph_con_workfn+0xe4/0x1530 [libceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb6169aba>] ? __schedule+0x42a/0x860
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5abaf9f>] process_one_work+0x17f/0x440
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5abc036>] worker_thread+0x126/0x3c0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5abbf10>] ? manage_workers.isra.25+0x2a0/0x2a0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac2e81>] kthread+0xd1/0xe0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac2db0>] ? insert_kthread_work+0x40/0x40
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb6176c1d>] ret_from_fork_nospec_begin+0x7/0x21
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac2db0>] ? insert_kthread_work+0x40/0x40
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: INFO: task cp:169181 blocked for more than 120 seconds.
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: cp D ffff9a71612e30c0 0 169181 1 0x00000086
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: Call Trace:
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffc0cfc7fa>] ? ceph_con_send+0xba/0x1c0 [libceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffc0cf9416>] ? ceph_kvmalloc+0x26/0x60 [libceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6168060>] ? bit_wait+0x50/0x50
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6169f19>] schedule+0x29/0x70
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6167a21>] schedule_timeout+0x221/0x2d0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffc0ee522e>] ? send_cap_msg+0x28e/0x470 [ceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5b02372>] ? ktime_get_ts64+0x52/0xf0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6168060>] ? bit_wait+0x50/0x50
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb61695ed>] io_schedule_timeout+0xad/0x130
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6169688>] io_schedule+0x18/0x20
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6168071>] bit_wait_io+0x11/0x50
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6167c21>] __wait_on_bit_lock+0x61/0xc0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5bba6de>] ? __find_get_pages+0x11e/0x1c0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5bb6f64>] __lock_page+0x74/0x90
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac4010>] ? wake_bit_function+0x40/0x40
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5bc859c>] truncate_inode_pages_range+0x6cc/0x700
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffc0ee6c82>] ? __ceph_caps_issued+0x82/0xe0 [ceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5c70eca>] ? __inode_wait_for_writeback+0x7a/0xf0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac4010>] ? wake_bit_function+0x40/0x40
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5bc863f>] truncate_inode_pages_final+0x4f/0x60
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5ffdc>] evict+0x16c/0x180
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c6082c>] iput+0xfc/0x190
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5b1b0>] __dentry_kill+0x120/0x180
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5b2c0>] dput+0xb0/0x160
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c44ace>] __fput+0x17e/0x260
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c44c9e>] ____fput+0xe/0x10
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5abf9eb>] task_work_run+0xbb/0xe0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5a9eeb1>] do_exit+0x2d1/0xa40
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5a9f69f>] do_group_exit+0x3f/0xa0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5ab049e>] get_signal_to_deliver+0x1ce/0x5e0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5a2b527>] do_signal+0x57/0x6f0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5a2bc32>] do_notify_resume+0x72/0xc0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb6177134>] int_signal+0x12/0x17
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: INFO: task kworker/5:2:300661 blocked for more than 120 seconds.
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: kworker/5:2 D ffff9a7149f22080 0 300661 2 0x00000080
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: Workqueue: ceph-msgr ceph_con_workfn [libceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: Call Trace:
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffc0ee45f0>] ? ceph_fh_to_parent+0x70/0x70 [ceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb6169f19>] schedule+0x29/0x70
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5f0c0>] __wait_on_freeing_inode+0xb0/0xf0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac4010>] ? wake_bit_function+0x40/0x40
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5f199>] find_inode+0x99/0xc0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffc0ee45f0>] ? ceph_fh_to_parent+0x70/0x70 [ceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5f231>] ilookup5_nowait+0x71/0x90
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5c6004f>] ilookup5+0xf/0x60
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffc0eebb8a>] ceph_handle_caps+0x26a/0x1a00 [ceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffc0ef28b0>] ? mds_check_message_signature+0x30/0x40 [ceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffc0efadfa>] dispatch+0x7fa/0xb00 [ceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb601b56a>] ? kernel_recvmsg+0x3a/0x50
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffc0cfcfb4>] try_read+0x514/0x12c0 [libceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5addd9e>] ? account_entity_dequeue+0xae/0xd0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5ae192c>] ? dequeue_entity+0x11c/0x5e0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffc0cfdf64>] ceph_con_workfn+0xe4/0x1530 [libceph]
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb6169aba>] ? __schedule+0x42a/0x860
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5abaf9f>] process_one_work+0x17f/0x440
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5abc036>] worker_thread+0x126/0x3c0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5abbf10>] ? manage_workers.isra.25+0x2a0/0x2a0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac2e81>] kthread+0xd1/0xe0
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac2db0>] ? insert_kthread_work+0x40/0x40
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb6176c1d>] ret_from_fork_nospec_begin+0x7/0x21
- Feb 02 17:05:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac2db0>] ? insert_kthread_work+0x40/0x40
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: INFO: task cp:169181 blocked for more than 120 seconds.
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: cp D ffff9a71612e30c0 0 169181 1 0x00000086
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: Call Trace:
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffc0cfc7fa>] ? ceph_con_send+0xba/0x1c0 [libceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffc0cf9416>] ? ceph_kvmalloc+0x26/0x60 [libceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6168060>] ? bit_wait+0x50/0x50
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6169f19>] schedule+0x29/0x70
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6167a21>] schedule_timeout+0x221/0x2d0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffc0ee522e>] ? send_cap_msg+0x28e/0x470 [ceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5b02372>] ? ktime_get_ts64+0x52/0xf0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6168060>] ? bit_wait+0x50/0x50
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb61695ed>] io_schedule_timeout+0xad/0x130
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6169688>] io_schedule+0x18/0x20
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6168071>] bit_wait_io+0x11/0x50
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6167c21>] __wait_on_bit_lock+0x61/0xc0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5bba6de>] ? __find_get_pages+0x11e/0x1c0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5bb6f64>] __lock_page+0x74/0x90
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac4010>] ? wake_bit_function+0x40/0x40
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5bc859c>] truncate_inode_pages_range+0x6cc/0x700
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffc0ee6c82>] ? __ceph_caps_issued+0x82/0xe0 [ceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5c70eca>] ? __inode_wait_for_writeback+0x7a/0xf0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac4010>] ? wake_bit_function+0x40/0x40
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5bc863f>] truncate_inode_pages_final+0x4f/0x60
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5ffdc>] evict+0x16c/0x180
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5c6082c>] iput+0xfc/0x190
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5b1b0>] __dentry_kill+0x120/0x180
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5b2c0>] dput+0xb0/0x160
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5c44ace>] __fput+0x17e/0x260
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5c44c9e>] ____fput+0xe/0x10
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5abf9eb>] task_work_run+0xbb/0xe0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5a9eeb1>] do_exit+0x2d1/0xa40
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5a9f69f>] do_group_exit+0x3f/0xa0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5ab049e>] get_signal_to_deliver+0x1ce/0x5e0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5a2b527>] do_signal+0x57/0x6f0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5a2bc32>] do_notify_resume+0x72/0xc0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6177134>] int_signal+0x12/0x17
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: INFO: task kworker/5:2:300661 blocked for more than 120 seconds.
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: kworker/5:2 D ffff9a7149f22080 0 300661 2 0x00000080
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: Workqueue: ceph-msgr ceph_con_workfn [libceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: Call Trace:
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffc0ee45f0>] ? ceph_fh_to_parent+0x70/0x70 [ceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6169f19>] schedule+0x29/0x70
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5f0c0>] __wait_on_freeing_inode+0xb0/0xf0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac4010>] ? wake_bit_function+0x40/0x40
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5f199>] find_inode+0x99/0xc0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffc0ee45f0>] ? ceph_fh_to_parent+0x70/0x70 [ceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5c5f231>] ilookup5_nowait+0x71/0x90
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5c6004f>] ilookup5+0xf/0x60
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffc0eebb8a>] ceph_handle_caps+0x26a/0x1a00 [ceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffc0ef28b0>] ? mds_check_message_signature+0x30/0x40 [ceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffc0efadfa>] dispatch+0x7fa/0xb00 [ceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb601b56a>] ? kernel_recvmsg+0x3a/0x50
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffc0cfcfb4>] try_read+0x514/0x12c0 [libceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5addd9e>] ? account_entity_dequeue+0xae/0xd0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5ae192c>] ? dequeue_entity+0x11c/0x5e0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffc0cfdf64>] ceph_con_workfn+0xe4/0x1530 [libceph]
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6169aba>] ? __schedule+0x42a/0x860
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5abaf9f>] process_one_work+0x17f/0x440
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5abc036>] worker_thread+0x126/0x3c0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5abbf10>] ? manage_workers.isra.25+0x2a0/0x2a0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac2e81>] kthread+0xd1/0xe0
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac2db0>] ? insert_kthread_work+0x40/0x40
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb6176c1d>] ret_from_fork_nospec_begin+0x7/0x21
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: [<ffffffffb5ac2db0>] ? insert_kthread_work+0x40/0x40
- Feb 02 17:07:17 hpc-be028.cern.ch kernel: ceph: mds0 hung
- Feb 02 17:09:17 hpc-be028.cern.ch kernel: INFO: task cp:169181 blocked for more than 120 seconds.
- Feb 02 17:09:17 hpc-be028.cern.ch kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
- Feb 02 17:09:17 hpc-be028.cern.ch kernel: cp D ffff9a71612e30c0 0 169181 1 0x00000086
- Feb 02 17:09:17 hpc-be028.cern.ch kernel: Call Trace:
- Feb 02 17:09:17 hpc-be028.cern.ch kernel: [<ffffffffc0cfc7fa>] ? ceph_con_send+0xba/0x1c0 [libceph]
- Feb 02 17:09:17 hpc-be028.cern.ch kernel: [<ffffffffc0cf9416>] ? ceph_kvmalloc+0x26/0x60 [libceph]
- Feb 02 17:09:17 hpc-be028.cern.ch kernel: [<ffffffffb6168060>] ? bit_wait+0x50/0x50
- Feb 02 17:09:17 hpc-be028.cern.ch kernel: [<ffffffffb6169f19>] schedule+0x29/0x70
- Feb 02 17:09:17 hpc-be028.cern.ch kernel: [<ffffffffb6167a21>] schedule_timeout+0x221/0x2d0
- Feb 02 17:09:17 hpc-be028.cern.ch kernel: [<ffffffffc0ee522e>] ? send_cap_msg+0x28e/0x470 [ceph]
- Feb 02 17:09:17 hpc-be028.cern.ch kernel: [<ffffffffb5b02372>] ? ktime_get_ts64+0x52/0xf0
- Feb 02 17:09:17 hpc-be028.cern.ch kernel: [<ffffffffb6168060>] ? bit_wait+0x50/0x50
- Feb 02 17:09:17 hpc-be028.cern.ch kernel: [<ffffffffb61695ed>] io_schedule_timeout+0xad/0x130
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb6169688>] io_schedule+0x18/0x20
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb6168071>] bit_wait_io+0x11/0x50
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb6167c21>] __wait_on_bit_lock+0x61/0xc0
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5bba6de>] ? __find_get_pages+0x11e/0x1c0
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5bb6f64>] __lock_page+0x74/0x90
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5ac4010>] ? wake_bit_function+0x40/0x40
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5bc859c>] truncate_inode_pages_range+0x6cc/0x700
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffc0ee6c82>] ? __ceph_caps_issued+0x82/0xe0 [ceph]
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5c70eca>] ? __inode_wait_for_writeback+0x7a/0xf0
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5ac4010>] ? wake_bit_function+0x40/0x40
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5bc863f>] truncate_inode_pages_final+0x4f/0x60
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5c5ffdc>] evict+0x16c/0x180
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5c6082c>] iput+0xfc/0x190
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5c5b1b0>] __dentry_kill+0x120/0x180
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5c5b2c0>] dput+0xb0/0x160
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5c44ace>] __fput+0x17e/0x260
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5c44c9e>] ____fput+0xe/0x10
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5abf9eb>] task_work_run+0xbb/0xe0
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5a9eeb1>] do_exit+0x2d1/0xa40
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5a9f69f>] do_group_exit+0x3f/0xa0
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5ab049e>] get_signal_to_deliver+0x1ce/0x5e0
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5a2b527>] do_signal+0x57/0x6f0
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb5a2bc32>] do_notify_resume+0x72/0xc0
- Feb 02 17:09:18 hpc-be028.cern.ch kernel: [<ffffffffb6177134>] int_signal+0x12/0x17
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement