[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: ravirin down



The tool works by each server sending an HTTP PUT every 5 minutes, so it’s
possible it was dying and a login shell was too much to ask for; odd though.
Anyway, back up now.

James

> On 31 Oct 2016, at 14:20, rod <r.schnell@mythos.freeddns.org> wrote:
> 
> I tried to get a prompt, which I usually do when I see seg fault errors,
> and it wouldn't return one.  Maybe it's a ghost.  Today's the day for them.
> 
> Rod
> 
> On 10/31/2016 9:18 AM, James Clarke wrote:
>> Hi Rod,
>> Don’t worry; those are all down to me (a kernel module to dump vmap_lazy_nr
>> and friends, and the DEBUG ones are because I added a printk to see if a
>> particular condition is reached). Then of course the tst_qtgraphical segfault
>> is just a normal user-space SIGSEGV.
>> 
>> I’m curious as to whether raverin actually needed rebooting; my monitoring
>> tool says it was still up 7 minutes ago, but not 2 minutes later at the
>> next refresh (presumably because you’ve rebooted). Being idle is expected;
>> there are no packages queued for building.
>> 
>> Thanks,
>> James
>> 
>>> On 31 Oct 2016, at 14:12, rod <r.schnell@mythos.freeddns.org> wrote:
>>> 
>>> Good morning,
>>> 
>>> I saw ravirin was not building again so in the process of rebooting him
>>> I found this snipet.
>>> 
>>> [ 1208.832559] vln_init: lazy_max_pages() is 4096
>>> [ 1208.918323] vln_init: vmap_lazy_nr is caeb1c
>>> [ 1208.974405] vln_init: *vmap_lazy_nr is 145
>>> [ 1209.030340] vln_init: lazy_max_pages() is 4096
>>> [ 1209.294257] vln_init: vmap_lazy_nr is caeb1c
>>> [ 1209.350412] vln_init: *vmap_lazy_nr is 174
>>> [ 1209.406334] vln_init: lazy_max_pages() is 4096
>>> [ 1209.834246] vln_init: vmap_lazy_nr is caeb1c
>>> [ 1209.890347] vln_init: *vmap_lazy_nr is 203
>>> [ 1209.946274] vln_init: lazy_max_pages() is 4096
>>> [ 1210.553105] vln_init: vmap_lazy_nr is caeb1c
>>> [ 1210.609279] vln_init: *vmap_lazy_nr is 232
>>> [ 1210.665157] vln_init: lazy_max_pages() is 4096
>>> [ 1211.349159] vln_init: vmap_lazy_nr is caeb1c
>>> [ 1211.405310] vln_init: *vmap_lazy_nr is 261
>>> [ 1211.461203] vln_init: lazy_max_pages() is 4096
>>> [24537.258928] TSB[kworker/0:5:5958]: DEBUG flush_tsb_kernel_range
>>> start=000000001000c000 end=00000000f0000000 PAGE_SIZE=2000
>>> [24537.404378] TSB[kworker/0:5:5958]: DEBUG flush_tsb_kernel_range
>>> start=0000000100000000 end=00000001020d6000 PAGE_SIZE=2000
>>> [217632.451540] TSB[kworker/0:1:17055]: DEBUG flush_tsb_kernel_range
>>> start=0000000100004000 end=0000000102126000 PAGE_SIZE=2000
>>> [230284.512421] tst_qtgraphical[476]: segfault at 0 ip fff00001013e8644
>>> (rpc fff0000100841884) sp 000007feffc275e1 error 30001 in
>>> libc-2.24.so[fff000010135c000+15e000]
>>> 
>>> Not sure what it means nor what I can do to help fix it.  Thoughts.
>>> 
>>> Rod
>>> 
>>> 
>>> On 10/27/2016 7:07 AM, James Clarke wrote:
>>>> Hi Rod,
>>>> It seems ravirin has been down for at least the past few days. Could you please
>>>> give it some love? If it’s crashed/hung with CPU lockups, I’ve built a 4.9
>>>> kernel with a patch to fix this which I’d like to test.
>>>> 
>>>> Thanks,
>>>> James
>>>> 
>> 


Reply to: