Hi Galina, I have observed that some of my bots timeout for no apparent reason (no external load, no network mishaps, no issues), and when I go to check the buildslave process, I see that it has just being restarted. Do you know what could be causing this? Swap is untouched, plenty of disk left, CPUs a plenty. So, I'm contemplating the possibility that the master lost connectivity to it and didn't get it back until it was too late. I've seen this happen with my local master, when I reload the config, the bot doesn't die but doesn't do much either. What I do is then a full restart of all slaves and the master. This is impractical in the LLVM Lab, but would be good to at least understand what's going on before I get people accusing me of "unstable behaviour". :) cheers, --renato
Hi Renato, Could you point me please (what bots and when did this happen)? I will look. Thanks Galina On Tue, Oct 13, 2015 at 1:40 PM, Renato Golin <renato.golin at linaro.org> wrote:> Hi Galina, > > I have observed that some of my bots timeout for no apparent reason > (no external load, no network mishaps, no issues), and when I go to > check the buildslave process, I see that it has just being restarted. > > Do you know what could be causing this? > > Swap is untouched, plenty of disk left, CPUs a plenty. So, I'm > contemplating the possibility that the master lost connectivity to it > and didn't get it back until it was too late. > > I've seen this happen with my local master, when I reload the config, > the bot doesn't die but doesn't do much either. What I do is then a > full restart of all slaves and the master. > > This is impractical in the LLVM Lab, but would be good to at least > understand what's going on before I get people accusing me of > "unstable behaviour". :) > > cheers, > --renato >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20151013/12fa451c/attachment.html>
On 13 October 2015 at 21:51, Galina Kistanova <gkistanova at gmail.com> wrote:> Could you point me please (what bots and when did this happen)?http://lab.llvm.org:8011/builders/clang-cmake-armv7-a15/builds/6793> I will look.Thanks! I may be wrong, but this is not the first phantom issue I had to chase recently. :) Just want to make sure there wasn't anything from your side before I dwell into the hardware parts. --renato