Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 23 Oct 2015 09:34:13 -0700
From:      "Russell L. Carter" <rcarter@pinyon.org>
To:        FreeBSD Ports ML <freebsd-ports@freebsd.org>
Subject:   hung poudriere bulk recovery
Message-ID:  <562A6185.5000305@pinyon.org>

next in thread | raw e-mail | index | archive | help

Greetings,

Recently my nightly cron poudriere builds have been occasionally
hanging.  For instance, here's last night's, with apparently no
progress for over 10 hours:

root@terpsichore> poudriere status
SET PORTS   JAIL            BUILD                STATUS         QUEUE 
BUILT FAIL SKIP IGNORE REMAIN TIME     LOGS
-   default 10-stable-amd64 2015-10-22_22h30m08s parallel_build   488 
  34    0    0      0    454 10:45:56 
/ssd1/poudriere/data/logs/bulk/10-stable-amd64-default/2015-10-22_22h30m08s
root@terpsichore>

htop now shows no significant activity for the specified 3 builders:

root@terpsichore> ps xa | grep poud
72482  -  Is       0:00.01 /bin/sh /root/poudriere/run-poudriere-bulk
73202  -  S        0:04.24 sh -e /usr/local/share/poudriere/bulk.sh -f 
/root/poudriere/ports -j 10-stable-amd64
73347  -  S        1:55.38 sh -e /usr/local/share/poudriere/bulk.sh -f 
/root/poudriere/ports -j 10-stable-amd64
73352  -  I        0:00.08 sh -e /usr/local/share/poudriere/bulk.sh -f 
/root/poudriere/ports -j 10-stable-amd64
  6119  1  S+       0:00.00 grep poud
root@terpsichore>

If I reboot, so that the tmp zfs filesystems are unmounted, and
manually rerun the exact same script as the previous cron'd, hung
instance, poudriere has (so far) run to completion.

I'm not sure how to debug this, but in the interim, I'm very curious
how I can stop the hung bulk run, and either restart it, or clean up
the various mounted zfs filesystems and manually restart from the
beginning w/o rebooting.  Studying the man page, it's not clear at all
the Right Way to do this, so any pointers here would be appreciated.

I'm leaving the system untouched for now so that I can try out any
suggestions for cleanup and restart.

Thanks,
Russell



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?562A6185.5000305>