From debbugs-submit-bounces@debbugs.gnu.org Tue Oct 04 13:09:08 2016 Received: (at 24496) by debbugs.gnu.org; 4 Oct 2016 17:09:09 +0000 Received: from localhost ([127.0.0.1]:44808 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1brTDI-00041C-LP for submit@debbugs.gnu.org; Tue, 04 Oct 2016 13:09:08 -0400 Received: from aibo.runbox.com ([91.220.196.211]:44400) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1brTDH-000413-E2 for 24496@debbugs.gnu.org; Tue, 04 Oct 2016 13:09:08 -0400 Received: from [10.9.9.212] (helo=mailfront12.runbox.com) by bars.runbox.com with esmtp (Exim 4.71) (envelope-from ) id 1brTDF-0001CG-Qg; Tue, 04 Oct 2016 19:09:05 +0200 Received: from x5d83ef73.dyn.telefonica.de ([93.131.239.115] helo=localhost) by mailfront12.runbox.com with esmtpsa (uid:892961 ) (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) id 1brTD9-0004tx-IW; Tue, 04 Oct 2016 19:08:59 +0200 From: ng0 To: Ludovic =?utf-8?Q?Court=C3=A8s?= Subject: Re: bug#24496: offloading should fall back to local build after n tries In-Reply-To: <87r387nhjg.fsf@gnu.org> References: <8760ppr3q3.fsf@we.make.ritual.n0.is> <87r387nhjg.fsf@gnu.org> Date: Tue, 04 Oct 2016 17:08:58 +0000 Message-ID: <87vax8nis5.fsf@we.make.ritual.n0.is> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 24496 Cc: 24496@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.7 (/) Ludovic Courtès writes: > Hello! > > ng0 skribis: > >> When I forgot that my build machine is offline and I did not pass >> --no-build-hook, the offloading keeps trying forever until I had to >> cancel the build, boot the build-machine and started the build again. >> >> A solution could be a config option or default behavior which after >> failing to offload for n times gives up and uses the local builder. >> >> Is this desired at all? Setups like hydra could get problems, but for >> small setups with the same architecture there could be a solution beyond >> --no-build-hook? > > Like you say, on Hydra-style setup this could be a problem: the > front-end machine may have --max-jobs=0, meaning that it cannot perform > builds on its own. > > So I guess we would need a command-line option to select a different > behavior. I’m not sure how to do that because ‘guix offload’ is > “hidden” behind ‘guix-daemon’, so there’s no obvious place for such an > option. Could the daemon run with --enable-hydra-style or --disable-hydra-style and --disable-hydra-style would allow falling back to local build if after a defined time - keeping slow connections in mind - the machine did not reply. > In the meantime, you could also hack up your machines.scm: it would > return a list where unreachable machines have been filtered out. How can I achieve this? And to append to this bug: it seems to me that offloading requires 1 lsh-key for each build-machine. (https://lists.gnu.org/archive/html/help-guix/2016-10/msg00007.html) and that you can not directly address them (say I want to create some system where I want to build on machine 1 AND machine 2. Having 2 x86_64 in machines.scm only selects one of them (if 2 were working, see linked thread) and builds on the one which is accessible first. If however the first machine is somehow blocked and it fails, therefore terminates lsh connection, the build does not happen at all. Leaving out the problems, what I want to do in short: How could I build on both systems at the same time when I desire to do so? > Ludo’. > --