installation of 0.8 stops at guile prompt.

  • Done
  • quality assurance status badge
Details
2 participants
  • Ludovic Courtès
  • th3kent
Owner
unassigned
Submitted by
th3kent
Severity
normal
T
T
th3kent wrote on 1 Dec 2014 23:01
(address . bug-guix@gnu.org)
877fybni67.fsf@gnulx000.localhost.localdomian
hello guix,

as requested by civodul on #guix, here is a bug report.

hardware: 8-year-old hp nx6125 with 512mb memory and 60gb h.d.d

first attempt at installation failed with the following:

Toggle snippet (9 lines)
unionfs invoked oom-killer: gfp_mask=0x...
unionfs cpuset=/ mems_allowed=0
[...]
Out of memory: Kill process 2068 (guix) score 526 or sacrifice child
Killed process 2068 (guix) total-vm:829080kB, anon-rss:201792kB, file-rss:0kB
guix system: error: build failed: unexpected end-of-file
2470 operations

i went past this by enabling a swap device.

second attempt at installation failed with "no permissions" when copying
pata_* kernel modules to /gnu/store/*-linux-libre-3.17.2/.

after adding write permissions for user on directory
/gnu/store/*-linux-libre-3.17.2/, the installation completed
successfully (no errors).

on restarting the laptop and booting from the h.d.d, the reboot stops
at the guile-user prompt. the backtrace shows the following:

Toggle snippet (6 lines)
In /gnu/store/*-module-import/gnu/build/linux-boot.scm:
385:7 2 (#<procedure 1e943c0 at /gnu/store/...
In srfi/srfi-1.scm:
619:17 1 (for-each #<procedure load-linux-module* (file)> ...

regards,
... kennedy tembo.
--
former club president
midrand toastmasters
/"\ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
\ / "[He has] campaigned steadfastly for freedom.
X I admire Nelson Mandela ..."
/ \ - Richard M. Stallman (rms), President, Free Software Foundation.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
L
L
Ludovic Courtès wrote on 3 Dec 2014 23:18
(name . th3kent)(address . tembokm@mweb.co.za)(address . 19248@debbugs.gnu.org)
87oark2x91.fsf@gnu.org
th3kent <tembokm@mweb.co.za> skribis:

Toggle quote (12 lines)
> hardware: 8-year-old hp nx6125 with 512mb memory and 60gb h.d.d
>
> first attempt at installation failed with the following:
>
> unionfs invoked oom-killer: gfp_mask=0x...
> unionfs cpuset=/ mems_allowed=0
> [...]
> Out of memory: Kill process 2068 (guix) score 526 or sacrifice child
> Killed process 2068 (guix) total-vm:829080kB, anon-rss:201792kB, file-rss:0kB
> guix system: error: build failed: unexpected end-of-file
> 2470 operations

This is a sign that too much data was written to the unionfs that sits
on the root file system.

Could it be because you forgot to run ‘deco start cow-store /mnt’?

Toggle quote (3 lines)
> second attempt at installation failed with "no permissions" when copying
> pata_* kernel modules to /gnu/store/*-linux-libre-3.17.2/.

That is weird. Do you remember at what point that happened? Was it at
the end of the ‘guix system init config.scm /mnt’ run?

Files and directories under /gnu/store are meant to be immutable.

Toggle quote (8 lines)
> on restarting the laptop and booting from the h.d.d, the reboot stops
> at the guile-user prompt. the backtrace shows the following:
>
> In /gnu/store/*-module-import/gnu/build/linux-boot.scm:
> 385:7 2 (#<procedure 1e943c0 at /gnu/store/...
> In srfi/srfi-1.scm:
> 619:17 1 (for-each #<procedure load-linux-module* (file)> ...

Does it show an actual error below that?

Could you post your OS config?

Thanks for the report!

Ludo’.
T
T
th3kent wrote on 5 Dec 2014 19:17
(name . Ludovic Courtès)(address . ludo@gnu.org)(address . 19248@debbugs.gnu.org)
871toeufk7.fsf@gnulx000.localhost.localdomian
ludo@gnu.org (Ludovic Courtès) writes:
Toggle quote (12 lines)
> th3kent <tembokm@mweb.co.za> skribis:
>> [...]
>> Out of memory: Kill process 2068 (guix) score 526 or sacrifice child
>> Killed process 2068 (guix) total-vm:829080kB, anon-rss:201792kB, file-rss:0kB
>> guix system: error: build failed: unexpected end-of-file
>> 2470 operations
>
> This is a sign that too much data was written to the unionfs that sits
> on the root file system.
>
> Could it be because you forgot to run ‘deco start cow-store /mnt’?

i definitely ran ‘deco start cow-store /mnt’ after i created my o.s
config.

Toggle quote (8 lines)
>> second attempt at installation failed with "no permissions" when copying
>> pata_* kernel modules to /gnu/store/*-linux-libre-3.17.2/.
>
> That is weird. Do you remember at what point that happened? Was it at
> the end of the ‘guix system init config.scm /mnt’ run?
>
> Files and directories under /gnu/store are meant to be immutable.

yes, the "no permission" error was after running a second 'guix system
init ...' command.

Toggle quote (12 lines)
>> on restarting the laptop and booting from the h.d.d, the reboot stops
>> at the guile-user prompt. the backtrace shows the following:
>>
>> In /gnu/store/*-module-import/gnu/build/linux-boot.scm:
>> 385:7 2 (#<procedure 1e943c0 at /gnu/store/...
>> In srfi/srfi-1.scm:
>> 619:17 1 (for-each #<procedure load-linux-module* (file)> ...
>
> Does it show an actual error below that?
>
> Could you post your OS config?

the last line of the backtrace starts with "In Unkown ...".

here is my o.s config:

Toggle snippet (31 lines)
(use-modules (gnu)
(srfi srfi-26))

(operating-system
(host-name "gnusys01")
(timezone "Africa/Johannesburg")
(locale "en_US.UTF-8")

(initrd (lambda (file-systems . rest)
(apply base-initrd file-systems
#:extra-modules '("pata_acpi.ko" "pata_atiixp.ko")
rest)))

(bootloader (grub-configuration (device "/dev/sda")))
(file-systems (cons (file-system
(device "root")
(title 'label)
(mount-point "/")
(type "ext4"))
%base-file-systems))

(users (list (user-account
(name "kent")
(comment "the wizard of gnu")
(group "users")

(supplementary-groups '("wheel"
"audio" "video"))
(home-directory "/home/kent")))))

regards,
... kennedy tembo.
--
former club president
midrand toastmasters
/"\ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
\ / "[He has] campaigned steadfastly for freedom.
X I admire Nelson Mandela ..."
/ \ - Richard M. Stallman (rms), President, Free Software Foundation.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
L
L
Ludovic Courtès wrote on 7 Dec 2014 23:17
(name . th3kent)(address . tembokm@mweb.co.za)(address . 19248@debbugs.gnu.org)
87388rksu8.fsf@gnu.org
th3kent <tembokm@mweb.co.za> skribis:

Toggle quote (3 lines)
> ludo@gnu.org (Ludovic Courtès) writes:
>> th3kent <tembokm@mweb.co.za> skribis:

[...]

Toggle quote (12 lines)
>>> on restarting the laptop and booting from the h.d.d, the reboot stops
>>> at the guile-user prompt. the backtrace shows the following:
>>>
>>> In /gnu/store/*-module-import/gnu/build/linux-boot.scm:
>>> 385:7 2 (#<procedure 1e943c0 at /gnu/store/...
>>> In srfi/srfi-1.scm:
>>> 619:17 1 (for-each #<procedure load-linux-module* (file)> ...
>>
>> Does it show an actual error below that?
>>
>> Could you post your OS config?

[...]

Toggle quote (5 lines)
> (initrd (lambda (file-systems . rest)
> (apply base-initrd file-systems
> #:extra-modules '("pata_acpi.ko" "pata_atiixp.ko")
> rest)))

This may be the culprit. First, it’s not needed, because 0.8 already
included these two modules by default. Second, 0.8 would fail when
trying to load the same module twice, which is exactly what would happen
here (this is fixed in ‘master’, around commit 0e704a2.)

Could you try to remove these ‘initrd’ lines and try again?

Thanks in advance,
Ludo’.
K
K
kennedy wrote on 12 Dec 2014 21:06
(name . Ludovic Courtès)(address . ludo@gnu.org)(address . 19248@debbugs.gnu.org)
87d27otyxq.fsf@gnulx000.localhost.localdomian
ludo@gnu.org (Ludovic Courtès) writes:
Toggle quote (16 lines)
> th3kent <tembokm@mweb.co.za> skribis:
>> ludo@gnu.org (Ludovic Courtès) writes:
>>> [...]
>>> Could you post your OS config?
>> (initrd (lambda (file-systems . rest)
>> (apply base-initrd file-systems
>> #:extra-modules '("pata_acpi.ko" "pata_atiixp.ko")
>> rest)))
>
> This may be the culprit. First, it’s not needed, because 0.8 already
> included these two modules by default. Second, 0.8 would fail when
> trying to load the same module twice, which is exactly what would happen
> here (this is fixed in ‘master’, around commit 0e704a2.)
>
> Could you try to remove these ‘initrd’ lines and try again?

i deleted the lines for "initrd" and the installation completed
successfully on first attempt.

however, the reboot after installation results in a kernel panic. the
messages just before the panic are the following,

Toggle snippet (6 lines)
groupadd: cannot lock /etc/group; try again later
usermod: no changes
adding user 'kent'
ERROR: In procedure getpw: entry not found

regards,
... kennedy tembo.
--
former club president
midrand toastmasters
/"\ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
\ / "[He has] campaigned steadfastly for freedom.
X I admire Nelson Mandela ..."
/ \ - Richard M. Stallman (rms), President, Free Software Foundation.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
L
L
Ludovic Courtès wrote on 12 Dec 2014 23:01
(name . kennedy)(address . tembokm@mweb.co.za)(address . 19248@debbugs.gnu.org)
8761dgsf2d.fsf@inria.fr
kennedy <tembokm@mweb.co.za> skribis:

Toggle quote (20 lines)
> ludo@gnu.org (Ludovic Courtès) writes:
>> th3kent <tembokm@mweb.co.za> skribis:
>>> ludo@gnu.org (Ludovic Courtès) writes:
>>>> [...]
>>>> Could you post your OS config?
>>> (initrd (lambda (file-systems . rest)
>>> (apply base-initrd file-systems
>>> #:extra-modules '("pata_acpi.ko" "pata_atiixp.ko")
>>> rest)))
>>
>> This may be the culprit. First, it’s not needed, because 0.8 already
>> included these two modules by default. Second, 0.8 would fail when
>> trying to load the same module twice, which is exactly what would happen
>> here (this is fixed in ‘master’, around commit 0e704a2.)
>>
>> Could you try to remove these ‘initrd’ lines and try again?
>
> i deleted the lines for "initrd" and the installation completed
> successfully on first attempt.

So you booted on the USB disk and re-run ‘guix system init’, is that
correct?

Did you reformat the root partition before doing that?

Toggle quote (5 lines)
> however, the reboot after installation results in a kernel panic. the
> messages just before the panic are the following,
>
> groupadd: cannot lock /etc/group; try again later

Could you try removing the /etc/.pwd.lock file on that partition? (You
can do that on a system booted from the USB stick, and then reboot in
the installed system.)

Thanks,
Ludo’.
T
T
th3kent wrote on 13 Dec 2014 20:19
(name . Ludovic Courtès)(address . ludo@gnu.org)(address . 19248@debbugs.gnu.org)
87oar7s6gr.fsf@gnulx000.localhost.localdomian
ludo@gnu.org (Ludovic Courtès) writes:
Toggle quote (9 lines)
> kennedy <tembokm@mweb.co.za> skribis:
>> i deleted the lines for "initrd" and the installation completed
>> successfully on first attempt.
>
> So you booted on the USB disk and re-run ‘guix system init’, is that
> correct?
>
> Did you reformat the root partition before doing that?

yes, i booted off my u.s.b disk, reformatted my root partition, and
re-installed.

Toggle quote (9 lines)
>> however, the reboot after installation results in a kernel panic. the
>> messages just before the panic are the following,
>>
>> groupadd: cannot lock /etc/group; try again later
>
> Could you try removing the /etc/.pwd.lock file on that partition? (You
> can do that on a system booted from the USB stick, and then reboot in
> the installed system.)

i removed /etc/.pwd.lock, as well as /etc/group.lock, and the reboot
tries to enter the guile prompt ("Entering new prompt ...") and i get a
kernel panic.

just f.y.i, i can't use shift-page.up to scrollback through the boot
messages.

regards,
... kennedy tembo.
--
former club president
midrand toastmasters
/"\ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
\ / "[He has] campaigned steadfastly for freedom.
X I admire Nelson Mandela ..."
/ \ - Richard M. Stallman (rms), President, Free Software Foundation.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
L
L
Ludovic Courtès wrote on 13 Dec 2014 21:15
(name . th3kent)(address . tembokm@mweb.co.za)(address . 19248@debbugs.gnu.org)
871to3joh4.fsf@gnu.org
th3kent <tembokm@mweb.co.za> skribis:

Toggle quote (26 lines)
> ludo@gnu.org (Ludovic Courtès) writes:
>> kennedy <tembokm@mweb.co.za> skribis:
>>> i deleted the lines for "initrd" and the installation completed
>>> successfully on first attempt.
>>
>> So you booted on the USB disk and re-run ‘guix system init’, is that
>> correct?
>>
>> Did you reformat the root partition before doing that?
>
> yes, i booted off my u.s.b disk, reformatted my root partition, and
> re-installed.
>
>>> however, the reboot after installation results in a kernel panic. the
>>> messages just before the panic are the following,
>>>
>>> groupadd: cannot lock /etc/group; try again later
>>
>> Could you try removing the /etc/.pwd.lock file on that partition? (You
>> can do that on a system booted from the USB stick, and then reboot in
>> the installed system.)
>
> i removed /etc/.pwd.lock, as well as /etc/group.lock, and the reboot
> tries to enter the guile prompt ("Entering new prompt ...") and i get a
> kernel panic.

But do you actually get the Guile prompt?

If you did and were brave enough, we could coordinate on IRC to try and
investigate, for instance to check whether / is writable, things like
that.

Toggle quote (3 lines)
> just f.y.i, i can't use shift-page.up to scrollback through the boot
> messages.

I’m running out of ideas, sorry.

If that is an option for you, could you try starting anew? That is,
boot on the USB image, erase the target partition with “mkfs.ext4
/dev/xyzN”, and proceed with installation. We must have missed
something, but I don’t see what.

Thanks,
Ludo’.
L
L
Ludovic Courtès wrote on 8 Feb 2015 19:18
(name . th3kent)(address . tembokm@mweb.co.za)
87oap4p8q5.fsf@gnu.org
tags 19248 moreinfo unreproducible
close 19248
thanks

Hello,

I’m closing this bug for now. I would be grateful if you could try
installing 0.8.1, released two weeks ago, and let us know if you can
reproduce this problem.

Thanks in advance,
Ludo’.
?