After upgrade 24.03 to 24.11 reboot hangs at start @ 0xffffff....
-
@jimp
Here is my loader.conf but I dont have a loader.conf.local. Also I'll try it again and copy the console contents to this post.../root: cat /boot/loader.conf
opensolaris_load="YES"
zfs_load="YES"
kern.cam.boot_delay=10000
kern.geom.label.disk_ident.enable="0"
kern.geom.label.gptid.enable="0"
kern.ipc.nmbclusters="1000000"
kern.ipc.nmbjumbo9="524288"
kern.ipc.nmbjumbop="524288"
loader_conf_files="/boot/loader.conf.lua"
boot_serial="YES"
comconsole_speed="115200"
console="efi"
autoboot_delay="3"
debug.ddb.capture.bufsize="524288"
hw.e6000sw.default_disabled=1
hw.hn.vf_transparent="0"
hw.hn.use_if_start="1"
net.link.ifqmaxlen="128"
machdep.hwpstate_pkg_ctrl="0"
net.pf.states_hashsize="262144"/root: cat /boot/loader.conf.local
cat: /boot/loader.conf.local: No such file or directory -
@stephenw10
I have had the uint for about a year and have only done one upgrade to 24.03 -
@jimp
I re-ran the upgrade to get the print out you wanted from the console and this time it completed successfully... I didn't do anything except for uninstall the Acme package because its been crashing do to a PHP error which I will list below, if you have any insights, and yes I will create a new ticket in another channel for this issue.16:16:02 PHP ERROR: Type: 1, File: /usr/local/www/acme/acme_accountkeys.php, Line: 158, Message: Uncaught TypeError: Cannot access offset of type string on string in /usr/local/www/acme/acme_accountkeys.php:158
Stack trace:
#0 {main}
thrown -
Are you seeing that error in 24.11 if you reinstall the pkg? Or just in 24.03?
-
@stephenw10
Yes, and now I don't see the acme menu option under services even when i do a reinstall. The only way to get to it is using the URL. Once I'm at acme.xml page I select account keys and that's when it crashes. -
Ok opened a bug to track that: https://redmine.pfsense.org/issues/15892
-
Anyone hitting this console issue try booting verbose. Escape the loader menu to the prompt (OK>) and enter:
boot -v
-
@jimp said in After upgrade 24.03 to 24.11 reboot hangs at start @ 0xffffff....:
@freph533 said in After upgrade 24.03 to 24.11 reboot hangs at start @ 0xffffff....:
Same issue observed just now on a VM that's had 0 issues with 24.03. Hard reboot on the VM booted successfully back into 24.03.
What type of VM? If it's Proxmox and running an EFI BIOS, it may be something we've hit before (adding a serial port to the VM hardware works around it) but that's different than the other problems reported in this thread.
Yes, it's in Proxmox on EFI BIOS. There's no serial port on the VM hardware so that might be part of the cause in my case, though given the feedback from others it seems like it may be something else.
@stephenw10 Given my setup with the Proxmox VM is it still useful for me to provide output on an attempted upgrade (and if so, with or without adding a serial port to the VM devices? FWIW I've never had issues through several upgrades without a serial device on this VM)?
-
Yes, if you can try booting verbose after the upgrade.
Also you could try adding a serial port to confirm if that has any effect since that might be a useful clue.
-
@stephenw10 said in After upgrade 24.03 to 24.11 reboot hangs at start @ 0xffffff....:
Yes, if you can try booting verbose after the upgrade.
Also you could try adding a serial port to confirm if that has any effect since that might be a useful clue.
Adding the serial device let me upgrade successfully. Upon removing the serial device and attempting multiple reboots it hangs at the same spot with the error. Adding the serial device back lets the VM boot again. So it seems like 24.11 has a hard dependency on a serial port existing whereas 24.03 does not.
-
@freph533 said in After upgrade 24.03 to 24.11 reboot hangs at start @ 0xffffff....:
So it seems like 24.11 has a hard dependency on a serial port existing whereas 24.03 does not.!
It's not new in 24.11, it just seems to hit some VMs differently than others, and we haven't nailed down why yet:
-
Happening to me with a 24.03 to 24.11 upgrade on a Netgate 4200 MAX
Verbose boot logs, unfortunately doesn't offer anything new:
OK boot -v Loading kernel... /boot/kernel/kernel text=0x1a4c98 text=0xff3048 text=0x17ed568 data=0x180+0xe80 data=0x24c808+0x3b37f8 0x8+0x1d4108+0x8+0x1e9a19 Loading configured modules... /boot/kernel/opensolaris.ko size 0x1e2a8 at 0x37be000 /boot/kernel/zfs.ko size 0x619a40 at 0x37dd000 can't find '/boot/entropy' can't find '/etc/hostid' staging 0x3fa00000-0x43f40000 (not copying) tramp 0x43f40000 PT4 0x43f41000 Start @ 0xffffffff803a5000 ... /
-
I opened a ticket with TAC support and they're telling me to "just reinstall the software". No effort to try and figure out what's wrong.
I don't doubt a reinstall will help, but what worries me is if it doesn't I'll have to then reinstall to 24.03 and that's downtime I can't afford when I use this device for my work.
I'll have to sit on 24.03 until the issue is identified.
-
@jackyaz said in After upgrade 24.03 to 24.11 reboot hangs at start @ 0xffffff....:
Netgate 4200 MAX
I also experience the problem with a Netgate 4200 MAX (vice 6100 MAX I reported earlier).
-
This thread indicates two types of devices experience problems:
- Pfsense on VM
- 4200 MAX
The VM folks got a solution/recommendation of attaching a serial port to the VM. Is there a recommendation for the 4200 MAX other than trying again? I've tried a couple times already, both resulting in failure.
-
Still unable to replicate this locally.
Is there anything about those devices that might be unique? Perhaps you made a BIOS setting change? Have something unusual connected to it?
-
@stephenw10 No BIOS changes on my side, never even been into the BIOS!
I do have a USB cable plugged into the USB A port on the side but the other end is disconnected - I'll occasionally need to use my phone as a USB tethering modem when my fibre drops out. I can try unplugging it to see if that makes a difference? I'd be surprised if it does considering its not connected to anything!
-
Mmm, me too. It certainly shouldn't make any difference. Might be worth trying to rule it out though.
-
@stephenw10
My 4200 MAX is off the shelf with no special (e.g. BIOS) changes and nothing unusual (e.g. USB) plugged in. -
@GripePotato said in After upgrade 24.03 to 24.11 reboot hangs at start @ 0xffffff....:
This thread indicates two types of devices experience problems:
- Pfsense on VM
- 4200 MAX
The VM folks got a solution/recommendation of attaching a serial port to the VM. Is there a recommendation for the 4200 MAX other than trying again? I've tried a couple times already, both resulting in failure.
The VM issue was quite a bit different and unrelated to what's happening on the 4200, which still isn't clear.
As for the 4200, nobody has yet been able to capture the whole boot process on a system affected by the problem (See my request in https://forum.netgate.com/post/1197950 ) -- the other person who tried ended up having a successful upgrade on their next attempt.
So if you could, try the upgrade again and capture the whole next boot output from the serial console and attach it here (or send it via TAC).
It may also be worth checking to make sure you have
/boot/loader.conf.lua
present with appropriate content. Also make sure you have thepfSense
,pfSense-plus
, andpfSense-boot
OS packages installed before upgrading. Checkpkg info -x pfSense
to see if they are in the list.