James Ray
2007-Sep-07 09:06 UTC
[Xen-users] Xen failing to boot under Debian Etch. (FATAL TRAP)
Hi, Running xen on a Debian Etch platform and about every 1 in 10 boots I am getting the following error: (XEN) ************************************ (XEN) CPU0 FATAL TRAP 6 (invalid opcode), ERROR_CODE 0000, IN INTERRUPT CONTEXT. (XEN) System shutting down -- need manual reset. (XEN) ************************************ (I have attached a file with the full output also of an OK boot and a failed one also). This is a Quad-Core, Dual CPU system currently running with 4Gb of RAM with a Intel S5000PAL server motherboard. I have tried tweaking around the APIC options to be bigsmp to no help at all. I am kind of at a loss right now as to how to progress. I have upgraded the BIOS to the latest revision to no avail also. Here are the xen package versions from my system: # dpkg -l | egrep ''(\||xen)'' | Status=Not/Installed/Config-files/Unpacked/Failed-config/Half-installed |/ Err?=(none)/Hold/Reinst-required/X=both-problems (Status,Err: uppercase=bad) ||/ Name Version Description ii linux-image-2.6-xen-amd64 2.6.18+6etch2 Linux kernel 2.6 image on AMD64 ii linux-image-2.6.18-4-xen-amd64 2.6.18.dfsg.1-12etch2 Linux 2.6.18 image on AMD64 ii linux-image-2.6.18-5-xen-amd64 2.6.18.dfsg.1-13etch2 Linux 2.6.18 image on AMD64 ii linux-modules-2.6.18-4-xen-amd64 2.6.18.dfsg.1-12etch2 Linux 2.6.18 modules on AMD64 ii linux-modules-2.6.18-5-xen-amd64 2.6.18.dfsg.1-13etch2 Linux 2.6.18 modules on AMD64 ii xen-hypervisor-3.0.3-1-amd64 3.0.3-0-2 The Xen Hypervisor on AMD64 ii xen-ioemu-3.0.3-1 3.0.3-0-2 XEN administrative tools ii xen-linux-system-2.6.18-4-xen-amd64 2.6.18.dfsg.1-12etch2 XEN system with Linux 2.6.18 image on AMD64 ii xen-tools 2.8-2 Tools to manage debian XEN virtual servers ii xen-utils-3.0.3-1 3.0.3-0-2 XEN administrative tools ii xen-utils-common 3.0.3-0-2 XEN administrative tools - common files And here is my grub boot section: title Xen 3.0.3-1-amd64 / Debian GNU/Linux, kernel 2.6.18-5-xen-amd64 root (hd0,0) kernel /xen-3.0.3-1-amd64.gz com2=19200,8n1 console=com2 module /vmlinuz-2.6.18-5-xen-amd64 root=/dev/mapper/hornet-root ro console=ttyS1,19200 console=tty0 module /initrd.img-2.6.18-5-xen-amd64 If I can help with any further information please let me know. Anyone any ideas on how to fix this problem? (I will start running the Intel Platform Test Suite to check its not the hardware, but I doubt it is since this happens the same on two identical boxes) -- James Ray. <j.ray@qmul.ac.uk> Computing Services Queen Mary, University of London _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
Kilburn, Preston
2007-Sep-07 14:24 UTC
RE: [Xen-users] Xen failing to boot under Debian Etch. (FATAL TRAP)
I''m not sure if this is helpful or not, but we are running basically the same setup: 2x Quadcores 16 Gig RAM Intel S5000PSL (S instead of A) It could be that you have an older rev of the hardware that isn''t able to upgrade properly. Not sure if you''ve checked it out - but there is a matrix on Intel''s website: http://www.intel.com/support/motherboards/server/sb/CS-023585.htm -PK -----Original Message----- From: xen-users-bounces@lists.xensource.com [mailto:xen-users-bounces@lists.xensource.com] On Behalf Of James Ray Sent: Friday, September 07, 2007 3:06 AM To: xen-users@lists.xensource.com Subject: [Xen-users] Xen failing to boot under Debian Etch. (FATAL TRAP) Hi, Running xen on a Debian Etch platform and about every 1 in 10 boots I am getting the following error: (XEN) ************************************ (XEN) CPU0 FATAL TRAP 6 (invalid opcode), ERROR_CODE 0000, IN INTERRUPT CONTEXT. (XEN) System shutting down -- need manual reset. (XEN) ************************************ (I have attached a file with the full output also of an OK boot and a failed one also). This is a Quad-Core, Dual CPU system currently running with 4Gb of RAM with a Intel S5000PAL server motherboard. I have tried tweaking around the APIC options to be bigsmp to no help at all. I am kind of at a loss right now as to how to progress. I have upgraded the BIOS to the latest revision to no avail also. Here are the xen package versions from my system: # dpkg -l | egrep ''(\||xen)'' | |Status=Not/Installed/Config-files/Unpacked/Failed-config/Half-installed |/ Err?=(none)/Hold/Reinst-required/X=both-problems (Status,Err: uppercase=bad) ||/ Name Version Description ii linux-image-2.6-xen-amd64 2.6.18+6etch2 Linux kernel 2.6 image on AMD64 ii linux-image-2.6.18-4-xen-amd64 2.6.18.dfsg.1-12etch2 Linux 2.6.18 image on AMD64 ii linux-image-2.6.18-5-xen-amd64 2.6.18.dfsg.1-13etch2 Linux 2.6.18 image on AMD64 ii linux-modules-2.6.18-4-xen-amd64 2.6.18.dfsg.1-12etch2 Linux 2.6.18 modules on AMD64 ii linux-modules-2.6.18-5-xen-amd64 2.6.18.dfsg.1-13etch2 Linux 2.6.18 modules on AMD64 ii xen-hypervisor-3.0.3-1-amd64 3.0.3-0-2 The Xen Hypervisor on AMD64 ii xen-ioemu-3.0.3-1 3.0.3-0-2 XEN administrative tools ii xen-linux-system-2.6.18-4-xen-amd64 2.6.18.dfsg.1-12etch2 XEN system with Linux 2.6.18 image on AMD64 ii xen-tools 2.8-2 Tools to manage debian XEN virtual servers ii xen-utils-3.0.3-1 3.0.3-0-2 XEN administrative tools ii xen-utils-common 3.0.3-0-2 XEN administrative tools - common files And here is my grub boot section: title Xen 3.0.3-1-amd64 / Debian GNU/Linux, kernel 2.6.18-5-xen-amd64 root (hd0,0) kernel /xen-3.0.3-1-amd64.gz com2=19200,8n1 console=com2 module /vmlinuz-2.6.18-5-xen-amd64 root=/dev/mapper/hornet-root ro console=ttyS1,19200 console=tty0 module /initrd.img-2.6.18-5-xen-amd64 If I can help with any further information please let me know. Anyone any ideas on how to fix this problem? (I will start running the Intel Platform Test Suite to check its not the hardware, but I doubt it is since this happens the same on two identical boxes) -- James Ray. <j.ray@qmul.ac.uk> Computing Services Queen Mary, University of London _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
James Ray
2007-Sep-07 15:18 UTC
Re: [Xen-users] Xen failing to boot under Debian Etch. (FATAL TRAP)
Kilburn, Preston wrote:> I''m not sure if this is helpful or not, but we are running basically the > same setup: > 2x Quadcores > 16 Gig RAM > Intel S5000PSL (S instead of A) > > It could be that you have an older rev of the hardware that isn''t able > to upgrade properly. Not sure if you''ve checked it out - but there is a > matrix on Intel''s website: > http://www.intel.com/support/motherboards/server/sb/CS-023585.htm >Yeah I have just checked this against my machine and the PBA version is D13607-804 (higher than the required). So it should be fine. I have just upgraded the Hypervisor to 3.1.1 (backported from Debian lenny). And the problem still exists but in a slightly different form. The full complement of processors still fails to work all the time but it doesn''t bail out in such a horrible way: (XEN) Booting processor 7/7 eip 90000 (XEN) Not responding. (XEN) Inquiring remote APIC #7... (XEN) ... APIC #7 ID: failed (XEN) ... APIC #7 VERSION: failed (XEN) ... APIC #7 SPIV: failed (XEN) Unmapping cpu 7 from all nodes (XEN) CPU #7 not responding - cannot use it. (XEN) Total of 7 processors activated. (There is meant to be 8 of course) (And carries on to boot) I wonder if other people are experiencing this problem and just not realizing it? Anyone have any bright ideas? :) -- James Ray. <j.ray@qmul.ac.uk> Computing Services Queen Mary, University of London _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
David Morris
2007-Sep-07 18:48 UTC
RE: [Xen-users] Xen failing to boot under Debian Etch. (FATAL TRAP)
Perhaps I''m confused by obtuse naming, but it looks to me that you are running AMD 64bit kernels and hypervisors on an Intel system. Is this a AMD vs Intel compatibility issue?> -----Original Message----- > From: James Ray [mailto:j.ray@qmul.ac.uk] > Sent: Friday, September 07, 2007 2:06 AM > To: xen-users@lists.xensource.com > Subject: [Xen-users] Xen failing to boot under Debian Etch. > (FATAL TRAP) > > > Hi, > Running xen on a Debian Etch platform and about every 1 > in 10 boots I > am getting the following error: > (XEN) ************************************ > (XEN) CPU0 FATAL TRAP 6 (invalid opcode), ERROR_CODE 0000, IN > INTERRUPT > CONTEXT. > (XEN) System shutting down -- need manual reset. > (XEN) ************************************ > > (I have attached a file with the full output also of an OK boot and a > failed one also). > > This is a Quad-Core, Dual CPU system currently running with 4Gb of RAM > with a Intel S5000PAL server motherboard. I have tried tweaking around > the APIC options to be bigsmp to no help at all. I am kind of > at a loss > right now as to how to progress. I have upgraded the BIOS to > the latest > revision to no avail also. > > Here are the xen package versions from my system: > # dpkg -l | egrep ''(\||xen)'' > | > Status=Not/Installed/Config-files/Unpacked/Failed-config/Half- > installed > |/ Err?=(none)/Hold/Reinst-required/X=both-problems (Status,Err: > uppercase=bad) > ||/ Name Version > Description > ii linux-image-2.6-xen-amd64 2.6.18+6etch2 > Linux kernel 2.6 image on AMD64 > ii linux-image-2.6.18-4-xen-amd64 2.6.18.dfsg.1-12etch2 > Linux 2.6.18 image on AMD64 > ii linux-image-2.6.18-5-xen-amd64 2.6.18.dfsg.1-13etch2 > Linux 2.6.18 image on AMD64 > ii linux-modules-2.6.18-4-xen-amd64 2.6.18.dfsg.1-12etch2 > Linux 2.6.18 modules on AMD64 > ii linux-modules-2.6.18-5-xen-amd64 2.6.18.dfsg.1-13etch2 > Linux 2.6.18 modules on AMD64 > ii xen-hypervisor-3.0.3-1-amd64 3.0.3-0-2 > The Xen Hypervisor on AMD64 > ii xen-ioemu-3.0.3-1 3.0.3-0-2 > XEN administrative tools > ii xen-linux-system-2.6.18-4-xen-amd64 2.6.18.dfsg.1-12etch2 > XEN system with Linux 2.6.18 image on AMD64 > ii xen-tools 2.8-2 > Tools to manage debian XEN virtual servers > ii xen-utils-3.0.3-1 3.0.3-0-2 > XEN administrative tools > ii xen-utils-common 3.0.3-0-2 > XEN administrative tools - common files > > And here is my grub boot section: > title Xen 3.0.3-1-amd64 / Debian GNU/Linux, kernel > 2.6.18-5-xen-amd64 > root (hd0,0) > kernel /xen-3.0.3-1-amd64.gz com2=19200,8n1 console=com2 > module /vmlinuz-2.6.18-5-xen-amd64 > root=/dev/mapper/hornet-root > ro console=ttyS1,19200 console=tty0 > module /initrd.img-2.6.18-5-xen-amd64 > > If I can help with any further information please let me know. Anyone > any ideas on how to fix this problem? > > (I will start running the Intel Platform Test Suite to check > its not the > hardware, but I doubt it is since this happens the same on > two identical > boxes) > > -- > James Ray. <j.ray@qmul.ac.uk> > Computing Services > Queen Mary, University of London >_______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
James Ray
2007-Sep-07 20:12 UTC
Re: [Xen-users] Xen failing to boot under Debian Etch. (FATAL TRAP)
David Morris wrote:> Perhaps I''m confused by obtuse naming, but it looks to me that you are running AMD 64bit kernels and hypervisors on an Intel system. Is this a AMD vs Intel compatibility issue?Yes I think it is Debian confusing naming getting the way of your understanding... The AMD64 kernel says the following: # aptitude show linux-image-2.6-amd64 Package: linux-image-2.6-amd64 Description: Linux kernel 2.6 image on AMD64 This package depends on the latest binary image for Linux kernel 2.6 on all 64bit single- and multiprocessor AMD and Intel machines. Confusingly it supports AMD64 and EM64T. So I don''t think this is the issue here. -- James Ray _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users