All, I seem to be experiencing the same problem that Michal Urbanski talked about in an email dated March 24th with a subject of "hanging dell hardware". Looking through the archives I can not find any resolution to the problem. The specs on my machine are Dell Poweredge 1850 Dual Xeon 2.8s 1gig ram PERC 4e/SI raid controller in mirror configuration with 2 73 gb drives (uses the megaraid_mm/megaraid_mbox modules) Any time the system is placed under high IO ( for example untar/bz2ing a tarball to create the fs for a xenU partiton) the machine hangs. There are no kernel messages. The console becomes completely unresponsive (pressing Caps lock does not change the status of the caps lock light) I can''t use the magic sysreq key. I''ve tried adding "watchdog" to the end of the grub line that loads xen.gz. That didn''t change anything. I''ve also tried disabling hyperthreading with no success. I''m using xen 2.0.5 with a 2.6.10 xen0 kernel. My raid controller bios is already at version 516a (the newest I can find on the dell site.) Does anyone have any more suggestions for me. Thanks, -Rob _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
On Sun, May 08, 2005 at 10:15:45AM -0400, Rob See wrote:> > Any time the system is placed under high IO ( for example untar/bz2ing a > tarball to create the fs for a xenU partiton) the machine hangs.Is this problem really xen-related; can you do the same actions booted from a "native" kernel without a crash? It looks like a memory or heating problem... Mzzlz, Arie Kraai _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
All, Yes it does only happen with the xen0 kernel booted. I did the install from a native kernel (I''m using Gentoo 2005.0), and I didn''t have the same problems. The system is less than a month old, and has passed the Dell diagnostics. There are two tests I have run that are guaranteed to cause the problem. 1) runing tar -jxPvf on the gentoo stage3 tarball (the gentoo base os image) 2) running emerge (gentoo package manager) to install a new package With the first test, the hang occurs around the same place in the file every time I generally run these tests within the first 30 seconds after boot, but I''ve also tried letting it sit for an hour or two. I have a 1g swap partition, and I''ve tried dom0_mem=192000 and 256000. Thanks, -Rob Arie Kraai wrote:>On Sun, May 08, 2005 at 10:15:45AM -0400, Rob See wrote: > > >>Any time the system is placed under high IO ( for example untar/bz2ing a >>tarball to create the fs for a xenU partiton) the machine hangs. >> >> > >Is this problem really xen-related; can you do the same actions booted >from a "native" kernel without a crash? >It looks like a memory or heating problem... > >Mzzlz, Arie Kraai > >_______________________________________________ >Xen-users mailing list >Xen-users@lists.xensource.com >http://lists.xensource.com/xen-users > >_______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
> > I seem to be experiencing the same problem that Michal > Urbanski talked about in an email dated March 24th with a > subject of "hanging dell hardware". Looking through the > archives I can not find any resolution to the problem. The > specs on my machine are > > Dell Poweredge 1850 > Dual Xeon 2.8s > 1gig ram > PERC 4e/SI raid controller in mirror configuration with 2 73 > gb drives (uses the megaraid_mm/megaraid_mbox modules)Have other people had more success with this PERC 4e/SI card? I know that the basic 1850 works fine under Xen.> Any time the system is placed under high IO ( for example > untar/bz2ing a tarball to create the fs for a xenU partiton) > the machine hangs. There are no kernel messages. The console > becomes completely unresponsive (pressing Caps lock does not > change the status of the caps lock light) I can''t use the > magic sysreq key. I''ve tried adding "watchdog" to the end of > the grub line that loads xen.gz. That didn''t change anything. > I''ve also tried disabling hyperthreading with no success. I''m > using xen 2.0.5 with a 2.6.10 xen0 kernel. My raid controller > bios is already at version 516a (the newest I can find on the > dell site.) Does anyone have any more suggestions for me.Get a serial line on the machine, and boot a debug=y build of Xen with ''watchdog com1=115200,8n1'' on the Xen command line. If you hit ctrl-a 3 times, and then ''h'' you should get a help message. See if you still get this when the machine is in the wedged state. Ian _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
Ian, Yes I can hit Ctrl-a, press h, and I do see the help screen printed out when the machine is hung. What is the next step ? Thanks, -Rob Ian Pratt wrote:>> >> I seem to be experiencing the same problem that Michal >>Urbanski talked about in an email dated March 24th with a >>subject of "hanging dell hardware". Looking through the >>archives I can not find any resolution to the problem. The >>specs on my machine are >> >>Dell Poweredge 1850 >>Dual Xeon 2.8s >>1gig ram >>PERC 4e/SI raid controller in mirror configuration with 2 73 >>gb drives (uses the megaraid_mm/megaraid_mbox modules) > > > Have other people had more success with this PERC 4e/SI card? I know > that the basic 1850 works fine under Xen. > > >>Any time the system is placed under high IO ( for example >>untar/bz2ing a tarball to create the fs for a xenU partiton) >>the machine hangs. There are no kernel messages. The console >>becomes completely unresponsive (pressing Caps lock does not >>change the status of the caps lock light) I can''t use the >>magic sysreq key. I''ve tried adding "watchdog" to the end of >>the grub line that loads xen.gz. That didn''t change anything. >>I''ve also tried disabling hyperthreading with no success. I''m >>using xen 2.0.5 with a 2.6.10 xen0 kernel. My raid controller >>bios is already at version 516a (the newest I can find on the >>dell site.) Does anyone have any more suggestions for me. > > > Get a serial line on the machine, and boot a debug=y build of Xen with > ''watchdog com1=115200,8n1'' on the Xen command line. > > If you hit ctrl-a 3 times, and then ''h'' you should get a help message. > See if you still get this when the machine is in the wedged state. > > Ian_______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
> Yes I can hit Ctrl-a, press h, and I do see the help > screen printed out when the machine is hung. What is the next step ?OK, that''s interesting. It means Xen is still happy, it''s just the dom0 that''s having problems. What do you see if you hit the ''q'' key? Does the guest respond, or just Xen? If the guest does respond, please look up the EIP. The other thing you could do would be to compile Xen with perf counters and see if the number of interrupts is still going up. Also, please can you have a go using the latest unstable version of Xen. It''s possible that the new ACPI code might make a difference. Ian> Thanks, > -Rob > > Ian Pratt wrote: > >> > >> I seem to be experiencing the same problem that Michal Urbanski > >>talked about in an email dated March 24th with a subject of > "hanging > >>dell hardware". Looking through the archives I can not find any > >>resolution to the problem. The specs on my machine are > >> > >>Dell Poweredge 1850 > >>Dual Xeon 2.8s > >>1gig ram > >>PERC 4e/SI raid controller in mirror configuration with 2 > 73 gb drives > >>(uses the megaraid_mm/megaraid_mbox modules) > > > > > > Have other people had more success with this PERC 4e/SI > card? I know > > that the basic 1850 works fine under Xen. > > > > > >>Any time the system is placed under high IO ( for example > untar/bz2ing > >>a tarball to create the fs for a xenU partiton) the machine hangs. > >>There are no kernel messages. The console becomes completely > >>unresponsive (pressing Caps lock does not change the status of the > >>caps lock light) I can''t use the magic sysreq key. I''ve > tried adding > >>"watchdog" to the end of the grub line that loads xen.gz. > That didn''t > >>change anything. > >>I''ve also tried disabling hyperthreading with no success. I''m using > >>xen 2.0.5 with a 2.6.10 xen0 kernel. My raid controller bios is > >>already at version 516a (the newest I can find on the dell > site.) Does > >>anyone have any more suggestions for me. > > > > > > Get a serial line on the machine, and boot a debug=y build > of Xen with > > ''watchdog com1=115200,8n1'' on the Xen command line. > > > > If you hit ctrl-a 3 times, and then ''h'' you should get a > help message. > > See if you still get this when the machine is in the wedged state. > > > > Ian > >_______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
Ian, Ian Pratt wrote:> OK, that''s interesting. It means Xen is still happy, it''s just the dom0 > that''s having problems. What do you see if you hit the ''q'' key? Does the > guest respond, or just Xen?Only xen responds> If the guest does respond, please look up the EIP. The other thing you > could do would be to compile Xen with perf counters and see if the > number of interrupts is still going up.Should I do this even if the guest doesn''t respond ?> > Also, please can you have a go using the latest unstable version of Xen. > It''s possible that the new ACPI code might make a difference. >Can I use my existing dom0 kernel (2.6.10), or should I compile one based on the patch in xen-unstable ? Thanks, -Rob _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
On Sun, May 08, 2005 at 06:43:59PM +0200, Arie Kraai wrote:> On Sun, May 08, 2005 at 10:15:45AM -0400, Rob See wrote: > > > > Any time the system is placed under high IO ( for example untar/bz2ing a > > tarball to create the fs for a xenU partiton) the machine hangs. > > Is this problem really xen-related; can you do the same actions booted > from a "native" kernel without a crash? > It looks like a memory or heating problem...I can state for certain that it is a xen problem. I''ve shelved the xen stuff on our 2850 for the past few months, and the machine has been running non-stop since then on a non-xen kernel. Just a data-point for you, fyi :) -michal _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
> > If the guest does respond, please look up the EIP. The other thing you > > could do would be to compile Xen with perf counters and see if the > > number of interrupts is still going up. > > Should I do this even if the guest doesn''t respond ?Might be interesting to see what the perf counters say and where EIP is.> > Also, please can you have a go using the latest unstable version of Xen. > > It''s possible that the new ACPI code might make a difference. > > Can I use my existing dom0 kernel (2.6.10), or should I compile one > based on the patch in xen-unstable ?You''ll need to recompile with the 2.6.11 sparse tree in the unstable tree. The ACPI changes Ian mentioned (which *might* help you) result in a fairly crucial change in the Xen <->dom0 API. Cheers, Mark _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
All,>You''ll need to recompile with the 2.6.11 sparse tree in the unstable tree. >The ACPI changes Ian mentioned (which *might* help you) result in a fairly >crucial change in the Xen <->dom0 API. > > >Is it ok to patch 2.6.11 up to .8 and use xen with that one ? Thanks, -Rob _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
> Is it ok to patch 2.6.11 up to .8 and use xen with that one ?That patch should apply fine. I doubt it''ll impact on Xen and the problems you''re seeing, so it''s probably OK. Cheers, Mark _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
Reasonably Related Threads
- Xen on Dell PowerEdge 1850 with Perc 4e/Si controller
- poweredge 1850 won't boot 7.1? maybe LSI-related : amr0: adapter is busy
- poweredge 1850 won't boot 7.1? maybe LSI-related : amr0: adapter is busy
- i can't install centos on my poweredge 2950
- megaraid problem