Hi, Got a testbed Proliant DL380G2, internal SmartArray 5i disabled, have SmartArray 5300 with 6 disks. Installed 6.1-REL last week, rebuilt to -STABLE three times since without a problem. However, when I try to add various (random) applications the box locks up solid during "configure" or "make". Apps tested are postfix, jdk15, OpenOffice-2.0 and more. When it hangs the only thing remaining is power cycling, no keyboard access possible. What puzzles me here is that now I installed -CURRENT from yesterday on same box without any other changes and it works like charm, so far I'm through building close to 50 ports including kde with dependencies. Next step I guess is to go back to RELENG and check again. Meanwhile, does anyone out there have an idea where I should look? This box is going into production and I'm not brave enough to run -CURRENT... Thanks,
Daniel Bond wrote:> On 22:22 Mon 24 Jul, Per olof Ljungmark wrote: >> Hi, >> >> Got a testbed Proliant DL380G2, internal SmartArray 5i disabled, have SmartArray 5300 with 6 disks. >> >> Installed 6.1-REL last week, rebuilt to -STABLE three times since without a problem. However, when I try to >> add various (random) applications the box locks up solid during "configure" or "make". Apps tested are >> postfix, jdk15, OpenOffice-2.0 and more. When it hangs the only thing remaining is power cycling, no >> keyboard access possible. >> >> What puzzles me here is that now I installed -CURRENT from yesterday on same box without any other changes >> and it works like charm, so far I'm through building close to 50 ports including kde with dependencies. >> >> Next step I guess is to go back to RELENG and check again. Meanwhile, does anyone out there have an idea >> where I should look? This box is going into production and I'm not brave enough to run -CURRENT... > > Hi, > > try to disable ACPI. It can be usefull for reading CPU temperature and fan > speeds, but has no real function on a server. It's mostly usefull for power > managment, supend and resume support. From the handbook: > > Most system hangs are a result of lost interrupts or an interrupt storm. > Chipsets have a lot of problems based on how the BIOS configures interrupts > before boot, correctness of the APIC (MADT) table, and routing of the System > Control Interrupt (SCI). > > Interrupt storms can be distinguished from lost interrupts by checking the > output of vmstat -i and looking at the line that has acpi0. If the counter is > increasing at more than a couple per second, you have an interrupt storm. If > the system appears hung, try breaking to DDB (CTRL+ALT+ESC on console) and > type show interrupts. > > Your best hope when dealing with interrupt problems is to try disabling APIC > support with hint.apic.0.disabled="1" in loader.conf >Sorry, I should have mentioned that I tried both ways with same outcome.
The 5300 have a battery backed up cache, I guess I should try to run the box off the 5i to check. Have several 360/380 G1/2/3's here too and never saw this before. I did: * Installed hw; iLO card + one Intel em0 + the 5300 * Booted 6.1-REL CD, installed base system including ports tree and sources * pkg_add -r cvsup-without-gui * fetched 6-STABLE sources * Edit the kernel config (took out 486/586, added SMP/APIC) * rebuilt and installed world (I usually do this a few times over to check for hardware problems) * Installed postfix, that worked ok. * Next app (don't remeber which one sorry) hung the box * fetched 6-STABLE sources again * rebuilt and installed world, worked fine * tried again to complie apps, no joy. Hangs at random places, no error messages, just locks. * fetched 6-STABLE sources again * rebuilt and installed world, worked fine and finally, fetched -CURRENT, rebuilt and now everything is just great. ACPI is enabled. Reason I'm running -STABE is that I expect this one to go into production about the time 6.2 is released. Thanks, Mark Saad wrote:> Hello > I use many 380's here G2 G3 and G4's and I have not see this yet . I > am currently using a G3 w/o any issues with RELENG_6. > This box is a jumpstart and buildmaster for my office and this is the DL > I use 6.1 on the most. As for the G2 I moved away from them for the most > part, but I have one here in my office I could check out if you have a > list of what you did. I have two questions for you first, Why are you > using the SA5300 , and are you running famd or ganim on the server; or a > nfs client attached to the server ? > > > Per olof Ljungmark wrote: >> Hi, >> >> Got a testbed Proliant DL380G2, internal SmartArray 5i disabled, have >> SmartArray 5300 with 6 disks. >> >> Installed 6.1-REL last week, rebuilt to -STABLE three times since >> without a problem. However, when I try to add various (random) >> applications the box locks up solid during "configure" or "make". Apps >> tested are postfix, jdk15, OpenOffice-2.0 and more. When it hangs the >> only thing remaining is power cycling, no keyboard access possible. >> >> What puzzles me here is that now I installed -CURRENT from yesterday >> on same box without any other changes and it works like charm, so far >> I'm through building close to 50 ports including kde with dependencies. >> >> Next step I guess is to go back to RELENG and check again. Meanwhile, >> does anyone out there have an idea where I should look? This box is >> going into production and I'm not brave enough to run -CURRENT... >> >> Thanks, >> _______________________________________________ >> freebsd-stable@freebsd.org mailing list >> http://lists.freebsd.org/mailman/listinfo/freebsd-stable >> To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" > >